pdf-to-text

Standalone .NET Converter library, not require Adobe Acrobat component nor Microsoft Office Interop Assemblies, to convert PDF, DOCX, XLSX, HTML, Image, CSV, RTF, TXT in .NET framework

Updated Nov 5, 2018
C#

zevio / pcu_io

Star

IO management for PCU project

python pdf parser json text pdf-to-text input-output pcu pcu-io json-to-text

Updated Nov 28, 2018
Python

zevio / pcu_pdf

Star

PDF parser component (Apache Tika) for PCU project

python pdf parser component tika apache pdf-to-text pcu pdf-parser-component

Updated Nov 28, 2018
Python

amitbd1508 / Blind-EYE

Star

A book reader with voice control functionality for blind people

windows pdf csharp winforms voice-recognition pdf-to-text voice-assistant

Updated Jun 29, 2020
C#

Academic-Hammer / SciTSR

Star

Table structure recognition dataset of the paper: Complicated Table Structure Recognition

pdf-to-text pdf2txt table-structure-recognition

Updated Jul 7, 2020
Python

pashaq / PdfToText-Converter

Star

Converting the Pdf and Fb2 documents to text or to the list of articles.

pdf csharp lib pdf-to-text itext pdf2txt fb2-to-text

Updated Aug 23, 2020
C#

bytescout / pdfco-rails

Star

PDF.co Gem plugin for Ruby on Rails

ruby rails api pdf parser api-wrapper pdf-files pdf-document pdf-generator pdf-generation pdf-to-text pdf-reader pdf-manipulation pdf-merge pdf-extractor pdf-document-processor

Updated Oct 21, 2020
Ruby

shine-jayakumar / Extract-Data-From-PDF-In-Python

Star

Batch-convert pdf to text, extract data from pdf in python

Updated Sep 29, 2021
Python

mfakca / pdf2text

Star

PDF'leri metne dönüştürür

pdf-converter pdf-to-text

Updated Oct 9, 2021
Roff

selectpdf / selectpdf-api-perl-client

Star

Perl client for SelectPdf Online REST API

html-to-pdf pdf-generator pdf-generation pdf-to-text pdf-merge pdf-generator-api html-to-pdf-converter search-pdf html-to-pdf-api

Updated Nov 17, 2021
Perl

selectpdf / selectpdf-api-ruby-client

Star

Ruby client for SelectPdf Online REST API

html-to-pdf pdf-to-text pdf-merge pdf-api html-to-pdf-api html-to-pdf-ruby

Updated Nov 17, 2021
Ruby

selectpdf / selectpdf-api-nodejs-client

Star

Node.js client for SelectPdf Online REST API

pdf pdf-converter html-to-pdf pdf-to-text pdf-merge html-to-pdf-converter html-to-pdf-api pdf-merge-api pdf-to-text-api

Updated Nov 23, 2021
JavaScript

Directorman9 / Optical-character-recognition

Star

The notebook in this repository uses pytesseract to extract text from a pdf document. The script can be used to automate text acquisition from a large body of printed resources such as books. The acquired text can then be used for dowstream tasks, such as training language models, topic models, document summarization etc

ocr pdf-to-text pytesseract

Updated Apr 30, 2022

NanoNets / ocr-python

Star

OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.

python pdf ocr tesseract pdf-to-text image-to-text textract pdf-to-csv pdf-to-json searchable-pdf pytesseract-ocr extract-table table-extract image-to-text-converter extract-text-from-image extract-text-from-pdf

Updated Dec 2, 2022
Jupyter Notebook

Improve this page

Add a description, image, and links to the pdf-to-text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pdf-to-text topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pdf-to-text

Here are 58 public repositories matching this topic...

orijtech / tikago

madnight / pdf-layout-text-stripper

mic-kul / pdf-textstream

kanishk-mehta / PDFBox-get-Coordinates-of-text

iditect / pdf-tutorial

AshkanAbd / pdf2word-GUI

iditectweb / converter

zevio / pcu_io

zevio / pcu_pdf

amitbd1508 / Blind-EYE

Academic-Hammer / SciTSR

pashaq / PdfToText-Converter

bytescout / pdfco-rails

shine-jayakumar / Extract-Data-From-PDF-In-Python

mfakca / pdf2text

selectpdf / selectpdf-api-perl-client

selectpdf / selectpdf-api-ruby-client

selectpdf / selectpdf-api-nodejs-client

Directorman9 / Optical-character-recognition

NanoNets / ocr-python

Improve this page

Add this topic to your repo