tesseract

Here are 1,085 public repositories matching this topic...

Akylas / OSS-DocumentScanner

Android document document scanning app

android pdf opencv scanner image-processing tesseract document document-scanner document-scanner-app document-scanning document-scan document-scan-to-text zxingcpp

Updated Jun 3, 2024
C++

hertzg / tesseract-server

Star

A small lightweight HTTP server that converts photos, images and scanned documents to text using optical character recognition by utilizing the power of Google Tesseract.

api docker typescript ocr docker-compose containers rest-api docker-image container image-processing tesseract http-server hacktoberfest tesseract-server

Updated Jun 3, 2024
TypeScript

koreader / koreader-base

Star

Base framework offering a Lua scriptable environment for creating document readers

emulator pdf lua ubuntu sdl ffi luajit tesseract epub mupdf leptonica djvu koreader

Updated Jun 3, 2024
Lua

stscoundrel / old-danish-dictionary-builder

Star

Build "Dictionary of the Old Danish Language" into easier-to-use data formats

kotlin python typescript spring-boot tesseract medieval-studies danish-language medieval-languages old-danish otto-kalkar

Updated Jun 2, 2024
Python

goksenpasli / GpScanner

Star

Twain Scanner Application

pdf scanner wpf tesseract udf eyp win10 twain tarayici win7 win11

Updated Jun 2, 2024
C#

tesseract-ocr / tesseract

Star

Tesseract Open Source OCR Engine (main repository)

machine-learning ocr tesseract lstm tesseract-ocr hacktoberfest ocr-engine

Updated Jun 2, 2024
C++

shelfio / aws-lambda-tesseract

Star

6 MB Tesseract (with English training data) to fit inside AWS Lambda

nodejs ocr aws-lambda serverless npm-package tesseract node-module optical-character-recognition

Updated Jun 2, 2024
Shell

scribeocr / scribeocr

Star

Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.

ocr abbyy tesseract proofreading

Updated Jun 1, 2024
JavaScript

hamidurrk / epaper-scraper

Star

Web scraper for extracting data from online newspapers

python tesseract asynchronous-programming sqlite3 lxml webscraping cuda-toolkit selenium-python beautifulsoup4 dataminig

Updated Jun 1, 2024
Python

pymupdf / PyMuPDF

Star

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

python pdf font data-science ocr tesseract epub mupdf text-processing pdf-documents extract-data table-extraction text-shaping xps pymupdf

Updated Jun 1, 2024
Python

CCExtractor / ccextractor

Star

CCExtractor - Official version maintained by the core team

c rust image ocr video image-processing tesseract subtitles tesseract-ocr dvb teletext hacktoberfest cea-608 cea-708 hacktoberfest2021

Updated Jun 1, 2024
C

ocrmypdf / OCRmyPDF

Star

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

python pdf ocr image-processing tesseract

Updated Jun 1, 2024
Python

JMatoso / Imatex

Star

Optical Character Recognition, Document Image Extractor and Video Downloader

ocr tesseract video-downloader youtube-downloader image-extractor blazor tiktok-downloader

Updated Jun 1, 2024
C#

ndavd / ncube

Star

A Generalized Hypercube Visualizer

rust simulation mathematics tesseract hypercube bevy

Updated May 31, 2024
Rust

danpla / dpscreenocr

Star

Program to recognize text on screen

ocr tesseract tesseract-ocr

Updated May 30, 2024
C++

Franky1 / Tesseract-OCR-5-Docker

Star

Docker Image with latest Tesseract OCR Version 5.x.x built from sources

docker ocr tesseract tesseract-ocr tesseract-5

Updated May 29, 2024
Python

jankstar / pydocu

Star

fastapi server for classification of documents and extraction of data

transformers tesseract torch data-extraction document-classification parsing-library bert fastapi

Updated May 28, 2024
Python

SkeathyTomas / genshin_artifact_auxiliary

Star

A Genshin Impact artifact rater sticking upon artifacts inside the game window. 刻晴办公桌 | 原神 | 圣遗物评分。集成在游戏窗口之上的原神圣遗物导出、评分工具，无需游戏内外来回切换对比，游戏中快速计算与查阅结果。

python ocr tesseract paddleocr genshin-impact pyside6 rapidocr

Updated May 28, 2024
Python

SubhamTyagi / android-ocr

Star

Tesseract based OCR for android

android ocr foss tesseract reader fdroid image-reader ocr-android ocr-recognition ocr-text-reader math-ocr

Updated May 28, 2024
Java

nonwill / GoldenDict-OCR

Star

GoldenDict++：内置大量的官方版本问题的修正；先期添加了一个简单的插件机制，并基于该机制接入了多个 OCR 划词和音频播放引擎；后期在增强易用性的基础上为提高查询效率、减少运行时 CPU 及内存占用、降低代码维护难度，完全重构了所有的实现；将来的目标是将功能扩展和词典格式处理抽象为完整的插件实现，以进一步增强应用的扩展性和可维护性。

Updated May 28, 2024

Improve this page

Add a description, image, and links to the tesseract topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tesseract topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tesseract

Here are 1,085 public repositories matching this topic...

Akylas / OSS-DocumentScanner

hertzg / tesseract-server

koreader / koreader-base

stscoundrel / old-danish-dictionary-builder

goksenpasli / GpScanner

tesseract-ocr / tesseract

shelfio / aws-lambda-tesseract

scribeocr / scribeocr

hamidurrk / epaper-scraper

pymupdf / PyMuPDF

CCExtractor / ccextractor

ocrmypdf / OCRmyPDF

JMatoso / Imatex

ndavd / ncube

danpla / dpscreenocr

Franky1 / Tesseract-OCR-5-Docker

jankstar / pydocu

SkeathyTomas / genshin_artifact_auxiliary

SubhamTyagi / android-ocr

nonwill / GoldenDict-OCR

Improve this page

Add this topic to your repo