Android document document scanning app
-
Updated
Jun 3, 2024 - C++
Android document document scanning app
A small lightweight HTTP server that converts photos, images and scanned documents to text using optical character recognition by utilizing the power of Google Tesseract.
Build "Dictionary of the Old Danish Language" into easier-to-use data formats
Tesseract Open Source OCR Engine (main repository)
6 MB Tesseract (with English training data) to fit inside AWS Lambda
Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.
Web scraper for extracting data from online newspapers
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
CCExtractor - Official version maintained by the core team
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Optical Character Recognition, Document Image Extractor and Video Downloader
A Generalized Hypercube Visualizer
Docker Image with latest Tesseract OCR Version 5.x.x built from sources
fastapi server for classification of documents and extraction of data
Tesseract based OCR for android
GoldenDict++:内置大量的官方版本问题的修正;先期添加了一个简单的插件机制,并基于该机制接入了多个 OCR 划词 和 音频播放 引擎;后期在增强易用性的基础上为提高查询效率、减少运行时 CPU 及 内存 占用、降低代码维护难度,完全重构了所有的实现;将来的目标是将功能扩展和词典格式处理抽象为完整的插件实现,以进一步增强应用的扩展性和可维护性。
Add a description, image, and links to the tesseract topic page so that developers can more easily learn about it.
To associate your repository with the tesseract topic, visit your repo's landing page and select "manage topics."