Read and extract text and other content from PDFs in C# (port of PDFBox)
-
Updated
Jun 2, 2024 - C#
Read and extract text and other content from PDFs in C# (port of PDFBox)
Main repository of the CGPG project for OCR and Text Analysis of the Patrologia Graeca
An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
OCR engine for all the languages
Curating a dataset of British patents
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
A toolbox of OCR models, algorithms, and pipelines based on MindSpore
HTR ground truth of the Chi-Know-Po project (Collex Persée)
PdfDet aims to simplify PDF layout detect tasks for users.
A python package to structure files using visual and style informations
A Unified Toolkit for Deep Learning Based Document Image Analysis
Nordrassil is a keyboard layout that provides an elegant and balanced typing experience by its use of a thumb-alpha, emphasis on middle fingers, and de-prioritisation of pinkies.
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
[ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)
Document Layout Analysis resources repos for development with PdfPig.
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
A web application for PDF content and table extraction, featuring image-based visual layout analysis, indexed document search, batch processing and extraction result annotation.
利用java-yolov8实现版面检测(Chinese layout detection),java-yolov8 is used to detect the layout of Chinese document images
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
Add a description, image, and links to the layout-analysis topic page so that developers can more easily learn about it.
To associate your repository with the layout-analysis topic, visit your repo's landing page and select "manage topics."