Generic framework for historical document processing
-
Updated
Jan 9, 2020 - Python
Generic framework for historical document processing
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
Detectron2 for Document Layout Analysis
Tools for extract figure, table, text, .. from a pdf document.
Awesome historical newspaper analysis tools and literature
document layout analysis results
ICDAR 2019: MaskRCNN on PubLayNet datasets. Paragraph detection, table detection, figure detection,...
Project for Deep Learning and its application
An end to end deep learning approach to extract information from shipping records
Learning to Sort Handwritten Text Lines in Reading Order through Estimated Binary Order Relations
Vision Based Document Layout Detection, Segmentation and context classification using MaskRCNN on Tensorflow-Keras, PyTorch & Detectron2.
GloSAT Historical Measurement Table Dataset
Simple docker deployment of document layout analysis using detectron2
BoundaryNet - A Semi-Automatic Layout Annotation Tool
Page to PAGE Layout Analysis Tool
Get the number of columns for a document image
Proof of concept of a simple SVM Region Classifier using PdfPig and Accord.Net. The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
A Bottom-Up Instance Segmentation Strategy for segmenting document instances using Transformers
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
Add a description, image, and links to the document-layout-analysis topic page so that developers can more easily learn about it.
To associate your repository with the document-layout-analysis topic, visit your repo's landing page and select "manage topics."