document-understanding

Here are 27 public repositories matching this topic...

jpWang / LiLT

Official PyTorch implementation of LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding (ACL 2022)

nlp information-extraction document-analysis document-understanding multilingual-models document-ai multimodal-pre-trained-model

Updated Oct 31, 2022
Python

irgroup / labelstudio-to-fonduer

Star

This small module connects Label Studio with Fonduer by creating a fonduer labeling function for gold labels from a label studio export. Documentation: https://irgroup.github.io/labelstudio-to-fonduer/

data-annotation knowledge-base-construction document-understanding label-studio fonduer

Updated Feb 14, 2023
Python

uakarsh / TiLT-Implementation

Star

Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.

deep-learning transformers pytorch-implementation document-understanding pytorch-lightning

Updated Apr 23, 2023
Jupyter Notebook

andreagemelli / doc2graph

Star

Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.

nlp deep-learning pytorch layout-analysis geometric-deep-learning table-detection gnn document-understanding key-information-extraction

Updated May 23, 2023
Jupyter Notebook

tstanislawek / awesome-document-understanding

Star

A curated list of resources for Document Understanding (DU) topic

Updated Jun 2, 2023

dhorvay / document-understanding-ebook

Star

(WIP) ✨ A comprehensive resource for understanding the world of software used in the Document Understanding field. 🧙✨

ocr ebook document-understanding document-ai awesome-document-understanding

Updated Jun 23, 2023
Markdown

ExtrieveTechnologies / QuickCapture_IOS

Star

QuickCapture Mobile Scanning SDK Specially designed for native IOS

swift ios objective-c document-classification document-scanner-app document-understanding document-scanning-sdk

Updated Jul 29, 2023
Objective-C

javier-marti-isasi / OCR-free-Document-Understanding-with-Donut-Transformer

Star

This project tackles a real-world challenge of automating client document processing, with a focus on enhancing document classification, error detection, data extraction, and validation.

ocr document-classification document-understanding

Updated Oct 1, 2023
Jupyter Notebook

LynnHaDo / Document-Layout-Analysis

Star

Object Detection Model for Scanned Documents

python object-detection document-understanding yolov8

Updated Oct 4, 2023
Jupyter Notebook

bwnyasse / dart-documentai-samples

Star

A hands-on CLI tool sample showcasing the integration of Dart with Google Cloud's DocumentAI.

dart machine-learning google-cloud samples dartlang document-understanding document-ai

Updated Nov 18, 2023
Dart

mycielski / textract_study

Star

Analysing expense reports/invoices with AWS Textract and boto3.

shell aws script invoices aws-cli expenses boto3 textract document-understanding

Updated Nov 27, 2023
Python

LynnHaDo / Checkbox-Detection

Star

Checkbox Detection Model for Scanned Documents

python computer-vision deep-learning object-detection copy-paste document-understanding yolov8

Updated Jan 25, 2024
Jupyter Notebook

NExTplusplus / TAT-DQA

Star

TAT-DQA: Towards Complex Document Understanding By Discrete Reasoning

vqa question-answering document-understanding

Updated Mar 1, 2024

ExtrieveTechnologies / QuickCapture_Android

Star

QuickCapture Mobile Scanning SDK Specially designed for native ANDROID from Extrieve

android java document-scanner kotllin document-scanner-app document-understanding document-scanning-sdk

Updated Mar 13, 2024
Kotlin

SCUT-DLVCLab / RFUND

Star

Official release of RFUND introduced in the paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction" (arXiv:2401.03472).

ocr document-understanding key-information-extraction document-ai visual-information-extraction

Updated Mar 22, 2024

microsoft / CompHRDoc

Star

Datasets and Evaluation Scripts for CompHRDoc

document-understanding document-structure-analysis rag-related

Updated Mar 28, 2024
Python

huggingface / chug

Star

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

computer-vision pdf-document datasets distributed-training dataloading document-understanding multi-modal-learning webdataset

Updated Apr 3, 2024
Python

jacobmarks / pytesseract-ocr-plugin

Star

Run optical character recognition with PyTesseract from the FiftyOne App!

python plugin nlp ocr computer-vision tesseract tesseract-ocr document-understanding fiftyone

Updated Apr 5, 2024
Python

AlibabaResearch / AdvancedLiterateMachinery

Star

A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.

ocr computer-vision artificial-intelligence text-recognition document text-detection document-analysis end-to-end-ocr multimodal scene-text-recognition multimodal-deep-learning scene-text-detection vision-language document-understanding scene-text-detection-recognition document-recognition document-intelligence documentai vision-language-transformer vision-language-model

Updated Apr 23, 2024
C++

wenwenyu / PICK-pytorch

Star

Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)

document-analysis graph-convolutional-network graph-learning graph-neural-networks document-understanding key-information-extraction

Updated May 3, 2024
Python

Improve this page

Add a description, image, and links to the document-understanding topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the document-understanding topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

document-understanding

Here are 27 public repositories matching this topic...

jpWang / LiLT

irgroup / labelstudio-to-fonduer

uakarsh / TiLT-Implementation

andreagemelli / doc2graph

tstanislawek / awesome-document-understanding

dhorvay / document-understanding-ebook

ExtrieveTechnologies / QuickCapture_IOS

javier-marti-isasi / OCR-free-Document-Understanding-with-Donut-Transformer

LynnHaDo / Document-Layout-Analysis

bwnyasse / dart-documentai-samples

mycielski / textract_study

LynnHaDo / Checkbox-Detection

NExTplusplus / TAT-DQA

ExtrieveTechnologies / QuickCapture_Android

SCUT-DLVCLab / RFUND

microsoft / CompHRDoc

huggingface / chug

jacobmarks / pytesseract-ocr-plugin

AlibabaResearch / AdvancedLiterateMachinery

wenwenyu / PICK-pytorch

Improve this page

Add this topic to your repo