Python PDF parser for scientific publications: content and figures
-
Updated
Mar 21, 2024 - Python
Python PDF parser for scientific publications: content and figures
A set of tools to allow PDF to XML conversion, utilising Apache Beam and other tools. The aim of this project is to bring multiple tools together to generate a full XML document.
A tool written in Python to perform a bibliographic analysis of the NIME proceedings archive and other similar corpora.
Grobid module for superconductor material and properties extraction
Training datasets for GROBID sale catalogues models.
ENLIT is a tool that supports scholars in exploring new literature
A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines.
Python library for serializing GROBID TEI XML to dataclass
Un conteneur docker destiné à l'entraînement de modèles Grobid
A Python CLI program for batch renaming academic article PDFs to their titles.
A set of tools to allow PDF to XML conversion, utilising Apache Beam and other tools. The aim of this project is to bring multiple tools together to generate a full XML document. It is now mainly used for evaluation purpose of external tools.
Final project as Computer Science Student at Telkom University || Stay tune guys at https://skripsi.fanzru.dev.
PaperAnalizer takes research papers an processes them, creating a word cloud based on key words that can be found in the abstract, a list of all the links that can be found in the selected papers and a file that shows the number of figures per paper and the sum of all of them.
Author Entity disambiguation for the new ACL Anthology
Add a description, image, and links to the grobid topic page so that developers can more easily learn about it.
To associate your repository with the grobid topic, visit your repo's landing page and select "manage topics."