Skip to content

creative-graphic-design/huggingface-datasets_PubLayNet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

annotations_creators language language_creators license multilinguality pretty_name size_categories source_datasets tags task_categories task_ids
machine-generated
en
found
cdla-permissive-1.0
monolingual
PubLayNet
original
graphic design
layout-generation
image-classification
image-segmentation
image-to-text
question-answering
other
multiple-choice
token-classification
tabular-to-text
object-detection
table-question-answering
text-classification
table-to-text
multi-label-image-classification
multi-class-image-classification
semantic-segmentation
image-captioning
extractive-qa
closed-domain-qa
multiple-choice-qa
named-entity-recognition

Dataset Card for PubLayNet

CI

Table of Contents

Dataset Description

Dataset Summary

PubLayNet is a dataset for document layout analysis. It contains images of research papers and articles and annotations for various elements in a page such as "text", "list", "figure" etc in these research paper images. The dataset was obtained by automatically matching the XML representations and the content of over 1 million PDF articles that are publicly available on PubMed Central.

Supported Tasks and Leaderboards

[More Information Needed]

Languages

[More Information Needed]

Dataset Structure

Data Instances

import datasets as ds

dataset = ds.load_dataset(
    path="shunk031/PubLayNet",
    decode_rle=True, # True if Run-length Encoding (RLE) is to be decoded and converted to binary mask.
)

Data Fields

[More Information Needed]

Data Splits

[More Information Needed]

Dataset Creation

Curation Rationale

[More Information Needed]

Source Data

[More Information Needed]

Initial Data Collection and Normalization

[More Information Needed]

Who are the source language producers?

[More Information Needed]

Annotations

[More Information Needed]

Annotation process

[More Information Needed]

Who are the annotators?

[More Information Needed]

Personal and Sensitive Information

[More Information Needed]

Considerations for Using the Data

Social Impact of Dataset

[More Information Needed]

Discussion of Biases

[More Information Needed]

Other Known Limitations

[More Information Needed]

Additional Information

Dataset Curators

[More Information Needed]

Licensing Information

Citation Information

@inproceedings{zhong2019publaynet,
  title={Publaynet: largest dataset ever for document layout analysis},
  author={Zhong, Xu and Tang, Jianbin and Yepes, Antonio Jimeno},
  booktitle={2019 International Conference on Document Analysis and Recognition (ICDAR)},
  pages={1015--1022},
  year={2019},
  organization={IEEE}
}

Contributions

Thanks to ibm-aur-nlp/PubLayNet for creating this dataset.