Skip to content

Issues: Unstructured-IO/unstructured

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Assignee
Filter by who’s assigned
Sort

Issues list

feat/docx-field-codes docx Related to Microsoft Word (.docx) file format enhancement New feature or request
#2944 opened Apr 27, 2024 by erik-squared
Text Extraction Issue: Greek Language PDFs Rendered with Incorrect Alphabet bug Something isn't working ocr Related to optical character recognition (OCR).
#2939 opened Apr 26, 2024 by DarioBernardo
feat/partition_metadata enhancement New feature or request html
#2933 opened Apr 25, 2024 by Falven
Clarify orig_elements documentation documentation Improvements or additions to documentation enhancement New feature or request
#2929 opened Apr 25, 2024 by Marcell-Balint
chore: Update unstructured-client bug Something isn't working
#2924 opened Apr 23, 2024 by Coniferish
infer_table_structure lead Failed to initialize the model bug Something isn't working pdf
#2923 opened Apr 23, 2024 by spongxin
bug/Execution speed is very slow in AWS LAMBDA environment investigating Issues that require more information before they are actionable
#2916 opened Apr 22, 2024 by cds-code
Doc/Docx with Checkboxes docx Related to Microsoft Word (.docx) file format enhancement New feature or request
#2912 opened Apr 19, 2024 by Rob-Smith-HDT
Documentation for Partitioning table for email has wrong class type documentation Improvements or additions to documentation
#2907 opened Apr 19, 2024 by debasisdwivedy
bug: TesseractError: Estimating resolution as X bug Something isn't working ocr Related to optical character recognition (OCR).
#2900 opened Apr 17, 2024 by qued
Documentation for Ingestion of wikipedia documentation Improvements or additions to documentation
#2899 opened Apr 17, 2024 by debasisdwivedy
bug/partition_pdf removes spaces from the text bug Something isn't working pdf
#2896 opened Apr 16, 2024 by christinestraub
bug/executing partition_doc using concurrent futures investigating Issues that require more information before they are actionable
#2891 opened Apr 15, 2024 by salahaz
Unable to import unstructured.partition.xyz bug Something isn't working
#2888 opened Apr 14, 2024 by flaviobrienza
Chunk overlap prefix is on even word boundary >= overlap character count. chunking Related to element chunking. enhancement New feature or request
#2886 opened Apr 12, 2024 by scanny
bug/unexpected kwarg in MongoDB Destination Connector bug Something isn't working
#2878 opened Apr 11, 2024 by ron-unstructured
File Not Found Error nlp/english-words.txt bug Something isn't working
#2859 opened Apr 5, 2024 by taaha3244
bug/partion_pdf import statement not completing execution bug Something isn't working
#2847 opened Apr 4, 2024 by viboognesh
bug/will not pip install unstructured[pdf] on gitpod bug Something isn't working
#2831 opened Apr 2, 2024 by thams
bug/WriteConfig-API-Broken bug Something isn't working ingest
#2698 opened Mar 27, 2024 by jasonbot
feat/ extract style or font for Text elements. enhancement New feature or request
#2695 opened Mar 26, 2024 by LunaticMaestro
ProTip! Type g i on any issue or pull request to go back to the issue listing page.