Skip to content

Releases: Unstructured-IO/unstructured-api

0.0.68

12 May 18:51
20238c2
Compare
Choose a tag to compare

0.0.68

  • Fix list params such as extract_image_block_types not working via the python/js clients

0.0.67

  • Allow for a different server port with the PORT variable
  • Change pdf_infer_table_structure parameter from being disabled in auto strategy.

0.0.66

  • Add support for unique_element_ids parameter.
  • Add max lifetime, via MAX_LIFETIME_SECONDS env-var, to API containers
  • Bump unstructured to 0.13.5
  • Change default values for pdf_infer_table_structure and skip_infer_table_types. Mark pdf_infer_table_structure deprecated.
  • Add support for the starting_page_number param.

0.0.65

13 Mar 19:43
ba1010d
Compare
Choose a tag to compare

0.0.65

  • Bump unstructured to 0.12.4
  • Add support for both list[str] and str input formats for ocr_languages parameter
  • Adds support for additional MIME types from unstructured
  • Document the support for gzip files and add additional testing

0.0.64

  • Bump Pydantic to 2.5.x and remove it from explicit dependencies list (will be managed by fastapi)
  • Introduce Form params description in the code, which will form openapi and swagger documentation
  • Roll back some openapi customizations
  • Keep backward compatibility for passing parameters in form of list[str] (will not be shown in the documentation)

0.0.63

  • Bump unstructured to 0.12.2
  • Fix bug that ignored combine_under_n_chars chunking option argument.

0.0.61

04 Jan 15:09
4ffd8bc
Compare
Choose a tag to compare
  • Bump unstructured to 0.11.6
  • Handle invalid hi_res_model_name kwarg

0.0.59

20 Nov 20:41
6c01cb9
Compare
Choose a tag to compare

0.0.59

  • Bump unstructured to 0.11.0

0.0.58

16 Nov 18:16
ab7bc82
Compare
Choose a tag to compare

0.0.58

  • Bump unstructured to 0.10.30

0.0.57

07 Nov 21:47
daa6ee0
Compare
Choose a tag to compare

0.0.57

  • Make sure multipage_sections param defaults to true as per the readme
  • Bump unstructured to 0.10.29

0.0.56

03 Nov 21:24
5e04b1c
Compare
Choose a tag to compare

0.0.56

  • Add max_characters param for chunking This param gives users additional control to "chunk" elements into larger or smaller CompositeElements
  • Bump unstructured to 0.10.28
  • Make sure chipperv2 is called whien hi_res_model_name==chipper

0.0.55

26 Oct 20:32
c91d1b9
Compare
Choose a tag to compare
  • Bump unstructured to 0.10.26
  • Bring parent_id metadata field back after fixing a backwards compatibility bug
  • Restrict Chipper usage to one at a time. The model is very resource intense, and this will prevent issues while we improve it.

0.0.54

23 Oct 16:58
13c1760
Compare
Choose a tag to compare
  • Bump unstructured to 0.10.25
  • Use a generator when splitting pdfs in parallel mode
  • Add a default memory minimum for 503 check
  • Fix an UnboundLocalError when an invalid docx file is caught

0.0.53

16 Oct 18:31
c0b945e
Compare
Choose a tag to compare
  • Bump unstructured to 0.10.23
  • Simplify the error message for BadZipFile errors