tika
Here are 141 public repositories matching this topic...
A doc searcher of the documents on the local host that is based on: Tika+OCR, ElasticSearch and Kibana
-
Updated
Jan 23, 2021 - Java
WORK IN PROGRESS - Dataiku DSS plugin to extract text data from documents
-
Updated
Jan 11, 2021 - Makefile
The simple monolithic application demonstrates: the extraction of the images of the PDF document pages using Apache Tika, the storage of the images files into the local filesystem, the display of the pages using the ngx-swiper-wrapper library.
-
Updated
May 9, 2023 - Java
Information retrieval system for documents.
-
Updated
Feb 15, 2022 - HTML
Early Buddhist texts from the Tipitaka (Tripitaka). Suttas (sutras) with the Buddha's teachings on mindfulness, insight, wisdom, and meditation.
-
Updated
Jul 6, 2023 - JavaScript
Directory tree metadata parser using Apache Tika
-
Updated
May 3, 2024 - Python
A windows service wrapper for the tika JSR 311 network server.
-
Updated
Jan 29, 2024 - Batchfile
Extracts GPS coordinates from pdf files and Points/Polygons from kmz files to create a master kml file. 🌎
-
Updated
Jul 7, 2021 - HTML
The Information Retrieval Labolatories
-
Updated
Apr 16, 2018 - Java
Information Retrieval system for indexing and searching files stored on disk, with support for Romanian language
-
Updated
Mar 16, 2019 - Java
POC: azure-functions (kotlin, gradle, tika)
-
Updated
Feb 18, 2019 - Kotlin
Improve this page
Add a description, image, and links to the tika topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the tika topic, visit your repo's landing page and select "manage topics."