PDFIntellect: Smart PDF Data Retrieval

Introduction

PDFIntellect is a Streamlit app designed for smart PDF data retrieval. This app leverages Language Models (LLMs) to efficiently extract valuable information from PDF documents.

Features

Advanced PDF parsing.
Integration with pre-trained Language Models.
Customizable cascading LLMs.
Intelligent short answer generation.

Requirements

Python 3.7 or higher.
Streamlit, transformers, torch, and pdfplumber libraries.
Access to pre-trained Language Models.
PDF parsing libraries.
PDF documents for extraction.

Usage

Clone the repository.

git clone  https://github.com/jaywyawhare/PDFIntellect

Install the required libraries.
```
pip install -r requirements.txt
```
Run the Streamlit app.
```
streamlit run app.py
```

App will be available at http://localhost:8501.

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

This project is licensed under the Licence license.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENCE		LICENCE
README.md		README.md
mvp.ipynb		mvp.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LICENCE

LICENCE

README.md

README.md

mvp.ipynb

mvp.ipynb

requirements.txt

requirements.txt

Repository files navigation

PDFIntellect: Smart PDF Data Retrieval

Introduction

Features

Requirements

Usage

Contributing

License

About

Languages

License

jaywyawhare/PDFIntellect

Folders and files

Latest commit

History

Repository files navigation

PDFIntellect: Smart PDF Data Retrieval

Introduction

Features

Requirements

Usage

Contributing

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages