A realtime and indexing and structured extraction engine for Unstructured Data to build Generative AI Applications
-
Updated
Jun 8, 2024 - Rust
A realtime and indexing and structured extraction engine for Unstructured Data to build Generative AI Applications
Build and deploy a fully-featured, observable, user-facing RAG backend in minutes.
An implementation of the TaxRetrievalBenchmark task for the 🤗 Massive Text Embedding Benchmark (MTEB) framework.
Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
MTEB: Massive Text Embedding Benchmark
All-in-One: Text Embedding, Retrieval, Rerank and RAG
Domain Adapted Language Modeling Toolkit - E2E RAG
Customizable Case-Based Reasoning (CBR) toolkit for Python with a built-in API and CLI.
Generative Representational Instruction Tuning
Retrieval Augmented Generative Engine
Atmospheric data Community Toolkit - A python based toolkit for exploring and analyzing time series atmospheric datasets
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/
Cottontail DB is a column store vector database aimed at multimedia retrieval. It allows for classical boolean as well as vector-space retrieval (nearest neighbour search) used in similarity search using a unified data and query model.
Neural Search
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
Add a description, image, and links to the retrieval topic page so that developers can more easily learn about it.
To associate your repository with the retrieval topic, visit your repo's landing page and select "manage topics."