🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.
-
Updated
May 22, 2024 - Python
🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.
Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database.
cuVS - a library for vector search and clustering on the GPU
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Minimalist web-searching app with an AI assistant that runs directly from your browser. Uses Web-LLM, Ratchet-ML, Wllama and SearXNG. Demo: https://felladrin-minisearch.hf.space
Tevatron - A flexible toolkit for neural retrieval research and development.
Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.
MTEB: Massive Text Embedding Benchmark
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications.
Copmuter Science pds notes
Apache Lucene open-source search software
Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.
🔍 LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Apache Solr open-source search software
A 2024 Reading List for Bilingual Lexicon Induction (BLI) / Word Translation. Frequently Updated.
Improve your Elasticsearch, OpenSearch, Solr, Vectara, Algolia and Custom Search search quality.
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
This repository is the work I write for Information Retrieval algorithm learnt in Dalhousie.
This app allows users to easily query a PDF document using OpenAI's GPT-3 language model in Google Colab, utilizing Google Drive for storage.
Add a description, image, and links to the information-retrieval topic page so that developers can more easily learn about it.
To associate your repository with the information-retrieval topic, visit your repo's landing page and select "manage topics."