RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
-
Updated
May 21, 2024 - Python
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
set of Data Science and Machine Learning tools
A repo containing the NLP pre-processing pipeline which cleans the EuroPages scraped products and services
PUMI: neuroimaging Pipelines Using Modular workflow Integration
🌈📊📈 The Zillow Home Value Prediction project employs linear regression models on Kaggle datasets to forecast house prices. 📉💰Using Apache Spark (PySpark) within a Docker setup enables efficient data preprocessing, exploration, analysis, visualization, and model building with distributed computing for parallel computation.
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Our streamlined Streamlit web app fetches and processes ESPN CricInfo data delivering dynamic graphs for a quick and engaging cricket experience. Deployed on AWS EC2 with CI/CD pipelines.
Welcome to the Credit Card Fraud Detection project repository. This project leverages machine learning techniques to identify fraudulent credit card transactions in a dataset
An open-source library for recognition of speech commands in the user dictionary using audiovisual data of the speaker
Data pre-processing with modular components for: normalizer/standarizer, unbiaser, trimmer and feature selector.
A simple ASCII format to represent music scores, and a music score editor
Pre- and post-processing Python library for TOUGH
"This repository hosts an implementation of the Singular Value Decomposition (SVD) algorithm tailored for data mining tasks. SVD is utilized for efficient dimensionality reduction, aiding in the extraction of key patterns and features from large and complex datasets."
Brain tumor classification using CNN and ML Techniques
Automated Time Series Forecasting
Automagic
Reports for the course "Elaborazione di dati scientifici"
log data pre processing in python
A tool for processing the input data for the DRYP hydrological model.
Add a description, image, and links to the preprocessing topic page so that developers can more easily learn about it.
To associate your repository with the preprocessing topic, visit your repo's landing page and select "manage topics."