Dolt – Git for Data
-
Updated
Jun 8, 2024 - Go
Dolt – Git for Data
Quilt is a data mesh for connecting people with actionable data
lakeFS - Data version control for your data lake | Git for data
🦉 ML Experiments and Data Management with Git
Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
Data version control with Makefile and DVC
An Git-like version control file system for data lineage & data collaboration.
Create, visualize, run & benchmark DVC pipelines in Python & Jupyter notebooks.
In this repository, an ML-Ops task is undertaken to practice configuring and storing data using DVC on GitHub. The goal is to explore how DVC seamlessly integrates for efficient data management, enhancing reproducibility and scalability in machine learning workflows.
Metrics Observability & Troubleshooting
A machine learning pipeline taking you from raw data to fully trained machine learning model - from data to model (d2m).
Python Data as Code core implementation
create a robust, simple, effecient, and modern end to end ML Batch Serving Pipeline Using set of modern open-source/free Platforms/Tools
Playground for learning DVC
Data version control for reproducible analysis pipelines in R with {targets}.
Declaratively create, transform, manage and version ML datasets.
sgr (command line client for Splitgraph) and the splitgraph Python library
Python framework for artificial text detection: NLP approaches to compare natural text against generated by neural networks.
Add a description, image, and links to the data-version-control topic page so that developers can more easily learn about it.
To associate your repository with the data-version-control topic, visit your repo's landing page and select "manage topics."