Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
Updated
May 19, 2024 - Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Open-source BI for engineers
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Database replication platform that leverages change data capture. Stream production data from databases to your data warehouse (Snowflake, BigQuery, Redshift) in real-time.
Efficient data transformation and modeling framework that is backwards compatible with dbt.
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
Flink CDC is a streaming data integration tool
The open source high performance ELT framework powered by Apache Arrow
PyAirbyte brings the power of Airbyte to every Python developer.
DataForge helps data teams write functional transformation pipelines by leveraging software engineering principles
Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
Add a description, image, and links to the elt topic page so that developers can more easily learn about it.
To associate your repository with the elt topic, visit your repo's landing page and select "manage topics."