Load data from the Million Song Dataset into a final dimensional model in RedShift utilizing Apache Airflow.
-
Updated
Jun 2, 2020 - Python
Load data from the Million Song Dataset into a final dimensional model in RedShift utilizing Apache Airflow.
Cassandra ETL Pipeline
This is a project based on the Data Engineering Coding Challenge of Verve Company.
BigQuery data pipeline with dbt, Spark, Docker, Airflow, Terraform, GCP
Proof of concept to manage data warehouse data transformations
Repo for tracking content related to DBT cloud
Simplified blueprints for building data pipelines with dbt Cloud.
Simplified blueprints for building data pipelines with Domo.
Data Warehouse with ELT pipelines
ELT for New York City (NYC) Collision Dataset
Trata-se de um processo de ELT (Extração, Carga e Transformação) que integra um sistema legado com um banco de dados relacional (no exemplo, um MySQL) para um banco NoSQL (ElasticSearch) sem alterações significativas nos dados transferidos.
Project for IBM Data Engineering & Python course on Linux & Shell Scripts -- Wrote and executed bash scripts to manipulate folders and files to create a full directory backup with automation using crontab
Collection of data Extract, Transform, Load
Add a description, image, and links to the elt topic page so that developers can more easily learn about it.
To associate your repository with the elt topic, visit your repo's landing page and select "manage topics."