💸A python module for building portfolio assessment pipeline
-
Updated
Apr 20, 2018 - Jupyter Notebook
💸A python module for building portfolio assessment pipeline
This is a basic example of using a pipeline in data science.
This is the data pipeline for the url-shortner application. Deprecated in favor of https://github.com/Dukes-Wine-Co/request-parsing-api
ETL pipeline with PySpark on Dataproc for data lake on Google Cloud Storage
An easy to use, reliable and well designed python module that domain experts and data scientists can use to fetch, visualise, and transform publicly available satellite and LIDAR data.
This project aims to do exploratory data analysis of the listings on Airbnb NYC 2019 Dataset (48895, 16). Airbnb has a global reach, and data analysis plays a crucial role in its operations.
Checking the scalability of a data pipeline involving MySQL, Spark and Machine Learning Models using Latency.
This repository contains code for comparing the performance of three different ELT (Extract, Load, Transform) methods on CSV files of different sizes. The three methods are implemented in Python using different approaches and libraries, and their execution times are compared and plotted for analysis.
House prices dataset exploration and prediction. Workflow includes useful examples of Tensorflow pipelines including k-Nearest Neighbors imputer, Decision Tree Regression and XGBoost Regression
A custom Airbyte connector to fetch football data from the Football-Data.org API. It allows users to retrieve match results, league tables, and player statistics for specific leagues, making it a versatile tool for football data analysis.
Optimizing offers/discounts send to Starbucks Clients using Machine Learning models on historical data.
POC in Apache Kafka and Spark Streaming using Avro serialization.
ETL pipeline with AWS Redshift orchestrated with Airflow
Codes for data flow between models, data post-process, and visualization
Udacity Data Engineering Nanodegree - Project #2
Short course: Introduction to Machine Learning
Transformation airbnb data set using dbt and snowflake, then visualizing data using preset
Data pipeline to gather data from chess website APIs using Airflow.
Исследование продаж компьютерных игр
Add a description, image, and links to the data-pipeline topic page so that developers can more easily learn about it.
To associate your repository with the data-pipeline topic, visit your repo's landing page and select "manage topics."