apache-beam
Here are 245 public repositories matching this topic...
Serverless data ingest pipeline on Google Cloud Platform
-
Updated
Dec 5, 2023 - Java
Pipeline para ingestão e tratamento de dados utilizando o Apache Beam
-
Updated
Sep 28, 2021 - Python
Desenvolvimento de um pipeline de dados utilizando Apache Beam para orquestrar o fluxo e Python para capturar e tratar os dados. Com os dados já refinados, foram utilizadas as bibliotecas Pandas e Matplotlib para desenvolver uma análise exploratória dos dados.
-
Updated
Feb 26, 2023 - Jupyter Notebook
Adtech Logs processing Pipeline with Apache Beam, Cloud Dataflow, Java, Protocol Buffer. | Data Analysis with BigQuery
-
Updated
Jun 11, 2021 - Java
This video present a real world use case developed with Apache Beam Java and launched with the serverless Dataflow runner in Google Cloud Platform. The job read a Json file from Cloud Storage, applies some transformations and write the result to a BigQuery table.
-
Updated
Apr 27, 2023 - Java
The scripts in this repo will build the Apache Beam Java SDK packages, using Cloud Build and Artifact Registry, for a personal Beam fork.
-
Updated
Feb 20, 2024 - HCL
Efficient Python data pipeline leveraging Apache Beam and Google Cloud Dataflow to update a Bucket with data concerning daily prices of instruments extracted from BMF website, serving as input for other data pipelines. The code generates a dataflow template, which is then scheduled to run periodically using Cloud Scheduler + Cloud Functions.
-
Updated
Feb 28, 2024
Evaluating Apache Beam batch, streaming, SQL, etc
-
Updated
Jan 27, 2019 - Java
Google Dataflow Flex Templates (in Python) for large scale Graph Loading with GDS and Apache Arrow
-
Updated
Feb 21, 2023 - Python
Connect Four Data Engineering Project: leveraging GCS for scalable and durable storage, Dataflow for data extraction and transformation, BigQuery as the data repository, Slack Integration for real-time sharing, Looker for insightful reports and visualizations, and Email Scheduler for automated report delivery.
-
Updated
Jan 31, 2024 - Python
Code and Notes for learning Apache Beam
-
Updated
May 8, 2019 - Jupyter Notebook
Redshift sink for Apache Beam
-
Updated
Jun 7, 2018 - Java
GCP Streaming Data Pipeline for Building Energy Consumption
-
Updated
Feb 18, 2020 - Python
Improve this page
Add a description, image, and links to the apache-beam topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the apache-beam topic, visit your repo's landing page and select "manage topics."