Skip to content
#

real-time-analytics

Here are 109 public repositories matching this topic...

This project serves as a thorough roadmap for constructing a complete data engineering pipeline. The implementation leverages a resilient technology stack, featuring Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. all components are containerized using Docker.

  • Updated Nov 12, 2023
  • Python

An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Kafka, Apache Zookeeper, Apache Spark, and Cassandra. All components are containerized with Docker for easy deployment and scalability.

  • Updated Nov 1, 2023
  • Python

Improve this page

Add a description, image, and links to the real-time-analytics topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the real-time-analytics topic, visit your repo's landing page and select "manage topics."

Learn more