data-ingestion

Here are 128 public repositories matching this topic...

apache / paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

big-data spark flink real-time-analytics data-ingestion table-store paimon streaming-datalake

Updated May 17, 2024
Java

apache / seatunnel

Star

SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.

streaming real-time offline high-performance apache batch data-integration elt cdc change-data-capture data-ingestion

Updated May 17, 2024
Java

ConduitIO / conduit-site

Star

documentation data data-integration data-ingestion

Updated May 16, 2024
TypeScript

SneaksAndData / arcane-operator

Star

Kubernetes-native data streaming service based on Akka.NET

c-sharp streaming reactive akka actor data-ingestion arcane

Updated May 16, 2024
C#

robert-koch-institut / mex-drop

Star

MEx data ingestion service

python data-ingestion research-data

Updated May 16, 2024
Python

SneaksAndData / arcane-stream-rest-api

Star

REST Api stream for Arcane Streaming Service

c-sharp streaming reactive akka actor data-ingestion arcane

Updated May 15, 2024
C#

Dynatrace / agent-nodejs

Star

Dynatrace agent for PaaS environments

agent paas apm dynatrace rollout data-ingestion oneagent

Updated May 14, 2024
JavaScript

SneaksAndData / arcane-framework

Star

Akka.NET-based framework for data streaming services using the Arcane Kubernetes Operator

c-sharp streaming reactive akka actor data-ingestion arcane

Updated May 13, 2024
C#

ialonsolinares / Data-Ingestion-and-Analysis-using-UK-Police-Crime-API

Star

Data Ingestion Lab with Nifi and Data Analysis on PySpark and Pandas of UK Police Crime API

python geospatial-data data-analytics api-rest crime-analysis nifi-processors data-ingestion police-data

Updated May 12, 2024
Jupyter Notebook

daq-tools / skeem

Star

Infer SQL DDL statements from tabular data.

Updated May 13, 2024
Python

AbhishekRS4 / Data_Ingestion_Prefect

Star

Data Ingestion pipeline orchestration with Prefect

data-ingestion prefect workflow-orchestration

Updated May 8, 2024
Python

merantix-momentum / squirrel-core

Star

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way 🌰

Updated May 16, 2024
Python

abhayvikramnayak98 / CricketDataIngest

Star

Data Ingestion Project on Cricket World Cups 2011-19.

sql pandas web-scraping data-ingestion beautifulsoup4

Updated May 8, 2024
Jupyter Notebook

gabriel-batistuta / amazon-tech-best-sellers

Star

a simple search, extractor and ingestion system for get the best sellers products of tech on the Amazon

web-crawler web-scraping data-ingestion amazon-product-price-watcher amazon-product-scraper

Updated May 4, 2024
Python

sbl-sdsc / kg-import

Star

kg-import automates the ingestion of heterogeneous datasets into a Knowledge Graph.

neo4j knowledge-graph data-integration property-graph data-ingestion datasets-preparation

Updated Apr 30, 2024
Jupyter Notebook

bruin-data / ingestr

Star

ingestr is a CLI tool to copy data between any databases with a single command seamlessly.

bigquery postgresql snowflake mssql data-integration data-pipeline data-ingestion copy-database ingestion-pipeline duckdb

Updated Apr 24, 2024
Python

antononcube / Raku-Data-Importers

Star

Various data importing routines with a unified interface (data-import, slurp).

data data-ingestion raku slurp rakulang

Updated Apr 19, 2024
Raku

agutiernc / data-eng-zoomcamp

Star

Data Engineering Zoomcamp 2024

Updated Apr 8, 2024
Jupyter Notebook

pravega / pravega

Star

Pravega - Streaming as a new software defined storage primitive

streaming distributed-storage real-time-data streaming-data data-ingestion

Updated Apr 2, 2024
Java

linkedin / data-integration-library

Star

The Data Integration Library project provides a library of generic components based on a multi-stage architecture for data ingress and egress.

data-integration data-ingestion gobblin data-ingest data-egress

Updated Mar 28, 2024
Java

Improve this page

Add a description, image, and links to the data-ingestion topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-ingestion topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data-ingestion

Here are 128 public repositories matching this topic...

apache / paimon

apache / seatunnel

ConduitIO / conduit-site

SneaksAndData / arcane-operator

robert-koch-institut / mex-drop

SneaksAndData / arcane-stream-rest-api

Dynatrace / agent-nodejs

SneaksAndData / arcane-framework

ialonsolinares / Data-Ingestion-and-Analysis-using-UK-Police-Crime-API

daq-tools / skeem

AbhishekRS4 / Data_Ingestion_Prefect

merantix-momentum / squirrel-core

abhayvikramnayak98 / CricketDataIngest

gabriel-batistuta / amazon-tech-best-sellers

sbl-sdsc / kg-import

bruin-data / ingestr

antononcube / Raku-Data-Importers

agutiernc / data-eng-zoomcamp

pravega / pravega

linkedin / data-integration-library

Improve this page

Add this topic to your repo