data-transformation

Star

Here are 421 public repositories matching this topic...

mahmoud / glom

Star

☄️ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! ☄️

python cli data dictionaries utilities declarative data-transformation nested-structures recursion apis

Updated Jan 30, 2024
Python

hi-primus / optimus

Star

🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark

data-science machine-learning spark bigdata data-transformation pyspark data-extraction data-analysis data-wrangling dask data-exploration data-preparation data-cleaning data-profiling data-cleansing big-data-cleaning data-cleaner cudf dask-cudf

Updated May 20, 2024
Python

2ndQuadrant / pglogical

Star

Logical Replication extension for PostgreSQL 15, 14, 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.

subscription replication etl zero-downtime postgresql data-transformation publish-subscribe cdc logical-decoding data-transport database-replication

Updated May 18, 2024
C

zinggAI / zingg

Star

Scalable identity resolution, entity resolution, data mastering and deduplication using ML

Updated May 21, 2024
Java

mattt / TransformerKit

Star

A block-based API for NSValueTransformer, with a growing collection of useful examples.

swift objective-c data-transformation nsvaluetransformer

Updated Oct 1, 2021
Objective-C

raystack / optimus

Star

Optimus is an easy-to-use, reliable, and performant workflow orchestrator for data transformation, data modeling, pipelines, and data quality management.

golang bigquery airflow automation etl analytics data-transformation data-warehouse business-intelligence dataops elt workflows data-pipelines data-modelling analytics-engineering

Updated Oct 30, 2023
Go

Microsoft Program Synthesis using Examples SDK is a framework of technologies for the automatic generation of programs from input-output examples. This repo includes samples and sample data for the Microsoft Program Synthesis using Example SDK.

microsoft sdk csharp dotnet examples prose data-transformation program-synthesis synthesis data-wrangling

Updated Mar 26, 2024
C#

ScriptFUSION / Porter

Star

💄 Durable and asynchronous data imports for consuming data at scale and publishing testable SDKs.

library framework asynchronous php-development scalability fibers porter data-import data-transformation abstraction durability

Updated Aug 3, 2023
PHP

SebKrantz / collapse

Sponsor

Star

Advanced and Fast Data Transformation in R

data-science cran r statistics time-series high-performance data-transformation scientific-computing econometrics rstats data-analysis data-manipulation data-processing weights panel-data weighted data-aggregation

Updated May 20, 2024
C

dbohdan / sqawk

Star

Like awk but with SQL and table joins

cli tsv converter json csv sql delimited-files data-transformation awk data-wrangling

Updated May 10, 2024
Tcl

jupyter-naas / naas

Sponsor

Star

Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications, build pipelines, manage secrets (Cloud-only)