lakehouse
Here are 74 public repositories matching this topic...
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.
-
Updated
May 15, 2024 - Java
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
-
Updated
May 11, 2024 - Java
YTsaurus is a scalable and fault-tolerant open-source big data platform.
-
Updated
May 14, 2024 - C++
World's most powerful data catalog service with providing a high-performance, geo-distributed and federated metadata lake.
-
Updated
May 15, 2024 - Java
Use SQL to build ELT pipelines on a data lakehouse.
-
Updated
May 25, 2022 - JavaScript
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
-
Updated
May 14, 2024 - Python
The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for several lakehouse algorithms, data flows and utilities for Data Products.
-
Updated
Apr 11, 2024 - Python
Examples of using Terraform to deploy Databricks resources
-
Updated
Apr 3, 2024 - HCL
Unified storage framework for the entire machine learning lifecycle
-
Updated
Mar 3, 2024 - Python
A curated list of open source tools used in analytical stacks and data engineering ecosystem
-
Updated
May 7, 2024
Lakehouse storage system benchmark
-
Updated
Feb 22, 2023 - Scala
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.
-
Updated
Sep 2, 2023 - Dockerfile
DeltaOMS is a solution that help build a centralized repository of Delta Transaction logs and associated operational metrics/statistics for your Delta Lakehouse. Unity Catalog supported in the v0.7.0-rc1 release.Documentation here - https://databrickslabs.github.io/delta-oms/v0.7.0-rc1/
-
Updated
Nov 27, 2023 - Scala
Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data
-
Updated
Dec 2, 2023 - Python
Improve this page
Add a description, image, and links to the lakehouse topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the lakehouse topic, visit your repo's landing page and select "manage topics."