Skip to content
#

databricks

Here are 750 public repositories matching this topic...

Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines

  • Updated May 28, 2024
  • Python

A cutting-edge data project leverages Azure's suite of services to seamlessly transform raw data from GitHub into actionable insights. Using Azure Data Factory for data ingestion, Databricks for PySpark transformations, Synapse Analytics for advanced analysis, and Power BI for intuitive visualization, this project navigates complex data workflows..

  • Updated May 27, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the databricks topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the databricks topic, visit your repo's landing page and select "manage topics."

Learn more