Reading the Avro files created by Event hubs using Spark
-
Updated
Mar 19, 2019 - Scala
Reading the Avro files created by Event hubs using Spark
Dublin Bus Trips Web App & Database
Notebook with data ETL use cases with Spark
A midterm on breadth first search, map reduce, and PySpark transformations
An Apache Spark course based on Spark: The Definitive Guide
I have forked this template to implement end to end Machine Learning Life cycle on Databricks Lakehouse
Real estate sales predictions and analytics
Implementation of the "CCF: Fast and Scalable Connected Component Computation in MapReduce" paper with Spark. Study of its scalability on several datasets using various clusters' sizes on Databricks and Google Cloud Platform (GCP)
Data pipeline that processes Formula1 data with Azure Databricks, DeltaLake, and Azure Data Factory
Ingestão de dados do Olist em formato CSV para as camadas Raw, Bronze, Silver e Gold
Neste repositório trabalharemos com processamento de dados usando Spark.
Batch & streaming data pipelines built using Databricks with Pyspark and modeled the data into star schema to analyze in PowerBI, Formula-1 racing data from multiple data sources, APIs.
This Repo Contains Azure Data Engineering Projects
Terraform module for creation of Databricks Unity Catalog Volumes
You can find in this repository the Big data's mini-projects .
Summer Fellowship Project.
Templates for the azbasespace connector (Azure to Illumina BaseSpace)
Add a description, image, and links to the databricks topic page so that developers can more easily learn about it.
To associate your repository with the databricks topic, visit your repo's landing page and select "manage topics."