Skip to content

Data pipeline that processes Formula1 data with Azure Databricks, DeltaLake, and Azure Data Factory

Notifications You must be signed in to change notification settings

khoinguyen19k8/formula1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Introduction

The project processes Formula 1 data from 1950 - 2021 that supports both full and incremental load into a medallion architecture. Data is extracted to loaded into Azure Data Lake Gen 2 between layers. Delta Lake is utilized to provide ACID properties to Spark Tables. The pipeline is orchestrated using Azure Data Factory.

Architecture

Alt text

About

Data pipeline that processes Formula1 data with Azure Databricks, DeltaLake, and Azure Data Factory

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages