Skip to content

BarbaraJoebstl/data-engineering-nd

Repository files navigation

Udacity Data Engineering Nanodegree

About

This Repo holds all exercises that are part of this online nanodegree

Data Engineering comprises all engineering and operational tasks required to make data available for the end-user, wether for the purposes of analytics, model building or app development.

This online class teaches how to solve common data engineering tasks:

  • ingest data from a data source
  • build and maintain a data warehouse
  • create a data pipeline
  • creat an analytics table for a specific use case
  • migrate data to the cloud
  • schedule and automate pipelines
  • backfill data
  • debug data quality issues
  • optimize queries
  • design a database

Chapters

Data Modeling

Cloud Data Warehouses

Data Lakes with Spark

Data Pipelines with Airflow

Capstone Project