Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
Updated
May 13, 2024 - Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Airflow extensions for communicating with Wherobots Cloud
Data Engineering Workbook (Notes, Exercises, Projects, etc...)
Workflow Engine for Kubernetes
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
AWS Summit 2022 ASEAN --- COM203 Using IaC with Terraform to provision Big Data Platform on Amazon EMR
Production Grade Terraform for Provisioning Infrastructure
Apply Data Engineering to Personal Finance
collection of image docker
Personal Spotify Wrapped using Airflow, Django and Docker
open source based development related contents
A Python package to submit and manage Apache Spark applications on Kubernetes.
This repository is a curated collection of projects and tools that exemplify best practices in data engineering. It serves as a resource for data professionals seeking to enhance their data infrastructure, optimize data pipelines, and implement cutting-edge data processing techniques.
User friendly and open source platform for workflow creation and monitoring
This project demonstrates how to build and automate an ETL pipeline using DAGs in Airflow and load the transformed data to Bigquery. There are different tools that have been used in this project such as Astro, DBT, GCP, Airflow, Metabase.
A sample repository features Apache Airflow deployed on an Azure Kubernetes Service (AKS) cluster with predefined budget constraints.
Add a description, image, and links to the airflow topic page so that developers can more easily learn about it.
To associate your repository with the airflow topic, visit your repo's landing page and select "manage topics."