etl

Here are 3,672 public repositories matching this topic...

Dina-Hosny / ETL-Data-Pipeline-using-AirFlow

An ETL Data Pipelines Project that uses AirFlow DAGs to extract employees' data from PostgreSQL Schemas, load it in AWS Data Lake, Transform it with Python script, and Finally load it into SnowFlake Data warehouse using SCD type 2.

python airflow etl aws-s3 snowflake pandas datawarehouse airflow-dags

Updated May 26, 2023
Python

thomd / twitter-etl-pipeline-with-airflow

Star

ETL Pipeline for Twitter Data using Apache Airflow

python airflow etl postgresql

Updated Apr 5, 2023
Python

olahsymbo / etl-gcs-postgres-bigquery

Star

ETL Pipeline (postgres, bigquery, csv, json, google storage)

python bigquery flask etl postgresql data-pipeline google-cloud-scheduler

Updated Jul 26, 2023
Python

leonardohss0 / etl-sql-s3-redshift

Star

Keywords: Python, Airflow, AWS, S3, Redshift, ETL

airflow etl data-engineering

Updated Apr 29, 2023
Python

aimee0317 / ETL-Data-Pipelines

Star

Python ETL Data Pipeline with AWS Glue and Athena

aws etl aws-s3 aws-ec2 aws-athena etl-pipeline aws-glue

Updated Aug 2, 2023

manali146 / airflow-etl-pipeline-docker

Star

This repository showcases an end-to-end data engineering pipeline built with Airflow and Docker. The pipeline extracts data from a .tsv file, performs transformation operations, and loads the transformed data into a CSV file.

docker airflow etl

Updated May 17, 2023
Jupyter Notebook

niyotham / data-engineering-ETL-ibm

Star

extract transform and load and transfrom

json airflow etl logging python3 wget requests xml-parser webscraping beautifulsoup4

Updated Oct 23, 2023
Jupyter Notebook

juliaobenauer / Data-Pipelines-with-Airflow

Star

Udacity project within the Data Engineer Nanodegree

python airflow sql etl data-engineering

Updated Nov 26, 2022
Python

inuwamobarak / ETL-data-pipeline

Star

Implementation ETL with Python for data integration workflows.

python data database etl data-engineering datawarehousing etl-pipeline

Updated May 16, 2023
Python

khushal2405 / ETL-pipeline-using-Airflow-and-AWS-EMR

Star

We Build an ETL pipeline using Airflow that accomplishes the following: Downloads data from an AWS S3 bucket, Runs a Spark/Spark SQL job on the downloaded data producing a cleaned-up dataset of delivery deadline missing orders and then Upload the cleaned-up dataset back to the same S3 bucket in a folder primed for higher level analytics