Data-Engineering-Projects

Welcome to Data-Engineering-Projects, a comprehensive repository dedicated to housing innovative and scalable data engineering solutions.

Overview

This repository is a curated collection of projects and tools that exemplify best practices in data engineering. It serves as a resource for data professionals seeking to enhance their data infrastructure, optimize data pipelines, and implement cutting-edge data processing techniques.

Projects

Each project within this repository is self-contained with its own set of instructions, documentation, and necessary scripts or code.

Project 1: Retail Data Pipeline - AirFlow
Project 2: Uber Data Pipeline - Mage

Technologies

The projects in this repository leverage a variety of technologies, including:

Apache Spark
Apache Airflow
Amazon Redshift
Google BigQuery
SnowFlake
Docker
Mage

Highlights

Fact Table vs Dimension Table

Data Pipeline Tree

Installation

Instructions on how to install and configure the necessary environment or dependencies for the projects.

# Example installation code
pip install -r requirements.txt

Usage

Examples of how to use the projects or tools within this repository.

# Example usage code
python project_1/main.py

Contributing

We welcome contributions from the data engineering community. Please read our CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests.

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

Contact

For any inquiries or contributions, please open an issue or contact the repository maintainers.

Thank you for visiting Data-Engineering-Projects. We hope this repository empowers you to build robust and efficient data solutions.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
02-uber-data-pipeline		02-uber-data-pipeline
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

02-uber-data-pipeline

02-uber-data-pipeline

README.md

README.md

Repository files navigation

Data-Engineering-Projects

Overview

Projects

Technologies

Highlights

Fact Table vs Dimension Table

Data Pipeline Tree

Installation

Usage

Contributing

License

Contact

About

Releases

Packages

Languages

machinelearningzuu/Data-Engineering-Projects

Folders and files

Latest commit

History

02-uber-data-pipeline

02-uber-data-pipeline

README.md

README.md

Repository files navigation

Data-Engineering-Projects

Overview

Projects

Technologies

Highlights

Fact Table vs Dimension Table

Data Pipeline Tree

Installation

Usage

Contributing

License

Contact

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages