Nexus is a distributed workflow system designed for use by data engineers to move data around their organisations.
-
Updated
Feb 15, 2017 - Python
Nexus is a distributed workflow system designed for use by data engineers to move data around their organisations.
Curated List of Data Engineering tools and frameworks
Code, Examples, Templates and Scripts for DataWorksSummit 2017 Sydney Talk
For the Coursera specialization https://www.coursera.org/specializations/gcp-data-machine-learning
A Python library for extracting information via XPaths
Coursera Specialization :Big Data for Data Engineers
Here's how to get DataQuest's Data Engineering Track missions' content to work on your localhost. Using data from my Valenbisi ARIMA modeling project, I document my steps using PostgreSQL, Postico, and the Command Line to get our DataQuest exercises running out of a Jupyter Notebook.
Wraps the DB by opening a REST API for storing and retrieving documents info & recommendations
Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups
Data Engineering knowledge as a readable tutorial (collaboratively).
Data Quest - Data Engineer Learning and Projects
Students will build an ETL pipeline that extracts data from S3, stages them in Redshift, and transforms data into a set of dimensional tables for their analytics team.
Udacity Data Engineer Nanodegree: Project Data Lake
An operational description of ML at Scale
Content for architecting a data science platform for products using Luigi, Spark & Flask.
Data Modeling with Postgres project for the Udacity Data Engineer Nanodegree program
Add a description, image, and links to the data-engineer topic page so that developers can more easily learn about it.
To associate your repository with the data-engineer topic, visit your repo's landing page and select "manage topics."