Skip to content
#

control-center

Here are 25 public repositories matching this topic...

Ecommerce Sales Analytics Data Generation, developed a detailed system architecture using Apache Flink, Kafka, Elasticsearch, and Docker. Implemented real-time data streaming, established a robust, scalable data pipeline. Flink was set up, transactions were aggregated in Postgres and Elasticsearch, concluding with a dynamic streaming dashboard.

  • Updated Feb 20, 2024

A comprehensive data engineering pipeline, orchestrates data workflows with Apache Airflow, Python, Kafka, Zookeeper, Spark, and Cassandra. Containerized using Docker: to deploy and scale effortlessly. This Etsy API Data Pipeline extracts, processes, and analyzes Etsy marketplace data—retrieving product listings, shop details, and reviews.

  • Updated Jan 6, 2024

The goal of this project is to build a docker cluster that gives access to Hadoop, HDFS, Hive, PySpark, Sqoop, Airflow, Kafka, Flume, Postgres, Cassandra, Hue, Zeppelin, Kadmin, Kafka Control Center and pgAdmin. This cluster is solely intended for usage in a development environment. Do not use it to run any production workloads.

  • Updated Feb 27, 2023
  • Shell

Improve this page

Add a description, image, and links to the control-center topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the control-center topic, visit your repo's landing page and select "manage topics."

Learn more