Skip to content

Docker Big Data Tools: This docker-compose file is configured to run multiple nodes. This is a Hadoop Cluster that contains the necessary tools that can be used in the BigData domain, It's a collection of docker containers that you can use directly.

ven2day/Bigdata-docker-sandbox

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 

Repository files navigation

Docker Big Data Tools

ℹ️ This docker-compose file is configured to run multiple nodes.

This is a Hadoop Cluster that contains the necessary tools that can be used in the BigData domain, It's a collection of docker containers that you can use directly to have the maximum of tools like :

  • Hive
  • Hue
  • MySql
  • Zookeeper
  • Kafka
  • Hbase
  • Mongo
  • Metabase
  • Streamsets
  • Sqoop
  • Storm
  • NiFi

Docker Images Used


Installations

git clone https://github.com/ven2day/Bigdata-docker-sandbox.git

cd docker-bigdata-tools

sudo docker-compose up -d

⚠️ It takes some time for launch and configure all the images

Screenshots

Namenode

👁️ You can see here 3 Live Nodes**

Datanode 1

Datanode 2

Datanode 3

Hue

Username : admin Password : admin

After click in Sign In

Now you can use Hive

  • Simple Query for test
CREATE TABLE IF NOT EXISTS users(id INT, name VARCHAR(45), website VARCHAR(45));

INSERT INTO users VALUES(1,"mahmoud zakaria","www.mahmoud.ma");
  • After insert data you can execute select query.
SELECT *FROM users;
  • Hue Dashboard

kafka Manager

Cluster Overview

Hbase

Storm UI

Jupyter

Mongo Express

StreamSets

Username : admin Password : admin

Nifi


Modified by Ven2Day

About

Docker Big Data Tools: This docker-compose file is configured to run multiple nodes. This is a Hadoop Cluster that contains the necessary tools that can be used in the BigData domain, It's a collection of docker containers that you can use directly.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published