Data Mining Projects 2017
-
Updated
Jan 28, 2018 - Python
Data Mining Projects 2017
Source code for the talk at Techorama Antwerp 2019
Describes the map-reduce concept used in Data Processing - Data Engineering
Implementation of Map reduce in parallel using Java 8 streams
A model classifying word pairs by their semantic similarity, using AWS, Hadoop and WEKA
A Big Data related project. Part 1: Write JAVA code to download and decompress files, upload them onto HDFS, and write WordCount Algorithm to count each word in all the six books. Part 2: Use Twitter Search API to gather tweets about a topic on 6 different timelines and write a WordCount Algorithm to count each occurrence of a hashtag. Technolog…
Distributed Map-reduce ( Distributed systems )
Big Data Processing with Hadoop
Program do równoległej analizy logów na 7 laboratoria z Programowania Równoległego
This repository contains an implementation of a map-reduce job for Hazelcast, created as part of a Distributed Systems course @ ITBA. The code is designed to facilitate distributed processing of large data sets, and can be used as a starting point for further exploration of distributed computing techniques.
Laboratory exercise created with Apache Spark, in the context of the "Advanced Topics in Database Systems" course in NTUA
Running CouchDB in docker
A collection of mandatory exercises in "Introduction to Big Data Projects" - 1st semester master @ Vorarlberg University of Applied Sciences (FHV)
📙 Versione di map-reduce implementata tramite MPI per il conteggio delle occorrenze di ogni parola contenuta in uno o più file.
Utilized Python, and TCP protocols to mimic Hadoop's Map Reduce architecture
Big Data Analysis of datasets for taking into account the character occurrences.
Add a description, image, and links to the map-reduce topic page so that developers can more easily learn about it.
To associate your repository with the map-reduce topic, visit your repo's landing page and select "manage topics."