Pyspark Notebook With Docker
-
Updated
Aug 18, 2015 - Python
Pyspark Notebook With Docker
A pyspark ETL example using a jupyter/pyspark-notebook Docker container
Apache Spark based implementation of research paper titled "N-gram-based text categorization"
Implementation of Triangle Counting Problem in Apache Spark
Eurecom Advanced Machine Learning course work
Distributed Keras model for making predictions of sentiment from Spanish sentences in stream context using Spark Streaming and Apache Kafka
Regression analysis to predict the interest rates for lending club.
Unsupervised sentiment analysis on GitHub data using PySpark
A PySpark course to get started with the basics for a Data Engineer
PySpark notebooks
Conducting a study of a recommendation system based on ALS based on Movielens movie data.
Spatial Database Final Project - Coastal and Offshore Marine Zones with Geopandas and Pyspark
Research And Development on Distributed Keras with Spark
Big data analytics performed with Spark and Hadoop on RITA airlines dataset (8.3 GB)
Add a description, image, and links to the pyspark-notebook topic page so that developers can more easily learn about it.
To associate your repository with the pyspark-notebook topic, visit your repo's landing page and select "manage topics."