Skip to content

pushshift/imdb_to_json

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

56 Commits
 
 
 
 
 
 
 
 

Repository files navigation

This code will fetch data using a title code and convert the data to JSON format.

Example of usage:

./convert_imdb_to_json.py tt0117731

The movie_data.ndjson.zst file contains movie / episode data for over 1 million shows. The data is in ndjson format and is sorted by the number of votes. This data also contains basic metadata for each show including genres, start/end year, title, adult classification and run time. You can use the "titleCode" in each movie object to download data from IMDB in json format using the code provided.

Movie title codes and other metadata can be downloaded from here: https://datasets.imdbws.com/

This should get you started.

2020-08-19: Added ratings metadata

2020-08-19: Added reviews metadata

2020-08-19: Added title keywords metada

To do:

Add additional logging

Add better error handling (for requests)

About

Fetch movie data from IMDB and output in JSON format.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages