Skip to content

Using 48,613 average movie ratings from 12 platforms for which we have ratings for 100 or more movies, we estimate the correlation between ratings across platforms. The median correlation between average ratings of two platforms was .37.

Notifications You must be signed in to change notification settings

soodoku/mixed_signals

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Mixed Signals: Movie Quality Assessments Across Platforms

Guardians

Is the difference between ratings across different platforms for Guardians of Galaxy the norm? And are the differences in ratings systematic, or are the ratings largely unrelated?

Data from 1950--2020

To shed light on the question, we scraped the list of American films on Wikipedia produced between 1950 and 2020. (See here, for example.) We then scraped the Google Knowledge Panel using APIfy for each of the movies. (The scripts for scraping the data are posted here.)

Analyses

In all, we have 16,319 movies and movie ratings from 19 platforms. For seven platforms, however, we have less than 100 movie reviews. We subset our initial analysis on 48,613 (average) ratings from the 12 platforms for which we have ratings for 100 or more movies.

Number of Movies Per Platform

We estimate the correlation between the average platform ratings. (See below for the (Pearson's) correlation matrix.) The median correlation between average ratings of two platforms was .37. (The median Spearman's correlation was .35.)

Correlation Plot

Largest Differences

Movies with the largest difference between Rotten Tomatoes and IMDb Rating.

title rotten_tomatoes_rating IMDb_rating
After Last Season 92 1.6
The Catered Affair 17 7.5
Sparkle 10 6.9
Only the Strong 8 6.7
The Life of David Gale 19 7.6
The Vanishing of Sidney Hall 11 6.9
I'll Never Forget You 14 7.1
The Professor 10 6.7
Gang of Roses 75 2.4

Movies with the largest difference between Rotten Tomatoes and Google Rating.

title rotten_tomatoes_rating p_google_likes
After Last Season 92 6
Faithful 7 95
Supercross 5 92
Material Girls 4 91
Only the Strong 8 94
Big Mommas: Like Father, Like Son 5 91
The Covenant 4 90
McHale's Navy 3 89
Equalizer 2000 11 96
Grind 8 93

Movie Ratings Over Time

To see how movies have been rated over time, we plotted the average of a movie-quality index over time (see below). To build an index of movie quality, we rescaled the ratings from the three platforms for which we have the most data---IMDB, Rotten Tomatoes, and Google---to lie between 0 and 1 and averaged it. The figure shows that movie quality steadily declined between 1950 and 2000. And since then, there has been a modest uptick.

Rating Over Time

But if you separate the platform ratings, you get three different trend lines. Google ratings are largely flat over time, with a modest decline since 2010. Rotten tomatoes ratings follow a very different trend with a sharp decline between 1950 and 2000 and a sharp uptick since 2000. IMDB ratings follow yet another trend---they decline till the 1980s and gently slope upward after that.

Rating Over Time by Platform

Correlation Between Aggregate Movie Ratings By Time, Maturity Ratings, Genre

The differing over-time trend lines across platforms are disconcerting. But to further understand the issue, we plot correlations across platform ratings by decade (see below). The correlation between Rotten Tomatoes Ratings and IMDB ratings is relatively similar across time. But the other two trends in correlations are much more volatile. The correlation between Rotten Tomatoes and Google changes sizably, from .09 in the 1950s to .36 in the 1960s. The correlation between IMDB and Google ratings swings even more, from .22 in the 1950s to .65 in the 2020s. Overall, it seems the correlation between IMDB and Google ratings becomes stronger over time.

Correlation Over Time

If we check correlation across platforms by maturity ratings, we see some variation between Rotten Tomatoes and Google, but overall it doesn't seem that maturity rating have much explanatory power.

Correlation by Maturity Rating

Next, we investigate how the correlation between ratings across platforms varies by genre. We limit ourselves to the top 10 most popular genres. We see fairly sizably variation in the correlation between IMDB and Google, with the correlation varying between .29 and .71. The range of correlations between Rotten Tomatoes and Google is also very wide---.04 to .48.

Correlation by Genre

PCA

For IMDB, Google, and Rotten Tomatoes, here's the summary of the principal components.

PC1 PC2 PC3
Standard deviation 1.3951 0.9045 0.48538
Proportion of Variance 0.6487 0.2727 0.07853
Cumulative Proportion 0.6487 0.9215 1.00000

Biplot

Top 100

I estimated the correlation between ratings for movies that earned the most (unadjusted for inflation) on the domestic box office. (Presumably variation in ratings for top 100 is smaller than if we had the entire sample. And that means the correlation is estimated over a small range. It is also likely that lots of people rated these movies so the measures are likely precise.)

The correlation between metacritic and rotten tomatoes is just .48. The correlation between imdb and google ratings is just .33. The highest correlation is surprisingly between rotten tomatoes and imdb ratings.

imdb rotten metacritic google
imdb 1.00 0.74 0.36 0.33
rotten 0.74 1.00 0.48 0.23
metacritic 0.36 0.48 1.00 0.18
google 0.33 0.23 0.18 1.00

License

Released under CC BY 2.0.

About

Using 48,613 average movie ratings from 12 platforms for which we have ratings for 100 or more movies, we estimate the correlation between ratings across platforms. The median correlation between average ratings of two platforms was .37.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Languages