SlideShare a Scribd company logo
MOVIE
RECOMMENDATION
SYSTEM
BANA7047-002 FINAL PROJECT
• Group 2
• Jagruti Joshi
• Priya Kumari
• Pooja Sahare
1
Part 1
• Background
• Data
• Preliminary Analysis
2
Background
Top Streaming
Services
Need for a
recommendation system 13,000+ titles ~8 seconds
Netflix’s total content library Average human attention span
Data used in a
recommendation system
Impact of a
recommendation system
Watch
Data
Search
Data
Ratings
Data
Increased
Revenues
3
Data
• Source: https://grouplens.org/datasets/movielens/
• recommended for education and development
• Small: 100,000 ratings and 3,600 tag applications applied to 9,000 movies by
600 users. Last updated 9/2018.
•Movies
•movieId
•title
•genres
•Ratings
•userId
•movieId
•rating
•timestamp
•Tags
•userId
•movieId
•tag
•timestamp
•Links
•movieId
•imdbId
•tmdbId
9,742 100,836 1,587,923 9,742
4
Preliminary Analysis
 Data
• Movies
 Insights
• No of movies
released is
increasing every
year peaking at
733 in 2006.
• Sharp decline is
observed in
recent years
(could be due to
not including
recent releases in
the data)
5
Preliminary Analysis
 Data
• Movies
 Insights
• Drama, Comedy,
Thriller, Action
and Romance are
the top 5 genres
• The top 5 genres
together contain
~60% of the
movies
6
Preliminary Analysis
 Data
• Movies
 Insights
• Drama and
Comedy have
consistently
stayed the #1 and
#2 genres over
the years
• Similar
distribution of
genres over the
years
7
Preliminary Analysis
 Data
• Movies
• Ratings
 Insights
• Median > Mean
• Left skewed
distribution
• Do most users
rate most movies
on the higher end
of the 0-5 scale?
8
Preliminary Analysis
 Data
• Movies
• Ratings
 Insights
• Most users rate
less than 1000
movies
• Most users rate
movies on the
higher end of the
0-5 scale
• Most movies
receive less than
100 ratings
• Most movies are
rated on the
higher end of the
0-5 scale
9
Preliminary Analysis
 Data
• Movies
• Ratings
 Insights
• Average Ratings
for all genres is
between 3 and 4
• Among the top 5
genres, Drama
and Romance
movies have
higher ratings
compared to the
remaining three
#1 #2 #4#3#5
10
Preliminary Analysis
 Data
• Movies
• Tags
 Insights
• Tags can be useful
to create sub-
genres and drill
deeper into a
specific genre e.g.
Sci-Fi movies are
a huge sub-genre
within Action
movies
11
Part 2
• Content Based Filtering
• User Based Collaborative Filtering
• Item Based Collaborative Filtering
• Singular Value Decomposition
12
Model 1: Content Based Filtering (CBF) Approach
• Genre-based approach
• Without factoring release year, algorithm
recommends very old movies
• Term Frequency (TF) and Inverse Document
Frequency (IDF) used to determine the relative
importance of genres
• Vector Space Model
• Each movie represented by a vector of its
attributes
• For similar movies,
• Angle between their vectors is small
• Cosine of angle between their vectors is
large
13
1 movie
Movie Genre,
Release Year
CBF Algorithm
TF/IDF,
Cosine Similarity Score
n movies
Similar in Genres,
Closer in Release Years
Sort Results
Highest to Lowest
Cosine Similarity Scores
Filter Results
Select Top 20
Movies
Recommend Results
Display Movies in User’s
Watch Next List
Model 1: Content Based Filtering (CBF) Results
14
1. Rampage (2018)
2. Solo: A Star Wars Story (2018)
3. Ant-Man and the Wasp (2018)
4. Deadpool 2 (2018)
5. Sorry to Bother You (2018)
6. Pacific Rim: Uprising (2018)
7. A Wrinkle in Time (2018)
8. Jupiter Ascending (2015)
9. Avengers: Age of Ultron (2015)
10.Ant-Man (2015)
11.Power/Rangers (2015)
12.Turbo Kid (2015)
13.Hardcore Henry (2015)
14.Iron Man (2008)
15.Journey to the Center of the Earth (2008)
16.Mutant Chronicles (2008)
17.Outlander (2008)
18.Doctor Strange (2016)
19.Independence Day: Resurgence (2016)
20.Star Trek Beyond (2016)
Avengers: Infinity War - Part I (2018) Toy Story (1995) Insidious: Chapter 3 (2015)
1. Gordy (1995)
2. Reckless (1995)
3. Ninja Scroll (Jûbei ninpûchô) (1995)
4. Tale of Despereaux, The (2008)
5. Wild, The (2006)
6. Asterix and the Vikings (Astérix etlesVikings)
(2006)
7. Monsters, Inc. (2001)
8. The Good Dinosaur (2015)
9. Toy Story 2 (1999)
10.Shrek the Third (2007)
11.Moana (2016)
12.Adventures of Rocky and Bullwinkle,The-2000
13.Emperor's New Groove, The (2000)
14.Turbo (2013)
15.Antz (1998)
16.Jumanji (1995)
17.Indian in the Cupboard, The (1995)
18.Shrek (2001)
19.TMNT (Teenage Mutant Ninja Turtles)(2007)
20.Three Wishes (1995)
1. The Gallows (2015)
2. Frankenstein (2015)
3. Maggie (2015)
4. Body (2015)
5. Massu Engira Maasilamani (2015)
6. Into the Grizzly Maze (2015)
7. Return to Sender (2015)
8. Careful What You Wish For (2015)
9. Spotlight (2015)
10. Mojave (2015)
11. Knock Knock (2015)
12. Zipper (2015)
13. The Stanford Prison Experiment (2015)
14. Partisan (2015)
15. Bridge of Spies (2015)
16. The Perfect Guy (2015)
17. Silent Hill (2006)
18. Nightmare on Elm Street, A (2010)
19. Insidious (2010)
20. Paperhouse (1988)
Model 2: User-based Collaborative Filtering (UBCF)
Approach & Results
15
• Find look alike users based on similarity
• Recommend movies which user’s look-alike
has chosen in past.
• Very effective due to creation of user profiles
• Very time and resource consuming algorithm
as computations are made for every user pair.
Thus, we only take 20% of original data
• Results
User 1
• Avengers
• Age of Ultron
• Civil War
• Infinity War
• Iron Man
• Iron Man 2
• Iron Man 3
• Endgame
Not Watched
Watched
Watched
Recommend
Similar
Sample Data For Model 20% of Original Data
Model Train Data 80% of Sample Data
Model Test Data 20% of Sample Data
Root Mean Square Error 24167
Model 3: Item-based Collaborative Filtering (IBCF)
Approach & Results
16
• Like UBCF, but instead of finding user's look-
alike, we find a movie's look-alike.
• Recommend alike movies to user who has
rated this movie.
• Far less time and resource consuming than
UBCF but we’ve used the same 20% subset of
original data for model comparison
• Results
Watched
User 1 Avengers
• Age of Ultron
• Civil War
• Infinity War
• Endgame
Similar
Recommend
Sample Data For Model 20% of Original Data
Model Train Data 80% of Sample Data
Model Test Data 20% of Sample Data
Root Mean Square Error 29123
• Basic essence of SVD is to decomposes a
matrix of any shape into a product of 3
matrices with notable mathematical
properties: X = U S VT
• Decomposition of ratings matrix results in an
ordered matrix of a user feature matrix and
an item feature matrix which encapsulate the
variance associated with every direction of
the matrix
• Larger variances indicate less redundancy
and less correlation and hold features of data
• A representative subset of user rating
directions or principal components to
recommend movies is utilized
• Overall SVD aims to find the smallest
condensed subset of features by discarding
features imparting noise
17
Model 4: Singular Value Decomposition (SVD)
Approach
Movie
User
Sci-Fi
FemaleMale
Wonder
Woman
Captain
Marvel
Drama
Avengers
Endgame
Iron Man
Captain
America
Thelma &
Louise
Legally
Blonde
The
Shawshank
Redemption
Fight Club
Model 4: Singular Value Decomposition (SVD)
Results
Top rated movies by user ID 400
18
Recommended movies for user ID 400
Model Comparison & Recommendations
Model Proportion of Data RMSE
UBCF 20% 24167
IBCF 20% 29123
SVD 20% 0.91
19
• Movie recommendations are very subjective and vary from one user to another
• Each model has a different approach and its own set of pros and cons
• Weighing all the pros and cons, we would recommend SVD as it is a good mix of both collaborative filtering
methods
References
• Slide 2: Background
• https://www.comparitech.com/blog/vpn-privacy/netflix-statistics-facts-
figures/
• Slide 3: Data
• https://grouplens.org/about/what-is-grouplens/
• https://movielens.org/info/about
20
References
• Slides 11,13,15: Collaborative Filtering, UBCF and IBCF
• https://github.com/khanhnamle1994/movielens/blob/master/Content_Base
d_and_Collaborative_Filtering_Models.ipynb
• https://www.comparitech.com/blog/vpn-privacy/netflix-statistics-facts-
figures/
• Slide 17: SVD
• http://www.cs.carleton.edu/cs_comps/0607/recommend/recommender/svd.
html
• https://alyssaq.github.io/2015/20150426-simple-movie-recommender-using-
svd/
• https://www.dataminingapps.com/2020/02/singular-value-decomposition-in-
recommender-systems/
21

More Related Content

What's hot

Movie Recommendation engine
Movie Recommendation engineMovie Recommendation engine
Movie Recommendation engine
Jayesh Lahori
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
Carlos Castillo (ChaTo)
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systems
NAVER Engineering
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introductionLiang Xiang
 
Collaborative Filtering using KNN
Collaborative Filtering using KNNCollaborative Filtering using KNN
Collaborative Filtering using KNN
Şeyda Hatipoğlu
 
Building a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engineBuilding a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engineNYC Predictive Analytics
 
Recommendation System Explained
Recommendation System ExplainedRecommendation System Explained
Recommendation System Explained
Crossing Minds
 
Recommendation System
Recommendation SystemRecommendation System
Recommendation System
Anamta Sayyed
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender SystemsLior Rokach
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
Justin Basilico
 
Recommendation system
Recommendation systemRecommendation system
Recommendation system
Akshat Thakar
 
Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system
Mauryasuraj98
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
Xavier Amatriain
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender system
Stanley Wang
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architectureLiang Xiang
 
Shallow and Deep Latent Models for Recommender System
Shallow and Deep Latent Models for Recommender SystemShallow and Deep Latent Models for Recommender System
Shallow and Deep Latent Models for Recommender System
Anoop Deoras
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Xavier Amatriain
 
Recommendation Engine Project Presentation
Recommendation Engine Project PresentationRecommendation Engine Project Presentation
Recommendation Engine Project Presentation
19Divya
 
Recommender systems
Recommender systemsRecommender systems
Recommender systems
Tamer Rezk
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
Girish Khanzode
 

What's hot (20)

Movie Recommendation engine
Movie Recommendation engineMovie Recommendation engine
Movie Recommendation engine
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Recent advances in deep recommender systems
Recent advances in deep recommender systemsRecent advances in deep recommender systems
Recent advances in deep recommender systems
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introduction
 
Collaborative Filtering using KNN
Collaborative Filtering using KNNCollaborative Filtering using KNN
Collaborative Filtering using KNN
 
Building a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engineBuilding a Recommendation Engine - An example of a product recommendation engine
Building a Recommendation Engine - An example of a product recommendation engine
 
Recommendation System Explained
Recommendation System ExplainedRecommendation System Explained
Recommendation System Explained
 
Recommendation System
Recommendation SystemRecommendation System
Recommendation System
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
Recommendation system
Recommendation systemRecommendation system
Recommendation system
 
Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender system
 
Recommender system algorithm and architecture
Recommender system algorithm and architectureRecommender system algorithm and architecture
Recommender system algorithm and architecture
 
Shallow and Deep Latent Models for Recommender System
Shallow and Deep Latent Models for Recommender SystemShallow and Deep Latent Models for Recommender System
Shallow and Deep Latent Models for Recommender System
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
 
Recommendation Engine Project Presentation
Recommendation Engine Project PresentationRecommendation Engine Project Presentation
Recommendation Engine Project Presentation
 
Recommender systems
Recommender systemsRecommender systems
Recommender systems
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 

Similar to Movie Recommendation System - MovieLens Dataset

movieRecommendation_FinalReport
movieRecommendation_FinalReportmovieRecommendation_FinalReport
movieRecommendation_FinalReportSohini Sarkar
 
Community detection recommender system
Community detection   recommender systemCommunity detection   recommender system
Community detection recommender system
RupaDutta3
 
OMRES-ProgressPresentation1.pptx
OMRES-ProgressPresentation1.pptxOMRES-ProgressPresentation1.pptx
OMRES-ProgressPresentation1.pptx
045HridikGulatiT17
 
Graph Gurus Episode 2: Building a Movie Recommendation Engine
Graph Gurus Episode 2: Building a Movie Recommendation EngineGraph Gurus Episode 2: Building a Movie Recommendation Engine
Graph Gurus Episode 2: Building a Movie Recommendation Engine
TigerGraph
 
End-to-end machine learning project in Arabic
End-to-end machine learning project in ArabicEnd-to-end machine learning project in Arabic
End-to-end machine learning project in Arabic
AMR koura
 
movie recommender system using vectorization and SVD tech
movie recommender system using vectorization and SVD techmovie recommender system using vectorization and SVD tech
movie recommender system using vectorization and SVD tech
UddeshBhagat
 
Recommender Systems, Matrices and Graphs
Recommender Systems, Matrices and GraphsRecommender Systems, Matrices and Graphs
Recommender Systems, Matrices and Graphs
Roelof Pieters
 
Software Suite for Movie Market Analysis
Software Suite for Movie Market AnalysisSoftware Suite for Movie Market Analysis
Software Suite for Movie Market Analysis
dariospin93
 
Online social network based object recommendation system
Online social network based object recommendation systemOnline social network based object recommendation system
Online social network based object recommendation systemSriram Patil
 
Movie Recommendation System_final.pptx
Movie Recommendation System_final.pptxMovie Recommendation System_final.pptx
Movie Recommendation System_final.pptx
Sridharkadiri2
 
Crowd-Based Personalized Natural Language Explanations for Recommendations
Crowd-Based Personalized Natural Language Explanations for Recommendations Crowd-Based Personalized Natural Language Explanations for Recommendations
Crowd-Based Personalized Natural Language Explanations for Recommendations
Shuo Chang
 
Presentation (1).pptx
Presentation (1).pptxPresentation (1).pptx
Presentation (1).pptx
AkshatVerma308939
 
Entity Recommendations Using Hierarchical Knowledge Bases
Entity Recommendations Using Hierarchical Knowledge BasesEntity Recommendations Using Hierarchical Knowledge Bases
Entity Recommendations Using Hierarchical Knowledge Bases
Artificial Intelligence Institute at UofSC
 
Lessons learnt at building recommendation services at industry scale
Lessons learnt at building recommendation services at industry scaleLessons learnt at building recommendation services at industry scale
Lessons learnt at building recommendation services at industry scale
Domonkos Tikk
 
Challenge@rule ml2015 rule based recommender systems for the Web of Data
Challenge@rule ml2015 rule based recommender systems for the Web of DataChallenge@rule ml2015 rule based recommender systems for the Web of Data
Challenge@rule ml2015 rule based recommender systems for the Web of Data
RuleML
 
Movie recommender system using the user's psychological profile
Movie recommender system using the user's psychological profileMovie recommender system using the user's psychological profile
Movie recommender system using the user's psychological profile
University Politehnica Bucharest
 
Content Recommendation through Semantic Annotation of User Reviews and Linked...
Content Recommendation through Semantic Annotation of User Reviews and Linked...Content Recommendation through Semantic Annotation of User Reviews and Linked...
Content Recommendation through Semantic Annotation of User Reviews and Linked...
Iacopo Vagliano
 
Big Data, Analytics, and Content Recommendations on AWS
Big Data, Analytics, and Content Recommendations on AWSBig Data, Analytics, and Content Recommendations on AWS
Big Data, Analytics, and Content Recommendations on AWS
Amazon Web Services
 
Movie_Recommendation.pdf
Movie_Recommendation.pdfMovie_Recommendation.pdf
Movie_Recommendation.pdf
MrShaikh12
 
Introduction to recommender systems
Introduction to recommender systemsIntroduction to recommender systems
Introduction to recommender systems
Rami Alsalman
 

Similar to Movie Recommendation System - MovieLens Dataset (20)

movieRecommendation_FinalReport
movieRecommendation_FinalReportmovieRecommendation_FinalReport
movieRecommendation_FinalReport
 
Community detection recommender system
Community detection   recommender systemCommunity detection   recommender system
Community detection recommender system
 
OMRES-ProgressPresentation1.pptx
OMRES-ProgressPresentation1.pptxOMRES-ProgressPresentation1.pptx
OMRES-ProgressPresentation1.pptx
 
Graph Gurus Episode 2: Building a Movie Recommendation Engine
Graph Gurus Episode 2: Building a Movie Recommendation EngineGraph Gurus Episode 2: Building a Movie Recommendation Engine
Graph Gurus Episode 2: Building a Movie Recommendation Engine
 
End-to-end machine learning project in Arabic
End-to-end machine learning project in ArabicEnd-to-end machine learning project in Arabic
End-to-end machine learning project in Arabic
 
movie recommender system using vectorization and SVD tech
movie recommender system using vectorization and SVD techmovie recommender system using vectorization and SVD tech
movie recommender system using vectorization and SVD tech
 
Recommender Systems, Matrices and Graphs
Recommender Systems, Matrices and GraphsRecommender Systems, Matrices and Graphs
Recommender Systems, Matrices and Graphs
 
Software Suite for Movie Market Analysis
Software Suite for Movie Market AnalysisSoftware Suite for Movie Market Analysis
Software Suite for Movie Market Analysis
 
Online social network based object recommendation system
Online social network based object recommendation systemOnline social network based object recommendation system
Online social network based object recommendation system
 
Movie Recommendation System_final.pptx
Movie Recommendation System_final.pptxMovie Recommendation System_final.pptx
Movie Recommendation System_final.pptx
 
Crowd-Based Personalized Natural Language Explanations for Recommendations
Crowd-Based Personalized Natural Language Explanations for Recommendations Crowd-Based Personalized Natural Language Explanations for Recommendations
Crowd-Based Personalized Natural Language Explanations for Recommendations
 
Presentation (1).pptx
Presentation (1).pptxPresentation (1).pptx
Presentation (1).pptx
 
Entity Recommendations Using Hierarchical Knowledge Bases
Entity Recommendations Using Hierarchical Knowledge BasesEntity Recommendations Using Hierarchical Knowledge Bases
Entity Recommendations Using Hierarchical Knowledge Bases
 
Lessons learnt at building recommendation services at industry scale
Lessons learnt at building recommendation services at industry scaleLessons learnt at building recommendation services at industry scale
Lessons learnt at building recommendation services at industry scale
 
Challenge@rule ml2015 rule based recommender systems for the Web of Data
Challenge@rule ml2015 rule based recommender systems for the Web of DataChallenge@rule ml2015 rule based recommender systems for the Web of Data
Challenge@rule ml2015 rule based recommender systems for the Web of Data
 
Movie recommender system using the user's psychological profile
Movie recommender system using the user's psychological profileMovie recommender system using the user's psychological profile
Movie recommender system using the user's psychological profile
 
Content Recommendation through Semantic Annotation of User Reviews and Linked...
Content Recommendation through Semantic Annotation of User Reviews and Linked...Content Recommendation through Semantic Annotation of User Reviews and Linked...
Content Recommendation through Semantic Annotation of User Reviews and Linked...
 
Big Data, Analytics, and Content Recommendations on AWS
Big Data, Analytics, and Content Recommendations on AWSBig Data, Analytics, and Content Recommendations on AWS
Big Data, Analytics, and Content Recommendations on AWS
 
Movie_Recommendation.pdf
Movie_Recommendation.pdfMovie_Recommendation.pdf
Movie_Recommendation.pdf
 
Introduction to recommender systems
Introduction to recommender systemsIntroduction to recommender systems
Introduction to recommender systems
 

Recently uploaded

Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Subhajit Sahu
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
ahzuo
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
axoqas
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
NABLAS株式会社
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
John Andrews
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
TravisMalana
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
manishkhaire30
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
ahzuo
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
GetInData
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
mzpolocfi
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
roli9797
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
rwarrenll
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
haila53
 

Recently uploaded (20)

Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTESAdjusting OpenMP PageRank : SHORT REPORT / NOTES
Adjusting OpenMP PageRank : SHORT REPORT / NOTES
 
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
一比一原版(UIUC毕业证)伊利诺伊大学|厄巴纳-香槟分校毕业证如何办理
 
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
做(mqu毕业证书)麦考瑞大学毕业证硕士文凭证书学费发票原版一模一样
 
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
【社内勉強会資料_Octo: An Open-Source Generalist Robot Policy】
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
Chatty Kathy - UNC Bootcamp Final Project Presentation - Final Version - 5.23...
 
Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)Malana- Gimlet Market Analysis (Portfolio 2)
Malana- Gimlet Market Analysis (Portfolio 2)
 
Learn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queriesLearn SQL from basic queries to Advance queries
Learn SQL from basic queries to Advance queries
 
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
一比一原版(CBU毕业证)卡普顿大学毕业证如何办理
 
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdfEnhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
Enhanced Enterprise Intelligence with your personal AI Data Copilot.pdf
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
一比一原版(Dalhousie毕业证书)达尔豪斯大学毕业证如何办理
 
Analysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performanceAnalysis insight about a Flyball dog competition team's performance
Analysis insight about a Flyball dog competition team's performance
 
My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.My burning issue is homelessness K.C.M.O.
My burning issue is homelessness K.C.M.O.
 
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdfCh03-Managing the Object-Oriented Information Systems Project a.pdf
Ch03-Managing the Object-Oriented Information Systems Project a.pdf
 

Movie Recommendation System - MovieLens Dataset

  • 1. MOVIE RECOMMENDATION SYSTEM BANA7047-002 FINAL PROJECT • Group 2 • Jagruti Joshi • Priya Kumari • Pooja Sahare 1
  • 2. Part 1 • Background • Data • Preliminary Analysis 2
  • 3. Background Top Streaming Services Need for a recommendation system 13,000+ titles ~8 seconds Netflix’s total content library Average human attention span Data used in a recommendation system Impact of a recommendation system Watch Data Search Data Ratings Data Increased Revenues 3
  • 4. Data • Source: https://grouplens.org/datasets/movielens/ • recommended for education and development • Small: 100,000 ratings and 3,600 tag applications applied to 9,000 movies by 600 users. Last updated 9/2018. •Movies •movieId •title •genres •Ratings •userId •movieId •rating •timestamp •Tags •userId •movieId •tag •timestamp •Links •movieId •imdbId •tmdbId 9,742 100,836 1,587,923 9,742 4
  • 5. Preliminary Analysis  Data • Movies  Insights • No of movies released is increasing every year peaking at 733 in 2006. • Sharp decline is observed in recent years (could be due to not including recent releases in the data) 5
  • 6. Preliminary Analysis  Data • Movies  Insights • Drama, Comedy, Thriller, Action and Romance are the top 5 genres • The top 5 genres together contain ~60% of the movies 6
  • 7. Preliminary Analysis  Data • Movies  Insights • Drama and Comedy have consistently stayed the #1 and #2 genres over the years • Similar distribution of genres over the years 7
  • 8. Preliminary Analysis  Data • Movies • Ratings  Insights • Median > Mean • Left skewed distribution • Do most users rate most movies on the higher end of the 0-5 scale? 8
  • 9. Preliminary Analysis  Data • Movies • Ratings  Insights • Most users rate less than 1000 movies • Most users rate movies on the higher end of the 0-5 scale • Most movies receive less than 100 ratings • Most movies are rated on the higher end of the 0-5 scale 9
  • 10. Preliminary Analysis  Data • Movies • Ratings  Insights • Average Ratings for all genres is between 3 and 4 • Among the top 5 genres, Drama and Romance movies have higher ratings compared to the remaining three #1 #2 #4#3#5 10
  • 11. Preliminary Analysis  Data • Movies • Tags  Insights • Tags can be useful to create sub- genres and drill deeper into a specific genre e.g. Sci-Fi movies are a huge sub-genre within Action movies 11
  • 12. Part 2 • Content Based Filtering • User Based Collaborative Filtering • Item Based Collaborative Filtering • Singular Value Decomposition 12
  • 13. Model 1: Content Based Filtering (CBF) Approach • Genre-based approach • Without factoring release year, algorithm recommends very old movies • Term Frequency (TF) and Inverse Document Frequency (IDF) used to determine the relative importance of genres • Vector Space Model • Each movie represented by a vector of its attributes • For similar movies, • Angle between their vectors is small • Cosine of angle between their vectors is large 13 1 movie Movie Genre, Release Year CBF Algorithm TF/IDF, Cosine Similarity Score n movies Similar in Genres, Closer in Release Years Sort Results Highest to Lowest Cosine Similarity Scores Filter Results Select Top 20 Movies Recommend Results Display Movies in User’s Watch Next List
  • 14. Model 1: Content Based Filtering (CBF) Results 14 1. Rampage (2018) 2. Solo: A Star Wars Story (2018) 3. Ant-Man and the Wasp (2018) 4. Deadpool 2 (2018) 5. Sorry to Bother You (2018) 6. Pacific Rim: Uprising (2018) 7. A Wrinkle in Time (2018) 8. Jupiter Ascending (2015) 9. Avengers: Age of Ultron (2015) 10.Ant-Man (2015) 11.Power/Rangers (2015) 12.Turbo Kid (2015) 13.Hardcore Henry (2015) 14.Iron Man (2008) 15.Journey to the Center of the Earth (2008) 16.Mutant Chronicles (2008) 17.Outlander (2008) 18.Doctor Strange (2016) 19.Independence Day: Resurgence (2016) 20.Star Trek Beyond (2016) Avengers: Infinity War - Part I (2018) Toy Story (1995) Insidious: Chapter 3 (2015) 1. Gordy (1995) 2. Reckless (1995) 3. Ninja Scroll (Jûbei ninpûchô) (1995) 4. Tale of Despereaux, The (2008) 5. Wild, The (2006) 6. Asterix and the Vikings (Astérix etlesVikings) (2006) 7. Monsters, Inc. (2001) 8. The Good Dinosaur (2015) 9. Toy Story 2 (1999) 10.Shrek the Third (2007) 11.Moana (2016) 12.Adventures of Rocky and Bullwinkle,The-2000 13.Emperor's New Groove, The (2000) 14.Turbo (2013) 15.Antz (1998) 16.Jumanji (1995) 17.Indian in the Cupboard, The (1995) 18.Shrek (2001) 19.TMNT (Teenage Mutant Ninja Turtles)(2007) 20.Three Wishes (1995) 1. The Gallows (2015) 2. Frankenstein (2015) 3. Maggie (2015) 4. Body (2015) 5. Massu Engira Maasilamani (2015) 6. Into the Grizzly Maze (2015) 7. Return to Sender (2015) 8. Careful What You Wish For (2015) 9. Spotlight (2015) 10. Mojave (2015) 11. Knock Knock (2015) 12. Zipper (2015) 13. The Stanford Prison Experiment (2015) 14. Partisan (2015) 15. Bridge of Spies (2015) 16. The Perfect Guy (2015) 17. Silent Hill (2006) 18. Nightmare on Elm Street, A (2010) 19. Insidious (2010) 20. Paperhouse (1988)
  • 15. Model 2: User-based Collaborative Filtering (UBCF) Approach & Results 15 • Find look alike users based on similarity • Recommend movies which user’s look-alike has chosen in past. • Very effective due to creation of user profiles • Very time and resource consuming algorithm as computations are made for every user pair. Thus, we only take 20% of original data • Results User 1 • Avengers • Age of Ultron • Civil War • Infinity War • Iron Man • Iron Man 2 • Iron Man 3 • Endgame Not Watched Watched Watched Recommend Similar Sample Data For Model 20% of Original Data Model Train Data 80% of Sample Data Model Test Data 20% of Sample Data Root Mean Square Error 24167
  • 16. Model 3: Item-based Collaborative Filtering (IBCF) Approach & Results 16 • Like UBCF, but instead of finding user's look- alike, we find a movie's look-alike. • Recommend alike movies to user who has rated this movie. • Far less time and resource consuming than UBCF but we’ve used the same 20% subset of original data for model comparison • Results Watched User 1 Avengers • Age of Ultron • Civil War • Infinity War • Endgame Similar Recommend Sample Data For Model 20% of Original Data Model Train Data 80% of Sample Data Model Test Data 20% of Sample Data Root Mean Square Error 29123
  • 17. • Basic essence of SVD is to decomposes a matrix of any shape into a product of 3 matrices with notable mathematical properties: X = U S VT • Decomposition of ratings matrix results in an ordered matrix of a user feature matrix and an item feature matrix which encapsulate the variance associated with every direction of the matrix • Larger variances indicate less redundancy and less correlation and hold features of data • A representative subset of user rating directions or principal components to recommend movies is utilized • Overall SVD aims to find the smallest condensed subset of features by discarding features imparting noise 17 Model 4: Singular Value Decomposition (SVD) Approach Movie User Sci-Fi FemaleMale Wonder Woman Captain Marvel Drama Avengers Endgame Iron Man Captain America Thelma & Louise Legally Blonde The Shawshank Redemption Fight Club
  • 18. Model 4: Singular Value Decomposition (SVD) Results Top rated movies by user ID 400 18 Recommended movies for user ID 400
  • 19. Model Comparison & Recommendations Model Proportion of Data RMSE UBCF 20% 24167 IBCF 20% 29123 SVD 20% 0.91 19 • Movie recommendations are very subjective and vary from one user to another • Each model has a different approach and its own set of pros and cons • Weighing all the pros and cons, we would recommend SVD as it is a good mix of both collaborative filtering methods
  • 20. References • Slide 2: Background • https://www.comparitech.com/blog/vpn-privacy/netflix-statistics-facts- figures/ • Slide 3: Data • https://grouplens.org/about/what-is-grouplens/ • https://movielens.org/info/about 20
  • 21. References • Slides 11,13,15: Collaborative Filtering, UBCF and IBCF • https://github.com/khanhnamle1994/movielens/blob/master/Content_Base d_and_Collaborative_Filtering_Models.ipynb • https://www.comparitech.com/blog/vpn-privacy/netflix-statistics-facts- figures/ • Slide 17: SVD • http://www.cs.carleton.edu/cs_comps/0607/recommend/recommender/svd. html • https://alyssaq.github.io/2015/20150426-simple-movie-recommender-using- svd/ • https://www.dataminingapps.com/2020/02/singular-value-decomposition-in- recommender-systems/ 21

Editor's Notes

  1. Background information of the project. The objective you want to achieve.
  2. discussion of data source and nature of the variables involved in the analysis GrouplLens - a research lab in the Department of Computer Science and Engineering at the University of Minnesota, Twin Cities specializing in recommender systems MovieLens - a research site run by GroupLens Research, a unique research vehicle for dozens of undergraduates and graduate students researching various aspects of personalization and filtering technologies
  3. Exploratory analysis of the data set, summary, plots, and maybe some kind of linear regression fit to check the feasibility of the problem as well as get a better idea of how this data looks.
  4. Exploratory analysis of the data set, summary, plots, and maybe some kind of linear regression fit to check the feasibility of the problem as well as get a better idea of how this data looks.
  5. Exploratory analysis of the data set, summary, plots, and maybe some kind of linear regression fit to check the feasibility of the problem as well as get a better idea of how this data looks.
  6. Exploratory analysis of the data set, summary, plots, and maybe some kind of linear regression fit to check the feasibility of the problem as well as get a better idea of how this data looks.
  7. Exploratory analysis of the data set, summary, plots, and maybe some kind of linear regression fit to check the feasibility of the problem as well as get a better idea of how this data looks.
  8. Exploratory analysis of the data set, summary, plots, and maybe some kind of linear regression fit to check the feasibility of the problem as well as get a better idea of how this data looks.
  9. Exploratory analysis of the data set, summary, plots, and maybe some kind of linear regression fit to check the feasibility of the problem as well as get a better idea of how this data looks.
  10. Ratings are associated with both users and movies While creating the best representation of our original features, we remove the unnecessary noise By retaining features the features which have larger variances
  11. Different genres in input -> Different genres in output as well Holistic view