SlideShare a Scribd company logo
A Hybrid Recommender System:
User Profiling from Keywords and
Ratings
Ana Stanescu, Swapnil Nagar, Doina Caragea
2013 IEEE/WIC/ACM International Conferences on Web
Intelligence (WI) and Intelligent Agent Technology (IAT)
Outline
 Introduction
 Related Work
 Approaches
 Experimental Setup
 Results
 Conclusion
2
Introduction(1/3)
 Recommendation systems[3]
 Content-Based
 User preferred in the past.
 Data scarcity problem.
 Cannot identify new and different items.
 Collaborative Filtering
 Based on the user-user similarity.
 A new item cannot be recommended.
 Hybrid
3
• [3] M. Balabanovic and Y. Shoham. Fab: content-based, collaborative recommendation.
Communications of the ACM, 40, 1997.
Introduction(2/3)
 We propose a hybrid system that mediates the
data sparsity problem and reduces the noise from
the user generated content.
 We adapt for movies the Weighted Tag
Recommender (WTR) approach from [14].
 Addressed the problem of recommending books on
Amazon and built their system exclusively from
tag information.
4
• [14] H. Liang, Y. Xu, Y. Li, R. Nayak, and G. Shaw. A hybrid recommender systems
based on weighted tags. 10th SIAM International Conference on Data Mining, 2010.
Introduction(3/3)
 Weighted Tag-Rating Recommender (WTRR).
 Weighted Keyword-Rating Recommender
(WKRR).
 Both our keyword and tag representations of users
can help alleviate the noise and semantic
ambiguity problems inherent in the information
contributed by users of social networks.
5
Related Work(1/3)
 Tagging is a type of labeling, whose purpose is to
assist users in the process of finding content on
the web. [18]
 Tags are free annotations and there are no
constrains assigning tags.
 A hybrid system proposed by Liang et al. [14]
addresses these problems, by using weighted tags.
6
• [14] H. Liang, Y. Xu, Y. Li, R. Nayak, and G. Shaw. A hybrid recommender systems based
on weighted tags. 10th SIAM International Conference on Data Mining, 2010.
• [18] A. Said, B. Kille, E. W. De Luca, and S. Albayrak. Personalizing tags: a folksonomy-
like approach for recommending movies. In Proceedings of the 2nd International Workshop
on Information Heterogeneity and Fusion in Recommender Systems, HetRec ’11, 2011.
Related Work (2/3)
 For domains where both tags and ratings are
available, a recommender system should exploit
all the information.
 Systems that leverage ratings, which can be either
explicitly provided by the users[5], are known to
perform well.
 Ratings can also be noisy.[2]
7
• [5] R. M. Bell, Y. Koren, and C. Volinsky. The Bellkor 2008 solution to the Netflix prize.
2008.
• [2] X. Amatriain, J. Pujol, and N. Oliver. I like it... i like it not: Evaluating user ratings noise
in recommender systems. In User Modeling, Adaptation, and Personalization, Lecture
Notes in Computer Science. 2009.
Related Work (3/3)
 The system proposed by [6] is an ensemble of
various recommenders primarily used for mining
and aggregating the information from various
sources.
 In [12], the authors propose learning multiple
models which can incorporate different types of
inputs to predict the preferences of diverse users.
8
• [6] E. Bothos, K. Christidis, D. Apostolou, and G. Mentzas. Information market based
recommender systems fusion. In Proceedings of the 2nd International Workshop on
Informatio.
• [12] C. Jones, J. Ghosh, and A. Sharma. Learning multiple models for exploiting predictive
heterogeneity in recommender systems. 2011.
Approaches – WTRR(1/5)
 Weighted Tag-Rating Recommender(WTRR)
 The book recommender system proposed in [14] is
built from tag information only.
 Tags may not always capture the true preference of
the user.
 We incorporate the actual ratings.
9
• [14] H. Liang, Y. Xu, Y. Li, R. Nayak, and G. Shaw. A hybrid recommender systems based
on weighted tags. 10th SIAM International Conference on Data Mining, 2010.
Approaches – WTRR(2/5)
 Tag Relevance
 Finding meaning of each tag for each user individually
 Tag Relatedness Metric
10
Summation of ratings assigned to
the movie mi by all the users who
used tag tx.
Summation of all the ratings from
the users who tagged mi.
Measures how similar
tag ty is to a given tag tx.
The set of movies tagged with tx by ui.
Approaches – WTRR(3/5)
 User Profile
 To leverage the advantages of hybrid systems,
users topic preferences and movie preferences are
combined.
 Every user is represented by a profile, encoded
using a vector of weights:
11
• ui
T : user ui’s topic preferences. (values denoting how much ui is interested in each tag.)
• ui
M : user ui’s movie preferences.
Approaches – WTRR(4/5)
 Weight of each tag for a user
 Total relevance weight of ty for ui
12
Summation of ratings assigned to
the movie mj by all the users who
used tx.
Summation of all ratings assigned
to the movie mj by all the users
who tagged it.
Approaches – WTRR(5/5)
 Inverse user frequency of tag ty
 The tag representation of each user
(Values of the topic preference vector ui
T for each user ui)
13
• |Uty
| is the number of users that used ty .
• e is Euler’s number.
Approaches – WKRR(1/4)
 Weighted Keyword-Rating Recommender (WKRR).
 Our algorithm dynamically creates a user profile
from IMDB movie keywords and explicit user
ratings.
 Similar to WTRR, we profile users on preference.
14
• ui
K : user ui’s keyword topic preferences.
• ui
R : user ui’s rating-based movie preferences.
Approaches – WKRR(2/4)
 Movie Description Based on Weighted Keywords
 movie keyword relevance metric
15
Approaches – WKRR(3/4)
 The Representation of Keywords
 degree of connection between keywords
 representation of keyword kx
16
Approaches – WKRR(4/4)
 User Profile Generation From Keywords
 Weight of a keyword to a user
 Total relevance weight of a keyword for a user
17
Approaches –
Neighborhood Formation(1/2)
 In order to predict a user’s rating for an unseen
movie, we first set out to find the community of
users sharing similar taste.
 Identify for each user u, an ordered list of k most
similar users such that sim(u, u1) is maximum,
sim(u, u2) is the second highest and so on.
18
Approaches –
Neighborhood Formation(2/2)
 The similarity between two users
 In this paper, ω = 0.9.
19
Approaches –
Rating Prediction Formula(1/2)
 Traditional Top N algorithms choose the Top N
most similar neighbors to predict the missing
value.
 Set of users similar to u:
20
Approaches –
Rating Prediction Formula(2/2)
 To calculate the missing ratings we used a popular
user-based prediction formula described in [11].
21
• [11] J. L. Herlocker, J. A. Konstan, L. G. Terveen, and J. T. Riedl. Evaluating collaborative
filtering recommender systems. ACM Transactions on Information Systems, 2004.
• ru : the average of the ratings given by user u.
• wuv : the similarity value between user u and user v.
• σu : the standard deviation of ratings given by user u.
• N(u) : set of most similar users to user u.
Experimental Setup(1/3)
 Dataset
 hetrec2011- movielens-2k dated May 2011[7]
 Based on the original MovieLens10M dataset, published
by the GroupLens research group.
22
• [7] I. Cantador, P. Brusilovsky, and T.
Kuflik. 2nd workshop on information
heterogeneity and fusion in recommender
systems (hetrec 2011). In Proceedings of
the 5th ACM conference on Recommender
systems, 2011.
• http://www.grouplens.org
Experimental Setup(2/3)
 Evaluation Metrics
 Predictive accuracy metrics
 Root Mean Squared Error (RMSE)
 Mean Absolute Error (MAE)
23
• N : the total number of ratings from all users.
• pu,m : the predicted rating for user u on movie m.
• ru,m : the actual rating for movie m assigned by the user u.
Experimental Setup(3/3)
 Experiments
 We trained our algorithm on the train set and then
predicted the ratings in the test set.
 We kept 80% of users for training, while 20% of
users were set aside for test.
24
Results(1/3)
 Compare WTRR ,WKRR, and purely
collaborative (PC) approach
25
Results(2/3)
 Compare the results of the WKRR with the results of
state of the art approaches reported in [6] and [12].
26
• [6] E. Bothos, K. Christidis, D. Apostolou, and G. Mentzas. Information market based
recommender systems fusion. In Proceedings of the 2nd International Workshop on
Information Heterogeneity and Fusion in Recommender Systems, 2011.
• [12] C. Jones, J. Ghosh, and A. Sharma. Learning multiple models for exploiting predictive
heterogeneity in recommender systems. 2011.
Results(3/3)
27
Conclusion
 We propose a novel hybrid recommendation
technique.
 WTRR and WKRR use tags and keywords,
respectively.
 The results of our experiments show that the
performance of WKRR exceeds the other approaches.
 WTRR is better than WKRR, when only the subset of
data with both tags and keywords is used.
28

More Related Content

What's hot

Tag recommendation in social bookmarking sites like deli
Tag recommendation in social bookmarking sites like deliTag recommendation in social bookmarking sites like deli
Tag recommendation in social bookmarking sites like deli
Vinay Singri
 
2012 kdd-com soc:adaptive transfer of user behaviors over composite social ne...
2012 kdd-com soc:adaptive transfer of user behaviors over composite social ne...2012 kdd-com soc:adaptive transfer of user behaviors over composite social ne...
2012 kdd-com soc:adaptive transfer of user behaviors over composite social ne...
thsszj
 
A location based movie recommender system
A location based movie recommender systemA location based movie recommender system
A location based movie recommender system
ijfcstjournal
 
The Effect of Different Set-based Visualizations on User Exploration of Reco...
The Effect of Different Set-based  Visualizations on User Exploration of Reco...The Effect of Different Set-based  Visualizations on User Exploration of Reco...
The Effect of Different Set-based Visualizations on User Exploration of Reco...
Denis Parra Santander
 
IRJET- An Intuitive Sky-High View of Recommendation Systems
IRJET- An Intuitive Sky-High View of Recommendation SystemsIRJET- An Intuitive Sky-High View of Recommendation Systems
IRJET- An Intuitive Sky-High View of Recommendation Systems
IRJET Journal
 
Human Being Character Analysis from Their Social Networking Profiles
Human Being Character Analysis from Their Social Networking ProfilesHuman Being Character Analysis from Their Social Networking Profiles
Human Being Character Analysis from Their Social Networking Profiles
Biswaranjan Samal
 
Community Analysis of Deep Networks (poster)
Community Analysis of Deep Networks (poster)Community Analysis of Deep Networks (poster)
Community Analysis of Deep Networks (poster)
Behrang Mehrparvar
 
Ullmann
UllmannUllmann
Ullmann
anesah
 
243
243243

What's hot (9)

Tag recommendation in social bookmarking sites like deli
Tag recommendation in social bookmarking sites like deliTag recommendation in social bookmarking sites like deli
Tag recommendation in social bookmarking sites like deli
 
2012 kdd-com soc:adaptive transfer of user behaviors over composite social ne...
2012 kdd-com soc:adaptive transfer of user behaviors over composite social ne...2012 kdd-com soc:adaptive transfer of user behaviors over composite social ne...
2012 kdd-com soc:adaptive transfer of user behaviors over composite social ne...
 
A location based movie recommender system
A location based movie recommender systemA location based movie recommender system
A location based movie recommender system
 
The Effect of Different Set-based Visualizations on User Exploration of Reco...
The Effect of Different Set-based  Visualizations on User Exploration of Reco...The Effect of Different Set-based  Visualizations on User Exploration of Reco...
The Effect of Different Set-based Visualizations on User Exploration of Reco...
 
IRJET- An Intuitive Sky-High View of Recommendation Systems
IRJET- An Intuitive Sky-High View of Recommendation SystemsIRJET- An Intuitive Sky-High View of Recommendation Systems
IRJET- An Intuitive Sky-High View of Recommendation Systems
 
Human Being Character Analysis from Their Social Networking Profiles
Human Being Character Analysis from Their Social Networking ProfilesHuman Being Character Analysis from Their Social Networking Profiles
Human Being Character Analysis from Their Social Networking Profiles
 
Community Analysis of Deep Networks (poster)
Community Analysis of Deep Networks (poster)Community Analysis of Deep Networks (poster)
Community Analysis of Deep Networks (poster)
 
Ullmann
UllmannUllmann
Ullmann
 
243
243243
243
 

Similar to A hybrid recommender system user profiling from keywords and ratings

A Hybrid Approach For Movie Recommendation Based On User Behaviour
A Hybrid Approach For Movie Recommendation Based On User BehaviourA Hybrid Approach For Movie Recommendation Based On User Behaviour
A Hybrid Approach For Movie Recommendation Based On User Behaviour
Tracy Drey
 
MOVIE RECOMMENDATION SYSTEM USING COLLABORATIVE FILTERING
MOVIE RECOMMENDATION SYSTEM USING COLLABORATIVE FILTERINGMOVIE RECOMMENDATION SYSTEM USING COLLABORATIVE FILTERING
MOVIE RECOMMENDATION SYSTEM USING COLLABORATIVE FILTERING
IRJET Journal
 
A Neural Network-Inspired Approach For Improved And True Movie Recommendations
A Neural Network-Inspired Approach For Improved And True Movie RecommendationsA Neural Network-Inspired Approach For Improved And True Movie Recommendations
A Neural Network-Inspired Approach For Improved And True Movie Recommendations
Amy Roman
 
A Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social MediaA Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social Media
Editor IJCATR
 
A Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social MediaA Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social Media
Editor IJCATR
 
A Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social MediaA Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social Media
Editor IJCATR
 
Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system
Mauryasuraj98
 
IRJET - Enhanced Movie Recommendation Engine using Content Filtering, Collabo...
IRJET - Enhanced Movie Recommendation Engine using Content Filtering, Collabo...IRJET - Enhanced Movie Recommendation Engine using Content Filtering, Collabo...
IRJET - Enhanced Movie Recommendation Engine using Content Filtering, Collabo...
IRJET Journal
 
Improving-Movie-Recommendation-Systems-Filtering-by-Exploiting-UserBased-Revi...
Improving-Movie-Recommendation-Systems-Filtering-by-Exploiting-UserBased-Revi...Improving-Movie-Recommendation-Systems-Filtering-by-Exploiting-UserBased-Revi...
Improving-Movie-Recommendation-Systems-Filtering-by-Exploiting-UserBased-Revi...
Malim Siregar
 
Music Recommendation System with User-based and Item-based Collaborative Filt...
Music Recommendation System with User-based and Item-based Collaborative Filt...Music Recommendation System with User-based and Item-based Collaborative Filt...
Music Recommendation System with User-based and Item-based Collaborative Filt...
ijeei-iaes
 
IRJET- Searching an Optimal Algorithm for Movie Recommendation System
IRJET- Searching an Optimal Algorithm for Movie Recommendation SystemIRJET- Searching an Optimal Algorithm for Movie Recommendation System
IRJET- Searching an Optimal Algorithm for Movie Recommendation System
IRJET Journal
 
A Review Study OF Movie Recommendation Using Machine Learning
A Review Study OF Movie Recommendation Using Machine LearningA Review Study OF Movie Recommendation Using Machine Learning
A Review Study OF Movie Recommendation Using Machine Learning
IRJET Journal
 
IRJET- A Survey on Recommender Systems used for User Service Rating in Social...
IRJET- A Survey on Recommender Systems used for User Service Rating in Social...IRJET- A Survey on Recommender Systems used for User Service Rating in Social...
IRJET- A Survey on Recommender Systems used for User Service Rating in Social...
IRJET Journal
 
Collaborative Filtering
Collaborative FilteringCollaborative Filtering
Collaborative Filtering
Tayfun Sen
 
Advances In Collaborative Filtering
Advances In Collaborative FilteringAdvances In Collaborative Filtering
Advances In Collaborative Filtering
Scott Donald
 
20320140501009 2
20320140501009 220320140501009 2
20320140501009 2
IAEME Publication
 
Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...
Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...
Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...
Waqas Tariq
 
ENTERTAINMENT CONTENT RECOMMENDATION SYSTEM USING MACHINE LEARNING
ENTERTAINMENT CONTENT RECOMMENDATION SYSTEM USING MACHINE LEARNINGENTERTAINMENT CONTENT RECOMMENDATION SYSTEM USING MACHINE LEARNING
ENTERTAINMENT CONTENT RECOMMENDATION SYSTEM USING MACHINE LEARNING
IRJET Journal
 
CS583-recommender-systems.ppt
CS583-recommender-systems.pptCS583-recommender-systems.ppt
CS583-recommender-systems.ppt
ArfatAhmadKhan1
 
Social media recommendation based on people and tags (final)
Social media recommendation based on people and tags (final)Social media recommendation based on people and tags (final)
Social media recommendation based on people and tags (final)
es712
 

Similar to A hybrid recommender system user profiling from keywords and ratings (20)

A Hybrid Approach For Movie Recommendation Based On User Behaviour
A Hybrid Approach For Movie Recommendation Based On User BehaviourA Hybrid Approach For Movie Recommendation Based On User Behaviour
A Hybrid Approach For Movie Recommendation Based On User Behaviour
 
MOVIE RECOMMENDATION SYSTEM USING COLLABORATIVE FILTERING
MOVIE RECOMMENDATION SYSTEM USING COLLABORATIVE FILTERINGMOVIE RECOMMENDATION SYSTEM USING COLLABORATIVE FILTERING
MOVIE RECOMMENDATION SYSTEM USING COLLABORATIVE FILTERING
 
A Neural Network-Inspired Approach For Improved And True Movie Recommendations
A Neural Network-Inspired Approach For Improved And True Movie RecommendationsA Neural Network-Inspired Approach For Improved And True Movie Recommendations
A Neural Network-Inspired Approach For Improved And True Movie Recommendations
 
A Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social MediaA Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social Media
 
A Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social MediaA Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social Media
 
A Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social MediaA Survey on Decision Support Systems in Social Media
A Survey on Decision Support Systems in Social Media
 
Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system Movie recommendation system using collaborative filtering system
Movie recommendation system using collaborative filtering system
 
IRJET - Enhanced Movie Recommendation Engine using Content Filtering, Collabo...
IRJET - Enhanced Movie Recommendation Engine using Content Filtering, Collabo...IRJET - Enhanced Movie Recommendation Engine using Content Filtering, Collabo...
IRJET - Enhanced Movie Recommendation Engine using Content Filtering, Collabo...
 
Improving-Movie-Recommendation-Systems-Filtering-by-Exploiting-UserBased-Revi...
Improving-Movie-Recommendation-Systems-Filtering-by-Exploiting-UserBased-Revi...Improving-Movie-Recommendation-Systems-Filtering-by-Exploiting-UserBased-Revi...
Improving-Movie-Recommendation-Systems-Filtering-by-Exploiting-UserBased-Revi...
 
Music Recommendation System with User-based and Item-based Collaborative Filt...
Music Recommendation System with User-based and Item-based Collaborative Filt...Music Recommendation System with User-based and Item-based Collaborative Filt...
Music Recommendation System with User-based and Item-based Collaborative Filt...
 
IRJET- Searching an Optimal Algorithm for Movie Recommendation System
IRJET- Searching an Optimal Algorithm for Movie Recommendation SystemIRJET- Searching an Optimal Algorithm for Movie Recommendation System
IRJET- Searching an Optimal Algorithm for Movie Recommendation System
 
A Review Study OF Movie Recommendation Using Machine Learning
A Review Study OF Movie Recommendation Using Machine LearningA Review Study OF Movie Recommendation Using Machine Learning
A Review Study OF Movie Recommendation Using Machine Learning
 
IRJET- A Survey on Recommender Systems used for User Service Rating in Social...
IRJET- A Survey on Recommender Systems used for User Service Rating in Social...IRJET- A Survey on Recommender Systems used for User Service Rating in Social...
IRJET- A Survey on Recommender Systems used for User Service Rating in Social...
 
Collaborative Filtering
Collaborative FilteringCollaborative Filtering
Collaborative Filtering
 
Advances In Collaborative Filtering
Advances In Collaborative FilteringAdvances In Collaborative Filtering
Advances In Collaborative Filtering
 
20320140501009 2
20320140501009 220320140501009 2
20320140501009 2
 
Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...
Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...
Hybrid Personalized Recommender System Using Modified Fuzzy C-Means Clusterin...
 
ENTERTAINMENT CONTENT RECOMMENDATION SYSTEM USING MACHINE LEARNING
ENTERTAINMENT CONTENT RECOMMENDATION SYSTEM USING MACHINE LEARNINGENTERTAINMENT CONTENT RECOMMENDATION SYSTEM USING MACHINE LEARNING
ENTERTAINMENT CONTENT RECOMMENDATION SYSTEM USING MACHINE LEARNING
 
CS583-recommender-systems.ppt
CS583-recommender-systems.pptCS583-recommender-systems.ppt
CS583-recommender-systems.ppt
 
Social media recommendation based on people and tags (final)
Social media recommendation based on people and tags (final)Social media recommendation based on people and tags (final)
Social media recommendation based on people and tags (final)
 

Recently uploaded

Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
Adtran
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
ThomasParaiso2
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
Neo4j
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
TIPNGVN2
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
Pierluigi Pugliese
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
Zilliz
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
danishmna97
 

Recently uploaded (20)

Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Pushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 daysPushing the limits of ePRTC: 100ns holdover for 100 days
Pushing the limits of ePRTC: 100ns holdover for 100 days
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...GridMate - End to end testing is a critical piece to ensure quality and avoid...
GridMate - End to end testing is a critical piece to ensure quality and avoid...
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
GraphSummit Singapore | Graphing Success: Revolutionising Organisational Stru...
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
Data structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdfData structures and Algorithms in Python.pdf
Data structures and Algorithms in Python.pdf
 
By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024By Design, not by Accident - Agile Venture Bolzano 2024
By Design, not by Accident - Agile Venture Bolzano 2024
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
Full-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalizationFull-RAG: A modern architecture for hyper-personalization
Full-RAG: A modern architecture for hyper-personalization
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
How to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptxHow to Get CNIC Information System with Paksim Ga.pptx
How to Get CNIC Information System with Paksim Ga.pptx
 

A hybrid recommender system user profiling from keywords and ratings

  • 1. A Hybrid Recommender System: User Profiling from Keywords and Ratings Ana Stanescu, Swapnil Nagar, Doina Caragea 2013 IEEE/WIC/ACM International Conferences on Web Intelligence (WI) and Intelligent Agent Technology (IAT)
  • 2. Outline  Introduction  Related Work  Approaches  Experimental Setup  Results  Conclusion 2
  • 3. Introduction(1/3)  Recommendation systems[3]  Content-Based  User preferred in the past.  Data scarcity problem.  Cannot identify new and different items.  Collaborative Filtering  Based on the user-user similarity.  A new item cannot be recommended.  Hybrid 3 • [3] M. Balabanovic and Y. Shoham. Fab: content-based, collaborative recommendation. Communications of the ACM, 40, 1997.
  • 4. Introduction(2/3)  We propose a hybrid system that mediates the data sparsity problem and reduces the noise from the user generated content.  We adapt for movies the Weighted Tag Recommender (WTR) approach from [14].  Addressed the problem of recommending books on Amazon and built their system exclusively from tag information. 4 • [14] H. Liang, Y. Xu, Y. Li, R. Nayak, and G. Shaw. A hybrid recommender systems based on weighted tags. 10th SIAM International Conference on Data Mining, 2010.
  • 5. Introduction(3/3)  Weighted Tag-Rating Recommender (WTRR).  Weighted Keyword-Rating Recommender (WKRR).  Both our keyword and tag representations of users can help alleviate the noise and semantic ambiguity problems inherent in the information contributed by users of social networks. 5
  • 6. Related Work(1/3)  Tagging is a type of labeling, whose purpose is to assist users in the process of finding content on the web. [18]  Tags are free annotations and there are no constrains assigning tags.  A hybrid system proposed by Liang et al. [14] addresses these problems, by using weighted tags. 6 • [14] H. Liang, Y. Xu, Y. Li, R. Nayak, and G. Shaw. A hybrid recommender systems based on weighted tags. 10th SIAM International Conference on Data Mining, 2010. • [18] A. Said, B. Kille, E. W. De Luca, and S. Albayrak. Personalizing tags: a folksonomy- like approach for recommending movies. In Proceedings of the 2nd International Workshop on Information Heterogeneity and Fusion in Recommender Systems, HetRec ’11, 2011.
  • 7. Related Work (2/3)  For domains where both tags and ratings are available, a recommender system should exploit all the information.  Systems that leverage ratings, which can be either explicitly provided by the users[5], are known to perform well.  Ratings can also be noisy.[2] 7 • [5] R. M. Bell, Y. Koren, and C. Volinsky. The Bellkor 2008 solution to the Netflix prize. 2008. • [2] X. Amatriain, J. Pujol, and N. Oliver. I like it... i like it not: Evaluating user ratings noise in recommender systems. In User Modeling, Adaptation, and Personalization, Lecture Notes in Computer Science. 2009.
  • 8. Related Work (3/3)  The system proposed by [6] is an ensemble of various recommenders primarily used for mining and aggregating the information from various sources.  In [12], the authors propose learning multiple models which can incorporate different types of inputs to predict the preferences of diverse users. 8 • [6] E. Bothos, K. Christidis, D. Apostolou, and G. Mentzas. Information market based recommender systems fusion. In Proceedings of the 2nd International Workshop on Informatio. • [12] C. Jones, J. Ghosh, and A. Sharma. Learning multiple models for exploiting predictive heterogeneity in recommender systems. 2011.
  • 9. Approaches – WTRR(1/5)  Weighted Tag-Rating Recommender(WTRR)  The book recommender system proposed in [14] is built from tag information only.  Tags may not always capture the true preference of the user.  We incorporate the actual ratings. 9 • [14] H. Liang, Y. Xu, Y. Li, R. Nayak, and G. Shaw. A hybrid recommender systems based on weighted tags. 10th SIAM International Conference on Data Mining, 2010.
  • 10. Approaches – WTRR(2/5)  Tag Relevance  Finding meaning of each tag for each user individually  Tag Relatedness Metric 10 Summation of ratings assigned to the movie mi by all the users who used tag tx. Summation of all the ratings from the users who tagged mi. Measures how similar tag ty is to a given tag tx. The set of movies tagged with tx by ui.
  • 11. Approaches – WTRR(3/5)  User Profile  To leverage the advantages of hybrid systems, users topic preferences and movie preferences are combined.  Every user is represented by a profile, encoded using a vector of weights: 11 • ui T : user ui’s topic preferences. (values denoting how much ui is interested in each tag.) • ui M : user ui’s movie preferences.
  • 12. Approaches – WTRR(4/5)  Weight of each tag for a user  Total relevance weight of ty for ui 12 Summation of ratings assigned to the movie mj by all the users who used tx. Summation of all ratings assigned to the movie mj by all the users who tagged it.
  • 13. Approaches – WTRR(5/5)  Inverse user frequency of tag ty  The tag representation of each user (Values of the topic preference vector ui T for each user ui) 13 • |Uty | is the number of users that used ty . • e is Euler’s number.
  • 14. Approaches – WKRR(1/4)  Weighted Keyword-Rating Recommender (WKRR).  Our algorithm dynamically creates a user profile from IMDB movie keywords and explicit user ratings.  Similar to WTRR, we profile users on preference. 14 • ui K : user ui’s keyword topic preferences. • ui R : user ui’s rating-based movie preferences.
  • 15. Approaches – WKRR(2/4)  Movie Description Based on Weighted Keywords  movie keyword relevance metric 15
  • 16. Approaches – WKRR(3/4)  The Representation of Keywords  degree of connection between keywords  representation of keyword kx 16
  • 17. Approaches – WKRR(4/4)  User Profile Generation From Keywords  Weight of a keyword to a user  Total relevance weight of a keyword for a user 17
  • 18. Approaches – Neighborhood Formation(1/2)  In order to predict a user’s rating for an unseen movie, we first set out to find the community of users sharing similar taste.  Identify for each user u, an ordered list of k most similar users such that sim(u, u1) is maximum, sim(u, u2) is the second highest and so on. 18
  • 19. Approaches – Neighborhood Formation(2/2)  The similarity between two users  In this paper, ω = 0.9. 19
  • 20. Approaches – Rating Prediction Formula(1/2)  Traditional Top N algorithms choose the Top N most similar neighbors to predict the missing value.  Set of users similar to u: 20
  • 21. Approaches – Rating Prediction Formula(2/2)  To calculate the missing ratings we used a popular user-based prediction formula described in [11]. 21 • [11] J. L. Herlocker, J. A. Konstan, L. G. Terveen, and J. T. Riedl. Evaluating collaborative filtering recommender systems. ACM Transactions on Information Systems, 2004. • ru : the average of the ratings given by user u. • wuv : the similarity value between user u and user v. • σu : the standard deviation of ratings given by user u. • N(u) : set of most similar users to user u.
  • 22. Experimental Setup(1/3)  Dataset  hetrec2011- movielens-2k dated May 2011[7]  Based on the original MovieLens10M dataset, published by the GroupLens research group. 22 • [7] I. Cantador, P. Brusilovsky, and T. Kuflik. 2nd workshop on information heterogeneity and fusion in recommender systems (hetrec 2011). In Proceedings of the 5th ACM conference on Recommender systems, 2011. • http://www.grouplens.org
  • 23. Experimental Setup(2/3)  Evaluation Metrics  Predictive accuracy metrics  Root Mean Squared Error (RMSE)  Mean Absolute Error (MAE) 23 • N : the total number of ratings from all users. • pu,m : the predicted rating for user u on movie m. • ru,m : the actual rating for movie m assigned by the user u.
  • 24. Experimental Setup(3/3)  Experiments  We trained our algorithm on the train set and then predicted the ratings in the test set.  We kept 80% of users for training, while 20% of users were set aside for test. 24
  • 25. Results(1/3)  Compare WTRR ,WKRR, and purely collaborative (PC) approach 25
  • 26. Results(2/3)  Compare the results of the WKRR with the results of state of the art approaches reported in [6] and [12]. 26 • [6] E. Bothos, K. Christidis, D. Apostolou, and G. Mentzas. Information market based recommender systems fusion. In Proceedings of the 2nd International Workshop on Information Heterogeneity and Fusion in Recommender Systems, 2011. • [12] C. Jones, J. Ghosh, and A. Sharma. Learning multiple models for exploiting predictive heterogeneity in recommender systems. 2011.
  • 28. Conclusion  We propose a novel hybrid recommendation technique.  WTRR and WKRR use tags and keywords, respectively.  The results of our experiments show that the performance of WKRR exceeds the other approaches.  WTRR is better than WKRR, when only the subset of data with both tags and keywords is used. 28

Editor's Notes

  1. The higher the value of weight the more likely it is that the tag represents the topic of movie (tag與movie的關係由weight大小來呈現) 結合rating
  2. Tag與user的關係