SlideShare a Scribd company logo
Recommender Systems
Vivek Murugesan
10th Jun 2017
Agenda
● Recommendations…
● Advantages
● Recommender systems
● Anatomy of recommender systems
● Type of recommender systems or algorithms
● Requirements for scaling
● Learning to rank model
● Where to start…?
● Questions
Recommendations...
● It is estimated that close to 30% of Amazon’s revenue comes from the way
they integrated recommendations
● About 75% of Netflix’s business is driven through recommendations
● Posts, groups, people and jobs recommended through Linkedin
● Posts, friends suggested in Facebook
● Google search results, autocomplete suggestions, google news, etc,.
● Stories recommended through Quora
● ...
Advantages...
● Increase customer engagement
● Increase customer satisfaction by delivering
relevant contents
● Increase sales with cross sell options
● Drive more traffic
Bought
Similar
Recommend
Recommender systems
● Attempts to predict the preference of a given user for an item
● Based on the prediction recommends set of items for the user
● By using,
○ Data: User history, Social profile, etc,.
○ Algorithms: Like Collaborative filtering, content based filtering etc,.
● Delivers the recommendations and let the users interact with them
● Attempts to learn more about the users
Anatomy of recommender systems
Item inventory
User details
U
S
E
R
S
I
T
E
M
S
Algorithms and
models
Business
objective
Recommendations
U
S
E
R
S
Deliver
Build
Feedback loop
User preference data
● Data that the recommender system rely on to know/learn about the users
● The same being used for personalizing recommendations for the users
● Explicit,
○ Rating
○ Voting
○ Opinion
● Implicit,
○ Click
○ Purchase
○ Follow
● Binary
○ Like/Dislike
○ Click/Ignore
● Rating
○ Scale
● Unary
○ Purchase
○ Views
● Normalize rating
○ Handling scales
○ User biases
● Unary data
○ Special
processing
Type of recommender systems or algorithms
● Non-personalized
○ Popular and trending product
○ Based on simple summary
statistics
○ To handle cold start scenario in
some cases
● Semi personalized
○ Association rule mining
○ Market basket recommendation
○ Ephemeral (contextual)
● Personalized
○ Persistent (long term interests)
○ Content based
○ Collaborative filtering
■ User-user similarity
■ Item-item similarity
■ Latent attributes through
SVD
● Advanced
○ Hybrid (contextual + interest and
content + collaborative)
○ Learning to rank
Content based filtering
1. Associate each item with certain keywords or attributes
2. Build a vector of keyword preference for user, by inferring based on
their item preferences
3. Use TFIDF or similar mechanism to accumulate preferences on
keywords across items
4. Score each item based on the cosine similarity (i.e. a dot product
between two vectors) of its keyword vector with the user’s keyword
vector
Association rule mining
● Attempts to identify rules like people “who bought X also bought Y”
● Based on the items bought together in a transaction or a time window
● Rules of the form X → Y are discovered/mined from the data
● Where X is called antecedent and Y is called the consequent
● Metrics associated with the rules,
○ Support = N(X U Y) / |T| ( P(X AND Y) Ratio of transactions in which X and Y are bought
together)
○ Confidence = Support(X U Y) / Support(X) ( P(Y|X) Percentage of buyers of X, who also
bought Y)
○ Lift = Support(X U Y) / [Support(X) * Support(Y)] ( P(X AND Y) / [P(X) * P(Y)] )
● Some popular product tend to be appearing part of the consequent of all rules
● Lift is the metric that can help to get away with the issue
Collaborative filtering
● Unlike content based filtering collaborative filtering doesn’t assume the
presence of attributes or keywords about the items
● Makes use of the User x Item matrix computed based on the history
● Generates recommendations entirely based on this this matrix
● Doesn’t rely on any additional details about the items or users (like
demography)
User-user similarity
1. Build neighbourhood for each user based on the User x Item matrix (by using
the correlation or cosine similarity of each user with others)
2. Use the likes/interactions of the top k users to build a potential set of items to
recommend for the user a.
3. Score each item in the potential set based on the preference of the user u and
their similarity with user a.
Item-item similarity
1. Build an item neighbourhood based on the preferences expressed by different
users (i.e. based on the rating vectors of two different items)
2. Use set of items that the user has expressed preferences on up front to
generate potential items as recommendations for them
3. Score each item in the potential set based on their similarity with the item that
the user has liked (expressed preference) earlier
Requirements for scaling
● Volume of item base:-
Depending on the item
inventory, it can sometimes
turn out to be huge number
● Volume of user base:- Similar
to item base, users can also be
very large in number
● Delivery channel/mode:-
Depending on the mode of
choice delivery, either turn
around time or response time
needs to be focused
● User feedback/interactions:- It is
an opportunity to learn more
about the users, handling the
streaming data of interactions
may pose some challenges
● Incremental update of
models/algorithms:- Interactions
means models need to be
updated in near real time.
Capability of incremental update
is critical to avoid whole bunch of
recomputation.
Scaling...
With n items and m users calculating
similarities of a given user with all other
m-1 users take O(mn) time.
When we are talking about performing this
across all users it takes O(m2
n) time, as
there will be np
2 pairs or nc
2
combinations.
Similarly for calculating similarities across
all items it takes O(n2
m) time.
But we need only top k neighbourhood for each
user or item.
When n>>k then it results in lot of unnecessary
computations. Is there a way to avoid these…?
The item similarity API from Spark ML provides
an answer to this.
Clustering:- Let’s say we are trying the
compute the user similarities among m
users, where is really a large number. User
neighbourhood can then be computed only
within the cluster of users.
Dimensionality reduction:- A User x Item matrix
of dimension mxn, can be reduced to mxk by
generating top k principal components from the
matrix. Similarity computation on top of this
reduced matrix will be faster than using the
original matrix.
Learning to rank
● Several algorithms and models,
○ Can generate predicted rating items by user
○ Can also generated rank ordered list of
recommendations
○ By consuming user history
● Various indicators of items like trend,
seasonality, etc,.
● User’s preference, context, etc,.
● Business promotional objectives
● Item catalog/inventory coverage
● Eventual rank ordering by combining all of
these
○ To satisfy user’s preference
○ And business objectives
Where to start…?
● Open source datasets
○ Movie lens
○ Million song
● Open source framework and API
○ Spark ML
○ Movie lens
● References
○ Recommender systems survey
○ Applying SVD/PCA on recommender systems
○ Learning to rank model
○ Coursera:- Recommender system
● Github
○ https://github.com/vivekmurugesan/recommender-systems
Questions…?
Email: vivek.murugesan@gmail.com
Linkedin: https://www.linkedin.com/in/vivek-murugesan/

More Related Content

What's hot

Udacity webinar on Recommendation Systems
Udacity webinar on Recommendation SystemsUdacity webinar on Recommendation Systems
Udacity webinar on Recommendation Systems
Axel de Romblay
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
Carlos Castillo (ChaTo)
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
Justin Basilico
 
LinkedIn talk at Netflix ML Platform meetup Sep 2019
LinkedIn talk at Netflix ML Platform meetup Sep 2019LinkedIn talk at Netflix ML Platform meetup Sep 2019
LinkedIn talk at Netflix ML Platform meetup Sep 2019
Faisal Siddiqi
 
Learning to Rank for Recommender Systems - ACM RecSys 2013 tutorial
Learning to Rank for Recommender Systems -  ACM RecSys 2013 tutorialLearning to Rank for Recommender Systems -  ACM RecSys 2013 tutorial
Learning to Rank for Recommender Systems - ACM RecSys 2013 tutorial
Alexandros Karatzoglou
 
Item Based Collaborative Filtering Recommendation Algorithms
Item Based Collaborative Filtering Recommendation AlgorithmsItem Based Collaborative Filtering Recommendation Algorithms
Item Based Collaborative Filtering Recommendation Algorithms
nextlib
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Xavier Amatriain
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender system
Stanley Wang
 
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
Balázs Hidasi
 
Missing values in recommender models
Missing values in recommender modelsMissing values in recommender models
Missing values in recommender models
Parmeshwar Khurd
 
Recommender Systems In Industry
Recommender Systems In IndustryRecommender Systems In Industry
Recommender Systems In Industry
Xavier Amatriain
 
Recent Trends in Personalization at Netflix
Recent Trends in Personalization at NetflixRecent Trends in Personalization at Netflix
Recent Trends in Personalization at Netflix
Förderverein Technische Fakultät
 
Recommendation Systems
Recommendation SystemsRecommendation Systems
Recommendation Systems
Robin Reni
 
Recommender system
Recommender systemRecommender system
Recommender system
Nilotpal Pramanik
 
Recommender Engines
Recommender EnginesRecommender Engines
Recommender Engines
Thomas Hess
 
Netflix talk at ML Platform meetup Sep 2019
Netflix talk at ML Platform meetup Sep 2019Netflix talk at ML Platform meetup Sep 2019
Netflix talk at ML Platform meetup Sep 2019
Faisal Siddiqi
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
Xavier Amatriain
 
Recommendation System Explained
Recommendation System ExplainedRecommendation System Explained
Recommendation System Explained
Crossing Minds
 
Recommender systems for E-commerce
Recommender systems for E-commerceRecommender systems for E-commerce
Recommender systems for E-commerce
Alexander Konduforov
 
Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filtering
Viet-Trung TRAN
 

What's hot (20)

Udacity webinar on Recommendation Systems
Udacity webinar on Recommendation SystemsUdacity webinar on Recommendation Systems
Udacity webinar on Recommendation Systems
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Deep Learning for Recommender Systems
Deep Learning for Recommender SystemsDeep Learning for Recommender Systems
Deep Learning for Recommender Systems
 
LinkedIn talk at Netflix ML Platform meetup Sep 2019
LinkedIn talk at Netflix ML Platform meetup Sep 2019LinkedIn talk at Netflix ML Platform meetup Sep 2019
LinkedIn talk at Netflix ML Platform meetup Sep 2019
 
Learning to Rank for Recommender Systems - ACM RecSys 2013 tutorial
Learning to Rank for Recommender Systems -  ACM RecSys 2013 tutorialLearning to Rank for Recommender Systems -  ACM RecSys 2013 tutorial
Learning to Rank for Recommender Systems - ACM RecSys 2013 tutorial
 
Item Based Collaborative Filtering Recommendation Algorithms
Item Based Collaborative Filtering Recommendation AlgorithmsItem Based Collaborative Filtering Recommendation Algorithms
Item Based Collaborative Filtering Recommendation Algorithms
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
 
Overview of recommender system
Overview of recommender systemOverview of recommender system
Overview of recommender system
 
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
GRU4Rec v2 - Recurrent Neural Networks with Top-k Gains for Session-based Rec...
 
Missing values in recommender models
Missing values in recommender modelsMissing values in recommender models
Missing values in recommender models
 
Recommender Systems In Industry
Recommender Systems In IndustryRecommender Systems In Industry
Recommender Systems In Industry
 
Recent Trends in Personalization at Netflix
Recent Trends in Personalization at NetflixRecent Trends in Personalization at Netflix
Recent Trends in Personalization at Netflix
 
Recommendation Systems
Recommendation SystemsRecommendation Systems
Recommendation Systems
 
Recommender system
Recommender systemRecommender system
Recommender system
 
Recommender Engines
Recommender EnginesRecommender Engines
Recommender Engines
 
Netflix talk at ML Platform meetup Sep 2019
Netflix talk at ML Platform meetup Sep 2019Netflix talk at ML Platform meetup Sep 2019
Netflix talk at ML Platform meetup Sep 2019
 
Past, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspectivePast, present, and future of Recommender Systems: an industry perspective
Past, present, and future of Recommender Systems: an industry perspective
 
Recommendation System Explained
Recommendation System ExplainedRecommendation System Explained
Recommendation System Explained
 
Recommender systems for E-commerce
Recommender systems for E-commerceRecommender systems for E-commerce
Recommender systems for E-commerce
 
Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filtering
 

Similar to Recommender systems

Recommendation engines
Recommendation enginesRecommendation engines
Recommendation engines
Georgian Micsa
 
Recommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxRecommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptx
Satyam Sharma
 
Introduction to Recommendation Systems (Vietnam Web Submit)
Introduction to Recommendation Systems (Vietnam Web Submit)Introduction to Recommendation Systems (Vietnam Web Submit)
Introduction to Recommendation Systems (Vietnam Web Submit)
Trieu Nguyen
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation Systems
Zia Babar
 
IntroductionRecommenderSystems_Petroni.pdf
IntroductionRecommenderSystems_Petroni.pdfIntroductionRecommenderSystems_Petroni.pdf
IntroductionRecommenderSystems_Petroni.pdf
AlphaIssaghaDiallo
 
B1802021823
B1802021823B1802021823
B1802021823
IOSR Journals
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
Milind Gokhale
 
Architecting AI Solutions in Azure for Business
Architecting AI Solutions in Azure for BusinessArchitecting AI Solutions in Azure for Business
Architecting AI Solutions in Azure for Business
Ivo Andreev
 
Recommandation systems -
Recommandation systems - Recommandation systems -
Recommandation systems -
Yousef Fadila
 
Filtering content bbased crs
Filtering content bbased crsFiltering content bbased crs
Filtering content bbased crs
Aravindharamanan S
 
Recommender.system.presentation.pjug.01.21.2014
Recommender.system.presentation.pjug.01.21.2014Recommender.system.presentation.pjug.01.21.2014
Recommender.system.presentation.pjug.01.21.2014
rpbrehm
 
Digital Trails Dave King 1 5 10 Part 2 D3
Digital Trails   Dave King   1 5 10   Part 2   D3Digital Trails   Dave King   1 5 10   Part 2   D3
Digital Trails Dave King 1 5 10 Part 2 D3
Dave King
 
Aaa ped-21-Recommender Systems: Content-based Filtering
Aaa ped-21-Recommender Systems: Content-based FilteringAaa ped-21-Recommender Systems: Content-based Filtering
Aaa ped-21-Recommender Systems: Content-based Filtering
AminaRepo
 
Aaa ped-19-Recommender Systems: Neighborhood-based Filtering
Aaa ped-19-Recommender Systems: Neighborhood-based FilteringAaa ped-19-Recommender Systems: Neighborhood-based Filtering
Aaa ped-19-Recommender Systems: Neighborhood-based Filtering
AminaRepo
 
case based recommendation approach for market basket data
case based recommendation approach for market basket datacase based recommendation approach for market basket data
case based recommendation approach for market basket data
mniranjanmurthy
 
Lecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionLecture Notes on Recommender System Introduction
Lecture Notes on Recommender System Introduction
PerumalPitchandi
 
Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022
Real-Time Recommendations  with Hopsworks and OpenSearch - MLOps World 2022Real-Time Recommendations  with Hopsworks and OpenSearch - MLOps World 2022
Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022
Jim Dowling
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE
 
Recommenders Systems
Recommenders SystemsRecommenders Systems
Recommenders Systems
Tariq Hassan
 
WORD
WORDWORD
WORD
butest
 

Similar to Recommender systems (20)

Recommendation engines
Recommendation enginesRecommendation engines
Recommendation engines
 
Recommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxRecommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptx
 
Introduction to Recommendation Systems (Vietnam Web Submit)
Introduction to Recommendation Systems (Vietnam Web Submit)Introduction to Recommendation Systems (Vietnam Web Submit)
Introduction to Recommendation Systems (Vietnam Web Submit)
 
Introduction to Recommendation Systems
Introduction to Recommendation SystemsIntroduction to Recommendation Systems
Introduction to Recommendation Systems
 
IntroductionRecommenderSystems_Petroni.pdf
IntroductionRecommenderSystems_Petroni.pdfIntroductionRecommenderSystems_Petroni.pdf
IntroductionRecommenderSystems_Petroni.pdf
 
B1802021823
B1802021823B1802021823
B1802021823
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
 
Architecting AI Solutions in Azure for Business
Architecting AI Solutions in Azure for BusinessArchitecting AI Solutions in Azure for Business
Architecting AI Solutions in Azure for Business
 
Recommandation systems -
Recommandation systems - Recommandation systems -
Recommandation systems -
 
Filtering content bbased crs
Filtering content bbased crsFiltering content bbased crs
Filtering content bbased crs
 
Recommender.system.presentation.pjug.01.21.2014
Recommender.system.presentation.pjug.01.21.2014Recommender.system.presentation.pjug.01.21.2014
Recommender.system.presentation.pjug.01.21.2014
 
Digital Trails Dave King 1 5 10 Part 2 D3
Digital Trails   Dave King   1 5 10   Part 2   D3Digital Trails   Dave King   1 5 10   Part 2   D3
Digital Trails Dave King 1 5 10 Part 2 D3
 
Aaa ped-21-Recommender Systems: Content-based Filtering
Aaa ped-21-Recommender Systems: Content-based FilteringAaa ped-21-Recommender Systems: Content-based Filtering
Aaa ped-21-Recommender Systems: Content-based Filtering
 
Aaa ped-19-Recommender Systems: Neighborhood-based Filtering
Aaa ped-19-Recommender Systems: Neighborhood-based FilteringAaa ped-19-Recommender Systems: Neighborhood-based Filtering
Aaa ped-19-Recommender Systems: Neighborhood-based Filtering
 
case based recommendation approach for market basket data
case based recommendation approach for market basket datacase based recommendation approach for market basket data
case based recommendation approach for market basket data
 
Lecture Notes on Recommender System Introduction
Lecture Notes on Recommender System IntroductionLecture Notes on Recommender System Introduction
Lecture Notes on Recommender System Introduction
 
Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022
Real-Time Recommendations  with Hopsworks and OpenSearch - MLOps World 2022Real-Time Recommendations  with Hopsworks and OpenSearch - MLOps World 2022
Real-Time Recommendations with Hopsworks and OpenSearch - MLOps World 2022
 
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.comHABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
HABIB FIGA GUYE {BULE HORA UNIVERSITY}(habibifiga@gmail.com
 
Recommenders Systems
Recommenders SystemsRecommenders Systems
Recommenders Systems
 
WORD
WORDWORD
WORD
 

Recently uploaded

一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
74nqk8xf
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
aqzctr7x
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Aggregage
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Fernanda Palhano
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
Sm321
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
soxrziqu
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
nuttdpt
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
sameer shah
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
kuntobimo2016
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
v7oacc3l
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
74nqk8xf
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
bopyb
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
Roger Valdez
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
Social Samosa
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
Timothy Spann
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Kiwi Creative
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
g4dpvqap0
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
zsjl4mimo
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 

Recently uploaded (20)

一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
一比一原版(牛布毕业证书)牛津布鲁克斯大学毕业证如何办理
 
一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理一比一原版(UO毕业证)渥太华大学毕业证如何办理
一比一原版(UO毕业证)渥太华大学毕业证如何办理
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
Beyond the Basics of A/B Tests: Highly Innovative Experimentation Tactics You...
 
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdfUdemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
Udemy_2024_Global_Learning_Skills_Trends_Report (1).pdf
 
Challenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more importantChallenges of Nation Building-1.pptx with more important
Challenges of Nation Building-1.pptx with more important
 
University of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma TranscriptUniversity of New South Wales degree offer diploma Transcript
University of New South Wales degree offer diploma Transcript
 
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
一比一原版(UCSB文凭证书)圣芭芭拉分校毕业证如何办理
 
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
STATATHON: Unleashing the Power of Statistics in a 48-Hour Knowledge Extravag...
 
State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023State of Artificial intelligence Report 2023
State of Artificial intelligence Report 2023
 
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
在线办理(英国UCA毕业证书)创意艺术大学毕业证在读证明一模一样
 
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
一比一原版(Chester毕业证书)切斯特大学毕业证如何办理
 
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
一比一原版(GWU,GW文凭证书)乔治·华盛顿大学毕业证如何办理
 
Everything you wanted to know about LIHTC
Everything you wanted to know about LIHTCEverything you wanted to know about LIHTC
Everything you wanted to know about LIHTC
 
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
4th Modern Marketing Reckoner by MMA Global India & Group M: 60+ experts on W...
 
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
06-04-2024 - NYC Tech Week - Discussion on Vector Databases, Unstructured Dat...
 
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging DataPredictably Improve Your B2B Tech Company's Performance by Leveraging Data
Predictably Improve Your B2B Tech Company's Performance by Leveraging Data
 
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
一比一原版(Glasgow毕业证书)格拉斯哥大学毕业证如何办理
 
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
一比一原版(Harvard毕业证书)哈佛大学毕业证如何办理
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
 

Recommender systems

  • 2. Agenda ● Recommendations… ● Advantages ● Recommender systems ● Anatomy of recommender systems ● Type of recommender systems or algorithms ● Requirements for scaling ● Learning to rank model ● Where to start…? ● Questions
  • 3. Recommendations... ● It is estimated that close to 30% of Amazon’s revenue comes from the way they integrated recommendations ● About 75% of Netflix’s business is driven through recommendations ● Posts, groups, people and jobs recommended through Linkedin ● Posts, friends suggested in Facebook ● Google search results, autocomplete suggestions, google news, etc,. ● Stories recommended through Quora ● ...
  • 4. Advantages... ● Increase customer engagement ● Increase customer satisfaction by delivering relevant contents ● Increase sales with cross sell options ● Drive more traffic Bought Similar Recommend
  • 5. Recommender systems ● Attempts to predict the preference of a given user for an item ● Based on the prediction recommends set of items for the user ● By using, ○ Data: User history, Social profile, etc,. ○ Algorithms: Like Collaborative filtering, content based filtering etc,. ● Delivers the recommendations and let the users interact with them ● Attempts to learn more about the users
  • 6. Anatomy of recommender systems Item inventory User details U S E R S I T E M S Algorithms and models Business objective Recommendations U S E R S Deliver Build Feedback loop
  • 7. User preference data ● Data that the recommender system rely on to know/learn about the users ● The same being used for personalizing recommendations for the users ● Explicit, ○ Rating ○ Voting ○ Opinion ● Implicit, ○ Click ○ Purchase ○ Follow ● Binary ○ Like/Dislike ○ Click/Ignore ● Rating ○ Scale ● Unary ○ Purchase ○ Views ● Normalize rating ○ Handling scales ○ User biases ● Unary data ○ Special processing
  • 8. Type of recommender systems or algorithms ● Non-personalized ○ Popular and trending product ○ Based on simple summary statistics ○ To handle cold start scenario in some cases ● Semi personalized ○ Association rule mining ○ Market basket recommendation ○ Ephemeral (contextual) ● Personalized ○ Persistent (long term interests) ○ Content based ○ Collaborative filtering ■ User-user similarity ■ Item-item similarity ■ Latent attributes through SVD ● Advanced ○ Hybrid (contextual + interest and content + collaborative) ○ Learning to rank
  • 9. Content based filtering 1. Associate each item with certain keywords or attributes 2. Build a vector of keyword preference for user, by inferring based on their item preferences 3. Use TFIDF or similar mechanism to accumulate preferences on keywords across items 4. Score each item based on the cosine similarity (i.e. a dot product between two vectors) of its keyword vector with the user’s keyword vector
  • 10. Association rule mining ● Attempts to identify rules like people “who bought X also bought Y” ● Based on the items bought together in a transaction or a time window ● Rules of the form X → Y are discovered/mined from the data ● Where X is called antecedent and Y is called the consequent ● Metrics associated with the rules, ○ Support = N(X U Y) / |T| ( P(X AND Y) Ratio of transactions in which X and Y are bought together) ○ Confidence = Support(X U Y) / Support(X) ( P(Y|X) Percentage of buyers of X, who also bought Y) ○ Lift = Support(X U Y) / [Support(X) * Support(Y)] ( P(X AND Y) / [P(X) * P(Y)] ) ● Some popular product tend to be appearing part of the consequent of all rules ● Lift is the metric that can help to get away with the issue
  • 11. Collaborative filtering ● Unlike content based filtering collaborative filtering doesn’t assume the presence of attributes or keywords about the items ● Makes use of the User x Item matrix computed based on the history ● Generates recommendations entirely based on this this matrix ● Doesn’t rely on any additional details about the items or users (like demography)
  • 12. User-user similarity 1. Build neighbourhood for each user based on the User x Item matrix (by using the correlation or cosine similarity of each user with others) 2. Use the likes/interactions of the top k users to build a potential set of items to recommend for the user a. 3. Score each item in the potential set based on the preference of the user u and their similarity with user a.
  • 13. Item-item similarity 1. Build an item neighbourhood based on the preferences expressed by different users (i.e. based on the rating vectors of two different items) 2. Use set of items that the user has expressed preferences on up front to generate potential items as recommendations for them 3. Score each item in the potential set based on their similarity with the item that the user has liked (expressed preference) earlier
  • 14. Requirements for scaling ● Volume of item base:- Depending on the item inventory, it can sometimes turn out to be huge number ● Volume of user base:- Similar to item base, users can also be very large in number ● Delivery channel/mode:- Depending on the mode of choice delivery, either turn around time or response time needs to be focused ● User feedback/interactions:- It is an opportunity to learn more about the users, handling the streaming data of interactions may pose some challenges ● Incremental update of models/algorithms:- Interactions means models need to be updated in near real time. Capability of incremental update is critical to avoid whole bunch of recomputation.
  • 15. Scaling... With n items and m users calculating similarities of a given user with all other m-1 users take O(mn) time. When we are talking about performing this across all users it takes O(m2 n) time, as there will be np 2 pairs or nc 2 combinations. Similarly for calculating similarities across all items it takes O(n2 m) time. But we need only top k neighbourhood for each user or item. When n>>k then it results in lot of unnecessary computations. Is there a way to avoid these…? The item similarity API from Spark ML provides an answer to this. Clustering:- Let’s say we are trying the compute the user similarities among m users, where is really a large number. User neighbourhood can then be computed only within the cluster of users. Dimensionality reduction:- A User x Item matrix of dimension mxn, can be reduced to mxk by generating top k principal components from the matrix. Similarity computation on top of this reduced matrix will be faster than using the original matrix.
  • 16. Learning to rank ● Several algorithms and models, ○ Can generate predicted rating items by user ○ Can also generated rank ordered list of recommendations ○ By consuming user history ● Various indicators of items like trend, seasonality, etc,. ● User’s preference, context, etc,. ● Business promotional objectives ● Item catalog/inventory coverage ● Eventual rank ordering by combining all of these ○ To satisfy user’s preference ○ And business objectives
  • 17. Where to start…? ● Open source datasets ○ Movie lens ○ Million song ● Open source framework and API ○ Spark ML ○ Movie lens ● References ○ Recommender systems survey ○ Applying SVD/PCA on recommender systems ○ Learning to rank model ○ Coursera:- Recommender system ● Github ○ https://github.com/vivekmurugesan/recommender-systems