SlideShare a Scribd company logo
1 of 38
Session 1 - Introduction to
Recommendation Systems
Zia Babar
Waterloo Data Science & Data Engineering Meetup
January 2018
Introduction
About Mary
Mary has 12 years experience in data science of which 7 years in academics and
5 year in industry. She has worked at various startups at which she has had
roles like NLP-developer and data scientist. Currently she is a senior data
engineer at TalkIQ.
Zia has 19 years of professional industry experience, with the most recent 8
years being in technical leadership roles, where he led various engineering
teams pertaining to the design, development and deployment of enterprise
applications with a particular focus on incorporating machine learning
practices and cognitive services into software applications. Presently Zia is
finishing up his PhD at the University of Toronto with particular research
interests on designing enterprise cognitive systems keeping in mind
organizational adoption challenges.
About Zia
“This meetup is about sharing knowledge & learning about machine learning,
data engineering, data analytics, parallel algorithms, and distributed
systems. It provides an opportunity for data engineers, data scientists,
software engineers, statisticians, and innovators to get together and make
connections. Our events … are always intended to provide new ideas, concepts
and tools to help you solve your hardest problems.”
Waterloo Data Science & Data
Engineering Meetup
1. Introducing Recommendation Systems
a. Recommendation systems and their approaches
b. Advantages and disadvantages of each
c. Exploring data sets
2. Building Recommendation Models
a. Building models for main approaches
b. Training and testing models
3. Productizing Recommendation Systems
a. Architecting a complete solution
b. Deploying recommendation systems
Recommendation Systems Series
Recommendation Systems
What are Recommendation Systems?
● Recommender systems (aka Recommenders) are used to help to
predict the preference that a user would have for an item.
● They analyze huge volumes of data, such as product information
and user information, to provide relevant suggestions by
leveraging different probabilistic and machine learning
approaches.
What are Recommendation Systems?
Source: Gorakala, Suresh Kumar. Building Recommendation Engines. Packt Publishing Ltd, 2016.
Examples
Source: https://mapr.com/blog/inside-look-at-components-of-recommendation-engine/
Recommendation Systems Approaches
Collaborative Filtering Recommendation
Systems
● Collaborative filtering approaches build a model from a user's
past behaviour as well as similar decisions made by other users.
● This model is then used to predict items that the user may have
an interest in.
● Memory based
○ User-based collaborative filtering
○ Item-based collaborative filtering
● Model Based
○ Clustering
○ Association
○ …
Collaborative Filtering
Advantages
● Recommenders utilizing this approach are easy to implement
and do not come with a high-computation cost.
● Neither the content information of the products nor the users'
profile information is required for building recommendations
● New items can be recommended to users giving a surprise
factor to the users.
● Poor performance when data (such as user ratings) is sparse.
● High computing cost for calculating the similarity values for all
the users is very high if the data is very large.
● The system suffers with performance if they are sparse, which is
very common in the real world where users will rate only a few
items from a large catalog.
● Rapidly changing user profiles or user inputs require
re-computation of the similarity values.
● Accuracy is low as user / product content information is not
considered during recommendation generation.
Disadvantages
User-based Collaborative Filtering
● Users will be provided with preferences based on their similarity
to other user groups.
● Recommend new items to a user based on how similar user
groups have indicated preference.
● Similarity of users can be determined through different methods,
such as clustering, euclidean distance, etc.
● The similarity between users is calculated based on the ratings
given by users.
User-based Collaborative Filtering
Source: http://www.salemmarafi.com/code/collaborative-filtering-r/
Item-based Collaborative Filtering
● Users will be provided with preferences based on the similarity
of other items.
● Recommend new items to a user based on the user’s preference
for similar items.
● Similarity of items can be determined through different methods,
such as cosine similarity.
Item-based Collaborative Filtering
Source: http://www.salemmarafi.com/code/collaborative-filtering-r/
● Model-Based Recommender Systems leverage supervised and
unsupervised machine learning approaches, and matrix
approaches (such as Matrix Factorization and single value
decomposition) to generate user preferences.
● Using available historical data, a model is built with weights
learned automatically and new predictions regarding the
products will be made using the learned weights.
● The final results are ranked in a specific order before making
recommendations.
Model-Based Recommender Systems
● Probabilistic
○ Use prior probabilities from available data to generate a ranked list of
recommendations through calculating the probability of liking/disliking of a
product for each user. E.g. algorithm is Naïve Bayes.
● Mathematical
○ Apply mathematical approaches on simple matrices based on the ratings or
interaction information of users on products to predict the missing ratings for
the users. E.g. approaches Matrix Factorization, Single Value Decomposition.
● Machine Learning
○ Applied for collaborative, content based, context aware, and hybrid
recommender systems.
○ Approaches such as Logistic Regression, KNN classification, Decision Trees,
SVM, Clustering etc.
Model-Based Recommender Systems
Personalized Recommendation Systems
Personalized Recommendation Systems
● Personalized recommendation systems provides
recommendations to users by taking into account information
about the user and is customized to the user preferences.
● Two primary approaches,
○ Content-based recommendation systems utilize discrete characteristics of
items to recommend additional items with similar properties.
○ Context-aware recommender systems utilize certain contextual factors,
such as time, location, and the purchasing purpose, that identify the context
in which recommendations are provided.
Advantages
● Recommendations are generated using the user preferences
alone rather than the user community.
● Can be employed in real time as the model doesn't need to load
all the data for processing or generating recommendations.
● High accuracy compared to collaborative approaches as product
content is utilized rather than just rating information.
● Easily handling of the “cold-start” problem.
○ the recommender cannot draw any inferences for users or items about which
sufficient or relevant information doesn’t yet exist.
Disadvantages
● Generated recommendations are narrowly defined for a user
and the user would missed out on recommendations based on
communal preferences.
● New products that are not related to existing user preferences
will not be shown to the user.
● Trending information would be missed by the user.
● Content-based recommenders produce more personalized
recommendations for a user by leveraging content information
of items.
● Recommendations are not just based on the similarity of items
or users, but also considers individual preferences and/or item
contents.
● Particularly important when dealing with the “cold-start”
problem.
Content-Based Recommendation
Systems
Content-Based Recommendation
Systems
Source: Gorakala, Suresh Kumar. Building Recommendation Engines. Packt Publishing Ltd, 2016.
Context-Aware Recommendation
Systems
● Context-aware recommendation systems also consider the
context when providing personalized user recommendations.
● Context refers to the present state of the user and can include
attributes such as location, weather, time, date, mood etc.
● Two approaches are used here,
○ Pre-filtering: When context information is applied to user profile and product
content before the recommendation is generated.
○ Post-filtering: When the context information is applied to a set of
recommendations to obtain a final recommendation.
Context-Aware Recommendation
Systems
Source: Gorakala, Suresh Kumar. Building Recommendation Engines. Packt Publishing Ltd, 2016.
Other Approaches
Hybrid Recommender Systems
● Hybrid recommender systems combine both collaborative
filtering and content-based approaches.
Hybrid Recommender Systems
● Some methods used to building such hybrid systems include,
○ Weighted method
○ Mixed method
○ Cascade method
○ Feature combination method
Advantages
● Cold-start problem and data sparsity can be handled.
● Much more robust and scalable than any of the individual
models.
● Combination of methods leads to an improvement in accuracy.
Further Readings
● Ricci, Francesco, Lior Rokach, and Bracha Shapira. "Introduction to recommender systems handbook." In
Recommender systems handbook, pp. 1-35. springer US, 2011.
● F. Maxwell Harper and Joseph A. Konstan. 2015. The MovieLens Datasets: History and Context. ACM
Transactions on Interactive Intelligent Systems (TiiS) 5, 4, Article 19,
DOI=http://dx.doi.org/10.1145/2827872
● Gorakala, Suresh Kumar. Building Recommendation Engines. Packt Publishing Ltd, 2016.
● Suresh Kumar. Building Recommendation Engines. Packt Publishing Ltd, 2016
● Recommender Systems An Introduction by Dietmar Jannach
● Mining Massive Datasets by Jure Leskovec, Anand Rajaraman, Jeff Ullman
(www.mmds.org)https://cambridgespark.com/content/tutorials/implementing-your-own-recommender-sy
stems-in-Python/index.html
● Coursera course on Recommender Systems, by University of Washington
● Coursera course on Recommender Systems, by University of Minnesota
References
Environment and Setup
Environment
● Language and Libraries
○ Python
○ Pandas
○ Numpy
○ Machine learning library like scikit-learn
● Environment
○ Azure Notebook (https://notebooks.azure.com/)
○ Google Colab (https://colab.research.google.com/notebooks)
○ GitHub Repo (https://github.com/ziababar/recommender)
● Dataset used
○ MovieLens Data: http://files.grouplens.org/datasets/movielens/ml-100k.zip
Thank You!

More Related Content

What's hot

IRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation SystemIRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation SystemIRJET Journal
 
Movie Recommender System Using Artificial Intelligence
Movie Recommender System Using Artificial Intelligence Movie Recommender System Using Artificial Intelligence
Movie Recommender System Using Artificial Intelligence Shrutika Oswal
 
Recommendation system
Recommendation systemRecommendation system
Recommendation systemAkshat Thakar
 
Recommendation techniques
Recommendation techniques Recommendation techniques
Recommendation techniques sun9413
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender SystemsDavid Zibriczky
 
Boston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender SystemsBoston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender SystemsJames Kirk
 
Recommender systems for E-commerce
Recommender systems for E-commerceRecommender systems for E-commerce
Recommender systems for E-commerceAlexander Konduforov
 
Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011idoguy
 
Recommendation system based on adaptive ontological graphs and weighted ranking
Recommendation system based on adaptive ontological graphs and weighted rankingRecommendation system based on adaptive ontological graphs and weighted ranking
Recommendation system based on adaptive ontological graphs and weighted rankingvikramadityajakkula
 
Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringViet-Trung TRAN
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introductionLiang Xiang
 
Retail products - machine learning recommendation engine
Retail products   - machine learning recommendation engineRetail products   - machine learning recommendation engine
Retail products - machine learning recommendation enginehkbhadraa
 
Recommendation system
Recommendation systemRecommendation system
Recommendation systemRishabh Mehta
 
FIND MY VENUE: Content & Review Based Location Recommendation System
FIND MY VENUE: Content & Review Based Location Recommendation SystemFIND MY VENUE: Content & Review Based Location Recommendation System
FIND MY VENUE: Content & Review Based Location Recommendation SystemIJTET Journal
 
Data Mining and Recommendation Systems
Data Mining and Recommendation SystemsData Mining and Recommendation Systems
Data Mining and Recommendation SystemsSalil Navgire
 

What's hot (20)

IRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation SystemIRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation System
 
B1802021823
B1802021823B1802021823
B1802021823
 
Movie Recommender System Using Artificial Intelligence
Movie Recommender System Using Artificial Intelligence Movie Recommender System Using Artificial Intelligence
Movie Recommender System Using Artificial Intelligence
 
Recommendation system
Recommendation systemRecommendation system
Recommendation system
 
WORD
WORDWORD
WORD
 
Recommendation techniques
Recommendation techniques Recommendation techniques
Recommendation techniques
 
An introduction to Recommender Systems
An introduction to Recommender SystemsAn introduction to Recommender Systems
An introduction to Recommender Systems
 
Boston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender SystemsBoston ML - Architecting Recommender Systems
Boston ML - Architecting Recommender Systems
 
Recommender systems for E-commerce
Recommender systems for E-commerceRecommender systems for E-commerce
Recommender systems for E-commerce
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011Social Recommender Systems Tutorial - WWW 2011
Social Recommender Systems Tutorial - WWW 2011
 
Recommendation system based on adaptive ontological graphs and weighted ranking
Recommendation system based on adaptive ontological graphs and weighted rankingRecommendation system based on adaptive ontological graphs and weighted ranking
Recommendation system based on adaptive ontological graphs and weighted ranking
 
Recommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filteringRecommender systems: Content-based and collaborative filtering
Recommender systems: Content-based and collaborative filtering
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introduction
 
Retail products - machine learning recommendation engine
Retail products   - machine learning recommendation engineRetail products   - machine learning recommendation engine
Retail products - machine learning recommendation engine
 
Collaborative filtering
Collaborative filteringCollaborative filtering
Collaborative filtering
 
Recommendation system
Recommendation systemRecommendation system
Recommendation system
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
FIND MY VENUE: Content & Review Based Location Recommendation System
FIND MY VENUE: Content & Review Based Location Recommendation SystemFIND MY VENUE: Content & Review Based Location Recommendation System
FIND MY VENUE: Content & Review Based Location Recommendation System
 
Data Mining and Recommendation Systems
Data Mining and Recommendation SystemsData Mining and Recommendation Systems
Data Mining and Recommendation Systems
 

Similar to Introduction to Recommendation Systems

Recommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxRecommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxSatyam Sharma
 
SIMILARITY MEASURES FOR RECOMMENDER SYSTEMS: A COMPARATIVE STUDY
SIMILARITY MEASURES FOR RECOMMENDER SYSTEMS: A COMPARATIVE STUDYSIMILARITY MEASURES FOR RECOMMENDER SYSTEMS: A COMPARATIVE STUDY
SIMILARITY MEASURES FOR RECOMMENDER SYSTEMS: A COMPARATIVE STUDYJournal For Research
 
Recommendation engines
Recommendation enginesRecommendation engines
Recommendation enginesGeorgian Micsa
 
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...IRJET Journal
 
A Survey of Recommender System Techniques and the E-commerce Domain.pptx
A Survey of Recommender System Techniques and the E-commerce Domain.pptxA Survey of Recommender System Techniques and the E-commerce Domain.pptx
A Survey of Recommender System Techniques and the E-commerce Domain.pptxmansivekaria09
 
Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...
Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...
Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...IRJET Journal
 
MOVIE RECOMMENDATION SYSTEM
MOVIE RECOMMENDATION SYSTEMMOVIE RECOMMENDATION SYSTEM
MOVIE RECOMMENDATION SYSTEMIRJET Journal
 
Teacher training material
Teacher training materialTeacher training material
Teacher training materialVikram Parmar
 
A.hybrid.recommendation.approach.for.a.tourism.system
A.hybrid.recommendation.approach.for.a.tourism.systemA.hybrid.recommendation.approach.for.a.tourism.system
A.hybrid.recommendation.approach.for.a.tourism.systembenny ribeiro
 
Analysis on Recommended System for Web Information Retrieval Using HMM
Analysis on Recommended System for Web Information Retrieval Using HMMAnalysis on Recommended System for Web Information Retrieval Using HMM
Analysis on Recommended System for Web Information Retrieval Using HMMIJERA Editor
 
A Novel Jewellery Recommendation System using Machine Learning and Natural La...
A Novel Jewellery Recommendation System using Machine Learning and Natural La...A Novel Jewellery Recommendation System using Machine Learning and Natural La...
A Novel Jewellery Recommendation System using Machine Learning and Natural La...IRJET Journal
 
Agent technology for e commerce-recommendation systems
Agent technology for e commerce-recommendation systemsAgent technology for e commerce-recommendation systems
Agent technology for e commerce-recommendation systemsAravindharamanan S
 
Recommender.system.presentation.pjug.01.21.2014
Recommender.system.presentation.pjug.01.21.2014Recommender.system.presentation.pjug.01.21.2014
Recommender.system.presentation.pjug.01.21.2014rpbrehm
 
Personalized recommendation for cold start users
Personalized recommendation for cold start usersPersonalized recommendation for cold start users
Personalized recommendation for cold start usersIRJET Journal
 
Different Location based Approaches in Recommendation Systems
Different Location based Approaches in Recommendation SystemsDifferent Location based Approaches in Recommendation Systems
Different Location based Approaches in Recommendation SystemsIRJET Journal
 
Impersonal Recommendation system on top of Hadoop
Impersonal Recommendation system on top of HadoopImpersonal Recommendation system on top of Hadoop
Impersonal Recommendation system on top of HadoopKostiantyn Kudriavtsev
 
Tutorial on Bias in Rec Sys @ UMAP2020
Tutorial on Bias in Rec Sys @ UMAP2020Tutorial on Bias in Rec Sys @ UMAP2020
Tutorial on Bias in Rec Sys @ UMAP2020Mirko Marras
 
A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...
A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...
A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...ijcsa
 

Similar to Introduction to Recommendation Systems (20)

Recommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptxRecommender System _Module 1_Introduction to Recommender System.pptx
Recommender System _Module 1_Introduction to Recommender System.pptx
 
SIMILARITY MEASURES FOR RECOMMENDER SYSTEMS: A COMPARATIVE STUDY
SIMILARITY MEASURES FOR RECOMMENDER SYSTEMS: A COMPARATIVE STUDYSIMILARITY MEASURES FOR RECOMMENDER SYSTEMS: A COMPARATIVE STUDY
SIMILARITY MEASURES FOR RECOMMENDER SYSTEMS: A COMPARATIVE STUDY
 
Recommendation engines
Recommendation enginesRecommendation engines
Recommendation engines
 
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
IRJET- Analysis on Existing Methodologies of User Service Rating Prediction S...
 
A Survey of Recommender System Techniques and the E-commerce Domain.pptx
A Survey of Recommender System Techniques and the E-commerce Domain.pptxA Survey of Recommender System Techniques and the E-commerce Domain.pptx
A Survey of Recommender System Techniques and the E-commerce Domain.pptx
 
Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...
Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...
Mixed Recommendation Algorithm Based on Content, Demographic and Collaborativ...
 
MOVIE RECOMMENDATION SYSTEM
MOVIE RECOMMENDATION SYSTEMMOVIE RECOMMENDATION SYSTEM
MOVIE RECOMMENDATION SYSTEM
 
Teacher training material
Teacher training materialTeacher training material
Teacher training material
 
A.hybrid.recommendation.approach.for.a.tourism.system
A.hybrid.recommendation.approach.for.a.tourism.systemA.hybrid.recommendation.approach.for.a.tourism.system
A.hybrid.recommendation.approach.for.a.tourism.system
 
Analysis on Recommended System for Web Information Retrieval Using HMM
Analysis on Recommended System for Web Information Retrieval Using HMMAnalysis on Recommended System for Web Information Retrieval Using HMM
Analysis on Recommended System for Web Information Retrieval Using HMM
 
A Novel Jewellery Recommendation System using Machine Learning and Natural La...
A Novel Jewellery Recommendation System using Machine Learning and Natural La...A Novel Jewellery Recommendation System using Machine Learning and Natural La...
A Novel Jewellery Recommendation System using Machine Learning and Natural La...
 
Agent technology for e commerce-recommendation systems
Agent technology for e commerce-recommendation systemsAgent technology for e commerce-recommendation systems
Agent technology for e commerce-recommendation systems
 
Recommender.system.presentation.pjug.01.21.2014
Recommender.system.presentation.pjug.01.21.2014Recommender.system.presentation.pjug.01.21.2014
Recommender.system.presentation.pjug.01.21.2014
 
Personalized recommendation for cold start users
Personalized recommendation for cold start usersPersonalized recommendation for cold start users
Personalized recommendation for cold start users
 
Different Location based Approaches in Recommendation Systems
Different Location based Approaches in Recommendation SystemsDifferent Location based Approaches in Recommendation Systems
Different Location based Approaches in Recommendation Systems
 
Impersonal Recommendation system on top of Hadoop
Impersonal Recommendation system on top of HadoopImpersonal Recommendation system on top of Hadoop
Impersonal Recommendation system on top of Hadoop
 
Tutorial on Bias in Rec Sys @ UMAP2020
Tutorial on Bias in Rec Sys @ UMAP2020Tutorial on Bias in Rec Sys @ UMAP2020
Tutorial on Bias in Rec Sys @ UMAP2020
 
A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...
A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...
A LOCATION-BASED RECOMMENDER SYSTEM FRAMEWORK TO IMPROVE ACCURACY IN USERBASE...
 
At4102337341
At4102337341At4102337341
At4102337341
 
lms final ppt.pptx
lms final ppt.pptxlms final ppt.pptx
lms final ppt.pptx
 

More from Zia Babar

Ways to Extract Variable Insights when Data is Scarse
Ways to Extract Variable Insights when Data is ScarseWays to Extract Variable Insights when Data is Scarse
Ways to Extract Variable Insights when Data is ScarseZia Babar
 
Key Influencers in Social Networks
Key Influencers in Social NetworksKey Influencers in Social Networks
Key Influencers in Social NetworksZia Babar
 
Lykaio Wang - Data Visualization in Web
Lykaio Wang - Data Visualization in WebLykaio Wang - Data Visualization in Web
Lykaio Wang - Data Visualization in WebZia Babar
 
Daria Voronova - The Art of Telling a Story
Daria Voronova - The Art of Telling a StoryDaria Voronova - The Art of Telling a Story
Daria Voronova - The Art of Telling a StoryZia Babar
 
Waterloo Data Science and Data Engineering Meetup - 2018-08-29
Waterloo Data Science and Data Engineering Meetup - 2018-08-29Waterloo Data Science and Data Engineering Meetup - 2018-08-29
Waterloo Data Science and Data Engineering Meetup - 2018-08-29Zia Babar
 
Cloud Data Storage and Database
Cloud Data Storage and DatabaseCloud Data Storage and Database
Cloud Data Storage and DatabaseZia Babar
 

More from Zia Babar (6)

Ways to Extract Variable Insights when Data is Scarse
Ways to Extract Variable Insights when Data is ScarseWays to Extract Variable Insights when Data is Scarse
Ways to Extract Variable Insights when Data is Scarse
 
Key Influencers in Social Networks
Key Influencers in Social NetworksKey Influencers in Social Networks
Key Influencers in Social Networks
 
Lykaio Wang - Data Visualization in Web
Lykaio Wang - Data Visualization in WebLykaio Wang - Data Visualization in Web
Lykaio Wang - Data Visualization in Web
 
Daria Voronova - The Art of Telling a Story
Daria Voronova - The Art of Telling a StoryDaria Voronova - The Art of Telling a Story
Daria Voronova - The Art of Telling a Story
 
Waterloo Data Science and Data Engineering Meetup - 2018-08-29
Waterloo Data Science and Data Engineering Meetup - 2018-08-29Waterloo Data Science and Data Engineering Meetup - 2018-08-29
Waterloo Data Science and Data Engineering Meetup - 2018-08-29
 
Cloud Data Storage and Database
Cloud Data Storage and DatabaseCloud Data Storage and Database
Cloud Data Storage and Database
 

Recently uploaded

Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjurptikerjasaptiker
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxParas Gupta
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1ranjankumarbehera14
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制vexqp
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Klinik kandungan
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxchadhar227
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制vexqp
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareGraham Ware
 
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATIONCapstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATIONLakpaYanziSherpa
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样wsppdmt
 
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxVivek487417
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRajesh Mondal
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...gajnagarg
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNKTimothy Spann
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制vexqp
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...Health
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........EfruzAsilolu
 

Recently uploaded (20)

Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling ManjurJual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
Jual Cytotec Asli Obat Aborsi No. 1 Paling Manjur
 
Harnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptxHarnessing the Power of GenAI for BI and Reporting.pptx
Harnessing the Power of GenAI for BI and Reporting.pptx
 
Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1Lecture_2_Deep_Learning_Overview-newone1
Lecture_2_Deep_Learning_Overview-newone1
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
怎样办理圣路易斯大学毕业证(SLU毕业证书)成绩单学校原版复制
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Gartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptxGartner's Data Analytics Maturity Model.pptx
Gartner's Data Analytics Maturity Model.pptx
 
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
怎样办理伦敦大学城市学院毕业证(CITY毕业证书)成绩单学校原版复制
 
Digital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham WareDigital Transformation Playbook by Graham Ware
Digital Transformation Playbook by Graham Ware
 
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATIONCapstone in Interprofessional Informatic  // IMPACT OF COVID 19 ON EDUCATION
Capstone in Interprofessional Informatic // IMPACT OF COVID 19 ON EDUCATION
 
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
一比一原版(UCD毕业证书)加州大学戴维斯分校毕业证成绩单原件一模一样
 
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptxThe-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
The-boAt-Story-Navigating-the-Waves-of-Innovation.pptx
 
Ranking and Scoring Exercises for Research
Ranking and Scoring Exercises for ResearchRanking and Scoring Exercises for Research
Ranking and Scoring Exercises for Research
 
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
Top profile Call Girls In Chandrapur [ 7014168258 ] Call Me For Genuine Model...
 
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24  Building Real-Time Pipelines With FLaNKDATA SUMMIT 24  Building Real-Time Pipelines With FLaNK
DATA SUMMIT 24 Building Real-Time Pipelines With FLaNK
 
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
怎样办理纽约州立大学宾汉姆顿分校毕业证(SUNY-Bin毕业证书)成绩单学校原版复制
 
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
+97470301568>>weed for sale in qatar ,weed for sale in dubai,weed for sale in...
 
Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........Switzerland Constitution 2002.pdf.........
Switzerland Constitution 2002.pdf.........
 

Introduction to Recommendation Systems

  • 1. Session 1 - Introduction to Recommendation Systems Zia Babar Waterloo Data Science & Data Engineering Meetup January 2018
  • 3. About Mary Mary has 12 years experience in data science of which 7 years in academics and 5 year in industry. She has worked at various startups at which she has had roles like NLP-developer and data scientist. Currently she is a senior data engineer at TalkIQ.
  • 4. Zia has 19 years of professional industry experience, with the most recent 8 years being in technical leadership roles, where he led various engineering teams pertaining to the design, development and deployment of enterprise applications with a particular focus on incorporating machine learning practices and cognitive services into software applications. Presently Zia is finishing up his PhD at the University of Toronto with particular research interests on designing enterprise cognitive systems keeping in mind organizational adoption challenges. About Zia
  • 5. “This meetup is about sharing knowledge & learning about machine learning, data engineering, data analytics, parallel algorithms, and distributed systems. It provides an opportunity for data engineers, data scientists, software engineers, statisticians, and innovators to get together and make connections. Our events … are always intended to provide new ideas, concepts and tools to help you solve your hardest problems.” Waterloo Data Science & Data Engineering Meetup
  • 6. 1. Introducing Recommendation Systems a. Recommendation systems and their approaches b. Advantages and disadvantages of each c. Exploring data sets 2. Building Recommendation Models a. Building models for main approaches b. Training and testing models 3. Productizing Recommendation Systems a. Architecting a complete solution b. Deploying recommendation systems Recommendation Systems Series
  • 8. What are Recommendation Systems? ● Recommender systems (aka Recommenders) are used to help to predict the preference that a user would have for an item. ● They analyze huge volumes of data, such as product information and user information, to provide relevant suggestions by leveraging different probabilistic and machine learning approaches.
  • 9. What are Recommendation Systems? Source: Gorakala, Suresh Kumar. Building Recommendation Engines. Packt Publishing Ltd, 2016.
  • 13. ● Collaborative filtering approaches build a model from a user's past behaviour as well as similar decisions made by other users. ● This model is then used to predict items that the user may have an interest in. ● Memory based ○ User-based collaborative filtering ○ Item-based collaborative filtering ● Model Based ○ Clustering ○ Association ○ … Collaborative Filtering
  • 14. Advantages ● Recommenders utilizing this approach are easy to implement and do not come with a high-computation cost. ● Neither the content information of the products nor the users' profile information is required for building recommendations ● New items can be recommended to users giving a surprise factor to the users.
  • 15. ● Poor performance when data (such as user ratings) is sparse. ● High computing cost for calculating the similarity values for all the users is very high if the data is very large. ● The system suffers with performance if they are sparse, which is very common in the real world where users will rate only a few items from a large catalog. ● Rapidly changing user profiles or user inputs require re-computation of the similarity values. ● Accuracy is low as user / product content information is not considered during recommendation generation. Disadvantages
  • 16. User-based Collaborative Filtering ● Users will be provided with preferences based on their similarity to other user groups. ● Recommend new items to a user based on how similar user groups have indicated preference. ● Similarity of users can be determined through different methods, such as clustering, euclidean distance, etc. ● The similarity between users is calculated based on the ratings given by users.
  • 17. User-based Collaborative Filtering Source: http://www.salemmarafi.com/code/collaborative-filtering-r/
  • 18. Item-based Collaborative Filtering ● Users will be provided with preferences based on the similarity of other items. ● Recommend new items to a user based on the user’s preference for similar items. ● Similarity of items can be determined through different methods, such as cosine similarity.
  • 19. Item-based Collaborative Filtering Source: http://www.salemmarafi.com/code/collaborative-filtering-r/
  • 20. ● Model-Based Recommender Systems leverage supervised and unsupervised machine learning approaches, and matrix approaches (such as Matrix Factorization and single value decomposition) to generate user preferences. ● Using available historical data, a model is built with weights learned automatically and new predictions regarding the products will be made using the learned weights. ● The final results are ranked in a specific order before making recommendations. Model-Based Recommender Systems
  • 21. ● Probabilistic ○ Use prior probabilities from available data to generate a ranked list of recommendations through calculating the probability of liking/disliking of a product for each user. E.g. algorithm is Naïve Bayes. ● Mathematical ○ Apply mathematical approaches on simple matrices based on the ratings or interaction information of users on products to predict the missing ratings for the users. E.g. approaches Matrix Factorization, Single Value Decomposition. ● Machine Learning ○ Applied for collaborative, content based, context aware, and hybrid recommender systems. ○ Approaches such as Logistic Regression, KNN classification, Decision Trees, SVM, Clustering etc. Model-Based Recommender Systems
  • 23. Personalized Recommendation Systems ● Personalized recommendation systems provides recommendations to users by taking into account information about the user and is customized to the user preferences. ● Two primary approaches, ○ Content-based recommendation systems utilize discrete characteristics of items to recommend additional items with similar properties. ○ Context-aware recommender systems utilize certain contextual factors, such as time, location, and the purchasing purpose, that identify the context in which recommendations are provided.
  • 24. Advantages ● Recommendations are generated using the user preferences alone rather than the user community. ● Can be employed in real time as the model doesn't need to load all the data for processing or generating recommendations. ● High accuracy compared to collaborative approaches as product content is utilized rather than just rating information. ● Easily handling of the “cold-start” problem. ○ the recommender cannot draw any inferences for users or items about which sufficient or relevant information doesn’t yet exist.
  • 25. Disadvantages ● Generated recommendations are narrowly defined for a user and the user would missed out on recommendations based on communal preferences. ● New products that are not related to existing user preferences will not be shown to the user. ● Trending information would be missed by the user.
  • 26. ● Content-based recommenders produce more personalized recommendations for a user by leveraging content information of items. ● Recommendations are not just based on the similarity of items or users, but also considers individual preferences and/or item contents. ● Particularly important when dealing with the “cold-start” problem. Content-Based Recommendation Systems
  • 27. Content-Based Recommendation Systems Source: Gorakala, Suresh Kumar. Building Recommendation Engines. Packt Publishing Ltd, 2016.
  • 28. Context-Aware Recommendation Systems ● Context-aware recommendation systems also consider the context when providing personalized user recommendations. ● Context refers to the present state of the user and can include attributes such as location, weather, time, date, mood etc. ● Two approaches are used here, ○ Pre-filtering: When context information is applied to user profile and product content before the recommendation is generated. ○ Post-filtering: When the context information is applied to a set of recommendations to obtain a final recommendation.
  • 29. Context-Aware Recommendation Systems Source: Gorakala, Suresh Kumar. Building Recommendation Engines. Packt Publishing Ltd, 2016.
  • 31. Hybrid Recommender Systems ● Hybrid recommender systems combine both collaborative filtering and content-based approaches.
  • 32. Hybrid Recommender Systems ● Some methods used to building such hybrid systems include, ○ Weighted method ○ Mixed method ○ Cascade method ○ Feature combination method
  • 33. Advantages ● Cold-start problem and data sparsity can be handled. ● Much more robust and scalable than any of the individual models. ● Combination of methods leads to an improvement in accuracy.
  • 35. ● Ricci, Francesco, Lior Rokach, and Bracha Shapira. "Introduction to recommender systems handbook." In Recommender systems handbook, pp. 1-35. springer US, 2011. ● F. Maxwell Harper and Joseph A. Konstan. 2015. The MovieLens Datasets: History and Context. ACM Transactions on Interactive Intelligent Systems (TiiS) 5, 4, Article 19, DOI=http://dx.doi.org/10.1145/2827872 ● Gorakala, Suresh Kumar. Building Recommendation Engines. Packt Publishing Ltd, 2016. ● Suresh Kumar. Building Recommendation Engines. Packt Publishing Ltd, 2016 ● Recommender Systems An Introduction by Dietmar Jannach ● Mining Massive Datasets by Jure Leskovec, Anand Rajaraman, Jeff Ullman (www.mmds.org)https://cambridgespark.com/content/tutorials/implementing-your-own-recommender-sy stems-in-Python/index.html ● Coursera course on Recommender Systems, by University of Washington ● Coursera course on Recommender Systems, by University of Minnesota References
  • 37. Environment ● Language and Libraries ○ Python ○ Pandas ○ Numpy ○ Machine learning library like scikit-learn ● Environment ○ Azure Notebook (https://notebooks.azure.com/) ○ Google Colab (https://colab.research.google.com/notebooks) ○ GitHub Repo (https://github.com/ziababar/recommender) ● Dataset used ○ MovieLens Data: http://files.grouplens.org/datasets/movielens/ml-100k.zip