Building a Recommender System Using Collaborative Filtering (CF

•Download as PPTX, PDF•

0 likes•23 views

This document discusses building a recommender system using collaborative filtering. It begins with an introduction to collaborative filtering and common methods like memory-based and model-based collaborative filtering. It then explains the process for memory-based collaborative filtering including similarity calculation, determining peer groups, and making recommendations. Model-based collaborative filtering is introduced using matrix factorization to predict ratings. The document concludes with the steps to build a recommender system which includes understanding data, pre-processing, building the collaborative filtering model, training and testing the model, and evaluating accuracy.

Data & Analytics

Building a recommender
system using Collaborative
Filtering (CF)
Sarah Mestiri
Machine Learning Engineer

“Customer expect to be treated as
human, not number.”
Introduction
“State of the connected customer” report- Salesforce - 2016

Outline
Experiencing personalization through
collaborative filtering
Common-Methods in CF
Building a recommender system using CF

Experiencing
personalization through
collaborative filtering

Memory-based CF
Types
User-based collaborative filtering Item-based collaborative filtering
Recommendations based on similar
users ratings to target user.
Recommendations based on target
user’s own ratings on similar items.
Item-based

Memory-based CF
Process explained
Prediction:
1- Similarity calculation
between items (users).
2-”Peer Group”: top-k
most similar items
(users).
i1 i2 i3 i4 i5
u1 3 5 1 1 ?
u2 1 0 0 2 1
u3 2 5 1 ? ?
u4 0 1 1 ? ?

Memory-based CF
Implementation &
challenges
Step 1: Choice of Similarity Measure
- Pearson (mean-centered ratings)
- Cosine
- Adjusted Cosine
Step 2: Determination of Peer Group
- Top-k most similar
items (users)
What if one user has a general tendency to rate generously while
the other is harsh in his ratings?
Step 3: Recommendation

Model-based CF
Introduction
Where do Neighborhood methods fail?
Computation
Scalability
Sparsity
Accuracy
Challenges

What to recommend to Alice?
Model-based CF

Matrix Inception Frozen King-kong Zootopia
Alice -1 -1 1 1 ?
Patrick 1 0 0 -1 1
John 1 -1 -1 ? ?
Sara 0 1 1 ? ?
Matrix Inception Frozen King-kong Zootopia
Children -1 -1 1 1 1
Action 1 1 -1 -1 0
Children Action
Alice 1 -1
Patrick -1 1
John -1 1
Sara 1 0
Model-based CF
Matrix factorization

X
Matrix Inception Frozen King-kong Zootopia
Alice -1 -1 1 1 ?
Patrick 1 0 0 -1 1
John 1 -1 -1 ? ?
Sara 0 1 1 ? ?
Matrix Inception Frozen King-kong Zootopia
Children -1 -1 1 1 1
Action 1 1 -1 -1 0
Children Action
Alice 1 -1
Patrick -1 1
John -1 1
Sara 1 0
Latent factors
Model-based CF
Matrix factorization

Model-based CF
Ratings prediction
How can one determine the
factor matrices U and V ?
Goal: Minimize the
difference between
predicted ratings &
observed ratings.
Stochastic Gradient
Descent SGD (~SVD)
or Alternating Least
Squares ALS

Step 1: Understand your Data
Visualizations (Pandas, Matplotlib,
Seaborn)

Step 2: Pre-Process your data
Cleanup, merge, etc. (Pandas,
Numpy)

Step 3: Build your ML Model
Implement the CF method
(KNN,SGD,ALS) or use it from
available libraries (Scikit-learn,
Surprise, MLLib Spark)

Step 4: Train your model
Predict ratings on the train set.

Step 5: Test your model
Predict ratings on the test set.

Step 6: Evaluate your model:
Measure accuracy using RMSE

https://github.com/SarahMestiri/RecommenderSystems
GitHub Repo:

Thank you!
mestiri.sa@gmail.com
Sarahmestiri.com
@mestirisarah

Similar to Building a Recommender System Using Collaborative Filtering (CF

Ai use casesSparsh Agarwal

Cloudera Movies Data Science Project On Big DataAbhishek M Shivalingaiah

Lecture Notes on Recommender System IntroductionPerumalPitchandi

RS in the context of Big Data-v4Khadija Atiya

AI in ProductionGiovanni Fernandez-Kincade

Typicality based collaborative filtering recommendationPapitha Velumani

Barga Data Science lecture 9Roger Barga

Recommendation system by_arpit_sharmaEr. Arpit Sharma

Telefonica Lunch SeminarNeal Lathia

Recommendation SystemsRobin Reni

Optimum Investment Selection process-Nov 9-2013Gary Crosbie

Recommenders SystemsTariq Hassan

Machine Learning for Recommender Systems in the Job MarketFabian Abel

collaborativefiltering-150228122057-conversion-gate02.pptxABINASHPADHY6

Machine Learning with Apache MahoutDaniel Glauser

Data miningNits Kedia

Data miningStudsPlanet.com

Machine Learning ICS 273Abutest

Recommendation Engine DemystifiedNYC Predictive Analytics

Similar to Building a Recommender System Using Collaborative Filtering (CF (20)

Ai use cases

Cloudera Movies Data Science Project On Big Data

Lecture Notes on Recommender System Introduction

RS in the context of Big Data-v4

AI in Production

Typicality based collaborative filtering recommendation

Barga Data Science lecture 9

Recommendation system by_arpit_sharma

Telefonica Lunch Seminar

Recommendation Systems

Optimum Investment Selection process-Nov 9-2013

Recommenders Systems

Machine Learning for Recommender Systems in the Job Market

collaborativefiltering-150228122057-conversion-gate02.pptx

Machine Learning with Apache Mahout

Data mining

Machine Learning ICS 273A

Recommendation Engine Demystified

Recently uploaded

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Data Science Jobs and Salaries Analysis.pptxFurkanTasci3

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Universitat Politècnica de Catalunya

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor

Ukraine War presentation: KNOW THE BASICSAishani27

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster

20240419 - Measurecamp Amsterdam - SAM.pdfHuman37

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh

Spark3's new memory model/managementakshesh doshi

From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

B2 Creative Industry Response Evaluation.docxStephen266013

Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth

Industrialised data - the key to AI success.pdfLars Albertsson

04242024_CCC TUG_Joins and Relationshipsccctableauusergroup

RadioAdProWritingCinderellabyButleri.pdfgstagge

Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor

Data Warehouse , Data Cube Computationsit20ad004

Recently uploaded (20)

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...

Data Science Jobs and Salaries Analysis.pptx

Deep Generative Learning for All - The Gen AI Hype (Spring 2024)

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...

Ukraine War presentation: KNOW THE BASICS

EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx

20240419 - Measurecamp Amsterdam - SAM.pdf

Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝

Spark3's new memory model/management

From idea to production in a day – Leveraging Azure ML and Streamlit to build...

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

B2 Creative Industry Response Evaluation.docx

Unveiling Insights: The Role of a Data Analyst

Industrialised data - the key to AI success.pdf

04242024_CCC TUG_Joins and Relationships

RadioAdProWritingCinderellabyButleri.pdf

Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...

Data Warehouse , Data Cube Computation

Building a Recommender System Using Collaborative Filtering (CF

1. Building a recommender system using Collaborative Filtering (CF) Sarah Mestiri Machine Learning Engineer

2. “Customer expect to be treated as human, not number.” Introduction “State of the connected customer” report- Salesforce - 2016

3. Outline Experiencing personalization through collaborative filtering Common-Methods in CF Building a recommender system using CF

4. Experiencing personalization through collaborative filtering

5. Amazon.com

6. What’s Collaborative Filtering?

7. Common Methods in CF

8. Memory-based CF Types User-based collaborative filtering Item-based collaborative filtering Recommendations based on similar users ratings to target user. Recommendations based on target user’s own ratings on similar items. Item-based

9. Memory-based CF Process explained Prediction: 1- Similarity calculation between items (users). 2-”Peer Group”: top-k most similar items (users). i1 i2 i3 i4 i5 u1 3 5 1 1 ? u2 1 0 0 2 1 u3 2 5 1 ? ? u4 0 1 1 ? ?

10. Memory-based CF Implementation & challenges Step 1: Choice of Similarity Measure - Pearson (mean-centered ratings) - Cosine - Adjusted Cosine Step 2: Determination of Peer Group - Top-k most similar items (users) What if one user has a general tendency to rate generously while the other is harsh in his ratings? Step 3: Recommendation

11. Model-based CF Introduction Where do Neighborhood methods fail? Computation Scalability Sparsity Accuracy Challenges

12. What to recommend to Alice? Model-based CF

13. Matrix Inception Frozen King-kong Zootopia Alice -1 -1 1 1 ? Patrick 1 0 0 -1 1 John 1 -1 -1 ? ? Sara 0 1 1 ? ? Matrix Inception Frozen King-kong Zootopia Children -1 -1 1 1 1 Action 1 1 -1 -1 0 Children Action Alice 1 -1 Patrick -1 1 John -1 1 Sara 1 0 Model-based CF Matrix factorization

14. X Matrix Inception Frozen King-kong Zootopia Alice -1 -1 1 1 ? Patrick 1 0 0 -1 1 John 1 -1 -1 ? ? Sara 0 1 1 ? ? Matrix Inception Frozen King-kong Zootopia Children -1 -1 1 1 1 Action 1 1 -1 -1 0 Children Action Alice 1 -1 Patrick -1 1 John -1 1 Sara 1 0 Latent factors Model-based CF Matrix factorization

15. Model-based CF Ratings prediction How can one determine the factor matrices U and V ? Goal: Minimize the difference between predicted ratings & observed ratings. Stochastic Gradient Descent SGD (~SVD) or Alternating Least Squares ALS

16. Building A Recommender System

17. Step 1: Understand your Data Visualizations (Pandas, Matplotlib, Seaborn)

18. Step 2: Pre-Process your data Cleanup, merge, etc. (Pandas, Numpy)

19. Step 3: Build your ML Model Implement the CF method (KNN,SGD,ALS) or use it from available libraries (Scikit-learn, Surprise, MLLib Spark)

20. Step 4: Train your model Predict ratings on the train set.

21. Step 5: Test your model Predict ratings on the test set.

22. Step 6: Evaluate your model: Measure accuracy using RMSE

23. https://github.com/SarahMestiri/RecommenderSystems GitHub Repo:

24. Thank you! mestiri.sa@gmail.com Sarahmestiri.com @mestirisarah

Editor's Notes

Advantages of item-based: leverage the user’s own ratings => more consistency. More stable with changes to the ratings. (adding new item VS adding new users : happening less VS more)
Similarity challenges: User Bias + Number of common ratings between users Peer Group: weakly or negatively correlated users + Long tail impact.
Compute all possible rating predictions for the relevant user-item pairs (e.g all items for a particular user) and then rank them. => Computationally expensive / possible MemoryError
Exploit the fact that significant portions of the rows and columns of data matrices are highly correlated. Data redundancies + high correlation => Low-rank Matrix.
State the advantages of ALS

Building a Recommender System Using Collaborative Filtering (CF

Recommended

Recommended

More Related Content

Similar to Building a Recommender System Using Collaborative Filtering (CF

Similar to Building a Recommender System Using Collaborative Filtering (CF (20)

More from Dataconomy Media

More from Dataconomy Media (20)

Recently uploaded

Recently uploaded (20)

Building a Recommender System Using Collaborative Filtering (CF

Editor's Notes