SlideShare a Scribd company logo
1 of 19
Please turn off your webcam
If you are joining from a mobile phone
be sure to click on
Join via Device Audio
We are waiting for other participants to join
We will begin at 4:30 PM IST
Mihir Thakkar
Founder and Instructor
hello@codeheroku.com
Building a Movie
Recommendation
Engine
SESSION
OBJECTIVES
● Quick Recap
● Recommendation
Systems
● Revise Some Math
● Build it!
www.codeheroku.com Building a Movie Recommendation Engine
Machine Learning Pipeline
Data Extraction
and Cleaning
Build ML
Model
Build Software
Infrastructure
www.codeheroku.com Building a Movie Recommendation Engine
Supervised Machine Learning
House Size (Sq feet) Location Age (years) Prize (Lakh
Rs)
500 Mumbai 2 70
1500 Pune 3 100
2000 Banglore 4 60
1000 Mumbai 2 ?
3000 Pune 10 ?
Training Data
Test Data
Features
www.codeheroku.com Building a Movie Recommendation Engine
2. Netflix1. Amazon
What’s Common?
QUIZ
Who are the Users and Items for RE in the following platforms?
1. LinkedIn
2. Amazon
3. Netflix
4. Facebook
Users: Members; Items: Members
Users: Members; Items: Products (E.g. Books, Electronics)
Users: Members; Items: Movie
Users: Members; Items: Members
www.codeheroku.com Building a Movie Recommendation Engine
Implementing A Recommender System
1. Popularity / Rating Based System
www.codeheroku.com Building a Movie Recommendation Engine
Implementing A Recommender System
3. Collaborative Filtering2. Content Based
www.codeheroku.com Building a Movie Recommendation Engine
Similarity Between Content
Text A: London Paris London
Text B: Paris Paris London
www.codeheroku.com Building a Movie Recommendation Engine
Paris
London0 1 2 3 4
5
4
3
2
1
Text A: London Paris London (London: 2, Paris: 1)
Text B: Paris Paris London (London: 1, Paris: 2)
Distance Between Two Vectors
(1,2)
(2,1)
D
www.codeheroku.com Building a Movie Recommendation Engine
When To Use Angular Distance?
www.codeheroku.com Building a Movie Recommendation Engine
Quiz
In which of the following scenarios you are most likely to use Cosine
Similarity measure?
1. Determining gender based on shoe length, height, weight etc.
2. Comparing similarities between documents of uneven size
3. Predicting rainfall based on city location, temperature, humidity etc.
www.codeheroku.com Building a Movie Recommendation Engine
Quiz Given the similarity matrix below, which movie is most
similar to Movie 0 ?
www.codeheroku.com Building a Movie Recommendation Engine
Let’s Build It
http://codeheroku.com/static/workshop/datasets/movie_recommender.zip
http://www.codeheroku.com/static/workshop/hw/movie_recommendation/assignment.pdf
www.codeheroku.com Building a Movie Recommendation Engine
www.codeheroku.com Building a Movie Recommendation Engine
Thank you!
www.codeheroku.com Building a Movie Recommendation Engine
Alternative Links:
DataSet: https://drive.google.com/file/d/1sJ9N2T2zDQwvywHCC6RCO68olL97Mp4O/view?usp=sharing
Assignment: https://drive.google.com/file/d/1EKjBr0id9_HtzGJzrNs8yGZ5MWWExx-b/view?usp=sharing
www.codeheroku.com Building a Movie Recommendation Engine
Recommendation Systems
Users Items

More Related Content

Similar to Building a movie recommendation engine in Python using Scikit-Learn - Code Heroku

Web Technology and Standards Tutorial
Web Technology and Standards Tutorial Web Technology and Standards Tutorial
Web Technology and Standards Tutorial Jonathan Jeon
 
Minor Project Presentation website (1).pdf
Minor Project Presentation  website (1).pdfMinor Project Presentation  website (1).pdf
Minor Project Presentation website (1).pdfAmanKumar411419
 
Juarez Barbosa Junior - Microsoft - OSL19
Juarez Barbosa Junior - Microsoft - OSL19Juarez Barbosa Junior - Microsoft - OSL19
Juarez Barbosa Junior - Microsoft - OSL19marketingsyone
 
Presentation of group_k
Presentation of group_kPresentation of group_k
Presentation of group_ksarasara802
 
The Trinity in Exponential Technologies: Open Source, Blockchain and Microsof...
The Trinity in Exponential Technologies: Open Source, Blockchain and Microsof...The Trinity in Exponential Technologies: Open Source, Blockchain and Microsof...
The Trinity in Exponential Technologies: Open Source, Blockchain and Microsof...Juarez Junior
 
iLoveBolly Drupal Case Study I Love Bolly
iLoveBolly Drupal Case Study    I Love BollyiLoveBolly Drupal Case Study    I Love Bolly
iLoveBolly Drupal Case Study I Love BollyBlisstering Solutions
 
Class 8th Book key board Chapter 5 Excersie.pptx
Class 8th Book key board Chapter 5 Excersie.pptxClass 8th Book key board Chapter 5 Excersie.pptx
Class 8th Book key board Chapter 5 Excersie.pptxasgharali79356
 
Ultimate Guide to Live Streaming
Ultimate Guide to Live StreamingUltimate Guide to Live Streaming
Ultimate Guide to Live StreamingPaul Richards
 
Lentin joseph learning robotics using python design, simulate, program, an...
Lentin joseph   learning robotics using python  design, simulate, program, an...Lentin joseph   learning robotics using python  design, simulate, program, an...
Lentin joseph learning robotics using python design, simulate, program, an...Rajmeet Singh
 
Techfest design patterns
Techfest design patternsTechfest design patterns
Techfest design patternsRookieOne
 
Microsoft IoT & Data OpenHack Zürich
Microsoft IoT & Data OpenHack ZürichMicrosoft IoT & Data OpenHack Zürich
Microsoft IoT & Data OpenHack ZürichSascha Corti
 
Airstream extended
Airstream extendedAirstream extended
Airstream extendedJitin Pillai
 
Microservices for the Masses with Spring Boot and JHipster - Chicago JUG 2018
Microservices for the Masses with Spring Boot and JHipster - Chicago JUG 2018Microservices for the Masses with Spring Boot and JHipster - Chicago JUG 2018
Microservices for the Masses with Spring Boot and JHipster - Chicago JUG 2018Matt Raible
 
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit ShahANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit ShahAgileNetwork
 
201507_NeoHsu_Portfolio
201507_NeoHsu_Portfolio201507_NeoHsu_Portfolio
201507_NeoHsu_PortfolioNeo Hsu
 
Hexagonal Symfony - SymfonyCon Amsterdam 2019
Hexagonal Symfony - SymfonyCon Amsterdam 2019Hexagonal Symfony - SymfonyCon Amsterdam 2019
Hexagonal Symfony - SymfonyCon Amsterdam 2019Matthias Noback
 

Similar to Building a movie recommendation engine in Python using Scikit-Learn - Code Heroku (20)

Web Technology and Standards Tutorial
Web Technology and Standards Tutorial Web Technology and Standards Tutorial
Web Technology and Standards Tutorial
 
AW-Report
AW-ReportAW-Report
AW-Report
 
Minor Project Presentation website (1).pdf
Minor Project Presentation  website (1).pdfMinor Project Presentation  website (1).pdf
Minor Project Presentation website (1).pdf
 
Juarez Barbosa Junior - Microsoft - OSL19
Juarez Barbosa Junior - Microsoft - OSL19Juarez Barbosa Junior - Microsoft - OSL19
Juarez Barbosa Junior - Microsoft - OSL19
 
Presentation of group_k
Presentation of group_kPresentation of group_k
Presentation of group_k
 
The Trinity in Exponential Technologies: Open Source, Blockchain and Microsof...
The Trinity in Exponential Technologies: Open Source, Blockchain and Microsof...The Trinity in Exponential Technologies: Open Source, Blockchain and Microsof...
The Trinity in Exponential Technologies: Open Source, Blockchain and Microsof...
 
iLoveBolly Drupal Case Study I Love Bolly
iLoveBolly Drupal Case Study    I Love BollyiLoveBolly Drupal Case Study    I Love Bolly
iLoveBolly Drupal Case Study I Love Bolly
 
Class 8th Book key board Chapter 5 Excersie.pptx
Class 8th Book key board Chapter 5 Excersie.pptxClass 8th Book key board Chapter 5 Excersie.pptx
Class 8th Book key board Chapter 5 Excersie.pptx
 
Ultimate Guide to Live Streaming
Ultimate Guide to Live StreamingUltimate Guide to Live Streaming
Ultimate Guide to Live Streaming
 
Lentin joseph learning robotics using python design, simulate, program, an...
Lentin joseph   learning robotics using python  design, simulate, program, an...Lentin joseph   learning robotics using python  design, simulate, program, an...
Lentin joseph learning robotics using python design, simulate, program, an...
 
Techfest design patterns
Techfest design patternsTechfest design patterns
Techfest design patterns
 
Atc workshop
Atc workshopAtc workshop
Atc workshop
 
Maker Boot Camp
Maker Boot CampMaker Boot Camp
Maker Boot Camp
 
Microsoft IoT & Data OpenHack Zürich
Microsoft IoT & Data OpenHack ZürichMicrosoft IoT & Data OpenHack Zürich
Microsoft IoT & Data OpenHack Zürich
 
Adobemax2009na
Adobemax2009naAdobemax2009na
Adobemax2009na
 
Airstream extended
Airstream extendedAirstream extended
Airstream extended
 
Microservices for the Masses with Spring Boot and JHipster - Chicago JUG 2018
Microservices for the Masses with Spring Boot and JHipster - Chicago JUG 2018Microservices for the Masses with Spring Boot and JHipster - Chicago JUG 2018
Microservices for the Masses with Spring Boot and JHipster - Chicago JUG 2018
 
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit ShahANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
ANIn Pune July 2023 |Prompt Engineering and AI first SDLC by Abhijit Shah
 
201507_NeoHsu_Portfolio
201507_NeoHsu_Portfolio201507_NeoHsu_Portfolio
201507_NeoHsu_Portfolio
 
Hexagonal Symfony - SymfonyCon Amsterdam 2019
Hexagonal Symfony - SymfonyCon Amsterdam 2019Hexagonal Symfony - SymfonyCon Amsterdam 2019
Hexagonal Symfony - SymfonyCon Amsterdam 2019
 

More from codeheroku

Reinforcement Learning with OpenAI Gym - Value Iteration Frozen Lake - Code H...
Reinforcement Learning with OpenAI Gym - Value Iteration Frozen Lake - Code H...Reinforcement Learning with OpenAI Gym - Value Iteration Frozen Lake - Code H...
Reinforcement Learning with OpenAI Gym - Value Iteration Frozen Lake - Code H...codeheroku
 
Introduction to Reinforcement Learning - Code Heroku
Introduction to Reinforcement Learning - Code HerokuIntroduction to Reinforcement Learning - Code Heroku
Introduction to Reinforcement Learning - Code Herokucodeheroku
 
Introduction to Unsupervised Learning - Code Heroku
Introduction to Unsupervised Learning - Code HerokuIntroduction to Unsupervised Learning - Code Heroku
Introduction to Unsupervised Learning - Code Herokucodeheroku
 
Building Web Apps with Python Part 2 - Code Heroku
Building Web Apps with Python Part 2 - Code HerokuBuilding Web Apps with Python Part 2 - Code Heroku
Building Web Apps with Python Part 2 - Code Herokucodeheroku
 
Building Web Apps with Python - Code Heroku
Building Web Apps with Python - Code HerokuBuilding Web Apps with Python - Code Heroku
Building Web Apps with Python - Code Herokucodeheroku
 
Introduction to Machine Learning - Code Heroku
Introduction to Machine Learning - Code HerokuIntroduction to Machine Learning - Code Heroku
Introduction to Machine Learning - Code Herokucodeheroku
 
Introduction to Data Visualization Part 2 - Code Heroku
Introduction to Data Visualization Part 2 - Code HerokuIntroduction to Data Visualization Part 2 - Code Heroku
Introduction to Data Visualization Part 2 - Code Herokucodeheroku
 
Introduction to Data Visualization - Code Heroku
Introduction to Data Visualization - Code HerokuIntroduction to Data Visualization - Code Heroku
Introduction to Data Visualization - Code Herokucodeheroku
 
Introduction to Computer Vision - Code Heroku
Introduction to Computer Vision - Code HerokuIntroduction to Computer Vision - Code Heroku
Introduction to Computer Vision - Code Herokucodeheroku
 
Introduction to JavaScript - Code Heroku
Introduction to JavaScript - Code HerokuIntroduction to JavaScript - Code Heroku
Introduction to JavaScript - Code Herokucodeheroku
 

More from codeheroku (10)

Reinforcement Learning with OpenAI Gym - Value Iteration Frozen Lake - Code H...
Reinforcement Learning with OpenAI Gym - Value Iteration Frozen Lake - Code H...Reinforcement Learning with OpenAI Gym - Value Iteration Frozen Lake - Code H...
Reinforcement Learning with OpenAI Gym - Value Iteration Frozen Lake - Code H...
 
Introduction to Reinforcement Learning - Code Heroku
Introduction to Reinforcement Learning - Code HerokuIntroduction to Reinforcement Learning - Code Heroku
Introduction to Reinforcement Learning - Code Heroku
 
Introduction to Unsupervised Learning - Code Heroku
Introduction to Unsupervised Learning - Code HerokuIntroduction to Unsupervised Learning - Code Heroku
Introduction to Unsupervised Learning - Code Heroku
 
Building Web Apps with Python Part 2 - Code Heroku
Building Web Apps with Python Part 2 - Code HerokuBuilding Web Apps with Python Part 2 - Code Heroku
Building Web Apps with Python Part 2 - Code Heroku
 
Building Web Apps with Python - Code Heroku
Building Web Apps with Python - Code HerokuBuilding Web Apps with Python - Code Heroku
Building Web Apps with Python - Code Heroku
 
Introduction to Machine Learning - Code Heroku
Introduction to Machine Learning - Code HerokuIntroduction to Machine Learning - Code Heroku
Introduction to Machine Learning - Code Heroku
 
Introduction to Data Visualization Part 2 - Code Heroku
Introduction to Data Visualization Part 2 - Code HerokuIntroduction to Data Visualization Part 2 - Code Heroku
Introduction to Data Visualization Part 2 - Code Heroku
 
Introduction to Data Visualization - Code Heroku
Introduction to Data Visualization - Code HerokuIntroduction to Data Visualization - Code Heroku
Introduction to Data Visualization - Code Heroku
 
Introduction to Computer Vision - Code Heroku
Introduction to Computer Vision - Code HerokuIntroduction to Computer Vision - Code Heroku
Introduction to Computer Vision - Code Heroku
 
Introduction to JavaScript - Code Heroku
Introduction to JavaScript - Code HerokuIntroduction to JavaScript - Code Heroku
Introduction to JavaScript - Code Heroku
 

Recently uploaded

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 

Recently uploaded (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 

Building a movie recommendation engine in Python using Scikit-Learn - Code Heroku

  • 1. Please turn off your webcam If you are joining from a mobile phone be sure to click on Join via Device Audio We are waiting for other participants to join We will begin at 4:30 PM IST
  • 2. Mihir Thakkar Founder and Instructor hello@codeheroku.com Building a Movie Recommendation Engine
  • 3. SESSION OBJECTIVES ● Quick Recap ● Recommendation Systems ● Revise Some Math ● Build it!
  • 4. www.codeheroku.com Building a Movie Recommendation Engine Machine Learning Pipeline Data Extraction and Cleaning Build ML Model Build Software Infrastructure
  • 5. www.codeheroku.com Building a Movie Recommendation Engine Supervised Machine Learning House Size (Sq feet) Location Age (years) Prize (Lakh Rs) 500 Mumbai 2 70 1500 Pune 3 100 2000 Banglore 4 60 1000 Mumbai 2 ? 3000 Pune 10 ? Training Data Test Data Features
  • 6. www.codeheroku.com Building a Movie Recommendation Engine 2. Netflix1. Amazon What’s Common?
  • 7. QUIZ Who are the Users and Items for RE in the following platforms? 1. LinkedIn 2. Amazon 3. Netflix 4. Facebook Users: Members; Items: Members Users: Members; Items: Products (E.g. Books, Electronics) Users: Members; Items: Movie Users: Members; Items: Members
  • 8. www.codeheroku.com Building a Movie Recommendation Engine Implementing A Recommender System 1. Popularity / Rating Based System
  • 9. www.codeheroku.com Building a Movie Recommendation Engine Implementing A Recommender System 3. Collaborative Filtering2. Content Based
  • 10. www.codeheroku.com Building a Movie Recommendation Engine Similarity Between Content Text A: London Paris London Text B: Paris Paris London
  • 11. www.codeheroku.com Building a Movie Recommendation Engine Paris London0 1 2 3 4 5 4 3 2 1 Text A: London Paris London (London: 2, Paris: 1) Text B: Paris Paris London (London: 1, Paris: 2) Distance Between Two Vectors (1,2) (2,1) D
  • 12. www.codeheroku.com Building a Movie Recommendation Engine When To Use Angular Distance?
  • 13. www.codeheroku.com Building a Movie Recommendation Engine Quiz In which of the following scenarios you are most likely to use Cosine Similarity measure? 1. Determining gender based on shoe length, height, weight etc. 2. Comparing similarities between documents of uneven size 3. Predicting rainfall based on city location, temperature, humidity etc.
  • 14. www.codeheroku.com Building a Movie Recommendation Engine Quiz Given the similarity matrix below, which movie is most similar to Movie 0 ?
  • 15. www.codeheroku.com Building a Movie Recommendation Engine Let’s Build It http://codeheroku.com/static/workshop/datasets/movie_recommender.zip http://www.codeheroku.com/static/workshop/hw/movie_recommendation/assignment.pdf
  • 16. www.codeheroku.com Building a Movie Recommendation Engine
  • 17. www.codeheroku.com Building a Movie Recommendation Engine Thank you!
  • 18. www.codeheroku.com Building a Movie Recommendation Engine Alternative Links: DataSet: https://drive.google.com/file/d/1sJ9N2T2zDQwvywHCC6RCO68olL97Mp4O/view?usp=sharing Assignment: https://drive.google.com/file/d/1EKjBr0id9_HtzGJzrNs8yGZ5MWWExx-b/view?usp=sharing
  • 19. www.codeheroku.com Building a Movie Recommendation Engine Recommendation Systems Users Items