SlideShare a Scribd company logo
1 of 24
How Ranker Turned Pop Culture
Lists Into Personalized TV
Recommendations
DataCon LA 2020
OVERVIEWIntroduction
Dr. Vincent Seah
VP, Data Science
Ranker since July 2019
Ph.D. Mechanical Engineering
UCLA
Fullscreen Media (acq. by AT&T)
KPMG US
Inkiru (acq. by Walmart Labs)
Hoodiny Entertainment Group (acq. by PRISA)
linkedin.com/in/drnotsoevil
Ker-Yu Ong
Senior Data Scientist
Ranker since April 2020
M.Sc. Data Science
University of San Francisco
Deloitte San Francisco
Deloitte Singapore
linkedin.com/in/keryu-ong
OVERVIEW
● CEO, Clark Benson
● Media publisher turning engagement into IP
● Over 100 employees
● Headquartered in Los Angeles, with an office in NYC
● 40M monthly unique visitors worldwide
● More than 1B votes cast over last 10 years
● Fan-powered votable content with 10,000 lists
covering everything from TV, Movies to
Sports, Food and Lifestyle
● Products built in-house:
○ Ranker Insights
○ Watchworthy App
○ Data Science Apps
WATCHWORTHY
Cross-Platform, Personalized Show Recommendations Based on 1B Data Points
ONBOARDING IN-APP
● Mobile app with an unparalleled
ability to give users targeted,
personalized TV
recommendations
● Using pure, first party voting
data from Ranker website
● Available on Android and iOS
DS APP: RecAlgo
CHALLENGE
List Selection
CHALLENGE
Voters and Biased Voting
CHALLENGE
Voters Sentiment versus Metadata
Actor-Based Recs
Voter sentiment casts a
wider net of recs across
genres and decades
Genre-Based Recs
"Rom-Com"
CHALLENGE
“If I like this older TV show, what new TV show should I watch?”
Breaking Bad (2008) Chernobyl (2019)
RELATED WORK
In Good Company
● A Fairness-aware Hybrid Recommender System
○ G. Farnadi, P. Kouki, S.K. Thompson, S. Srinivasan, L. Getoor [2018]
○ “...A fair recommender system should provide rankings to the protected group that are the
same as the unprotected group…”
● Group Recommender Systems: A Virtual User Approach Based on Precedence Mining
○ V.R. Kagita, A.K. Pujari, V. Padmanabhan [2015]
○ “... introducing a virtual user that can more effectively represent a group..”
● Personalized Real-Time Movie Recommendation System: Practical Prototype and Evaluation
○ J. Zhang, Y. Wang, Z. Yuan, Q. Jin [2019]
○ “...virtual opinion leader is conceived to represent the whole cluster…”
● Innovations in Graph Representation Learning
○ A. Epasto, and B. Perozzi [2019]
○ “...we developed Splitter, an unsupervised embedding method that allows the nodes in a graph
to have multiple embeddings to better encode their participation in multiple communities…”
APPROACH
Recap
Recap: How do we recommend new shows when:
● A user’s input taste profile is dominated by older shows
● The rec algo training data is dominated by older shows
Reframed this as a class imbalance problem
Solvable via classification techniques
● Minority-class Upsampling (SMOTE)
● Majority-class Downsampling
● Data Augmentation
APPROACH
● Classification: Balance/imbalance of the classes themselves
What is class balance and imbalance in the context of a rec algo?
APPROACH
● Recommendation: Balance/imbalance of the class relationships
What is class balance and imbalance in the context of a rec algo?
EXPERIMENTS
● Upsampling votes from bridge voters
● Downsampling votes from non-bridge voters
● Applying different thresholds for
○ Vote count
○ Vote type
○ Vote spread
Things We Tried
Challenge:
Because we were preserving each user’s voting
pattern, upsampling did not change the distribution
of bridge voters’ votes
EXPERIMENTS
Challenge, Illustrated
User Show Year
Keryu 2019
Keryu 2010
Keryu 2005
Original
Upsampled
User Show Year
Keryu 2019
Keryu 2010
Keryu 2005
Keryu_2 2019
Keryu_2 2010
Keryu_2 2005
EXPERIMENTS
Foray into “Splitting”: Upsampling Bridge Votes
User Show Year
Keryu 2019
Keryu 2010
Keryu 2005
Keryu_21 2019
Keryu_21 2010
Keryu_22 2019
Keryu_22 2005
Upsampled
User Show Year
Keryu 2019
Keryu 2010
Keryu 2005
Original
● What about individual models
for each bridge vote?
SPLIT SAMPLING
Foray into “Splitting” - Multiple Models
User Show Year
Keryu 2019
Keryu 2010
Keryu 2005
User Show Year
Keryu 2019
Keryu 2010
User Show Year
Keryu 2019
Keryu 2005
User Show Year
Keryu 2019
Keryu 2010
Keryu 2005
Original
m_0
m_1
m_2
SPLIT SAMPLING
Methodology
1. Bin shows into release year decades
2. Split bridge voters’ votes by bridge decade:
a. 1990s to new
b. 2000s to new
c. 2010s to new etc.
3. Build an overall model and individual decade-specific models
4. Ensemble to get maximum number of
new shows per recommendation stream
SPLIT SAMPLING
Input List
1990 -
2010 -
2019 -
1990 -
2010 -
2019 -
2010 -
2019 -
show, year, worthy
Q, 1995, 95
T, 1998, 89
X, 2019, 87
D, 2001, 75
..
show, year, worthy
M, 2009, 90
P, 2000, 89
C, 2019, 85
S, 2020, 78
..
show, year, worthy
A, 1995, 90
B, 2000, 89
E, 2011, 84
F, 2013, 81
C, 2019, 79
D, 2019, 78
G, 2012, 77
X, 2019, 77
J, 2003, 60
S, 2020, 57
..
show, year, worthy
A, 1995, 90
B, 2000, 89
X, 2019, 87
E, 2011, 84
C, 2019, 85
F, 2013, 81
C, 2019, 79
D, 2019, 78
S, 2020, 78
G, 2012, 77
X, 2019, 77
J, 2003, 60
S, 2020, 57
..
show, year, worthy
A, 1995, 90
B, 2000, 89
X, 2019, 87
C, 2019, 85
E, 2011, 84
F, 2013, 81
D, 2019, 79
S, 2020, 78
G, 2012, 77
J, 2003, 60
..
1990 -
2019 -
Overall
model
1990s
model
2000s
model
Scoring Pipeline
Split Score Merge
EXAMPLE
Grey's Anatomy
Law & Order: Special Victims Unit
Stranger Things
The Big Bang Theory
The Closer
The Crown
This Is Us
black-ish
Bob's Burgers
Breaking Bad
Family Guy
Fresh Off the Boat
Rick and Morty
Riverdale
The Vampire Diaries
EXAMPLE
Grey's Anatomy
Law & Order: Special Victims Unit
Stranger Things
The Big Bang Theory
The Closer
The Crown
This Is Us
black-ish
Bob's Burgers
Breaking Bad
Family Guy
Fresh Off the Boat
Rick and Morty
Riverdale
The Vampire Diaries
EXAMPLE
Grey's Anatomy
Law & Order: Special Victims Unit
Stranger Things
The Big Bang Theory
The Closer
The Crown
This Is Us
black-ish
Bob's Burgers
Breaking Bad
Family Guy
Fresh Off the Boat
Rick and Morty
Riverdale
The Vampire Diaries
with split sampling
original
WHAT’S NEXT
01 Extending to cross category, niche genres
Group watching and recommendations02
THANK YOU

More Related Content

Similar to How Ranker Turned Pop Culture Lists Into Personalized TV Recommendations

Steve Smith - Opening Remarks
Steve Smith - Opening RemarksSteve Smith - Opening Remarks
Steve Smith - Opening RemarksMediaPost
 
CV_ Xuan Zhu
CV_ Xuan ZhuCV_ Xuan Zhu
CV_ Xuan ZhuXuan Zhu
 
A Multi Generational, Multi Dimensional Approach To Training Using Web 2 0 Te...
A Multi Generational, Multi Dimensional Approach To Training Using Web 2 0 Te...A Multi Generational, Multi Dimensional Approach To Training Using Web 2 0 Te...
A Multi Generational, Multi Dimensional Approach To Training Using Web 2 0 Te...Kella Price
 
SearchLeeds 2018 - Jon Greenhalgh & Sam Wright - Edit & Branded3 - How to del...
SearchLeeds 2018 - Jon Greenhalgh & Sam Wright - Edit & Branded3 - How to del...SearchLeeds 2018 - Jon Greenhalgh & Sam Wright - Edit & Branded3 - How to del...
SearchLeeds 2018 - Jon Greenhalgh & Sam Wright - Edit & Branded3 - How to del...Branded3
 
Making Data Meaningful
Making Data MeaningfulMaking Data Meaningful
Making Data MeaningfulAmanda Makulec
 
YOUTUBE CONTENT CREATION AND ONLINE BUSINESS PROMOTION-UGC CARE LIST AUG 2022...
YOUTUBE CONTENT CREATION AND ONLINE BUSINESS PROMOTION-UGC CARE LIST AUG 2022...YOUTUBE CONTENT CREATION AND ONLINE BUSINESS PROMOTION-UGC CARE LIST AUG 2022...
YOUTUBE CONTENT CREATION AND ONLINE BUSINESS PROMOTION-UGC CARE LIST AUG 2022...PugalendhiR
 
Asia's Next Digital Decade
Asia's Next Digital DecadeAsia's Next Digital Decade
Asia's Next Digital DecadeAndrew Pickup
 
5G Wireless Technology Campaign by Slidesgo.pptx
5G Wireless Technology Campaign by Slidesgo.pptx5G Wireless Technology Campaign by Slidesgo.pptx
5G Wireless Technology Campaign by Slidesgo.pptxscorpionblade04
 
PAN Communications
PAN CommunicationsPAN Communications
PAN Communicationspancomm
 
Data Summer Conf 2018, “Multi-touch Attribution: Key challenge around designi...
Data Summer Conf 2018, “Multi-touch Attribution: Key challenge around designi...Data Summer Conf 2018, “Multi-touch Attribution: Key challenge around designi...
Data Summer Conf 2018, “Multi-touch Attribution: Key challenge around designi...Provectus
 
#ATAGTR2019 Presentation "Re-imagining Quality Assurance in the age of Digiti...
#ATAGTR2019 Presentation "Re-imagining Quality Assurance in the age of Digiti...#ATAGTR2019 Presentation "Re-imagining Quality Assurance in the age of Digiti...
#ATAGTR2019 Presentation "Re-imagining Quality Assurance in the age of Digiti...Agile Testing Alliance
 
Social media strategy presentation-Cher
Social media strategy presentation-CherSocial media strategy presentation-Cher
Social media strategy presentation-CherCher Wang
 
1215 daa industry lunch yorchak sterne young
1215 daa industry lunch yorchak sterne young1215 daa industry lunch yorchak sterne young
1215 daa industry lunch yorchak sterne youngRising Media, Inc.
 
Yankee Group Q&A with Emily Green, CEO
Yankee Group Q&A with Emily Green, CEOYankee Group Q&A with Emily Green, CEO
Yankee Group Q&A with Emily Green, CEOYankee Group
 
Automation wars. Made by Kateryna Vasylenko and Oleksandr Yatsenko
Automation wars. Made by Kateryna Vasylenko and Oleksandr YatsenkoAutomation wars. Made by Kateryna Vasylenko and Oleksandr Yatsenko
Automation wars. Made by Kateryna Vasylenko and Oleksandr YatsenkoKate Vasylenko
 
Customised air conditioning_mitp
Customised air conditioning_mitpCustomised air conditioning_mitp
Customised air conditioning_mitpJamesVersluis
 
Integrated Content Strategy Portfolio (Example)
Integrated Content Strategy Portfolio (Example)Integrated Content Strategy Portfolio (Example)
Integrated Content Strategy Portfolio (Example)David R. Iannone Jr.
 

Similar to How Ranker Turned Pop Culture Lists Into Personalized TV Recommendations (20)

Steve Smith - Opening Remarks
Steve Smith - Opening RemarksSteve Smith - Opening Remarks
Steve Smith - Opening Remarks
 
CV_ Xuan Zhu
CV_ Xuan ZhuCV_ Xuan Zhu
CV_ Xuan Zhu
 
A Multi Generational, Multi Dimensional Approach To Training Using Web 2 0 Te...
A Multi Generational, Multi Dimensional Approach To Training Using Web 2 0 Te...A Multi Generational, Multi Dimensional Approach To Training Using Web 2 0 Te...
A Multi Generational, Multi Dimensional Approach To Training Using Web 2 0 Te...
 
SearchLeeds 2018 - Jon Greenhalgh & Sam Wright - Edit & Branded3 - How to del...
SearchLeeds 2018 - Jon Greenhalgh & Sam Wright - Edit & Branded3 - How to del...SearchLeeds 2018 - Jon Greenhalgh & Sam Wright - Edit & Branded3 - How to del...
SearchLeeds 2018 - Jon Greenhalgh & Sam Wright - Edit & Branded3 - How to del...
 
Making Data Meaningful
Making Data MeaningfulMaking Data Meaningful
Making Data Meaningful
 
Media-Economics-Final-Presentation-Team-4
Media-Economics-Final-Presentation-Team-4Media-Economics-Final-Presentation-Team-4
Media-Economics-Final-Presentation-Team-4
 
YOUTUBE CONTENT CREATION AND ONLINE BUSINESS PROMOTION-UGC CARE LIST AUG 2022...
YOUTUBE CONTENT CREATION AND ONLINE BUSINESS PROMOTION-UGC CARE LIST AUG 2022...YOUTUBE CONTENT CREATION AND ONLINE BUSINESS PROMOTION-UGC CARE LIST AUG 2022...
YOUTUBE CONTENT CREATION AND ONLINE BUSINESS PROMOTION-UGC CARE LIST AUG 2022...
 
Asia's Next Digital Decade
Asia's Next Digital DecadeAsia's Next Digital Decade
Asia's Next Digital Decade
 
5G Wireless Technology Campaign by Slidesgo.pptx
5G Wireless Technology Campaign by Slidesgo.pptx5G Wireless Technology Campaign by Slidesgo.pptx
5G Wireless Technology Campaign by Slidesgo.pptx
 
PAN Communications
PAN CommunicationsPAN Communications
PAN Communications
 
Data Summer Conf 2018, “Multi-touch Attribution: Key challenge around designi...
Data Summer Conf 2018, “Multi-touch Attribution: Key challenge around designi...Data Summer Conf 2018, “Multi-touch Attribution: Key challenge around designi...
Data Summer Conf 2018, “Multi-touch Attribution: Key challenge around designi...
 
#ATAGTR2019 Presentation "Re-imagining Quality Assurance in the age of Digiti...
#ATAGTR2019 Presentation "Re-imagining Quality Assurance in the age of Digiti...#ATAGTR2019 Presentation "Re-imagining Quality Assurance in the age of Digiti...
#ATAGTR2019 Presentation "Re-imagining Quality Assurance in the age of Digiti...
 
Social media strategy presentation-Cher
Social media strategy presentation-CherSocial media strategy presentation-Cher
Social media strategy presentation-Cher
 
Leading Regional Transformation Change Through Innovation - Michael Thissen
Leading Regional Transformation Change Through Innovation - Michael ThissenLeading Regional Transformation Change Through Innovation - Michael Thissen
Leading Regional Transformation Change Through Innovation - Michael Thissen
 
Digital Marketing Demystified
Digital Marketing DemystifiedDigital Marketing Demystified
Digital Marketing Demystified
 
1215 daa industry lunch yorchak sterne young
1215 daa industry lunch yorchak sterne young1215 daa industry lunch yorchak sterne young
1215 daa industry lunch yorchak sterne young
 
Yankee Group Q&A with Emily Green, CEO
Yankee Group Q&A with Emily Green, CEOYankee Group Q&A with Emily Green, CEO
Yankee Group Q&A with Emily Green, CEO
 
Automation wars. Made by Kateryna Vasylenko and Oleksandr Yatsenko
Automation wars. Made by Kateryna Vasylenko and Oleksandr YatsenkoAutomation wars. Made by Kateryna Vasylenko and Oleksandr Yatsenko
Automation wars. Made by Kateryna Vasylenko and Oleksandr Yatsenko
 
Customised air conditioning_mitp
Customised air conditioning_mitpCustomised air conditioning_mitp
Customised air conditioning_mitp
 
Integrated Content Strategy Portfolio (Example)
Integrated Content Strategy Portfolio (Example)Integrated Content Strategy Portfolio (Example)
Integrated Content Strategy Portfolio (Example)
 

More from Data Con LA

Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA
 
Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA
 
Data Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA
 
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA
 
Data Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA
 
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA
 
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA
 
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA
 
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA
 
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA
 
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA
 
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA
 
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA
 
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA
 
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA
 
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA
 

More from Data Con LA (20)

Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
 
Data Con LA 2022 Keynotes
Data Con LA 2022 KeynotesData Con LA 2022 Keynotes
Data Con LA 2022 Keynotes
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
 
Data Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup ShowcaseData Con LA 2022 - Startup Showcase
Data Con LA 2022 - Startup Showcase
 
Data Con LA 2022 Keynote
Data Con LA 2022 KeynoteData Con LA 2022 Keynote
Data Con LA 2022 Keynote
 
Data Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendationsData Con LA 2022 - Using Google trends data to build product recommendations
Data Con LA 2022 - Using Google trends data to build product recommendations
 
Data Con LA 2022 - AI Ethics
Data Con LA 2022 - AI EthicsData Con LA 2022 - AI Ethics
Data Con LA 2022 - AI Ethics
 
Data Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learningData Con LA 2022 - Improving disaster response with machine learning
Data Con LA 2022 - Improving disaster response with machine learning
 
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and AtlasData Con LA 2022 - What's new with MongoDB 6.0 and Atlas
Data Con LA 2022 - What's new with MongoDB 6.0 and Atlas
 
Data Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentationData Con LA 2022 - Real world consumer segmentation
Data Con LA 2022 - Real world consumer segmentation
 
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
Data Con LA 2022 - Modernizing Analytics & AI for today's needs: Intuit Turbo...
 
Data Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWSData Con LA 2022 - Moving Data at Scale to AWS
Data Con LA 2022 - Moving Data at Scale to AWS
 
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AIData Con LA 2022 - Collaborative Data Exploration using Conversational AI
Data Con LA 2022 - Collaborative Data Exploration using Conversational AI
 
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
Data Con LA 2022 - Why Database Modernization Makes Your Data Decisions More ...
 
Data Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data ScienceData Con LA 2022 - Intro to Data Science
Data Con LA 2022 - Intro to Data Science
 
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing EntertainmentData Con LA 2022 - How are NFTs and DeFi Changing Entertainment
Data Con LA 2022 - How are NFTs and DeFi Changing Entertainment
 
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
Data Con LA 2022 - Why Data Quality vigilance requires an End-to-End, Automat...
 
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
Data Con LA 2022-Perfect Viral Ad prediction of Superbowl 2022 using Tease, T...
 
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...Data Con LA 2022- Embedding medical journeys with machine learning to improve...
Data Con LA 2022- Embedding medical journeys with machine learning to improve...
 
Data Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with KafkaData Con LA 2022 - Data Streaming with Kafka
Data Con LA 2022 - Data Streaming with Kafka
 

Recently uploaded

modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryJeremy Anderson
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 

Recently uploaded (20)

modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
Defining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data StoryDefining Constituents, Data Vizzes and Telling a Data Story
Defining Constituents, Data Vizzes and Telling a Data Story
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 

How Ranker Turned Pop Culture Lists Into Personalized TV Recommendations

  • 1. How Ranker Turned Pop Culture Lists Into Personalized TV Recommendations DataCon LA 2020
  • 2. OVERVIEWIntroduction Dr. Vincent Seah VP, Data Science Ranker since July 2019 Ph.D. Mechanical Engineering UCLA Fullscreen Media (acq. by AT&T) KPMG US Inkiru (acq. by Walmart Labs) Hoodiny Entertainment Group (acq. by PRISA) linkedin.com/in/drnotsoevil Ker-Yu Ong Senior Data Scientist Ranker since April 2020 M.Sc. Data Science University of San Francisco Deloitte San Francisco Deloitte Singapore linkedin.com/in/keryu-ong
  • 3. OVERVIEW ● CEO, Clark Benson ● Media publisher turning engagement into IP ● Over 100 employees ● Headquartered in Los Angeles, with an office in NYC ● 40M monthly unique visitors worldwide ● More than 1B votes cast over last 10 years ● Fan-powered votable content with 10,000 lists covering everything from TV, Movies to Sports, Food and Lifestyle ● Products built in-house: ○ Ranker Insights ○ Watchworthy App ○ Data Science Apps
  • 4. WATCHWORTHY Cross-Platform, Personalized Show Recommendations Based on 1B Data Points ONBOARDING IN-APP ● Mobile app with an unparalleled ability to give users targeted, personalized TV recommendations ● Using pure, first party voting data from Ranker website ● Available on Android and iOS
  • 8. CHALLENGE Voters Sentiment versus Metadata Actor-Based Recs Voter sentiment casts a wider net of recs across genres and decades Genre-Based Recs "Rom-Com"
  • 9. CHALLENGE “If I like this older TV show, what new TV show should I watch?” Breaking Bad (2008) Chernobyl (2019)
  • 10. RELATED WORK In Good Company ● A Fairness-aware Hybrid Recommender System ○ G. Farnadi, P. Kouki, S.K. Thompson, S. Srinivasan, L. Getoor [2018] ○ “...A fair recommender system should provide rankings to the protected group that are the same as the unprotected group…” ● Group Recommender Systems: A Virtual User Approach Based on Precedence Mining ○ V.R. Kagita, A.K. Pujari, V. Padmanabhan [2015] ○ “... introducing a virtual user that can more effectively represent a group..” ● Personalized Real-Time Movie Recommendation System: Practical Prototype and Evaluation ○ J. Zhang, Y. Wang, Z. Yuan, Q. Jin [2019] ○ “...virtual opinion leader is conceived to represent the whole cluster…” ● Innovations in Graph Representation Learning ○ A. Epasto, and B. Perozzi [2019] ○ “...we developed Splitter, an unsupervised embedding method that allows the nodes in a graph to have multiple embeddings to better encode their participation in multiple communities…”
  • 11. APPROACH Recap Recap: How do we recommend new shows when: ● A user’s input taste profile is dominated by older shows ● The rec algo training data is dominated by older shows Reframed this as a class imbalance problem Solvable via classification techniques ● Minority-class Upsampling (SMOTE) ● Majority-class Downsampling ● Data Augmentation
  • 12. APPROACH ● Classification: Balance/imbalance of the classes themselves What is class balance and imbalance in the context of a rec algo?
  • 13. APPROACH ● Recommendation: Balance/imbalance of the class relationships What is class balance and imbalance in the context of a rec algo?
  • 14. EXPERIMENTS ● Upsampling votes from bridge voters ● Downsampling votes from non-bridge voters ● Applying different thresholds for ○ Vote count ○ Vote type ○ Vote spread Things We Tried Challenge: Because we were preserving each user’s voting pattern, upsampling did not change the distribution of bridge voters’ votes
  • 15. EXPERIMENTS Challenge, Illustrated User Show Year Keryu 2019 Keryu 2010 Keryu 2005 Original Upsampled User Show Year Keryu 2019 Keryu 2010 Keryu 2005 Keryu_2 2019 Keryu_2 2010 Keryu_2 2005
  • 16. EXPERIMENTS Foray into “Splitting”: Upsampling Bridge Votes User Show Year Keryu 2019 Keryu 2010 Keryu 2005 Keryu_21 2019 Keryu_21 2010 Keryu_22 2019 Keryu_22 2005 Upsampled User Show Year Keryu 2019 Keryu 2010 Keryu 2005 Original
  • 17. ● What about individual models for each bridge vote? SPLIT SAMPLING Foray into “Splitting” - Multiple Models User Show Year Keryu 2019 Keryu 2010 Keryu 2005 User Show Year Keryu 2019 Keryu 2010 User Show Year Keryu 2019 Keryu 2005 User Show Year Keryu 2019 Keryu 2010 Keryu 2005 Original m_0 m_1 m_2
  • 18. SPLIT SAMPLING Methodology 1. Bin shows into release year decades 2. Split bridge voters’ votes by bridge decade: a. 1990s to new b. 2000s to new c. 2010s to new etc. 3. Build an overall model and individual decade-specific models 4. Ensemble to get maximum number of new shows per recommendation stream
  • 19. SPLIT SAMPLING Input List 1990 - 2010 - 2019 - 1990 - 2010 - 2019 - 2010 - 2019 - show, year, worthy Q, 1995, 95 T, 1998, 89 X, 2019, 87 D, 2001, 75 .. show, year, worthy M, 2009, 90 P, 2000, 89 C, 2019, 85 S, 2020, 78 .. show, year, worthy A, 1995, 90 B, 2000, 89 E, 2011, 84 F, 2013, 81 C, 2019, 79 D, 2019, 78 G, 2012, 77 X, 2019, 77 J, 2003, 60 S, 2020, 57 .. show, year, worthy A, 1995, 90 B, 2000, 89 X, 2019, 87 E, 2011, 84 C, 2019, 85 F, 2013, 81 C, 2019, 79 D, 2019, 78 S, 2020, 78 G, 2012, 77 X, 2019, 77 J, 2003, 60 S, 2020, 57 .. show, year, worthy A, 1995, 90 B, 2000, 89 X, 2019, 87 C, 2019, 85 E, 2011, 84 F, 2013, 81 D, 2019, 79 S, 2020, 78 G, 2012, 77 J, 2003, 60 .. 1990 - 2019 - Overall model 1990s model 2000s model Scoring Pipeline Split Score Merge
  • 20. EXAMPLE Grey's Anatomy Law & Order: Special Victims Unit Stranger Things The Big Bang Theory The Closer The Crown This Is Us black-ish Bob's Burgers Breaking Bad Family Guy Fresh Off the Boat Rick and Morty Riverdale The Vampire Diaries
  • 21. EXAMPLE Grey's Anatomy Law & Order: Special Victims Unit Stranger Things The Big Bang Theory The Closer The Crown This Is Us black-ish Bob's Burgers Breaking Bad Family Guy Fresh Off the Boat Rick and Morty Riverdale The Vampire Diaries
  • 22. EXAMPLE Grey's Anatomy Law & Order: Special Victims Unit Stranger Things The Big Bang Theory The Closer The Crown This Is Us black-ish Bob's Burgers Breaking Bad Family Guy Fresh Off the Boat Rick and Morty Riverdale The Vampire Diaries with split sampling original
  • 23. WHAT’S NEXT 01 Extending to cross category, niche genres Group watching and recommendations02