SlideShare a Scribd company logo
Big Data as a Streaming Service 
Big Data as a Streaming Service 
Julie Knibbe 
Product Manager – Deezer 
@julieknibbe 
Manuel Moussalam 
R&D – Deezer
Big Data as a Streaming Service 
Product Manager 
Defines features that meet users needs 
Based on: 
• Market research 
• Product Data Analytics 
• Users feedback 
• Competitive Analysis 
• Creativity 
Big Data as a Streaming Service 
The Leanback Experience Team at Deezer 
• Product Manager 
• Project Manager 
• R&D Developers 
• Big Data developers 
• Web developers (front/back) 
• Mobile developers 
• QA
Big Data as a Streaming Service 
Deezer 
Active users 30M 
Countries 180+ 
Tracks in catalog 35M 
Artists in catalog 1M 
Music providers 1K+
Big Data as a Streaming Service 
The recommendation problem 
No one wants to hear music 
they don’t like
Big Data as a Streaming Service 
The recommendation problem 
No one wants to hear the 
same 200 tracks over and 
over again
Big Data as a Streaming Service 
The recommendation problem 
You need to hear a song from 
1 to 7 times to like it
Big Data as a Streaming Service 
The recommendation problem 
Parameters and variables: 
• Mood 
• Tastes 
• Habits 
• Openness 
• Sociological profile 
• … 
Dimensions: 
• 35M tracks 
• 1M artists 
• 30M users
Big Data as a Streaming Service 
Building a user profile 
Onboarding users 
Monitoring user actions
Big Data as a Streaming Service 
Deezer – User qualification
Big Data as a Streaming Service 
User Profile
Big Data as a Streaming Service 
User Profile – Implicit / Explicit feedback 
Adaptation 
Add new information 
Forget old interests
Big Data as a Streaming Service 
Music Recommendation 
Given a listening profile for user X, what music should we 
recommend?
Recommendation system – adapting to user types 
Big Data as a Streaming Service 
Savants 
Enthusiasts 
Casuals 
Indifferents 
Riskier 
recommendations 
Popular 
recommendations 
Finding the right mix between novelty, familiarity and relevance
Recommendation system – adapting to user types 
Big Data as a Streaming Service 
Sources: 
http://alchemi.co.uk/archives/mus/groups_and_beha.html 
http://musicmachinery.com/2014/01/14/the-zero-button-music-player-2/
Big Data as a Streaming Service 
Use cases 
Playlist / Channel generation 
Discovery 
Personal Search 
…
Big Data as a Streaming Service 
Deezer features – Flow
Big Data as a Streaming Service 
Deezer features – Hear This
Big Data as a Streaming Service 
At Deezer 
Mixing collaborative filtering with semi-supervised 
approaches 
• Curation: Deezer Editors 
• Multi-layered graph structure of tracks & artists 
• Usage monitoring 
Based on Hadoop + ElasticSearch + Spark
Big Data as a Streaming Service 
Collaborative Filtering: Matching 
Collaborative Filtering : 
« User X listened to the Rolling Stones. Users listening 
to the Rolling Stones usually also listen to the Who, 
let's suggest the Who to user X. » 
Popularized by the Netflix Prize
Big Data as a Streaming Service 
Collaborative Filtering 
Either compute similarity upon users or items.. or both
Big Data as a Streaming Service 
Real data
Big Data as a Streaming Service 
Collaborative filtering: Exemplar based 
Association rules 
• Market basket analysis 
• A priori Algorithm 
• .. 
But: 
• Scalability issues 
• Hubs and Island issues (Stromae example)
Big Data as a Streaming Service 
Collaborative filtering: Model based 
Matrix Factorization 
A 
n 
m 
= U 
I 
X 
k 
• U is low-dimensional model on users 
• I on items 
Recommended items are missing entries of A
Big Data as a Streaming Service 
Collaborative Filtering: Limitations 
• Cold Start problem 
• Sparse user-item matrix (1% coverage) 
• Only based on social behaviors 
• Popularity bias (« The rich gets richer »)
Content-based filtering: Music items representation 
Big Data as a Streaming Service
Big Data as a Streaming Service 
Content-based filtering: Limitations 
• Cold Start problem 
• Users with atypical tastes 
• Lack of novelty 
• Subjectivity not taken into account
Big Data as a Streaming Service 
Content Similarity 
Clustering tracks, artists, albums… 
Methods: 
• Matrix Factorization techniques 
• Spectral clustering 
• Musical features extraction 
• Louvain algorithm 
• …
Big Data as a Streaming Service 
Example: Multiple Spectral Clustering
Big Data as a Streaming Service 
Cleaning 
• Mislabeled data: Different sources tell different things 
about songs, artists, albums 
• No universally adopted music ontology 
• Subjectivity 
• Outlier detection: confronting several sources and 
models
Big Data as a Streaming Service 
Cleaning: Example
Big Data as a Streaming Service 
In real life… 
A/B Testing
Big Data as a Streaming Service 
Algorithms A/B Testing 
Algo A 
Algo B 
Observe results: 
• Daily Active Users 
• Streams / users 
• Satisfaction 
• … 
Deezer users
Big Data as a Streaming Service 
Algorithms A/B Testing: Example 
Test: Are new users (with no profile data) more likely to be 
more satisfied with charts items or with new ones?
Big Data as a Streaming Service 
Thanks !

More Related Content

What's hot

How data drives spotify
How data drives spotifyHow data drives spotify
How data drives spotify
Ali Sarrafi
 
Big Data At Spotify
Big Data At SpotifyBig Data At Spotify
Big Data At SpotifyAdam Kawa
 
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
Hakka Labs
 
Building Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at SpotifyBuilding Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at Spotify
Vidhya Murali
 
Music Recommendation 2018
Music Recommendation 2018Music Recommendation 2018
Music Recommendation 2018
Fabien Gouyon
 
Product School - Spotify presentation
Product School - Spotify presentationProduct School - Spotify presentation
Product School - Spotify presentation
Suleiman Younossi
 
Music Recommendation Tutorial
Music Recommendation TutorialMusic Recommendation Tutorial
Music Recommendation Tutorial
Oscar Celma
 
Analysis of Spotify & New Feature Ideas
Analysis of Spotify & New Feature IdeasAnalysis of Spotify & New Feature Ideas
Analysis of Spotify & New Feature Ideas
Sarah L. Miller
 
Digital Marketing - Spotify
Digital Marketing - SpotifyDigital Marketing - Spotify
Digital Marketing - Spotify
Laura Sorrentino
 
Spotify-Direct Marketing
Spotify-Direct MarketingSpotify-Direct Marketing
Spotify-Direct Marketing
Alexander Borowick
 
A Digital Marketing Strategy for Spotify
A Digital Marketing Strategy for Spotify A Digital Marketing Strategy for Spotify
A Digital Marketing Strategy for Spotify
Maura Hickey
 
Product Owner presentation for Spotify
Product Owner presentation for SpotifyProduct Owner presentation for Spotify
Product Owner presentation for Spotify
pdicorpo
 
Digital Marketing Strategy by Karan Bhah-Saavn
Digital Marketing Strategy by Karan Bhah-SaavnDigital Marketing Strategy by Karan Bhah-Saavn
Digital Marketing Strategy by Karan Bhah-Saavn
Indian Institute of Digital Education
 
Spotify Chords - Creating Music Moments
Spotify Chords - Creating Music MomentsSpotify Chords - Creating Music Moments
Spotify Chords - Creating Music Moments
Ryan Cunningham
 
Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...
Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...
Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...
MME 4.5 / Music 4.5 / 2Pears
 
The Music Streaming Industry
The Music Streaming IndustryThe Music Streaming Industry
The Music Streaming Industry
RudyJoon
 
Spotify's Brand DNA
Spotify's Brand DNASpotify's Brand DNA
Spotify's Brand DNA
Anushka Kassim
 
Spotify Company presentation
Spotify Company presentationSpotify Company presentation
Spotify Company presentation
alifost
 

What's hot (20)

How data drives spotify
How data drives spotifyHow data drives spotify
How data drives spotify
 
Big Data At Spotify
Big Data At SpotifyBig Data At Spotify
Big Data At Spotify
 
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
 
Building Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at SpotifyBuilding Data Pipelines for Music Recommendations at Spotify
Building Data Pipelines for Music Recommendations at Spotify
 
Spotify
SpotifySpotify
Spotify
 
Music Recommendation 2018
Music Recommendation 2018Music Recommendation 2018
Music Recommendation 2018
 
Product School - Spotify presentation
Product School - Spotify presentationProduct School - Spotify presentation
Product School - Spotify presentation
 
Music Recommendation Tutorial
Music Recommendation TutorialMusic Recommendation Tutorial
Music Recommendation Tutorial
 
Analysis of Spotify & New Feature Ideas
Analysis of Spotify & New Feature IdeasAnalysis of Spotify & New Feature Ideas
Analysis of Spotify & New Feature Ideas
 
Digital Marketing - Spotify
Digital Marketing - SpotifyDigital Marketing - Spotify
Digital Marketing - Spotify
 
Spotify-Direct Marketing
Spotify-Direct MarketingSpotify-Direct Marketing
Spotify-Direct Marketing
 
A Digital Marketing Strategy for Spotify
A Digital Marketing Strategy for Spotify A Digital Marketing Strategy for Spotify
A Digital Marketing Strategy for Spotify
 
Product Owner presentation for Spotify
Product Owner presentation for SpotifyProduct Owner presentation for Spotify
Product Owner presentation for Spotify
 
Digital Marketing Strategy by Karan Bhah-Saavn
Digital Marketing Strategy by Karan Bhah-SaavnDigital Marketing Strategy by Karan Bhah-Saavn
Digital Marketing Strategy by Karan Bhah-Saavn
 
Spotify Chords - Creating Music Moments
Spotify Chords - Creating Music MomentsSpotify Chords - Creating Music Moments
Spotify Chords - Creating Music Moments
 
Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...
Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...
Music 4.5: The value of playlists – for the record labels with Simon Rugg, Na...
 
The Music Streaming Industry
The Music Streaming IndustryThe Music Streaming Industry
The Music Streaming Industry
 
Music
MusicMusic
Music
 
Spotify's Brand DNA
Spotify's Brand DNASpotify's Brand DNA
Spotify's Brand DNA
 
Spotify Company presentation
Spotify Company presentationSpotify Company presentation
Spotify Company presentation
 

Viewers also liked

Recommendation @Deezer
Recommendation @DeezerRecommendation @Deezer
Recommendation @Deezer
recsysfr
 
Music discovery: What, why, who, when, where?
Music discovery: What, why, who, when, where?Music discovery: What, why, who, when, where?
Music discovery: What, why, who, when, where?
Julie Knibbe
 
Flexible recommender systems based on graphs
Flexible recommender systems based on graphsFlexible recommender systems based on graphs
Flexible recommender systems based on graphs
recsysfr
 
"Création de la team Growth Hacking chez Deezer" par Alicia Combaz
"Création de la team Growth Hacking chez Deezer" par Alicia Combaz"Création de la team Growth Hacking chez Deezer" par Alicia Combaz
"Création de la team Growth Hacking chez Deezer" par Alicia Combaz
TheFamily
 
The Daft Punk Phenomenon-Marketing Plan
The Daft Punk Phenomenon-Marketing PlanThe Daft Punk Phenomenon-Marketing Plan
The Daft Punk Phenomenon-Marketing PlanRania Papadopoulou
 
Vidgo tv streaming service
Vidgo tv streaming serviceVidgo tv streaming service
Vidgo tv streaming service
Rodney Bailey
 
GlobalDots - How Video Streaming Works
GlobalDots - How Video Streaming WorksGlobalDots - How Video Streaming Works
GlobalDots - How Video Streaming Works
GlobalDots
 
How to Perform A/B Testing?
How to Perform A/B Testing?How to Perform A/B Testing?
How to Perform A/B Testing?
QATestLab
 
In-Stream Processing Service Blueprint, Reference architecture for real-time ...
In-Stream Processing Service Blueprint, Reference architecture for real-time ...In-Stream Processing Service Blueprint, Reference architecture for real-time ...
In-Stream Processing Service Blueprint, Reference architecture for real-time ...
Grid Dynamics
 
The New TV — The Rise of Digital Video And Streaming Services
The New TV — The Rise of Digital Video And Streaming ServicesThe New TV — The Rise of Digital Video And Streaming Services
The New TV — The Rise of Digital Video And Streaming Services
BIEvents
 
Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...
Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...
Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...
Spark Summit
 
OTT Streaming Services in Germany
OTT Streaming Services in GermanyOTT Streaming Services in Germany
OTT Streaming Services in Germany
René C.G. Arnold
 
CV Ria Pituita Suhata (Uwie)
CV Ria Pituita Suhata (Uwie)CV Ria Pituita Suhata (Uwie)
CV Ria Pituita Suhata (Uwie)Ria Pituita
 
Build Features Not Apps
Build Features Not AppsBuild Features Not Apps
Build Features Not Apps
Natasha Murashev
 
Netflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.js
Netflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.jsNetflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.js
Netflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.js
Chris Saint-Amant
 

Viewers also liked (15)

Recommendation @Deezer
Recommendation @DeezerRecommendation @Deezer
Recommendation @Deezer
 
Music discovery: What, why, who, when, where?
Music discovery: What, why, who, when, where?Music discovery: What, why, who, when, where?
Music discovery: What, why, who, when, where?
 
Flexible recommender systems based on graphs
Flexible recommender systems based on graphsFlexible recommender systems based on graphs
Flexible recommender systems based on graphs
 
"Création de la team Growth Hacking chez Deezer" par Alicia Combaz
"Création de la team Growth Hacking chez Deezer" par Alicia Combaz"Création de la team Growth Hacking chez Deezer" par Alicia Combaz
"Création de la team Growth Hacking chez Deezer" par Alicia Combaz
 
The Daft Punk Phenomenon-Marketing Plan
The Daft Punk Phenomenon-Marketing PlanThe Daft Punk Phenomenon-Marketing Plan
The Daft Punk Phenomenon-Marketing Plan
 
Vidgo tv streaming service
Vidgo tv streaming serviceVidgo tv streaming service
Vidgo tv streaming service
 
GlobalDots - How Video Streaming Works
GlobalDots - How Video Streaming WorksGlobalDots - How Video Streaming Works
GlobalDots - How Video Streaming Works
 
How to Perform A/B Testing?
How to Perform A/B Testing?How to Perform A/B Testing?
How to Perform A/B Testing?
 
In-Stream Processing Service Blueprint, Reference architecture for real-time ...
In-Stream Processing Service Blueprint, Reference architecture for real-time ...In-Stream Processing Service Blueprint, Reference architecture for real-time ...
In-Stream Processing Service Blueprint, Reference architecture for real-time ...
 
The New TV — The Rise of Digital Video And Streaming Services
The New TV — The Rise of Digital Video And Streaming ServicesThe New TV — The Rise of Digital Video And Streaming Services
The New TV — The Rise of Digital Video And Streaming Services
 
Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...
Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...
Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...
 
OTT Streaming Services in Germany
OTT Streaming Services in GermanyOTT Streaming Services in Germany
OTT Streaming Services in Germany
 
CV Ria Pituita Suhata (Uwie)
CV Ria Pituita Suhata (Uwie)CV Ria Pituita Suhata (Uwie)
CV Ria Pituita Suhata (Uwie)
 
Build Features Not Apps
Build Features Not AppsBuild Features Not Apps
Build Features Not Apps
 
Netflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.js
Netflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.jsNetflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.js
Netflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.js
 

Similar to Deezer - Big data as a streaming service

Analytics in media and entertainment industry
Analytics in media and entertainment industryAnalytics in media and entertainment industry
Analytics in media and entertainment industry
SupreethaKrishna2
 
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
AIST
 
Trends in Music Recommendations 2018
Trends in Music Recommendations 2018Trends in Music Recommendations 2018
Trends in Music Recommendations 2018
Karthik Murugesan
 
FindStream investor deck
FindStream investor deckFindStream investor deck
FindStream investor deck
FindStream
 
Group discussion2 (New media ecology)
Group discussion2 (New media ecology)Group discussion2 (New media ecology)
Group discussion2 (New media ecology)
Yuchen LIU
 
Story of the algorithms behind Deezer Flow
Story of the algorithms behind Deezer FlowStory of the algorithms behind Deezer Flow
Story of the algorithms behind Deezer Flow
recsysfr
 
Music data analysis big data presentation
Music data analysis big data presentationMusic data analysis big data presentation
Music data analysis big data presentation
Shubhanshu Gupta
 
Spotify Machine Learning Solution for Music Discovery
Spotify Machine Learning Solution for Music DiscoverySpotify Machine Learning Solution for Music Discovery
Spotify Machine Learning Solution for Music Discovery
Karthik Murugesan
 
Group discussion- Netease Cloud Music
Group discussion- Netease Cloud MusicGroup discussion- Netease Cloud Music
Group discussion- Netease Cloud Music
Xuanting ZHANG
 
Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...
Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...
Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...
Bellakarina Solorzano
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introductionLiang Xiang
 
[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music
NAVER D2
 
Presentation by purshotam verma
Presentation by purshotam vermaPresentation by purshotam verma
Presentation by purshotam verma
Rohit malav
 
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Timo van Niedek
 
Walmart Big Data Expo
Walmart Big Data ExpoWalmart Big Data Expo
Walmart Big Data Expo
BigDataExpo
 
ux academy - Beginner UX Design Course Portfolio - Louise
ux academy - Beginner UX Design Course Portfolio - Louise ux academy - Beginner UX Design Course Portfolio - Louise
ux academy - Beginner UX Design Course Portfolio - Louise
MobileUXLondon
 
So, What Does a Data Scientist do?
So, What Does a Data Scientist do?So, What Does a Data Scientist do?
So, What Does a Data Scientist do?
Jameel Syed
 
Ordering the chaos: Creating websites with imperfect data
Ordering the chaos: Creating websites with imperfect dataOrdering the chaos: Creating websites with imperfect data
Ordering the chaos: Creating websites with imperfect data
Andy Stretton
 

Similar to Deezer - Big data as a streaming service (20)

Analytics in media and entertainment industry
Analytics in media and entertainment industryAnalytics in media and entertainment industry
Analytics in media and entertainment industry
 
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
 
Trends in Music Recommendations 2018
Trends in Music Recommendations 2018Trends in Music Recommendations 2018
Trends in Music Recommendations 2018
 
FindStream investor deck
FindStream investor deckFindStream investor deck
FindStream investor deck
 
Group discussion2 (New media ecology)
Group discussion2 (New media ecology)Group discussion2 (New media ecology)
Group discussion2 (New media ecology)
 
Story of the algorithms behind Deezer Flow
Story of the algorithms behind Deezer FlowStory of the algorithms behind Deezer Flow
Story of the algorithms behind Deezer Flow
 
Social music slides4
Social music slides4Social music slides4
Social music slides4
 
Music data analysis big data presentation
Music data analysis big data presentationMusic data analysis big data presentation
Music data analysis big data presentation
 
Spotify Machine Learning Solution for Music Discovery
Spotify Machine Learning Solution for Music DiscoverySpotify Machine Learning Solution for Music Discovery
Spotify Machine Learning Solution for Music Discovery
 
Group discussion- Netease Cloud Music
Group discussion- Netease Cloud MusicGroup discussion- Netease Cloud Music
Group discussion- Netease Cloud Music
 
Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...
Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...
Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...
 
Recommender system introduction
Recommender system   introductionRecommender system   introduction
Recommender system introduction
 
[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music[221]똑똑한 인공지능 dj 비서 clova music
[221]똑똑한 인공지능 dj 비서 clova music
 
Presentation by purshotam verma
Presentation by purshotam vermaPresentation by purshotam verma
Presentation by purshotam verma
 
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
Random Walk with Restart for Automatic Playlist Continuation and Query-specif...
 
69 kuta
69 kuta69 kuta
69 kuta
 
Walmart Big Data Expo
Walmart Big Data ExpoWalmart Big Data Expo
Walmart Big Data Expo
 
ux academy - Beginner UX Design Course Portfolio - Louise
ux academy - Beginner UX Design Course Portfolio - Louise ux academy - Beginner UX Design Course Portfolio - Louise
ux academy - Beginner UX Design Course Portfolio - Louise
 
So, What Does a Data Scientist do?
So, What Does a Data Scientist do?So, What Does a Data Scientist do?
So, What Does a Data Scientist do?
 
Ordering the chaos: Creating websites with imperfect data
Ordering the chaos: Creating websites with imperfect dataOrdering the chaos: Creating websites with imperfect data
Ordering the chaos: Creating websites with imperfect data
 

Recently uploaded

Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
Fwdays
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
Elena Simperl
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
Frank van Harmelen
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
DanBrown980551
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
CatarinaPereira64715
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
Abida Shariff
 

Recently uploaded (20)

Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...When stars align: studies in data quality, knowledge graphs, and machine lear...
When stars align: studies in data quality, knowledge graphs, and machine lear...
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*Neuro-symbolic is not enough, we need neuro-*semantic*
Neuro-symbolic is not enough, we need neuro-*semantic*
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 

Deezer - Big data as a streaming service

  • 1. Big Data as a Streaming Service Big Data as a Streaming Service Julie Knibbe Product Manager – Deezer @julieknibbe Manuel Moussalam R&D – Deezer
  • 2. Big Data as a Streaming Service Product Manager Defines features that meet users needs Based on: • Market research • Product Data Analytics • Users feedback • Competitive Analysis • Creativity 
  • 3. Big Data as a Streaming Service The Leanback Experience Team at Deezer • Product Manager • Project Manager • R&D Developers • Big Data developers • Web developers (front/back) • Mobile developers • QA
  • 4. Big Data as a Streaming Service Deezer Active users 30M Countries 180+ Tracks in catalog 35M Artists in catalog 1M Music providers 1K+
  • 5. Big Data as a Streaming Service The recommendation problem No one wants to hear music they don’t like
  • 6. Big Data as a Streaming Service The recommendation problem No one wants to hear the same 200 tracks over and over again
  • 7. Big Data as a Streaming Service The recommendation problem You need to hear a song from 1 to 7 times to like it
  • 8. Big Data as a Streaming Service The recommendation problem Parameters and variables: • Mood • Tastes • Habits • Openness • Sociological profile • … Dimensions: • 35M tracks • 1M artists • 30M users
  • 9. Big Data as a Streaming Service Building a user profile Onboarding users Monitoring user actions
  • 10. Big Data as a Streaming Service Deezer – User qualification
  • 11. Big Data as a Streaming Service User Profile
  • 12. Big Data as a Streaming Service User Profile – Implicit / Explicit feedback Adaptation Add new information Forget old interests
  • 13. Big Data as a Streaming Service Music Recommendation Given a listening profile for user X, what music should we recommend?
  • 14. Recommendation system – adapting to user types Big Data as a Streaming Service Savants Enthusiasts Casuals Indifferents Riskier recommendations Popular recommendations Finding the right mix between novelty, familiarity and relevance
  • 15. Recommendation system – adapting to user types Big Data as a Streaming Service Sources: http://alchemi.co.uk/archives/mus/groups_and_beha.html http://musicmachinery.com/2014/01/14/the-zero-button-music-player-2/
  • 16. Big Data as a Streaming Service Use cases Playlist / Channel generation Discovery Personal Search …
  • 17. Big Data as a Streaming Service Deezer features – Flow
  • 18. Big Data as a Streaming Service Deezer features – Hear This
  • 19. Big Data as a Streaming Service At Deezer Mixing collaborative filtering with semi-supervised approaches • Curation: Deezer Editors • Multi-layered graph structure of tracks & artists • Usage monitoring Based on Hadoop + ElasticSearch + Spark
  • 20. Big Data as a Streaming Service Collaborative Filtering: Matching Collaborative Filtering : « User X listened to the Rolling Stones. Users listening to the Rolling Stones usually also listen to the Who, let's suggest the Who to user X. » Popularized by the Netflix Prize
  • 21. Big Data as a Streaming Service Collaborative Filtering Either compute similarity upon users or items.. or both
  • 22. Big Data as a Streaming Service Real data
  • 23. Big Data as a Streaming Service Collaborative filtering: Exemplar based Association rules • Market basket analysis • A priori Algorithm • .. But: • Scalability issues • Hubs and Island issues (Stromae example)
  • 24. Big Data as a Streaming Service Collaborative filtering: Model based Matrix Factorization A n m = U I X k • U is low-dimensional model on users • I on items Recommended items are missing entries of A
  • 25. Big Data as a Streaming Service Collaborative Filtering: Limitations • Cold Start problem • Sparse user-item matrix (1% coverage) • Only based on social behaviors • Popularity bias (« The rich gets richer »)
  • 26. Content-based filtering: Music items representation Big Data as a Streaming Service
  • 27. Big Data as a Streaming Service Content-based filtering: Limitations • Cold Start problem • Users with atypical tastes • Lack of novelty • Subjectivity not taken into account
  • 28. Big Data as a Streaming Service Content Similarity Clustering tracks, artists, albums… Methods: • Matrix Factorization techniques • Spectral clustering • Musical features extraction • Louvain algorithm • …
  • 29. Big Data as a Streaming Service Example: Multiple Spectral Clustering
  • 30. Big Data as a Streaming Service Cleaning • Mislabeled data: Different sources tell different things about songs, artists, albums • No universally adopted music ontology • Subjectivity • Outlier detection: confronting several sources and models
  • 31. Big Data as a Streaming Service Cleaning: Example
  • 32. Big Data as a Streaming Service In real life… A/B Testing
  • 33. Big Data as a Streaming Service Algorithms A/B Testing Algo A Algo B Observe results: • Daily Active Users • Streams / users • Satisfaction • … Deezer users
  • 34. Big Data as a Streaming Service Algorithms A/B Testing: Example Test: Are new users (with no profile data) more likely to be more satisfied with charts items or with new ones?
  • 35. Big Data as a Streaming Service Thanks !

Editor's Notes

  1. User based neighbourhood: find similar users and recommend their taste Item based neighbourhood: find similar items (association rules  item in same playlists, etc.)
  2. User based neighbourhood: find similar users and recommend their taste Item based neighbourhood: find similar items (association rules  item in same playlists, etc.)
  3. User based neighbourhood: find similar users and recommend their taste Item based neighbourhood: find similar items (association rules  item in same playlists, etc.)
  4. User based neighbourhood: find similar users and recommend their taste Item based neighbourhood: find similar items (association rules  item in same playlists, etc.)
  5. Rich gets richer
  6. Collect information to describe items – and work on similarity
  7. Collect information to describe items – and work on similarity
  8. Collect information to describe items – and work on similarity
  9. Artist / Artist matrix to find similar artists