40 million songs, albums and artists available - how nice? Streaming allows you to get a grasp at the biggest music collections in the world. The only thing is that you would need centuries to listen to all of it.
Getting access doesn’t mean knowing what to do with it. How are we making music discovery more & more efficient at Deezer?
Presented at the Machine Learning class at Chalmers, Gothenburg.
http://www.cse.chalmers.se/research/lab/courses.php?coid=9
Trying to connect their theoretical machine learning class with industry examples.
- Music streaming insights and numbers from Deezer perspective
- Working with B2B2C partnerships
- Deezer content strategy
- Music industry in numbers
- Future projections
Machine Learning and Big Data for Music Discovery at SpotifyChing-Wei Chen
Spotify is the world’s largest on-demand music streaming company, with over 100 million active users who generate around 2TB of interaction data every day. With over 30 million songs to choose from, discovery and personalization play an essential role in helping users discover the best music for them. In this talk, given at the newly opened Galvanize space in NYC in March 2017, we’ll explain how Spotify uses Latent Space Models and Deep Learning to power features such as Discover Weekly and Release Radar.
Spotify Discover Weekly: The machine learning behind your music recommendationsSophia Ciocca
In this presentation, I give an overview of the machine learning algorithms behind Spotify’s extraordinarily popular Discover Weekly playlist. I provide a brief introduction to what the playlist is, explain how music recommendation engines have evolved over time, then break down the three main algorithm types powering Spotify’s recommendations: (1) collaborative filtering, (2) Natural Language Processing (NLP), and (3) Raw audio analysis.
Video of the presentation can be found here: https://www.youtube.com/watch?v=PUtYNjInopA
Learn how to use Deezer and Spotify platform to highlight your music content. How to create Deezer and Spotify Apps?
What are the 5 points to take into account for a good label application?
Presented at the Machine Learning class at Chalmers, Gothenburg.
http://www.cse.chalmers.se/research/lab/courses.php?coid=9
Trying to connect their theoretical machine learning class with industry examples.
- Music streaming insights and numbers from Deezer perspective
- Working with B2B2C partnerships
- Deezer content strategy
- Music industry in numbers
- Future projections
Machine Learning and Big Data for Music Discovery at SpotifyChing-Wei Chen
Spotify is the world’s largest on-demand music streaming company, with over 100 million active users who generate around 2TB of interaction data every day. With over 30 million songs to choose from, discovery and personalization play an essential role in helping users discover the best music for them. In this talk, given at the newly opened Galvanize space in NYC in March 2017, we’ll explain how Spotify uses Latent Space Models and Deep Learning to power features such as Discover Weekly and Release Radar.
Spotify Discover Weekly: The machine learning behind your music recommendationsSophia Ciocca
In this presentation, I give an overview of the machine learning algorithms behind Spotify’s extraordinarily popular Discover Weekly playlist. I provide a brief introduction to what the playlist is, explain how music recommendation engines have evolved over time, then break down the three main algorithm types powering Spotify’s recommendations: (1) collaborative filtering, (2) Natural Language Processing (NLP), and (3) Raw audio analysis.
Video of the presentation can be found here: https://www.youtube.com/watch?v=PUtYNjInopA
Learn how to use Deezer and Spotify platform to highlight your music content. How to create Deezer and Spotify Apps?
What are the 5 points to take into account for a good label application?
Part of my guest lecture on Data Driven Business Models at Stockholm School of Entrepreneurship. I spoke about how Data is core to the Spotify business and it drives Spotify forward.
Building Data Pipelines for Music Recommendations at SpotifyVidhya Murali
In this talk, we will get into the architectural and functional details as to how we build scalable and robust data pipelines for music recommendations at Spotify. We will also discuss some of the challenges and an overview of work to address these challenges.
The current revolution in the music industry represents great opportunities and challenges for music recommendation systems. Recommendation systems are now central to music streaming platforms, which are rapidly increasing in listenership and becoming the top source of revenue for the music industry. It is increasingly more common for a music listener to simply access music than to purchase and own it in a personal collection. In this scenario, recommendation calls no longer for a one-shot recommendation for the purpose of a track or album purchase, but for a recommendation of a listening experience, comprising a very wide range of challenges, such as sequential recommendation, or conversational and contextual recommendations. Recommendation technologies now impact all actors in the rich and complex music industry ecosystem (listeners, labels, music makers and producers, concert halls, advertisers, etc.).
UPDATED VERSION (2011): http://www.slideshare.net/plamere/music-recommendation-and-discovery
As the world of online music grows, music 2.0 recommendation systems become an increasingly important way for music listeners to discover new music.
Commercial recommenders such as Last.fm and Pandora have enjoyed commercial and critical success. But how well do these systems really work? How good are the recommendations? How far into The Long Tail do these recommenders reach?
In this tutorial we look at the current stateof theart in music recommendation. We examine current commercial and research systems, focusing on the advantages and the disadvantages of the various recommendation strategies. We look at some of the challenges in building music recommenders and we explore some of the ways that MIR techniques can be used to improve future recommenders.
I share ideas for what Spotify (my favorite app) could be doing better as far as new product features and partnerships along with demographic data (see appendix). This deck is for fun and relates to personal opinions as a user along with real data. Enjoy it!
A Digital Marketing Strategy for Spotify Maura Hickey
This is a digital strategy for Spotify, delving into a situation analysis, customer analysis, competitor analysis, external factors, objectives, digital marketing channels strategy and tactics, measurement and control and strategy implementation.
The final slide deck for Spotify Chords, a product partnership proposal between Spotify and Riffstation. With Spotify's richly organized content library and Riffstation's automatic chord detection technology, these two can create a product for musicians and aspiring musicians to easily learn how to play their favorite songs.
14 April 2016
The value of playlists – for the record labels
Value, interaction, traction…as digital radio and streaming music radio-like services boom they have become a significant revenue stream for the music rights industry with playlists being bought and sold. What are the implications for the growing playlist industry?
Simon Rugg, National Accounts Manager, PIAS UK
Music discovery: What, why, who, when, where?Julie Knibbe
Recommending music is promising that you will make people like, feel or remember something when they’ll listen. What is pushing you to get adventurous and hit “play”?
Part of my guest lecture on Data Driven Business Models at Stockholm School of Entrepreneurship. I spoke about how Data is core to the Spotify business and it drives Spotify forward.
Building Data Pipelines for Music Recommendations at SpotifyVidhya Murali
In this talk, we will get into the architectural and functional details as to how we build scalable and robust data pipelines for music recommendations at Spotify. We will also discuss some of the challenges and an overview of work to address these challenges.
The current revolution in the music industry represents great opportunities and challenges for music recommendation systems. Recommendation systems are now central to music streaming platforms, which are rapidly increasing in listenership and becoming the top source of revenue for the music industry. It is increasingly more common for a music listener to simply access music than to purchase and own it in a personal collection. In this scenario, recommendation calls no longer for a one-shot recommendation for the purpose of a track or album purchase, but for a recommendation of a listening experience, comprising a very wide range of challenges, such as sequential recommendation, or conversational and contextual recommendations. Recommendation technologies now impact all actors in the rich and complex music industry ecosystem (listeners, labels, music makers and producers, concert halls, advertisers, etc.).
UPDATED VERSION (2011): http://www.slideshare.net/plamere/music-recommendation-and-discovery
As the world of online music grows, music 2.0 recommendation systems become an increasingly important way for music listeners to discover new music.
Commercial recommenders such as Last.fm and Pandora have enjoyed commercial and critical success. But how well do these systems really work? How good are the recommendations? How far into The Long Tail do these recommenders reach?
In this tutorial we look at the current stateof theart in music recommendation. We examine current commercial and research systems, focusing on the advantages and the disadvantages of the various recommendation strategies. We look at some of the challenges in building music recommenders and we explore some of the ways that MIR techniques can be used to improve future recommenders.
I share ideas for what Spotify (my favorite app) could be doing better as far as new product features and partnerships along with demographic data (see appendix). This deck is for fun and relates to personal opinions as a user along with real data. Enjoy it!
A Digital Marketing Strategy for Spotify Maura Hickey
This is a digital strategy for Spotify, delving into a situation analysis, customer analysis, competitor analysis, external factors, objectives, digital marketing channels strategy and tactics, measurement and control and strategy implementation.
The final slide deck for Spotify Chords, a product partnership proposal between Spotify and Riffstation. With Spotify's richly organized content library and Riffstation's automatic chord detection technology, these two can create a product for musicians and aspiring musicians to easily learn how to play their favorite songs.
14 April 2016
The value of playlists – for the record labels
Value, interaction, traction…as digital radio and streaming music radio-like services boom they have become a significant revenue stream for the music rights industry with playlists being bought and sold. What are the implications for the growing playlist industry?
Simon Rugg, National Accounts Manager, PIAS UK
Music discovery: What, why, who, when, where?Julie Knibbe
Recommending music is promising that you will make people like, feel or remember something when they’ll listen. What is pushing you to get adventurous and hit “play”?
Visit us at http://www.newtechnologytv.com/vidgo-streaming-tv/ VIDGO is an OTT television streaming service offering the most comprehensive content at very affordable prices. Find info about NEXT GENERATION TV and enjoy live local, sports, national, international and on-demand live television streamed instantly while you are at home or on the go.
Speedier internet connections, enhanced decoders and plugins, better equipped computer machines, all contribute to putting the fun back to streaming video online.
For the set goals achievement and successful A/B testing fulfillment, first of all, one should get acquainted with the peculiarities of such checking type and the algorithm of its performance.
(by QATestLab)
In-Stream Processing Service Blueprint, Reference architecture for real-time ...Grid Dynamics
What is it about? In-Stream Event Processing is a new approach for building near real time big data systems with rapidly growing user base and applications like clickstream analytics, preventive maintenance or fraud detection. Maturity of some open source projects enables building an enterprise grade In-Stream Processing service in-house. However the open source world comprises of many competing projects of different maturity, having different perspectives so the task to select effective and efficient projects is not straightforward. In the talk I’ll present a blueprint of an In-Stream Processing Service, enterprise grade reliable and scalable, cloud ready, build from 100% open source components.
Spark-Streaming-as-a-Service with Kafka and YARN: Spark Summit East talk by J...Spark Summit
Since April 2016, Spark-as-a-service has been available to researchers in Sweden from the Swedish ICT SICS Data Center at www.hops.site. Researchers work in an entirely UI-driven environment on a platform built with only open-source software.
Spark applications can be either deployed as jobs (batch or streaming) or written and run directly from Apache Zeppelin. Spark applications are run within a project on a YARN cluster with the novel property that Spark applications are metered and charged to projects. Projects are also securely isolated from each other and include support for project-specific Kafka topics. That is, Kafka topics are protected from access by users that are not members of the project. In this talk we will discuss the challenges in building multi-tenant Spark streaming applications on YARN that are metered and easy-to-debug. We show how we use the ELK stack (Elasticsearch, Logstash, and Kibana) for logging and debugging running Spark streaming applications, how we use Graphana and Graphite for monitoring Spark streaming applications, and how users can debug and optimize terminated Spark Streaming jobs using Dr Elephant. We will also discuss the experiences of our users (over 120 users as of Sept 2016): how they manage their Kafka topics and quotas, patterns for how users share topics between projects, and our novel solutions for helping researchers debug and optimize Spark applications.
To conclude, we will also give an overview on our course ID2223 on Large Scale Learning and Deep Learning, in which 60 students designed and ran SparkML applications on the platform.
Music and video streaming services such as Spotify, Deezer, Netflix, YouTube, or Amazon Instant Video account for a continuously increasing part of media consumption in Germany. Although traditional media formats, such as linear TV, CDs and DVDs, have been written off repeatedly, many consumers could not live without them.
Which changes in media usage patterns are actually observable? Why are consumers attracted to streaming services? Why do some consumers still prefer traditional media formats? What are the implications for telecommunications operators?
Finding answers to these questions is the objective of this study. The responses we present are surprising and should encourage decision-makers both at content and telecommunications providers to reflect upon their respective business strategies.
In order to address all aspects of these issues from a consumer behaviour perspective, this study used a mixed-methods approach combining quantitative and qualitative research methods. The first step involved a survey of a representative sample of more than 1,000 German consumers. The results of this survey were then reflected on and scrutinised in 28 in-depth interviews with consumers.
Presented at iOS Conf SG: http://iosconf.sg/
Most iPhone users don’t bother installing any apps per months. And worse, ~80% never use an app they’ve installed again. The future of mobile is clearly not app, but features. Features that make the iPhone ecosystem still a native experience, but as open and flexible as the web. Learn how you could prepare for that future.
Netflix JavaScript Talks - Scaling A/B Testing on Netflix.com with Node.jsChris Saint-Amant
At Netflix we run hundreds of A/B tests every year. Maintaining multivariate experiences quickly adds strain to any UI engineering team. In this talk, Alex Liu and Micah Ransdell explore the patterns we’ve built in Node.js to tame this beast - ultimately enabling quick feature development and rapid test iteration on our service used by over 50 million people around the world.
The purpose of this presentation is to understand how analytics is used in the Media and Entertainment Industry. Examples of Netflix, Spotify and BookMyShow have been considered to look at the same
Group members:
LIU Yuchen, RAN Zichuan, WU Wenjun, WANG Xiaotan, LI Chaoran, NI Ning
The business model, strategy and industry analysis of NetEase Cloud Music.
Music accounts for a significant chunk of interest among
various online activities. This is reflected by wide array of
alternatives offered in music related web/mobile apps, information portals, featuring millions of artists, songs and
events attracting user activity at similar scale. Availability of large scale structured and unstructured data has attracted similar level of attention by data science community. This paper attempts to offer current state-of-the-art
in music related analysis. Various approaches involving machine learning, information theory, social network analysis,
semantic web and linked open data are represented in the
form of taxonomy along with data sources and use cases
addressed by the research community.
Social Media Monitoring as a Tool to Assess Customer Satisfaction: The Case o...Bellakarina Solorzano
This study demonstrates how social media monitoring can be used as a tool for assessing customer satisfaction and other consumer sentiments. The researchers chose Spotify, a digital music service, as the focal company given its strong social media presence.
Following a methodology similar to the one employed by Collins, Hasan, and Ukkusuri (2013), the target constructs were assessed by measuring Twitter feeds.
This research was undertaken as the second phase of a three phase brand analysis project.
This study was presented at the 2016 Consumer Satisfaction, Dissatisfaction and Complaining Behavior Conference in the City of New Orleans.
A recommender system, or a recommendation system, is a subclass of information filtering system that seeks to predict the "rating" or "preference" a user would give to an item. They are primarily used in commercial applications.
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Jeffrey Haguewood
Sidekick Solutions uses Bonterra Impact Management (fka Social Solutions Apricot) and automation solutions to integrate data for business workflows.
We believe integration and automation are essential to user experience and the promise of efficient work through technology. Automation is the critical ingredient to realizing that full vision. We develop integration products and services for Bonterra Case Management software to support the deployment of automations for a variety of use cases.
This video focuses on the notifications, alerts, and approval requests using Slack for Bonterra Impact Management. The solutions covered in this webinar can also be deployed for Microsoft Teams.
Interested in deploying notification automations for Bonterra Impact Management? Contact us at sales@sidekicksolutionsllc.com to discuss next steps.
Epistemic Interaction - tuning interfaces to provide information for AI supportAlan Dix
Paper presented at SYNERGY workshop at AVI 2024, Genoa, Italy. 3rd June 2024
https://alandix.com/academic/papers/synergy2024-epistemic/
As machine learning integrates deeper into human-computer interactions, the concept of epistemic interaction emerges, aiming to refine these interactions to enhance system adaptability. This approach encourages minor, intentional adjustments in user behaviour to enrich the data available for system learning. This paper introduces epistemic interaction within the context of human-system communication, illustrating how deliberate interaction design can improve system understanding and adaptation. Through concrete examples, we demonstrate the potential of epistemic interaction to significantly advance human-computer interaction by leveraging intuitive human communication strategies to inform system design and functionality, offering a novel pathway for enriching user-system engagements.
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Ramesh Iyer
In today's fast-changing business world, Companies that adapt and embrace new ideas often need help to keep up with the competition. However, fostering a culture of innovation takes much work. It takes vision, leadership and willingness to take risks in the right proportion. Sachin Dev Duggal, co-founder of Builder.ai, has perfected the art of this balance, creating a company culture where creativity and growth are nurtured at each stage.
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualityInflectra
In this insightful webinar, Inflectra explores how artificial intelligence (AI) is transforming software development and testing. Discover how AI-powered tools are revolutionizing every stage of the software development lifecycle (SDLC), from design and prototyping to testing, deployment, and monitoring.
Learn about:
• The Future of Testing: How AI is shifting testing towards verification, analysis, and higher-level skills, while reducing repetitive tasks.
• Test Automation: How AI-powered test case generation, optimization, and self-healing tests are making testing more efficient and effective.
• Visual Testing: Explore the emerging capabilities of AI in visual testing and how it's set to revolutionize UI verification.
• Inflectra's AI Solutions: See demonstrations of Inflectra's cutting-edge AI tools like the ChatGPT plugin and Azure Open AI platform, designed to streamline your testing process.
Whether you're a developer, tester, or QA professional, this webinar will give you valuable insights into how AI is shaping the future of software delivery.
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
I have heard many times that architecture is not important for the front-end. Also, many times I have seen how developers implement features on the front-end just following the standard rules for a framework and think that this is enough to successfully launch the project, and then the project fails. How to prevent this and what approach to choose? I have launched dozens of complex projects and during the talk we will analyze which approaches have worked for me and which have not.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Neuro-symbolic is not enough, we need neuro-*semantic*Frank van Harmelen
Neuro-symbolic (NeSy) AI is on the rise. However, simply machine learning on just any symbolic structure is not sufficient to really harvest the gains of NeSy. These will only be gained when the symbolic structures have an actual semantics. I give an operational definition of semantics as “predictable inference”.
All of this illustrated with link prediction over knowledge graphs, but the argument is general.
JMeter webinar - integration with InfluxDB and GrafanaRTTS
Watch this recorded webinar about real-time monitoring of application performance. See how to integrate Apache JMeter, the open-source leader in performance testing, with InfluxDB, the open-source time-series database, and Grafana, the open-source analytics and visualization application.
In this webinar, we will review the benefits of leveraging InfluxDB and Grafana when executing load tests and demonstrate how these tools are used to visualize performance metrics.
Length: 30 minutes
Session Overview
-------------------------------------------
During this webinar, we will cover the following topics while demonstrating the integrations of JMeter, InfluxDB and Grafana:
- What out-of-the-box solutions are available for real-time monitoring JMeter tests?
- What are the benefits of integrating InfluxDB and Grafana into the load testing stack?
- Which features are provided by Grafana?
- Demonstration of InfluxDB and Grafana using a practice web application
To view the webinar recording, go to:
https://www.rttsweb.com/jmeter-integration-webinar
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
1. Big Data as a Streaming Service
Big Data as a Streaming Service
Julie Knibbe
Product Manager – Deezer
@julieknibbe
Manuel Moussalam
R&D – Deezer
2. Big Data as a Streaming Service
Product Manager
Defines features that meet users needs
Based on:
• Market research
• Product Data Analytics
• Users feedback
• Competitive Analysis
• Creativity
3. Big Data as a Streaming Service
The Leanback Experience Team at Deezer
• Product Manager
• Project Manager
• R&D Developers
• Big Data developers
• Web developers (front/back)
• Mobile developers
• QA
4. Big Data as a Streaming Service
Deezer
Active users 30M
Countries 180+
Tracks in catalog 35M
Artists in catalog 1M
Music providers 1K+
5. Big Data as a Streaming Service
The recommendation problem
No one wants to hear music
they don’t like
6. Big Data as a Streaming Service
The recommendation problem
No one wants to hear the
same 200 tracks over and
over again
7. Big Data as a Streaming Service
The recommendation problem
You need to hear a song from
1 to 7 times to like it
8. Big Data as a Streaming Service
The recommendation problem
Parameters and variables:
• Mood
• Tastes
• Habits
• Openness
• Sociological profile
• …
Dimensions:
• 35M tracks
• 1M artists
• 30M users
9. Big Data as a Streaming Service
Building a user profile
Onboarding users
Monitoring user actions
10. Big Data as a Streaming Service
Deezer – User qualification
12. Big Data as a Streaming Service
User Profile – Implicit / Explicit feedback
Adaptation
Add new information
Forget old interests
13. Big Data as a Streaming Service
Music Recommendation
Given a listening profile for user X, what music should we
recommend?
14. Recommendation system – adapting to user types
Big Data as a Streaming Service
Savants
Enthusiasts
Casuals
Indifferents
Riskier
recommendations
Popular
recommendations
Finding the right mix between novelty, familiarity and relevance
15. Recommendation system – adapting to user types
Big Data as a Streaming Service
Sources:
http://alchemi.co.uk/archives/mus/groups_and_beha.html
http://musicmachinery.com/2014/01/14/the-zero-button-music-player-2/
16. Big Data as a Streaming Service
Use cases
Playlist / Channel generation
Discovery
Personal Search
…
17. Big Data as a Streaming Service
Deezer features – Flow
18. Big Data as a Streaming Service
Deezer features – Hear This
19. Big Data as a Streaming Service
At Deezer
Mixing collaborative filtering with semi-supervised
approaches
• Curation: Deezer Editors
• Multi-layered graph structure of tracks & artists
• Usage monitoring
Based on Hadoop + ElasticSearch + Spark
20. Big Data as a Streaming Service
Collaborative Filtering: Matching
Collaborative Filtering :
« User X listened to the Rolling Stones. Users listening
to the Rolling Stones usually also listen to the Who,
let's suggest the Who to user X. »
Popularized by the Netflix Prize
21. Big Data as a Streaming Service
Collaborative Filtering
Either compute similarity upon users or items.. or both
23. Big Data as a Streaming Service
Collaborative filtering: Exemplar based
Association rules
• Market basket analysis
• A priori Algorithm
• ..
But:
• Scalability issues
• Hubs and Island issues (Stromae example)
24. Big Data as a Streaming Service
Collaborative filtering: Model based
Matrix Factorization
A
n
m
= U
I
X
k
• U is low-dimensional model on users
• I on items
Recommended items are missing entries of A
25. Big Data as a Streaming Service
Collaborative Filtering: Limitations
• Cold Start problem
• Sparse user-item matrix (1% coverage)
• Only based on social behaviors
• Popularity bias (« The rich gets richer »)
27. Big Data as a Streaming Service
Content-based filtering: Limitations
• Cold Start problem
• Users with atypical tastes
• Lack of novelty
• Subjectivity not taken into account
28. Big Data as a Streaming Service
Content Similarity
Clustering tracks, artists, albums…
Methods:
• Matrix Factorization techniques
• Spectral clustering
• Musical features extraction
• Louvain algorithm
• …
29. Big Data as a Streaming Service
Example: Multiple Spectral Clustering
30. Big Data as a Streaming Service
Cleaning
• Mislabeled data: Different sources tell different things
about songs, artists, albums
• No universally adopted music ontology
• Subjectivity
• Outlier detection: confronting several sources and
models
31. Big Data as a Streaming Service
Cleaning: Example
32. Big Data as a Streaming Service
In real life…
A/B Testing
33. Big Data as a Streaming Service
Algorithms A/B Testing
Algo A
Algo B
Observe results:
• Daily Active Users
• Streams / users
• Satisfaction
• …
Deezer users
34. Big Data as a Streaming Service
Algorithms A/B Testing: Example
Test: Are new users (with no profile data) more likely to be
more satisfied with charts items or with new ones?
User based neighbourhood: find similar users and recommend their taste
Item based neighbourhood: find similar items (association rules item in same playlists, etc.)
User based neighbourhood: find similar users and recommend their taste
Item based neighbourhood: find similar items (association rules item in same playlists, etc.)
User based neighbourhood: find similar users and recommend their taste
Item based neighbourhood: find similar items (association rules item in same playlists, etc.)
User based neighbourhood: find similar users and recommend their taste
Item based neighbourhood: find similar items (association rules item in same playlists, etc.)
Rich gets richer
Collect information to describe items – and work on similarity
Collect information to describe items – and work on similarity
Collect information to describe items – and work on similarity