SlideShare a Scribd company logo
May 6th, 2014
Real-Time Personalization
with Ad Tech @Spotify
Kinshuk Mishra
kinshuk@spotify.com
@_kinshukmishra
1
About Me
• @Spotify since 2011
• Data and Backend engineering
• With the Monetization team
May 12, 2014
Spotify in numbers
Started in 2006, available in 56 markets
20+ million songs, 20,000 added daily
24+ million active users, 6+ million subscribers
1.5 billion playlists
May 12, 2014
Monetization at Spotify
• $1 billion paid to rights holders since launch
• 1 in every 4 Spotify users is a paying subscriber
• Ad revenue pays for free tier music streaming
• Build platform to power artist and music label promotions
What is our mission?
• Power the free tier with ads customers love.
– Delight users, brands and artists with standout
promotional experiences.
– Deliver the right message for every moment.
5
Delight users, brands and artists with standout
promotional experience
6
• Ad formats need to be
- Native
- Actionable
- Scalable
- Engaging
7
Spotlight
8
Album promotion
9
Sponsored Genre
10
Sponsored Genre
11
!(Right message for every moment)
Things we had to do
• Make ads context aware.
• Create measurable ad formats.
• Build tools to help brands understand their audience.
• Build flexible targeting infrastructure.
• Analyze user behavior to improve ad relevancy.
12
Why is real time interesting?
• True assessment of the context
- current view, last track, etc.
• Immediate feedback
- ad clicked, followed a playlist, etc.
• Increased ad opportunities
- hyperlocal, current mood, etc.
13
source : http://nativemobile.com/tag/real-time-bidding
How to serve right message for every moment ?
• Fix the WTFs for ads in the music context
- Realtime short window activity analysis
• Improve ad relevancy based on user behavior
- (Realtime + Batch) long window activity analysis
14
Some quick wins
• Fix the WTFs for ads in the music context
- Fix the “Not Safe For Kids” ads problem.
- Fix the relevancy of ads in your music session.
• Improve ad relevancy based on user behavior
- Fix ad relevancy based on user’s musical taste.
15
Targeting Architecture
16
Desired overall design
17
Our requirements
• Scalable log collection
• Capability to process logs in batch and
realtime mode and aggregate user activity
• Capability to store user profiles and
enable serve time lookups
• Ability to update and add new features to
existing user profiles
Bigdata @Spotify and choices
• 700 node hadoop cluster
• 400 GB service logs daily
• 4.5 TB user data daily
• 7500 hadoop jobs daily
• 64 TB data generated daily
• YARN Map-Reduce, Giraph,
Storm, Spark, etc. 
!
19
Overall design and tech choices
20
Source : http://tfosuccess.com/day-106-107-the-storm-arrived/
What is Storm?
• Real time stream processing
• Like Hadoop without HDFS
• Like Map/Reduce with many reducer steps
• Fault tolerant and guaranteed message processing
21
Storm @Spotify
• storm-0.8.0
• 22 node cluster
• 15+ topologies
• 200k tuples/second
• ads, recommendation,
analytics, monitoring, etc.
22
source: http://storm.incubator.apache.org/
Use-case
Use real-time session genre information to control 
ad serving in real-time
23
Realtime activity analysis
• Kafka -> Storm -> Memcached -> Targeting Backend
• Soft realtime
• Easy to scale
24
Getting data
25
Getting data across the globe
26
Genre Topology
27
May 12, 2014
Measuring effectiveness
• Build ad quality scores from lower-level metrics
• Positive response, avoidance, etc.
• A/B test hypotheses, evaluate against quality scores 
• Build quality score optimization into real-time system
May 6th, 2014
Want to join the band?
https://www.spotify.com/us/jobs/ or
https://twitter.com/Spotifyjobs
!
Kinshuk Mishra 
kinshuk@spotify.com @_kinshukmishra

More Related Content

Viewers also liked

Real-Time Personalization: Optimize Ad Spend with Personalized Targeting and ...
Real-Time Personalization: Optimize Ad Spend with Personalized Targeting and ...Real-Time Personalization: Optimize Ad Spend with Personalized Targeting and ...
Real-Time Personalization: Optimize Ad Spend with Personalized Targeting and ...
Marketo
 
Digital predictions 2015
Digital predictions 2015Digital predictions 2015
Digital predictions 2015
Marketa
 
Fail Safe, Fail Smart, Succeed
Fail Safe, Fail Smart, SucceedFail Safe, Fail Smart, Succeed
Fail Safe, Fail Smart, Succeed
Kevin Goldsmith
 
It's Getting Personal: The Rise of Hyper-Targeted User Experiences - Colin Eagan
It's Getting Personal: The Rise of Hyper-Targeted User Experiences - Colin EaganIt's Getting Personal: The Rise of Hyper-Targeted User Experiences - Colin Eagan
It's Getting Personal: The Rise of Hyper-Targeted User Experiences - Colin Eagan
UXPA International
 
BUAD 497 Strategic Management- Spotify
BUAD 497 Strategic Management- Spotify BUAD 497 Strategic Management- Spotify
BUAD 497 Strategic Management- Spotify Vincent Tsao
 
The Real World October 2015
The Real World October 2015The Real World October 2015
The Real World October 2015
Posterscope
 
Intro to Spotify Ads
Intro to Spotify AdsIntro to Spotify Ads
Intro to Spotify Ads
Spotify for Brands
 
Spotify's Music Recommendations Lambda Architecture
Spotify's Music Recommendations Lambda ArchitectureSpotify's Music Recommendations Lambda Architecture
Spotify's Music Recommendations Lambda Architecture
Esh Vckay
 
Business model of Spotify
Business model of SpotifyBusiness model of Spotify
Business model of SpotifyAnirban Ghosh
 
How AOL Advertising Uses NoSQL to Make Millions of Smart Targeting Decisions ...
How AOL Advertising Uses NoSQL to Make Millions of Smart Targeting Decisions ...How AOL Advertising Uses NoSQL to Make Millions of Smart Targeting Decisions ...
How AOL Advertising Uses NoSQL to Make Millions of Smart Targeting Decisions ...
DATAVERSITY
 
Hadoop Adventures At Spotify (Strata Conference + Hadoop World 2013)
Hadoop Adventures At Spotify (Strata Conference + Hadoop World 2013)Hadoop Adventures At Spotify (Strata Conference + Hadoop World 2013)
Hadoop Adventures At Spotify (Strata Conference + Hadoop World 2013)Adam Kawa
 
The Spotify Tribe
The Spotify TribeThe Spotify Tribe
The Spotify Tribe
Kevin Goldsmith
 
How Apache Drives Music Recommendations At Spotify
How Apache Drives Music Recommendations At SpotifyHow Apache Drives Music Recommendations At Spotify
How Apache Drives Music Recommendations At Spotify
Josh Baer
 
Spotify: behind the scenes
Spotify: behind the scenesSpotify: behind the scenes
Spotify: behind the scenes
Ricardo Vice Santos
 
Microservices at Spotify
Microservices at SpotifyMicroservices at Spotify
Microservices at Spotify
Kevin Goldsmith
 
A Spotify Presentation - Case studies
A Spotify Presentation - Case studiesA Spotify Presentation - Case studies
A Spotify Presentation - Case studies
Emily Wilkinson
 

Viewers also liked (18)

Real-Time Personalization: Optimize Ad Spend with Personalized Targeting and ...
Real-Time Personalization: Optimize Ad Spend with Personalized Targeting and ...Real-Time Personalization: Optimize Ad Spend with Personalized Targeting and ...
Real-Time Personalization: Optimize Ad Spend with Personalized Targeting and ...
 
Digital predictions 2015
Digital predictions 2015Digital predictions 2015
Digital predictions 2015
 
Fail Safe, Fail Smart, Succeed
Fail Safe, Fail Smart, SucceedFail Safe, Fail Smart, Succeed
Fail Safe, Fail Smart, Succeed
 
It's Getting Personal: The Rise of Hyper-Targeted User Experiences - Colin Eagan
It's Getting Personal: The Rise of Hyper-Targeted User Experiences - Colin EaganIt's Getting Personal: The Rise of Hyper-Targeted User Experiences - Colin Eagan
It's Getting Personal: The Rise of Hyper-Targeted User Experiences - Colin Eagan
 
BUAD 497 Strategic Management- Spotify
BUAD 497 Strategic Management- Spotify BUAD 497 Strategic Management- Spotify
BUAD 497 Strategic Management- Spotify
 
The Real World October 2015
The Real World October 2015The Real World October 2015
The Real World October 2015
 
Intro to Spotify Ads
Intro to Spotify AdsIntro to Spotify Ads
Intro to Spotify Ads
 
Spotify's Music Recommendations Lambda Architecture
Spotify's Music Recommendations Lambda ArchitectureSpotify's Music Recommendations Lambda Architecture
Spotify's Music Recommendations Lambda Architecture
 
Business model of Spotify
Business model of SpotifyBusiness model of Spotify
Business model of Spotify
 
How AOL Advertising Uses NoSQL to Make Millions of Smart Targeting Decisions ...
How AOL Advertising Uses NoSQL to Make Millions of Smart Targeting Decisions ...How AOL Advertising Uses NoSQL to Make Millions of Smart Targeting Decisions ...
How AOL Advertising Uses NoSQL to Make Millions of Smart Targeting Decisions ...
 
Hadoop Adventures At Spotify (Strata Conference + Hadoop World 2013)
Hadoop Adventures At Spotify (Strata Conference + Hadoop World 2013)Hadoop Adventures At Spotify (Strata Conference + Hadoop World 2013)
Hadoop Adventures At Spotify (Strata Conference + Hadoop World 2013)
 
The Spotify Tribe
The Spotify TribeThe Spotify Tribe
The Spotify Tribe
 
How Apache Drives Music Recommendations At Spotify
How Apache Drives Music Recommendations At SpotifyHow Apache Drives Music Recommendations At Spotify
How Apache Drives Music Recommendations At Spotify
 
Spotify
SpotifySpotify
Spotify
 
Spotify: behind the scenes
Spotify: behind the scenesSpotify: behind the scenes
Spotify: behind the scenes
 
Microservices at Spotify
Microservices at SpotifyMicroservices at Spotify
Microservices at Spotify
 
Definition of marketing strategy
Definition of marketing   strategyDefinition of marketing   strategy
Definition of marketing strategy
 
A Spotify Presentation - Case studies
A Spotify Presentation - Case studiesA Spotify Presentation - Case studies
A Spotify Presentation - Case studies
 

Similar to Spotify's Ad Targeting Infrastructure: Achieving Real-time Personalization for 24 million+ Users - Kinshuk Mishra

Boom Boxes - Android App Marketing Plan
Boom Boxes - Android App Marketing PlanBoom Boxes - Android App Marketing Plan
Boom Boxes - Android App Marketing Plan
Varshit Kumar
 
Vinylmint's Growth Strategy Story
Vinylmint's Growth Strategy StoryVinylmint's Growth Strategy Story
Vinylmint's Growth Strategy Story
Byron Morgan
 
Spotify: Dominating Music In Real Time
Spotify: Dominating Music In Real TimeSpotify: Dominating Music In Real Time
Spotify: Dominating Music In Real Time
LHBS
 
Data at Spotify
Data at SpotifyData at Spotify
Data at Spotify
Danielle Jabin
 
Making Mobile Work For You, SIPA 2012 Washington DC
Making Mobile Work For You, SIPA 2012 Washington DCMaking Mobile Work For You, SIPA 2012 Washington DC
Making Mobile Work For You, SIPA 2012 Washington DC
Mary Kay Jerige Lofurno
 
Dailyhunt Media Deck - June 19'
Dailyhunt Media Deck - June 19'Dailyhunt Media Deck - June 19'
Dailyhunt Media Deck - June 19'
TaranpalSinghGandhi
 
Opportunity with audio
Opportunity with audioOpportunity with audio
Opportunity with audio
Junda Ong
 
First Beat Media - Tehnologije na velikim projektima #tnt3
First Beat Media - Tehnologije na velikim projektima #tnt3First Beat Media - Tehnologije na velikim projektima #tnt3
First Beat Media - Tehnologije na velikim projektima #tnt3SICEF
 
Mobile Roadie - Profile Accelerator
Mobile Roadie - Profile AcceleratorMobile Roadie - Profile Accelerator
Mobile Roadie - Profile Accelerator
Mobile Roadie
 
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
AIST
 
Opera Mediaworks — Mobile Marketing & Advertising Services — 2014 presentation
Opera Mediaworks — Mobile Marketing & Advertising Services — 2014 presentationOpera Mediaworks — Mobile Marketing & Advertising Services — 2014 presentation
Opera Mediaworks — Mobile Marketing & Advertising Services — 2014 presentation
Opera Mobile Store
 
Open Source as a Viable Business Model
Open Source as a Viable Business ModelOpen Source as a Viable Business Model
Open Source as a Viable Business ModelTien-Soon Law
 
Product Owner presentation for Spotify
Product Owner presentation for SpotifyProduct Owner presentation for Spotify
Product Owner presentation for Spotify
pdicorpo
 
PMI Project 3- Spotify
PMI Project 3- SpotifyPMI Project 3- Spotify
PMI Project 3- SpotifyTed Shaw
 
POP System 6 - An Introduction
POP System 6 - An IntroductionPOP System 6 - An Introduction
POP System 6 - An Introduction
Global Creative Group, Inc
 
Yeovil Stand out on Social Media 6 October 2015
Yeovil Stand out on Social Media 6 October 2015Yeovil Stand out on Social Media 6 October 2015
Yeovil Stand out on Social Media 6 October 2015
Get up to Speed
 
The Performance Content Framework
The Performance Content FrameworkThe Performance Content Framework
The Performance Content Framework
Performics EMEA
 
Print for Profit - Digital for Growth
Print for Profit - Digital for GrowthPrint for Profit - Digital for Growth
Print for Profit - Digital for Growth
Rhapsody
 
ATAK Interactive Marketing Success Stories
ATAK Interactive Marketing Success StoriesATAK Interactive Marketing Success Stories
ATAK Interactive Marketing Success Stories
ATAK Interactive, Inc
 

Similar to Spotify's Ad Targeting Infrastructure: Achieving Real-time Personalization for 24 million+ Users - Kinshuk Mishra (20)

Boom Boxes - Android App Marketing Plan
Boom Boxes - Android App Marketing PlanBoom Boxes - Android App Marketing Plan
Boom Boxes - Android App Marketing Plan
 
Vinylmint's Growth Strategy Story
Vinylmint's Growth Strategy StoryVinylmint's Growth Strategy Story
Vinylmint's Growth Strategy Story
 
Spotify: Dominating Music In Real Time
Spotify: Dominating Music In Real TimeSpotify: Dominating Music In Real Time
Spotify: Dominating Music In Real Time
 
Data at Spotify
Data at SpotifyData at Spotify
Data at Spotify
 
Making Mobile Work For You, SIPA 2012 Washington DC
Making Mobile Work For You, SIPA 2012 Washington DCMaking Mobile Work For You, SIPA 2012 Washington DC
Making Mobile Work For You, SIPA 2012 Washington DC
 
Dailyhunt Media Deck - June 19'
Dailyhunt Media Deck - June 19'Dailyhunt Media Deck - June 19'
Dailyhunt Media Deck - June 19'
 
Opportunity with audio
Opportunity with audioOpportunity with audio
Opportunity with audio
 
First Beat Media - Tehnologije na velikim projektima #tnt3
First Beat Media - Tehnologije na velikim projektima #tnt3First Beat Media - Tehnologije na velikim projektima #tnt3
First Beat Media - Tehnologije na velikim projektima #tnt3
 
Mobile Roadie - Profile Accelerator
Mobile Roadie - Profile AcceleratorMobile Roadie - Profile Accelerator
Mobile Roadie - Profile Accelerator
 
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
Dmitry Bugaychenko - Smart.Data@ОК.ru. How to make the world a bit better usi...
 
Opera Mediaworks — Mobile Marketing & Advertising Services — 2014 presentation
Opera Mediaworks — Mobile Marketing & Advertising Services — 2014 presentationOpera Mediaworks — Mobile Marketing & Advertising Services — 2014 presentation
Opera Mediaworks — Mobile Marketing & Advertising Services — 2014 presentation
 
Music 4.5 Iast.fm
Music 4.5 Iast.fmMusic 4.5 Iast.fm
Music 4.5 Iast.fm
 
Open Source as a Viable Business Model
Open Source as a Viable Business ModelOpen Source as a Viable Business Model
Open Source as a Viable Business Model
 
Product Owner presentation for Spotify
Product Owner presentation for SpotifyProduct Owner presentation for Spotify
Product Owner presentation for Spotify
 
PMI Project 3- Spotify
PMI Project 3- SpotifyPMI Project 3- Spotify
PMI Project 3- Spotify
 
POP System 6 - An Introduction
POP System 6 - An IntroductionPOP System 6 - An Introduction
POP System 6 - An Introduction
 
Yeovil Stand out on Social Media 6 October 2015
Yeovil Stand out on Social Media 6 October 2015Yeovil Stand out on Social Media 6 October 2015
Yeovil Stand out on Social Media 6 October 2015
 
The Performance Content Framework
The Performance Content FrameworkThe Performance Content Framework
The Performance Content Framework
 
Print for Profit - Digital for Growth
Print for Profit - Digital for GrowthPrint for Profit - Digital for Growth
Print for Profit - Digital for Growth
 
ATAK Interactive Marketing Success Stories
ATAK Interactive Marketing Success StoriesATAK Interactive Marketing Success Stories
ATAK Interactive Marketing Success Stories
 

More from Hakka Labs

Always Valid Inference (Ramesh Johari, Stanford)
Always Valid Inference (Ramesh Johari, Stanford)Always Valid Inference (Ramesh Johari, Stanford)
Always Valid Inference (Ramesh Johari, Stanford)
Hakka Labs
 
DataEngConf SF16 - High cardinality time series search
DataEngConf SF16 - High cardinality time series searchDataEngConf SF16 - High cardinality time series search
DataEngConf SF16 - High cardinality time series search
Hakka Labs
 
DataEngConf SF16 - Data Asserts: Defensive Data Science
DataEngConf SF16 - Data Asserts: Defensive Data ScienceDataEngConf SF16 - Data Asserts: Defensive Data Science
DataEngConf SF16 - Data Asserts: Defensive Data Science
Hakka Labs
 
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast DataDatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
Hakka Labs
 
DataEngConf SF16 - Recommendations at Instacart
DataEngConf SF16 - Recommendations at InstacartDataEngConf SF16 - Recommendations at Instacart
DataEngConf SF16 - Recommendations at Instacart
Hakka Labs
 
DataEngConf SF16 - Running simulations at scale
DataEngConf SF16 - Running simulations at scaleDataEngConf SF16 - Running simulations at scale
DataEngConf SF16 - Running simulations at scale
Hakka Labs
 
DataEngConf SF16 - Deriving Meaning from Wearable Sensor Data
DataEngConf SF16 - Deriving Meaning from Wearable Sensor DataDataEngConf SF16 - Deriving Meaning from Wearable Sensor Data
DataEngConf SF16 - Deriving Meaning from Wearable Sensor Data
Hakka Labs
 
DataEngConf SF16 - Collecting and Moving Data at Scale
DataEngConf SF16 - Collecting and Moving Data at Scale DataEngConf SF16 - Collecting and Moving Data at Scale
DataEngConf SF16 - Collecting and Moving Data at Scale
Hakka Labs
 
DataEngConf SF16 - BYOMQ: Why We [re]Built IronMQ
DataEngConf SF16 - BYOMQ: Why We [re]Built IronMQDataEngConf SF16 - BYOMQ: Why We [re]Built IronMQ
DataEngConf SF16 - BYOMQ: Why We [re]Built IronMQ
Hakka Labs
 
DataEngConf SF16 - Unifying Real Time and Historical Analytics with the Lambd...
DataEngConf SF16 - Unifying Real Time and Historical Analytics with the Lambd...DataEngConf SF16 - Unifying Real Time and Historical Analytics with the Lambd...
DataEngConf SF16 - Unifying Real Time and Historical Analytics with the Lambd...
Hakka Labs
 
DataEngConf SF16 - Three lessons learned from building a production machine l...
DataEngConf SF16 - Three lessons learned from building a production machine l...DataEngConf SF16 - Three lessons learned from building a production machine l...
DataEngConf SF16 - Three lessons learned from building a production machine l...
Hakka Labs
 
DataEngConf SF16 - Scalable and Reliable Logging at Pinterest
DataEngConf SF16 - Scalable and Reliable Logging at PinterestDataEngConf SF16 - Scalable and Reliable Logging at Pinterest
DataEngConf SF16 - Scalable and Reliable Logging at Pinterest
Hakka Labs
 
DataEngConf SF16 - Bridging the gap between data science and data engineering
DataEngConf SF16 - Bridging the gap between data science and data engineeringDataEngConf SF16 - Bridging the gap between data science and data engineering
DataEngConf SF16 - Bridging the gap between data science and data engineering
Hakka Labs
 
DataEngConf SF16 - Multi-temporal Data Structures
DataEngConf SF16 - Multi-temporal Data StructuresDataEngConf SF16 - Multi-temporal Data Structures
DataEngConf SF16 - Multi-temporal Data Structures
Hakka Labs
 
DataEngConf SF16 - Entity Resolution in Data Pipelines Using Spark
DataEngConf SF16 - Entity Resolution in Data Pipelines Using SparkDataEngConf SF16 - Entity Resolution in Data Pipelines Using Spark
DataEngConf SF16 - Entity Resolution in Data Pipelines Using Spark
Hakka Labs
 
DataEngConf SF16 - Beginning with Ourselves
DataEngConf SF16 - Beginning with OurselvesDataEngConf SF16 - Beginning with Ourselves
DataEngConf SF16 - Beginning with Ourselves
Hakka Labs
 
DataEngConf SF16 - Routing Billions of Analytics Events with High Deliverability
DataEngConf SF16 - Routing Billions of Analytics Events with High DeliverabilityDataEngConf SF16 - Routing Billions of Analytics Events with High Deliverability
DataEngConf SF16 - Routing Billions of Analytics Events with High Deliverability
Hakka Labs
 
DataEngConf SF16 - Tales from the other side - What a hiring manager wish you...
DataEngConf SF16 - Tales from the other side - What a hiring manager wish you...DataEngConf SF16 - Tales from the other side - What a hiring manager wish you...
DataEngConf SF16 - Tales from the other side - What a hiring manager wish you...
Hakka Labs
 
DataEngConf SF16 - Methods for Content Relevance at LinkedIn
DataEngConf SF16 - Methods for Content Relevance at LinkedInDataEngConf SF16 - Methods for Content Relevance at LinkedIn
DataEngConf SF16 - Methods for Content Relevance at LinkedIn
Hakka Labs
 
DataEngConf SF16 - Spark SQL Workshop
DataEngConf SF16 - Spark SQL WorkshopDataEngConf SF16 - Spark SQL Workshop
DataEngConf SF16 - Spark SQL Workshop
Hakka Labs
 

More from Hakka Labs (20)

Always Valid Inference (Ramesh Johari, Stanford)
Always Valid Inference (Ramesh Johari, Stanford)Always Valid Inference (Ramesh Johari, Stanford)
Always Valid Inference (Ramesh Johari, Stanford)
 
DataEngConf SF16 - High cardinality time series search
DataEngConf SF16 - High cardinality time series searchDataEngConf SF16 - High cardinality time series search
DataEngConf SF16 - High cardinality time series search
 
DataEngConf SF16 - Data Asserts: Defensive Data Science
DataEngConf SF16 - Data Asserts: Defensive Data ScienceDataEngConf SF16 - Data Asserts: Defensive Data Science
DataEngConf SF16 - Data Asserts: Defensive Data Science
 
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast DataDatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
DatEngConf SF16 - Apache Kudu: Fast Analytics on Fast Data
 
DataEngConf SF16 - Recommendations at Instacart
DataEngConf SF16 - Recommendations at InstacartDataEngConf SF16 - Recommendations at Instacart
DataEngConf SF16 - Recommendations at Instacart
 
DataEngConf SF16 - Running simulations at scale
DataEngConf SF16 - Running simulations at scaleDataEngConf SF16 - Running simulations at scale
DataEngConf SF16 - Running simulations at scale
 
DataEngConf SF16 - Deriving Meaning from Wearable Sensor Data
DataEngConf SF16 - Deriving Meaning from Wearable Sensor DataDataEngConf SF16 - Deriving Meaning from Wearable Sensor Data
DataEngConf SF16 - Deriving Meaning from Wearable Sensor Data
 
DataEngConf SF16 - Collecting and Moving Data at Scale
DataEngConf SF16 - Collecting and Moving Data at Scale DataEngConf SF16 - Collecting and Moving Data at Scale
DataEngConf SF16 - Collecting and Moving Data at Scale
 
DataEngConf SF16 - BYOMQ: Why We [re]Built IronMQ
DataEngConf SF16 - BYOMQ: Why We [re]Built IronMQDataEngConf SF16 - BYOMQ: Why We [re]Built IronMQ
DataEngConf SF16 - BYOMQ: Why We [re]Built IronMQ
 
DataEngConf SF16 - Unifying Real Time and Historical Analytics with the Lambd...
DataEngConf SF16 - Unifying Real Time and Historical Analytics with the Lambd...DataEngConf SF16 - Unifying Real Time and Historical Analytics with the Lambd...
DataEngConf SF16 - Unifying Real Time and Historical Analytics with the Lambd...
 
DataEngConf SF16 - Three lessons learned from building a production machine l...
DataEngConf SF16 - Three lessons learned from building a production machine l...DataEngConf SF16 - Three lessons learned from building a production machine l...
DataEngConf SF16 - Three lessons learned from building a production machine l...
 
DataEngConf SF16 - Scalable and Reliable Logging at Pinterest
DataEngConf SF16 - Scalable and Reliable Logging at PinterestDataEngConf SF16 - Scalable and Reliable Logging at Pinterest
DataEngConf SF16 - Scalable and Reliable Logging at Pinterest
 
DataEngConf SF16 - Bridging the gap between data science and data engineering
DataEngConf SF16 - Bridging the gap between data science and data engineeringDataEngConf SF16 - Bridging the gap between data science and data engineering
DataEngConf SF16 - Bridging the gap between data science and data engineering
 
DataEngConf SF16 - Multi-temporal Data Structures
DataEngConf SF16 - Multi-temporal Data StructuresDataEngConf SF16 - Multi-temporal Data Structures
DataEngConf SF16 - Multi-temporal Data Structures
 
DataEngConf SF16 - Entity Resolution in Data Pipelines Using Spark
DataEngConf SF16 - Entity Resolution in Data Pipelines Using SparkDataEngConf SF16 - Entity Resolution in Data Pipelines Using Spark
DataEngConf SF16 - Entity Resolution in Data Pipelines Using Spark
 
DataEngConf SF16 - Beginning with Ourselves
DataEngConf SF16 - Beginning with OurselvesDataEngConf SF16 - Beginning with Ourselves
DataEngConf SF16 - Beginning with Ourselves
 
DataEngConf SF16 - Routing Billions of Analytics Events with High Deliverability
DataEngConf SF16 - Routing Billions of Analytics Events with High DeliverabilityDataEngConf SF16 - Routing Billions of Analytics Events with High Deliverability
DataEngConf SF16 - Routing Billions of Analytics Events with High Deliverability
 
DataEngConf SF16 - Tales from the other side - What a hiring manager wish you...
DataEngConf SF16 - Tales from the other side - What a hiring manager wish you...DataEngConf SF16 - Tales from the other side - What a hiring manager wish you...
DataEngConf SF16 - Tales from the other side - What a hiring manager wish you...
 
DataEngConf SF16 - Methods for Content Relevance at LinkedIn
DataEngConf SF16 - Methods for Content Relevance at LinkedInDataEngConf SF16 - Methods for Content Relevance at LinkedIn
DataEngConf SF16 - Methods for Content Relevance at LinkedIn
 
DataEngConf SF16 - Spark SQL Workshop
DataEngConf SF16 - Spark SQL WorkshopDataEngConf SF16 - Spark SQL Workshop
DataEngConf SF16 - Spark SQL Workshop
 

Recently uploaded

Launch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in MinutesLaunch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in Minutes
Roshan Dwivedi
 
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
informapgpstrackings
 
Graphic Design Crash Course for beginners
Graphic Design Crash Course for beginnersGraphic Design Crash Course for beginners
Graphic Design Crash Course for beginners
e20449
 
Vitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume MontevideoVitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume Montevideo
Vitthal Shirke
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
May Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdfMay Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdf
Adele Miller
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Globus
 
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptxTop Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
rickgrimesss22
 
Text-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptx
Text-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptxText-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptx
Text-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptx
ShamsuddeenMuhammadA
 
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoamOpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
takuyayamamoto1800
 
How to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good PracticesHow to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good Practices
Globus
 
GraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph TechnologyGraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph Technology
Neo4j
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Globus
 
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdfVitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus
 
2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx
Georgi Kodinov
 
Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
Globus
 
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket ManagementUtilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
Utilocate
 
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Globus
 
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data AnalysisProviding Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Globus
 

Recently uploaded (20)

Launch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in MinutesLaunch Your Streaming Platforms in Minutes
Launch Your Streaming Platforms in Minutes
 
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
Field Employee Tracking System| MiTrack App| Best Employee Tracking Solution|...
 
Graphic Design Crash Course for beginners
Graphic Design Crash Course for beginnersGraphic Design Crash Course for beginners
Graphic Design Crash Course for beginners
 
Vitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume MontevideoVitthal Shirke Microservices Resume Montevideo
Vitthal Shirke Microservices Resume Montevideo
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
May Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdfMay Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdf
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
 
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptxTop Features to Include in Your Winzo Clone App for Business Growth (4).pptx
Top Features to Include in Your Winzo Clone App for Business Growth (4).pptx
 
Text-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptx
Text-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptxText-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptx
Text-Summarization-of-Breaking-News-Using-Fine-tuning-BART-Model.pptx
 
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoamOpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
 
How to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good PracticesHow to Position Your Globus Data Portal for Success Ten Good Practices
How to Position Your Globus Data Portal for Success Ten Good Practices
 
GraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph TechnologyGraphSummit Paris - The art of the possible with Graph Technology
GraphSummit Paris - The art of the possible with Graph Technology
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
 
Vitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdfVitthal Shirke Java Microservices Resume.pdf
Vitthal Shirke Java Microservices Resume.pdf
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
 
2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx
 
Understanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSageUnderstanding Globus Data Transfers with NetSage
Understanding Globus Data Transfers with NetSage
 
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket ManagementUtilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
Utilocate provides Smarter, Better, Faster, Safer Locate Ticket Management
 
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
Climate Science Flows: Enabling Petabyte-Scale Climate Analysis with the Eart...
 
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data AnalysisProviding Globus Services to Users of JASMIN for Environmental Data Analysis
Providing Globus Services to Users of JASMIN for Environmental Data Analysis
 

Spotify's Ad Targeting Infrastructure: Achieving Real-time Personalization for 24 million+ Users - Kinshuk Mishra

  • 1. May 6th, 2014 Real-Time Personalization with Ad Tech @Spotify Kinshuk Mishra kinshuk@spotify.com @_kinshukmishra 1
  • 2. About Me • @Spotify since 2011 • Data and Backend engineering • With the Monetization team
  • 3. May 12, 2014 Spotify in numbers Started in 2006, available in 56 markets 20+ million songs, 20,000 added daily 24+ million active users, 6+ million subscribers 1.5 billion playlists
  • 4. May 12, 2014 Monetization at Spotify • $1 billion paid to rights holders since launch • 1 in every 4 Spotify users is a paying subscriber • Ad revenue pays for free tier music streaming • Build platform to power artist and music label promotions
  • 5. What is our mission? • Power the free tier with ads customers love. – Delight users, brands and artists with standout promotional experiences. – Deliver the right message for every moment. 5
  • 6. Delight users, brands and artists with standout promotional experience 6 • Ad formats need to be - Native - Actionable - Scalable - Engaging
  • 11. 11 !(Right message for every moment)
  • 12. Things we had to do • Make ads context aware. • Create measurable ad formats. • Build tools to help brands understand their audience. • Build flexible targeting infrastructure. • Analyze user behavior to improve ad relevancy. 12
  • 13. Why is real time interesting? • True assessment of the context - current view, last track, etc. • Immediate feedback - ad clicked, followed a playlist, etc. • Increased ad opportunities - hyperlocal, current mood, etc. 13 source : http://nativemobile.com/tag/real-time-bidding
  • 14. How to serve right message for every moment ? • Fix the WTFs for ads in the music context - Realtime short window activity analysis • Improve ad relevancy based on user behavior - (Realtime + Batch) long window activity analysis 14
  • 15. Some quick wins • Fix the WTFs for ads in the music context - Fix the “Not Safe For Kids” ads problem. - Fix the relevancy of ads in your music session. • Improve ad relevancy based on user behavior - Fix ad relevancy based on user’s musical taste. 15
  • 18. Our requirements • Scalable log collection • Capability to process logs in batch and realtime mode and aggregate user activity • Capability to store user profiles and enable serve time lookups • Ability to update and add new features to existing user profiles
  • 19. Bigdata @Spotify and choices • 700 node hadoop cluster • 400 GB service logs daily • 4.5 TB user data daily • 7500 hadoop jobs daily • 64 TB data generated daily • YARN Map-Reduce, Giraph, Storm, Spark, etc. ! 19
  • 20. Overall design and tech choices 20
  • 21. Source : http://tfosuccess.com/day-106-107-the-storm-arrived/ What is Storm? • Real time stream processing • Like Hadoop without HDFS • Like Map/Reduce with many reducer steps • Fault tolerant and guaranteed message processing 21
  • 22. Storm @Spotify • storm-0.8.0 • 22 node cluster • 15+ topologies • 200k tuples/second • ads, recommendation, analytics, monitoring, etc. 22 source: http://storm.incubator.apache.org/
  • 23. Use-case Use real-time session genre information to control ad serving in real-time 23
  • 24. Realtime activity analysis • Kafka -> Storm -> Memcached -> Targeting Backend • Soft realtime • Easy to scale 24
  • 26. Getting data across the globe 26
  • 28. May 12, 2014 Measuring effectiveness • Build ad quality scores from lower-level metrics • Positive response, avoidance, etc. • A/B test hypotheses, evaluate against quality scores • Build quality score optimization into real-time system
  • 29. May 6th, 2014 Want to join the band? https://www.spotify.com/us/jobs/ or https://twitter.com/Spotifyjobs ! Kinshuk Mishra kinshuk@spotify.com @_kinshukmishra