Michelle Ufford is a Principal Architect at Netflix who leads their Data Engineering and Analytics team. Netflix has over 86 million members who watch over 125 million hours of content daily on over 1000 supported devices. The data team manages a 40 petabyte data warehouse with 4 petabyte daily reads and 300 terabyte daily writes, processing over 700 billion events. The data is used to predict content value, optimize the user experience, analyze news and PR, monitor global service delivery, and power experimentation.
2. Michelle Ufford
Highlights
● Principal Architect at Netflix
Data Engineering & Analytics
● Prev. Engineering Manager at GoDaddy
Data Platform
● Microsoft Data Platform MVP
● 10+ years building web-scale analytics &
data engineering infrastructure
● advises on Big Data topics
Microsoft, Hortonworks, Teradata, etc.
Gratuitous picture of my kids
13. Predicting Value for
Licensed Content.
● value / cost
● if efficient, license
Feature Engineering Predictive Models License Terms Content Efficiency
14. “ last year our original content overall
was some of our most efficient content.
”
15. “ We are building a studio in the cloud
and pioneering new approaches to movie
production, optimizing pitches, production
schedules, subtitling, and digital asset
management for our Original content. ”
33. data access
AWS
S3
Big Data Platform
Amazon
Redshift
data processing
fast storage data viz
METACA
T
data services
events data
operational data
elastic storage Apache Pig
43. Data Engineering & Analytics
MarketingProduct PlaybackContent Finance
105 talented engineers & analysts
data viz engineers
analytics engineers
data engineers
Big Data
Platform
analysts
50. Thank you
for attending!
Michelle Ufford
linkedin.com/in/mufford
@sqlfool
Data @
Netflix
@NetflixData
hadoopsie.com techblog.netflix.com
tinyurl.com/NetflixData
Editor's Notes
Abstract:
Netflix is the quintessential data-driven company. It’s 83 million members stream more than 125 million hours in over 190 countries every day and generate more than 700 billion events in the process. In this session, we’ll share how data is used to make informed decisions across the entire business — from content acquisition to content delivery, and everything in between. We’ll look at how Netflix successfully employs a scalable cloud-based data platform to support a constant deluge of data and a small army of data analysts, engineers, and scientists. We’ll discuss the advanced analytical capabilities that are enabled through modern data technologies. Lastly, we’ll explore some of the architectural & operational principals that enable Netflix to so effectively make use of its data.
Obligatory “why should you listen to me talk?” slide
Numbers as of Q3 2016
During CES 2016 this January, ‘flipped the switch’ making Netflix available in 130+ new countries. Netflix is presently available in over 190 countries worldwide.
What content should we license?
How much should we bid?
How should we value exclusivity?
How should we measure content performance?