SAIS2018 - Fact Store At Netflix Scale

•

0 likes•107 views

Nitin S

Engineering

Kedar Sadekar, Netflix
Nitin Sharma, Netflix
Fact Store - Netflix
Recommendations
#DevSAIS11

#DevSAIS11
Recommendations at Netflix
● Personalized Homepage for each member
○ Goal: Quickly help members find content they’d like to
watch
○ Risk: Member may lose interest and abandon the service
○ Challenge: Recommendations at Scale

#DevSAIS11
Experimentation Cycle @ Netflix
Design a New Experiment to Test Out Different Ideas
Offline
Experiment
Online
System
Design Experiment Model Testing
Collect Label Data
Offline Feature
Generation
Model Training Model Validation Metrics
Online A/B Testing

#DevSAIS11
ML Feature Engineering - Architectural View
Online Feature
Generation
Microservices
Online Scoring
Offline Feature
Generation
Shared Feature
Encoders
Model
Training
Deploy Models
Online SystemOffline Experiment
Features
Facts

#DevSAIS11
What is a Fact?
● Fact
○ Input data for feature encoders. Used to construct a feature
○ Example: Viewing history of member, my list of a member
● Historical Version of a fact
○ Rewindable - State of the world at that time
● Temporal
■ Facts are temporal i.e. they change with time
■ Each online scoring service uses the latest value of a fact

#DevSAIS11
Online
Scoring
Predictor
Fact Microservices
Features
Facts
Log these
Online
Scoring
Predictor
Fact Microservices
Features
Facts
Log these
Recommendations Recommendations
Feature Logging Fact Logging

#DevSAIS11
Fact Logging - Pull Architecture
Pull
● Daily snapshots of key facts
● Storage
○ S3 & Parquet
● Api to access the data
○ RDD & DataFrames
● Cons
○ Lacks temporal accuracy
○ Load on Microservices
○ Missing Experiment specific facts
Capture
Snapshots
Fact Microservices
Stratified
Member sets
Snapshots

#DevSAIS11
Fact Logging - Push Architecture
● Compute engines
themselves control
what to log
● Stratification
● Temporal accuracy
Compute Services
Fact Store
Fact Transformer
Fact Fetcher
Fact Logger
ML Workflows
Feature
Generation
Model
Training

#DevSAIS11
Fact Logger
Precompute Live Compute
Fact
Logger
● Library
● Facts
○ User Related
○ Video Related
○ Computation Specific
● Serialization
● Stratification Service
● Fact Stream
● Storage
Base Fact Tables
Stratification

#DevSAIS11
Fact Logging - Scalability
Precompute Live Compute
Fact Logger
● 5-10x increase in data through
Kafka
● SLA Impact; Cost Increase
● Compression - 70% decrease

Storage & Access
Fact Store
Fact Transformer
Deduplication
Precompute Live Compute
Fact Logger
● Pipeline load
○ Repeated facts
● Aggressive or not
○ Loss threshold
Conditional push
● Spark Job
○ Fact pointers
○ SLA
#DevSAIS11

#DevSAIS11
API Lookback
Member ID My List View History Thumbs
122312 My List Value View History Pointer Thumbs Value
254637 My List Pointer View History Pointer Thumbs Pointer
Member n My List Pointer View History Value Thumbs Pointer
My List
Partition 1
Values
Partition m
Values
View
History
Partition 1
Values
Partition m
Values
Thumbs
Partition 1
Values
Partition m
Values
log_time - x
log_time - y
log_time - z

Storage & Access
Fact Store
Fact Transformer
Read API
Precompute Live Compute
Fact Logger
● Query performance
○ Slow moving facts
● Point query
○ Connector
● Query time reduction
○ Hours to minutes
Deduplication
Conditional push
Write
Read
#DevSAIS11

Performance: Storage
• Partitioning scheme
– Noisy neighbor
• Storage format
– Exploratory vs production
• Fast & Slow lane
– Lookback limit
#DevSAIS11

Performance: Spark reads
• Bloom Filters
– Reduce scan
• Cache Access
– EVCache, Spectator
• MapPartitions vs UDF
– Eager vs Lazy
– SPARK-11438, SPARK-11469,
SPARK-20586
Application
ML Library
Read API
#DevSAIS11

Future Work
• Structured with schema evolution
– Best of both (POJO & Spark SQL), Iceberg
• Streaming vs Batch
– Multiple lanes, accountability, independent scale
• Duplication
– Storage vs Runtime cost
#DevSAIS11

What's hot

Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...confluent

Kafka Summit SF 2017 - Providing Reliability Guarantees in Kafka at One Trill...confluent

Kafka Summit NYC 2017 - Simplifying Omni-Channel Retail at Scaleconfluent

Kafka Summit NYC 2017 - The Real-time Event Driven Bank: A Kafka Story confluent

Scylla Summit 2022: Scalable and Sustainable Supply Chains with DLT and ScyllaDBScyllaDB

Low-latency data applications with Kafka and Agg indexes | Tino Tereshko, Fir...HostedbyConfluent

Pinot: Enabling Real-time Analytics Applications @ LinkedIn's ScaleSeunghyun Lee

Kafka for Real-Time Event Processing in Serverless Environmentsconfluent

Kafka Summit NYC 2017 - Venice: A Distributed Database on top of Kafkaconfluent

RedisConf17- How Redis Saved Us a Boatload of Money and Boosted EfficiencyRedis Labs

The evolution of the big data platform @ Netflix (OSCON 2015)Eva Tse

Дмитрий Лавриненко "Blockchain for Identity Management, based on Fast Big Data"Fwdays

[WSO2Con USA 2018] Deploying Applications in K8S and DockerWSO2

How to mutate your immutable log | Andrey Falko, StripeHostedbyConfluent

The Holy Grail of continuous delivery in distributed teams environmentSzymon Kurcab

Ledingkart Meetup #4: Data pipeline @ lkMukesh Singh

MongoDB Days Germany: Data Processing with MongoDBMongoDB

What's hot (17)

Billions of Messages in Real Time: Why Paypal & LinkedIn Trust an Engagement ...

Kafka Summit SF 2017 - Providing Reliability Guarantees in Kafka at One Trill...

Kafka Summit NYC 2017 - Simplifying Omni-Channel Retail at Scale

Kafka Summit NYC 2017 - The Real-time Event Driven Bank: A Kafka Story

Scylla Summit 2022: Scalable and Sustainable Supply Chains with DLT and ScyllaDB

Low-latency data applications with Kafka and Agg indexes | Tino Tereshko, Fir...

Pinot: Enabling Real-time Analytics Applications @ LinkedIn's Scale

Kafka for Real-Time Event Processing in Serverless Environments

Kafka Summit NYC 2017 - Venice: A Distributed Database on top of Kafka

RedisConf17- How Redis Saved Us a Boatload of Money and Boosted Efficiency

The evolution of the big data platform @ Netflix (OSCON 2015)

Дмитрий Лавриненко "Blockchain for Identity Management, based on Fast Big Data"

[WSO2Con USA 2018] Deploying Applications in K8S and Docker

How to mutate your immutable log | Andrey Falko, Stripe

The Holy Grail of continuous delivery in distributed teams environment

Ledingkart Meetup #4: Data pipeline @ lk

MongoDB Days Germany: Data Processing with MongoDB

Similar to SAIS2018 - Fact Store At Netflix Scale

Near Real-Time Netflix Recommendations Using Apache Spark Streaming with Nit...Databricks

Near Real-Time Netflix Recommendations Using Apache Spark Streaming Karthik Murugesan

Data Con LA 2018 - Enabling real-time exploration and analytics at scale at H...Data Con LA

Structured Streaming in SparkDigital Vidya

Netflix - Realtime Impression Store Nitin S

Revealing ALLSTOCKERMasashi Umezawa

Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...Demi Ben-Ari

Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...Codemotion

Growing into a proactive Data PlatformLivePerson

Sql Server Machine Learning Services - Sql Saturday Prague 2018 #SqlSatPragueLuis Beltran

Sparking up Data Engineering: Spark Summit East talk by Rohan SharmaSpark Summit

Extracting Insights from Data at TwitterPrasad Wagle

Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017Demi Ben-Ari

Netflix Recommendations Feature Engineering with Time TravelFaisal Siddiqi

AWS Lambda and Serverless framework: lessons learned while building a serverl...Luciano Mammino

KFServing and FeastAnimesh Singh

Netflix Big Data Paris 2017Jason Flittner

Data Science in the Cloud @StitchFixC4Media

Apache Pinot Meetup Sept02, 2020Mayank Shrivastava

Sprint 45 reviewManageIQ

Similar to SAIS2018 - Fact Store At Netflix Scale (20)

Near Real-Time Netflix Recommendations Using Apache Spark Streaming with Nit...

Near Real-Time Netflix Recommendations Using Apache Spark Streaming

Data Con LA 2018 - Enabling real-time exploration and analytics at scale at H...

Structured Streaming in Spark

Netflix - Realtime Impression Store

Revealing ALLSTOCKER

Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...

Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...

Growing into a proactive Data Platform

Sql Server Machine Learning Services - Sql Saturday Prague 2018 #SqlSatPrague

Sparking up Data Engineering: Spark Summit East talk by Rohan Sharma

Extracting Insights from Data at Twitter

Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017

Netflix Recommendations Feature Engineering with Time Travel

AWS Lambda and Serverless framework: lessons learned while building a serverl...

KFServing and Feast

Netflix Big Data Paris 2017

Data Science in the Cloud @StitchFix

Apache Pinot Meetup Sept02, 2020

Sprint 45 review

Recently uploaded

Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234

(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat

9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Low Rate Call Girls In Saket, Delhi NCR

(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEslot gacor bisa pakai pulsa

What are the advantages and disadvantages of membrane structures.pptxwendy cai

Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝soniya singh

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat

GDSC ASEB Gen AI study jams presentationGDSCAESB

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile

Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia

HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha

Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxnull - The Open Security Community

Recently uploaded (20)

Microscopic Analysis of Ceramic Materials.pptx

(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts

9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf

(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts

(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE

What are the advantages and disadvantages of membrane structures.pptx

Model Call Girl in Narela Delhi reach out to us at 🔝8264348440🔝

(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...

GDSC ASEB Gen AI study jams presentation

OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...

Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts

College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik

Software Development Life Cycle By Team Orange (Dept. of Pharmacy)

HARMONY IN THE NATURE AND EXISTENCE - Unit-IV

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx

Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx

SAIS2018 - Fact Store At Netflix Scale

1. Kedar Sadekar, Netflix Nitin Sharma, Netflix Fact Store - Netflix Recommendations #DevSAIS11

2. Agenda ● ● ● ● ● ● #DevSAIS11

3. #DevSAIS11 Recommendations at Netflix ● Personalized Homepage for each member ○ Goal: Quickly help members find content they’d like to watch ○ Risk: Member may lose interest and abandon the service ○ Challenge: Recommendations at Scale

4. #DevSAIS11 Scale @ Netflix ● ● ● ●

5. #DevSAIS11 Experimentation Cycle @ Netflix Design a New Experiment to Test Out Different Ideas Offline Experiment Online System Design Experiment Model Testing Collect Label Data Offline Feature Generation Model Training Model Validation Metrics Online A/B Testing

6. #DevSAIS11 ML Feature Engineering - Architectural View Online Feature Generation Microservices Online Scoring Offline Feature Generation Shared Feature Encoders Model Training Deploy Models Online SystemOffline Experiment Features Facts

7. #DevSAIS11 What is a Fact? ● Fact ○ Input data for feature encoders. Used to construct a feature ○ Example: Viewing history of member, my list of a member ● Historical Version of a fact ○ Rewindable - State of the world at that time ● Temporal ■ Facts are temporal i.e. they change with time ■ Each online scoring service uses the latest value of a fact

8. #DevSAIS11 Online Scoring Predictor Fact Microservices Features Facts Log these Online Scoring Predictor Fact Microservices Features Facts Log these Recommendations Recommendations Feature Logging Fact Logging

9. #DevSAIS11 Fact Logging - Pull Architecture Pull ● Daily snapshots of key facts ● Storage ○ S3 & Parquet ● Api to access the data ○ RDD & DataFrames ● Cons ○ Lacks temporal accuracy ○ Load on Microservices ○ Missing Experiment specific facts Capture Snapshots Fact Microservices Stratified Member sets Snapshots

10. #DevSAIS11 Fact Logging - Push Architecture ● Compute engines themselves control what to log ● Stratification ● Temporal accuracy Compute Services Fact Store Fact Transformer Fact Fetcher Fact Logger ML Workflows Feature Generation Model Training

11. #DevSAIS11 Fact Logger Precompute Live Compute Fact Logger ● Library ● Facts ○ User Related ○ Video Related ○ Computation Specific ● Serialization ● Stratification Service ● Fact Stream ● Storage Base Fact Tables Stratification

12. #DevSAIS11 Fact Logging - Scalability Precompute Live Compute Fact Logger ● 5-10x increase in data through Kafka ● SLA Impact; Cost Increase ● Compression - 70% decrease

13. Storage & Access Fact Store Fact Transformer Deduplication Precompute Live Compute Fact Logger ● Pipeline load ○ Repeated facts ● Aggressive or not ○ Loss threshold Conditional push ● Spark Job ○ Fact pointers ○ SLA #DevSAIS11

14. #DevSAIS11 API Lookback Member ID My List View History Thumbs 122312 My List Value View History Pointer Thumbs Value 254637 My List Pointer View History Pointer Thumbs Pointer Member n My List Pointer View History Value Thumbs Pointer My List Partition 1 Values Partition m Values View History Partition 1 Values Partition m Values Thumbs Partition 1 Values Partition m Values log_time - x log_time - y log_time - z

15. Storage & Access Fact Store Fact Transformer Read API Precompute Live Compute Fact Logger ● Query performance ○ Slow moving facts ● Point query ○ Connector ● Query time reduction ○ Hours to minutes Deduplication Conditional push Write Read #DevSAIS11

16. Performance: Storage • Partitioning scheme – Noisy neighbor • Storage format – Exploratory vs production • Fast & Slow lane – Lookback limit #DevSAIS11

17. Performance: Spark reads • Bloom Filters – Reduce scan • Cache Access – EVCache, Spectator • MapPartitions vs UDF – Eager vs Lazy – SPARK-11438, SPARK-11469, SPARK-20586 Application ML Library Read API #DevSAIS11

18. Future Work • Structured with schema evolution – Best of both (POJO & Spark SQL), Iceberg • Streaming vs Batch – Multiple lanes, accountability, independent scale • Duplication – Storage vs Runtime cost #DevSAIS11

19. #DevSAIS11 Questions?

SAIS2018 - Fact Store At Netflix Scale

Recommended

Recommended

More Related Content

What's hot

What's hot (17)

Similar to SAIS2018 - Fact Store At Netflix Scale

Similar to SAIS2018 - Fact Store At Netflix Scale (20)

Recently uploaded

Recently uploaded (20)

SAIS2018 - Fact Store At Netflix Scale