Ensuring a consistently great Netflix experience while continuously pushing innovative technology updates is no easy feat.
We'll look at how Netflix turns log streams into real-time metrics to provide visibility into how devices are performing in the field. Including sharing some of the lessons learned around optimizing Druid to handle our load.
How Netflix Uses Druid in Real-time to Ensure a High Quality Streaming Experience
1. How Netflix Uses Druid in
Real-time to Ensure a High
Quality Streaming
Experience
September 2020
Ben Sykes, Sr Software Engineer, Netflix
1
2. Druid @
Netflix
How Netflix Uses Druid in Real-time to
Ensure a High Quality Streaming
Experience
Druid Summit II - Sep 2020
3. Druid Summit II
Sep 2020
Talking Points
Background
Quality of Experience
Metrics Pipeline
Using Druid
Druid Cluster
Data Ingestion
Managing Performance
Trade-offs
Tools
4. Druid Summit II
Sep 2020
Glossary
Measure
A value relating to an event. E.g. The presence of an error, size of a
buffer, or a duration.
Metric
Meaning and value derived from one or more measures. Counts,
Rates or Percentiles of measured values.
Dimension, Tag
An attribute of a metric that can be used to group or summarize
metrics by populations with shared properties.
Cardinality
The count of unique values of a given dimension.
Segment
An index file used by Druid to contain data for a given time block. A
time block may be formed of multiple segments.
16. Druid Summit II
Sep 2020
Realtime Indexing
Per Kafka Topic Many Partitions
Many Indexers per
MiddleManager,
Many Partitions per
Indexer
Many Segments per
Indexer
17. Druid Summit II
Sep 2020
Realtime Indexing
Per Kafka Topic Many Partitions
Many Indexers per
MiddleManager,
Many Partitions per
Indexer
Many Segments per
Indexer
18. Druid Summit II
Sep 2020
Realtime Indexing
Per Kafka Topic Many Partitions
Many Indexers per
MiddleManager,
Many Partitions per
Indexer
Many Segments per
Indexer
19. Druid Summit II
Sep 2020
Realtime Indexing
Per Kafka Topic Many Partitions
Many Indexers per
MiddleManager,
Many Partitions per
Indexer
Many Segments per
Indexer
20. Druid Summit II
Sep 2020
Realtime Indexing
Per Kafka Topic Many Partitions
Many Indexers per
MiddleManager,
Many Partitions per
Indexer
Many Segments per
Indexer
2
27. Time for questions
27
Thank you!
Apache Druid is an independent project of The Apache Software Foundation. More information can be found at https://druid.apache.org.
Apache Druid, Druid, and the Druid logo are either registered trademarks or trademarks of The Apache Software Foundation in the United States and other countries.