Scaling Uber's Real-Time Infra for Trillion Events Daily

Scaling Uber’s Real-Time Infra for
Trillion Events per Day
Ankur Bansal
Mingmin Chen
Hadoop Summit
June 14, 2017

Ankur Bansal
● Sr. Software Engineer,
Streaming Team @ Uber
○ Apache Kafka
● Staff Software Engineer @ ebay
○ Build & scale Ebay’s cloud
using openstack
● Apache Kylin: Committer,
Emeritus PMC
About the Speakers
Mingmin Chen
● Sr. Software Engineer,
Streaming Team @ Uber
○ Apache Kafka
● Distributed Data Systems @
Twitter
○ Apache Hive
○ Apache Spark
● Exadata @ Oracle

Agenda
● Use Cases & Current Scale
● How We Scaled The Infrastructure
○ Rest Proxy & Clients
○ Local Agent
○ uReplicator (Mirrormaker)
● Reliable Messaging & Tooling
○ At-Least-Once
○ Chaperone (Auditing)
○ Cluster Balancing
● Future Work

Real-time Driver-Rider Matching
Stream
Processing
- Driver-Rider Match
- ETA
App Views
Vehicle information
KAFKA

A bunch more...
● Fraud Detection
● Share My ETA
● Driver & Rider Signups
● Etc.

Apache Kafka is Uber’s Data Hub

Kafka - Use Cases
● General Pub-Sub
● Stream Processing
○ AthenaX - Self-Serve Platform (Samza, Flink)
● Ingestion
○ HDFS, S3
● Database Changelog Transport
○ Schemaless, Cassandra, MySQL
● Logging

Scale
* obligatory show-off slide

Trillion+ ~PBs
Messages/Day Data Volume
Scale
excluding replication
Tens of Thousands
Topics

PRODUCERS
CONSUMERS
Real-time
Analytics, Alerts,
Dashboards
Samza / Flink
Applications
Data Science
Analytics
Reporting
Kafka
Vertica / Hive
Rider App
Driver App
API / Services
Etc.
Ad-hoc Exploration
ELK
Ecosystem @ Uber
Debugging
Hadoop
Surge Mobile App
Cassandra
Schemaless
MySQL
AWS S3

Requirements
● Scale Horizontally
● API Latency (<5ms typically)
● Availability -> 99.99%
● Durability -> 99.99%; 100% -> Critical Customers
● Multi-DC Replication
● Multi-Language Support
○ Java, Go, Python, Node.js, C++
● Auditing

Kafka Pipeline
Local
Agent
uReplicator

Data Flow: Batching everywhere
1
2
3 5 7
64 8
1 1

Kafka Clusters
Local
Agent
uReplicator

Kafka Clusters
● Running Kafka 0.10
● Use Case-based
○ Data
○ Logging
○ Database Changelogs
○ Highly Isolated & Reliable e.g. Surge
○ High Value Data (e.g. Signups)
● Fallback Secondary Clusters
● Global Aggregates

Kafka Rest Proxy
Local
Agent
uReplicator

Why Kafka Rest Proxy ?
● Simplified Client API
○ Multi-lang Support
● Decouple Client With Kafka broker
○ Thin Clients = Operational Ease
○ Easier Kafka Upgrades
● Enhanced Reliability
○ Quota Management
○ Primary & Secondary Clusters

Kafka Rest Proxy: Internals
● Based on Confluent’s open sourced Rest Proxy
● Performance enhancements
○ Simple HTTP servlets on jetty instead of Jersey
○ Optimized for binary payloads.
○ Performance increase from 7K* to 45K QPS/box
● Caching of topic metadata
● Reliability improvements*
○ Support for Fallback cluster
○ Support for multiple producers (SLA-based segregation)
● Plan to contribute back to community
*Based on benchmarking & analysis done in Jun ’2015

Rest Proxy: Performance (1 box)
Message rate (K/second) at single node
End-endLatency(ms)

Producer Libraries
● High Throughput
○ Non-blocking, async, batched
○ Back off when throttled
● ‘Topic Discovery’
○ Discovers the kafka cluster a topic belongs
○ Able to multiplex to different kafka clusters
● Local Agent for Critical Data

Kafka Local Agent
Local
Agent
uReplicator

Local Agent
● Producer side persistence
○ Local storage
● Isolates clients from downstream outages, backpressure
● Controlled backfill upon recovery
○ Prevents from overwhelming a recovering cluster

Local Agent in Action
Add
Figure

uReplicator
Local
Agent
uReplicator

uReplicator
● In-house Intercluster Replication Solution
○ Apache Helix-based
○ Mirror all traffic between & within DCs
○ Lower rebalance latencies
● Running in Production ~2 Years
● Open Sourced: https://github.com/uber/uReplicator
● Uber Engineering Blog: https://eng.uber.com/ureplicator/

At-Least-Once
Application Process Kafka Proxy Server uReplicator
1
2
3 5 7
64 8
Regional Kafka Aggregate Kafka
● Most of infrastructure tuned for high throughput
○ Batching at each stage
○ Ack before being persisted (ack’ed != committed)
● Single node failure in any stage leads to data loss
● Need a reliable pipeline for High Value Data e.g. Payments

At-least-once Kafka: Data Flow
Application Process
ProxyClient
Kafka Proxy Server uReplicator
1
6
2 3 7
45 8
Regional Kafka Aggregate Kafka

CONFIDENTIAL
>> INSERT SCREENSHOT HERE <<
Chaperone - Track Counts

CONFIDENTIAL
>> INSERT SCREENSHOT HERE <<
Chaperone - Track Latency

Chaperone - End to End Auditing
● In-house Auditing Solution for Kafka
● Running in Production for ~2 Years
○ Audit 20k+ topics for 99.99% completeness
● Open Sourced: https://github.com/uber/chaperone
● Uber Engineering Blog: https://eng.uber.com/chaperone/

Cluster Balancing
● No Auto Rebalancing
● Manual Placement is Hard
● Auto Plan Generation
○ And execution!

Future Work
● Multi-zone Clusters
○ Durability during DC wide outages
● Chargebacks
● Efficiency Enhancements
○ Intelligent aggregates, automated topic GC etc..
● uReplicator Enhancements
● Open Source
● And Much More..

More open-source projects at eng.uber.com

Scaling Uber's Real-Time Infra for Trillion Events Daily

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Scaling Uber's Real-Time Infra for Trillion Events Daily

Similar to Scaling Uber's Real-Time Infra for Trillion Events Daily (20)

More from DataWorks Summit

More from DataWorks Summit (20)

Recently uploaded

Recently uploaded (20)

Scaling Uber's Real-Time Infra for Trillion Events Daily