Netflix is a large and ever-changing ecosystem made up of: hundreds of production changes every hour, thousands of micro services, tens of thousands of instances, millions of concurrent customers, billions of metrics every minute. And I'm the guy with the pager. This is an in-the-trenches look at what operating at Netflix scale in the cloud is really like. It covers how Netflix views the velocity of innovation, expected failures, high availability, engineer responsibility, and obsessing over the quality of the customer experience. It also explains why freedom and responsibility are key, trust is required, and chaos is your friend.
5. CORE team
Part of Operations Engineering
Responsible for:
Crisis management
Availability reporting
Reliability best practices
AWS relationship
Operations education
33. Netflix ecosystem
100s of microservices
1000s of daily production changes
10,000s of instances
100,000s of customer interactions per minute
1,000,000s of customers
1,000,000,000s of metrics
10,000,000,000 hours of streamed
34. Netflix ecosystem
100s of microservices
1000s of daily production changes
10,000s of instances
100,000s of customer interactions per minute
1,000,000s of customers
1,000,000,000s of metrics
10,000,000,000 hours of streamed
10s of operations engineers
35. Netflix ecosystem
100s of microservices
1000s of daily production changes
10,000s of instances
100,000s of customer interactions per minute
1,000,000s of customers
1,000,000,000s of metrics
10,000,000,000 hours of streamed
10s of operations engineers
No NOC
80. Speaker When? Where?
Running Spark and Presto on the Netflix Big Data
Platform
Daniel
Weeks
Thu @
11am
Palazzo
F
Splitting the Check on Compliance and Security:
Keeping Developers and Auditors Happy in the
Cloud
Jason Chan
Thu @
11am
Marcello
4501B
@
Visit the Netflix booth
Speakers there to answer questions