Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Data Science with Elastic MapReduce (EMR) at Netflix

2,295 views

Published on

Published in: Technology
  • Here's a link to the video of this session: http://www.youtube.com/watch?v=oGcZ7WVx6EI
       Reply 
    Are you sure you want to  Yes  No
    Your message goes here

Data Science with Elastic MapReduce (EMR) at Netflix

  1. 1. What is Netflix’s data warehouse?a) Cassandrab) Teradatac) Hived) S3
  2. 2. DSE Platform
  3. 3. DSE PlatformChukwa / Honu S3
  4. 4. Aegisthus
  5. 5. DSE PlatformChukwa / Honu Aegisthus S3
  6. 6. Sting
  7. 7. DSE Platform StingChukwa / Honu Aegisthus S3
  8. 8. What is Netflix’s data warehouse?a) Cassandrab) Teradatac) Hived) S3
  9. 9. DSE Platform StingChukwa / Honu Aegisthus S3
  10. 10. S3
  11. 11. S3
  12. 12. 99.999999999%
  13. 13. S3
  14. 14. High SLAS3 Query
  15. 15. HDFS ?
  16. 16. “Data Science as a Service”• Execution Service / Genie• Event Service• Metadata Service
  17. 17. High SLA Cluster Job High SLA S3 Query Cluster Job Query
  18. 18. High SLA S3Query Cluster Job Query
  19. 19. High SLA Cluster Job High SLA S3 Query Cluster Job Query
  20. 20. Super SLA Cluster Job Super SLAHigh SLA Cluster Job S3 High SLA Query Cluster Job Query
  21. 21. Super SLA Cluster JobHigh SLA Cluster Job High SLA S3 Query Cluster Job Query
  22. 22. Questions? http://jobs.netflix.comkurtbrown@netflix.com

×