×
  • Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
 

BDT303 Data Science with Elastic MapReduce - AWS re: Invent 2012

by on Dec 05, 2012

  • 1,686 views

In this talk, we dive into the Netflix Data Science & Engineering architecture. Not just the what, but also the why. Some key topics include the big data technologies we leverage (Cassandra, Hadoop, ...

In this talk, we dive into the Netflix Data Science & Engineering architecture. Not just the what, but also the why. Some key topics include the big data technologies we leverage (Cassandra, Hadoop, Pig + Python, and Hive), our use of Amazon S3 as our central data hub, our use of multiple persistent Amazon Elastic MapReduce (EMR) clusters, how we leverage the elasticity of AWS, our data science as a service approach, how we make our hybrid AWS / data center setup work well, and more.

Statistics

Views

Total Views
1,686
Views on SlideShare
1,686
Embed Views
0

Actions

Likes
4
Downloads
11
Comments
0

0 Embeds 0

No embeds

Accessibility

Upload Details

Uploaded via SlideShare as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
Post Comment
Edit your comment

BDT303 Data Science with Elastic MapReduce - AWS re: Invent 2012 BDT303 Data Science with Elastic MapReduce - AWS re: Invent 2012 Presentation Transcript