The Enterprise Data Hub in the Cloud
Eli Collins, Chief Technologist

1

©2014 Cloudera, Inc. All rights reserved.
2

©2014 Cloudera, Inc. All rights reserved.
What we’re really talking about
Host

Customer

Vendor
3

Vendor

AWS
GCE
SoftLayer
…

Customer

Manage

T-Systems
Accentu...
Engineering Perspective
• Long-running

Long-running batch jobs
• Cluster stores the data and provides services (Impala,
S...
Product Thinking
• Many EDH environments

will be hybrid

Valid reasons for/against cloud deployments
• Private/public cap...
Portability is KEY
• Multiple deployment options

Cloud Connect: AWS, SoftLayer, Savvis, T-Systems, Verizon
• Integrated s...
Functionality is KEY too
• Enterprise Data Hub functionality & innovation

Impala, Search, Sentry, Spark, ..
• ISV ecosyst...
Our Reference Architecture

+

8

©2014 Cloudera, Inc. All rights reserved.
Cloudera Leveraging AWS
• Elastic Compute (EC2)
• Simple Storage Service (S3)
• Relational Database Service (RDS)
• Elasti...
Private VPC Subnet

10

©2014 Cloudera, Inc. All rights reserved.
Public VPC Subnet

11

©2014 Cloudera, Inc. All rights reserved.
Private and Public Subnets

12

©2014 Cloudera, Inc. All rights reserved.
Instance Types and Roles

13

©2014 Cloudera, Inc. All rights reserved.
What’s coming?
• Automated deployment

Joint reference architectures
• Extend this with your IT
•

• Self-service (via ser...
Taking Full Advantage of the Cloud
• Enhanced transient clusters

Grow/shrink, compute only instances, spot instances
• Im...
16

©2014 Cloudera, Inc. All rights reserved.
Upcoming SlideShare
Loading in...5
×

Cloudera Federal Forum 2014: Cloud Deployment for the Enterprise Data Hub

896

Published on

Chief Technologist, Office of the CTO at Cloudera, Eli Collins, shares information about the enterprise data hub in the cloud and Cloudera's relationship with AWS.

Published in: Technology, Business
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total Views
896
On Slideshare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
0
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Cloudera Federal Forum 2014: Cloud Deployment for the Enterprise Data Hub

  1. 1. The Enterprise Data Hub in the Cloud Eli Collins, Chief Technologist 1 ©2014 Cloudera, Inc. All rights reserved.
  2. 2. 2 ©2014 Cloudera, Inc. All rights reserved.
  3. 3. What we’re really talking about Host Customer Vendor 3 Vendor AWS GCE SoftLayer … Customer Manage T-Systems Accenture .. EMR AltiScale .. ©2014 Cloudera, Inc. All rights reserved. Primarily dedicated physical on-prem infrastructure 2. Alternatives emerging 1.
  4. 4. Engineering Perspective • Long-running Long-running batch jobs • Cluster stores the data and provides services (Impala, Search, HBase, Accumulo, etc) • • Ephemeral Self-service, demos • Test/Dev, POC • Periodic batch • 4 ©2014 Cloudera, Inc. All rights reserved.
  5. 5. Product Thinking • Many EDH environments will be hybrid Valid reasons for/against cloud deployments • Private/public capabilities will converge • • Run Cloudera anywhere • 5 EDH works with multiple deployment models ©2014 Cloudera, Inc. All rights reserved.
  6. 6. Portability is KEY • Multiple deployment options Cloud Connect: AWS, SoftLayer, Savvis, T-Systems, Verizon • Integrated support offerings • Growing provider, SI, and MSP ecosystem • • Multiple pricing models Traditional • Usage-based • 6 ©2014 Cloudera, Inc. All rights reserved.
  7. 7. Functionality is KEY too • Enterprise Data Hub functionality & innovation Impala, Search, Sentry, Spark, .. • ISV ecosystem • • Management • 7 Cloudera Manager, NAVIGATOR, and BDR ©2014 Cloudera, Inc. All rights reserved.
  8. 8. Our Reference Architecture + 8 ©2014 Cloudera, Inc. All rights reserved.
  9. 9. Cloudera Leveraging AWS • Elastic Compute (EC2) • Simple Storage Service (S3) • Relational Database Service (RDS) • Elastic Block Store (EBS) • Direct Connect • Virtual Private Cloud (VPC) 9 ©2014 Cloudera, Inc. All rights reserved.
  10. 10. Private VPC Subnet 10 ©2014 Cloudera, Inc. All rights reserved.
  11. 11. Public VPC Subnet 11 ©2014 Cloudera, Inc. All rights reserved.
  12. 12. Private and Public Subnets 12 ©2014 Cloudera, Inc. All rights reserved.
  13. 13. Instance Types and Roles 13 ©2014 Cloudera, Inc. All rights reserved.
  14. 14. What’s coming? • Automated deployment Joint reference architectures • Extend this with your IT • • Self-service (via service providers) • More platforms and providers 14 ©2014 Cloudera, Inc. All rights reserved.
  15. 15. Taking Full Advantage of the Cloud • Enhanced transient clusters Grow/shrink, compute only instances, spot instances • Improved S3 and Swift support • • Hybrid environments Cross-DC operation • Centralized discovery & management • Bursting • 15 ©2014 Cloudera, Inc. All rights reserved.
  16. 16. 16 ©2014 Cloudera, Inc. All rights reserved.

×