• Like
  • Save

Cloudera Federal Forum 2014: The Building Blocks of the Enterprise Data Hub

  • 1,009 views
Uploaded on

Chief Technologist, Office of the CTO at Cloudera Eli Collins, shares the story of the enterprise data hub and how it relates to the enterprise data warehouse.

Chief Technologist, Office of the CTO at Cloudera Eli Collins, shares the story of the enterprise data hub and how it relates to the enterprise data warehouse.

More in: Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
No Downloads

Views

Total Views
1,009
On Slideshare
0
From Embeds
0
Number of Embeds
2

Actions

Shares
Downloads
0
Comments
0
Likes
2

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide

Transcript

  • 1. Building Blocks of the Enterprise Data Hub Eli Collins, Chief Technologist 1 ©2014 Cloudera, Inc. All rights reserved.
  • 2. The Enterprise Data Warehouse Reporting Operational Store Metadata Operational Store Staging Mining Facts & Dimensions Summary Analysis Flat Files Data Sources 2 EDW ©2014 Cloudera, Inc. All rights reserved. Data Marts
  • 3. The Enterprise Data Hub MR, SQL, Spark, SAS, R, Search.. DB dumps Logs Images 3 1. 2. 3. 4. Binar y ©2014 Cloudera, Inc. All rights reserved. Inexpensive storage File system & KV storage Co-located compute Separate compute & storage
  • 4. So it’s Like a Data Warehouse? 4 ©2014 Cloudera, Inc. All rights reserved.
  • 5. An Analogy 5 ©2014 Cloudera, Inc. All rights reserved.
  • 6. What changed? • The need? • Convenience? 6 Cost? ©2014 Cloudera, Inc. All rights reserved.
  • 7. Take and share good photos 7
  • 8. Data Warehouse vs. Data Hub Enterprise Data Warehouse 8 Enterprise Data Hub ©2014 Cloudera, Inc. All Rights Reserved.
  • 9. An Operating System App 1 App 2 App 3 File System 9 ©2014 Cloudera, Inc. All Rights Reserved. Services Scheduler Management Lib
  • 10. The Enterprise Data Hub Search Engine Batch Stream Processing Processing Machine Learning Resource Management For Any Type of Data Elastic, Fault-tolerant, Self-healing, In-memory capabilities SQL 10 Streaming File System (NFS) ©2014 Cloudera, Inc. All Rights Reserved. Data Management Unified Scale-out Storage System Management Analytic MPP DBMS Metadata, Security, Audit, Lineage Online NoSQL DBMS
  • 11. 11 ©2014 Cloudera, Inc. All rights reserved.
  • 12. 12 ©2014 Cloudera, Inc. All rights reserved.
  • 13. 13 ©2014 Cloudera, Inc. All rights reserved.
  • 14. 14 ©2014 Cloudera, Inc. All rights reserved.
  • 15. Data Warehouse on the Data Hub Connectors MPP Query execution ODBC/J DBC Impala HBase/Accum ulo HDFS Storage & formats 15 LLAMa YARN Resource Management ©2014 Cloudera, Inc. All Rights Reserved. Hive Metastore Parquet, Avro, Text, etc UDFs MadLib Catalog
  • 16. 16 ©2014 Cloudera, Inc. All rights reserved.