Your SlideShare is downloading. ×
0
TiE SV Big Data Panel               Oct 13, 2011
What did Google do?        Dremel    Evenflow                       Evenflow                                   Dremel    M...
What did Google do?                                Store files        Dremel    Evenflow                       Evenflow   ...
What did Google do?                          Process                           data        Dremel    Evenflow             ...
What did Google do?                              Ingest data        Dremel    Evenflow                       Evenflow     ...
What did Google do?                              Store records & tables        Dremel    Evenflow                       Ev...
What did Google do?                               High level domain specific                               language       ...
What did Google do?                               Chain together complex workloads        Dremel    Evenflow              ...
What did Google do?                               Schedule them        Dremel    Evenflow                       Evenflow  ...
What did Google do?                               Columnar format + metadata        Dremel    Evenflow                    ...
What did Google do?                               End user queries        Dremel    Evenflow                       Evenflo...
What did Google do?                               Coordinate within                               system        Dremel    ...
The pattern repeated          HiPal     Databee                            Databee                                  Hive  ...
The pattern repeated      Oozie                            Oozie                                  Hive                    ...
The pattern repeated     Azkaban                       Azkaban     Sqoop                              Pig                 ...
The pattern repeated    Cloudera’s Distribution Including Apache Hadoop               Hue                                 ...
Project summaryTopic                                                    Project(s)File storage                            ...
POSSIBLE  withBIG DATA anything         is
Celebrate Next  Saturday
TiE Big Data panel
TiE Big Data panel
TiE Big Data panel
TiE Big Data panel
Upcoming SlideShare
Loading in...5
×

TiE Big Data panel

34,950

Published on

Published in: Technology, Business
2 Comments
5 Likes
Statistics
Notes
No Downloads
Views
Total Views
34,950
On Slideshare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
121
Comments
2
Likes
5
Embeds 0
No embeds

No notes for slide

Transcript of "TiE Big Data panel"

  1. 1. TiE SV Big Data Panel Oct 13, 2011
  2. 2. What did Google do? Dremel Evenflow Evenflow Dremel MySQL Sawzall Bigtable Gateway MapReduce / GFS Chubby ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  3. 3. What did Google do? Store files Dremel Evenflow Evenflow Dremel MySQL Sawzall Bigtable Gateway MapReduce / GFS Chubby ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  4. 4. What did Google do? Process data Dremel Evenflow Evenflow Dremel MySQL Sawzall Bigtable Gateway MapReduce / GFS Chubby ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  5. 5. What did Google do? Ingest data Dremel Evenflow Evenflow Dremel MySQL Sawzall Bigtable Gateway MapReduce / GFS Chubby ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  6. 6. What did Google do? Store records & tables Dremel Evenflow Evenflow Dremel MySQL Sawzall Bigtable Gateway MapReduce / GFS Chubby ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  7. 7. What did Google do? High level domain specific language Dremel Evenflow Evenflow Dremel MySQL Sawzall Bigtable Gateway MapReduce / GFS Chubby ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  8. 8. What did Google do? Chain together complex workloads Dremel Evenflow Evenflow Dremel MySQL Sawzall Bigtable Gateway MapReduce / GFS Chubby ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  9. 9. What did Google do? Schedule them Dremel Evenflow Evenflow Dremel MySQL Sawzall Bigtable Gateway MapReduce / GFS Chubby ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  10. 10. What did Google do? Columnar format + metadata Dremel Evenflow Evenflow Dremel MySQL Sawzall Bigtable Gateway MapReduce / GFS Chubby ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  11. 11. What did Google do? End user queries Dremel Evenflow Evenflow Dremel MySQL Sawzall Bigtable Gateway MapReduce / GFS Chubby ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  12. 12. What did Google do? Coordinate within system Dremel Evenflow Evenflow Dremel MySQL Sawzall Bigtable Gateway MapReduce / GFS Chubby ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  13. 13. The pattern repeated HiPal Databee Databee Hive Hive HBase Scribe Zookeeper ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  14. 14. The pattern repeated Oozie Oozie Hive Pig & Hive Data HBase Highway Zookeeper ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  15. 15. The pattern repeated Azkaban Azkaban Sqoop Pig Voldemort Kafka Zookeeper ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  16. 16. The pattern repeated Cloudera’s Distribution Including Apache Hadoop Hue Hue Oozie Oozie Hive Sqoop Hive / Pig HBase Flume Zookeeper ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  17. 17. Project summaryTopic Project(s)File storage HDFSRecord storage Hbase, Hypertabe, AccumuloMetadata storage Hive, HcatalogBatch data processing MapReduceStreaming data processing S4, StormGraph processing Giraph, X-RimeQuery language HiveDataflow language PigDatabase integration SqoopEvent data collection Flume, ScribeTest & assembly BigtopDistributed lock ZookeeperWeb access HueWorkflow Oozie, AzkabanFile format Avro, RCFile, Protocol Buffers, Sequence File ©2011 Cloudera, Inc. All Rights Reserved. Confidential. Reproduction or redistribution without written permission is prohibited.
  18. 18. POSSIBLE withBIG DATA anything is
  19. 19. Celebrate Next Saturday
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×