Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

The DAP - Where YARN, HBase, Kafka and Spark go to Production

591 views

Published on

The DAP - Where YARN, HBase, Kafka and Spark go to Production

Published in: Technology
  • Be the first to comment

The DAP - Where YARN, HBase, Kafka and Spark go to Production

  1. 1. The DAP – Where YARN, HBase, Kafka and Spark Go to Production Hadoop Summit - June 30th, 2016 cask.co Cask, CDAP, Cask Hydrator and Cask Tracker are trademarks or registered trademarks of Cask Data. Apache Spark, Spark, the Spark logo, Apache Hadoop, Hadoop and the Hadoop logo are trademarks or registered trademarks of the Apache Software Foundation. All other trademarks and registered trademarks are the property of their respective owners.
  2. 2. cask.co About Me 2
  3. 3. cask.co The Many Faces of Hadoop 3 Developer Data Scientist IT Pro / Ops LOB Manager Advanced Programming Focuses on App Logic Basic Programming Focuses on Data Configuration & Monitoring Focuses on Operations Analysis & Decision Making Focuses on Insights
  4. 4. cask.co Big Data Challenges 4
  5. 5. cask.co Building a Big Data App 5
  6. 6. cask.co Deploying and Operating a Big Data App 6
  7. 7. cask.co Today’s Integration Solutions are Silo’ed 7 Data Integration App Integration Cloud Integration Governance
  8. 8. cask.co Introducing the DAP 8
  9. 9. cask.co9 Enter Cask Key Customers and Partners Named a Gartner Cool Vendor 2016 Founded in 2011 by early Hadoop engineers from Facebook and Yahoo!
  10. 10. cask.co Introducing the Cask Data App Platform 10
  11. 11. cask.co CDAP Overview 11 Open Source, Integrated Framework for Building and Running Data Applications on Hadoop and Spark
  12. 12. cask.co12 ● Provides a platform with framework level correctness ● Dataset abstractions & self-service data ● One framework: Prototype to Production ● Unified approach across all paradigms ○ Metrics & Log collection ○ Lineage, Audit, Access Control CDAP Consolidates Big Data App Lifecycle
  13. 13. cask.co CDAP Extensions 13
  14. 14. cask.co CDAP Architecture 14 ● Application Container Architecture ● Reusable Programming Abstractions ● Global User and Machine Metadata
  15. 15. cask.co CDAP Application Structure 15
  16. 16. cask.co CDAP Deployment Architecture 16
  17. 17. cask.co Hadoop in the Enterprise – Simplified with CDAP 17
  18. 18. cask.co Common Use Cases 18
  19. 19. cask.co Summary 19
  20. 20. cask.co Thank You ! 20

×