Cask provides the Cask Data Application Platform (CDAP) which provides an integrated platform for developers and organizations to build, deploy, and manage big data applications. CDAP hides the complexity of Hadoop, provides reusable components, and integrates with Cloudera's data platform. It allows both technical and non-technical users to easily develop applications for ingesting, processing, and analyzing large amounts of data. The document discusses CDAP capabilities and provides an example of how a marketing SaaS company used it to build a real-time customer analytics application.
5. CASK DATA APP PLATFORM
ABSTRACTION
Hide Complexity and
Enable Reuse
INTEGRATION
Provide Capabilities
over Features
TOOLS &
SERVICES
Support Applications
from Dev to Prod
Open Source, Integrated Platform for Developers and Organizations
to Build, Deploy, and Manage Data Applications
6. 6
Hadoop is the Distributed OS
CDAP is the Distributed App Framework
7. PROPRIETARY & CONFIDENTIAL
Cask is delivering the Developer Platform for
Big Data Applications
• Founded by early Hadoop engineers from Facebook and Yahoo!
• Focused on developers and enabling big data applications
We are the WebLogic of Big Data
SIMPLE ACCESS TO POWERFUL TECHNOLOGY
7
9. CDAP Capabilities
Datasets
Programs
+ Ingestion, Egress, Tools & User
Experience
• Standardized containers providing
consistency for diverse processing
paradigms
• Services for developers to enable richer
apps with less hassle; and production to
enable application and data
management
• Libraries to build reusable data access
patterns spanning multiple storage
technologies
Runtime
Services
9
10. PROPRIETARY & CONFIDENTIAL
CDAP Integration with Cloudera
Cask Data Application Platform (CDAP) – Cloudera Integration
Today
Cloudera Manager – CDAP CSD enables install, update,
monitor of CDAP within CM
Impala – CDAP adapter for Impala enables data transformation
into Impala optimized file formats with just a few simple
commands
Future:
Further Impala integration
Integration with Sentry
Integration with Navigator
Support for Spark Streaming
Support for Cloudera Search Deployment
Flexibility
Unlimited Storage
Security and Administration
Process Discover Model Serve
On-Premises
Appliances
Engineered Systems
Public Cloud
Private Cloud
Hybrid Cloud
Cloudera’s Enterprise Data Hub
Programs
Batch Programs Realtime Programs
CASK DATA APPLICATION PLATFORM (CDAP)
Event /Data
Ingestion
Tools and
User Experience
Datasets
Runtime Services
Egress
Adapters
Data Application
Examples
Anomaly
Detection
360o
Consumer
profile
Network
Analytics
Multi-log
Correlation
Analytics
11. • New role-based user interface with capability for user-defined dashboards
• Code-free data ingestion, exploration, and transformations from UI and Shell
• Pre-built, out of the box support for real-time and batch ETL pipelines
• Application templates and plugins to speed development and enable reuse
• Addition of OLAP Cube dataset
• Support for multi-tenancy with easy to configure and manage namespaces
• Enhanced metrics and workflow support
Integrated features and pre-built modules for new users to become instantly productive
Powerful capabilities and open source extensibility for advanced users to move fast
13. Cask on Cloudera Use case: Marketing SaaS Company
Challenges
• Technical: 15B real-time events / day with consistency
• Talent: Domain experts don’t know Hadoop; Hadoop
consultants didn’t know domain
• Budget: Specialized skills expensive
• Operational: Utilize established best practices
Goals
• Velocity: Real-time customer response
• Revenue: Increase ACV
• Competitive Advantage: Differentiate with scale and
data consistency
Solution
• CDAP delivered scale while maintaining data
consistency
• CDAP abstractions enabled domain experts to deliver
without learning Hadoop
• CDAP integrated into their existing development
process
Results
• Development to production in 3 months after 9 month
failed effort to write natively on Hadoop with
consultants
• Budget saved avoiding Hadoop consultants
• New service driving revenue with existing customers
14. 14
CDAP on Hadoop compared to Hadoop alone
Lines of code 82% reduction
Development time 86% reduction
Other advantages
• Reduced Cyclomatic complexity
• Improved Testability
• Code readability and maintenance
• Application deployment and maintenance
• Egress support for application data
• Simplified knowledge transfer
Actual Developer’s Experience
Top 5 SaaS Company
15. Who
Application ISVs
SaaS Providers
Opportunity
Build new applications and services on
CDAP
Value Propositions
• Lower TCO
• Better use of developer resources
• Faster time to market
• Enable new features or services
that require real-time ingestion,
processing with data integrity
Cloudera Partners
Opportunities to engage with Cask
Who
System Integrators
Consulting Partners
Opportunity
Incorporate CDAP as a development platform
within your big data practice
Value Propositions
• Broaden pool of big data developers to
include Java developers
• Build solutions beyond offline analytics to
increase business value
• Lower cost of delivery by leveraging
reuse and lower cost skill sets
• Increase service capacity by accelerating
time to market
Who
Infrastructure ISVs
Opportunity
Integration with CDAP (Datasets,
Programs, Templates, etc.)
Value Propositions
• Address potential gaps in your
customers’ ability to grow footprint
on your solution
• Easily extend integration
opportunities into the rest of the
ecosystem via CDAP
16. Download CDAP
100% Apache 2 Open Source
CDAP 3.0 with ZeroApp UI + Shell
http://cask.co/download
http://www.cloudera.com/content/
cloudera/en/downloads/cdap/
latest.html
Use Cases and Examples
http://cask.co/get-started
Cloudera Partners
Next steps to work with Cask
Live Technical Webinar
Cask hosts a 2-hour, invite-only live
technical webinar for customers and
partners to learn more about CDAP
Next Webinar will be on
Wednesday, June 3rd, 2015
To register, please e-mail
partners@cask.co
CDAP Certification
The Cask Certified Partner Program is
available to Cloudera Partners
at no cost
Two day, on-site technical training
session at Cask HQ in Palo Alto for
developers. Includes basic and advanced
CDAP courses and labs.
To register, please e-mail
partners@cask.co
If you have any questions…
Jonathan Gray, CEO
jon@cask.co
Tom Aliotti, SVP Field Ops
tom@cask.co
Yuri Bukhan, Cloudera
ybukhan@cloudera.com