More Related Content Similar to IBM CDS Overview Similar to IBM CDS Overview (20) IBM CDS Overview2. ©2015 IBM Corporation
Why the Journey to Cloud-based Analytics?
MISSION
To provide the best experience for developers and
enterprises with a comprehensive set of rich,
integrated cloud data services covering content, data
and analytics.
Fully managed 24x7 so you
can focus on new
development
Pay as you go with no
big up-front capital
investments
Instant provisioning saves
weeks of data center setup
FASTER
INNOVATION
BETTER IT
ECONOMICS
LOWER RISK
OF FAILURE
3. ©2015 IBM Corporation
IBM Cloud Data Services
Cloudant dashDB
BigInsights on
Cloud
Spark as a
Service
DB2 on Cloud
NoSQL DBaaS
Analytic Data
Warehouse
Hadoop in the Cloud
Fully-managed
Spark Service
Hosted Database in
the Cloud
• Global data
distribution
• Massively scalable
• Eventually
consistent data
model
• Built for mobile,
Systems of
Engagement
• SQL interface
• Massively parallel
• ACID compliance
• Columnar, in-
memory
performance
• BLU augmented
with NZ in-DB
analytics
• Built for Systems of
Insight
• Bare metal
performance
• Build on reference
architecture
• BigInsights
enterprise features
• Optimized for
extremely fast and
large scale data
processing
• Spark SQL,
Streaming, MLlib,
GraphX
• Build and run apps
benefiting from
operational,
maintenance and
hardware
excellence
• Power of DB2
• Fast Provisioning
• Flexible pricing
• No loss of DBA
control
• Built for Systems of
Record
4. ©2015 IBM Corporation
Watson Analytics
Analytics & Visualization
Services
DataWorks
Data Refinery
Services
BigInsights on Cloud
• Spark for in-memory Hadoop
• Built on IBM Open Platform
• Bare metal performance
• BigInsights enterprise features
Cloudant
• Database as a Service (DBaaS)
• Massively scalable for global data distribution
• Eventually consistent data model
• Built for mobile, Systems of Engagement
dashDB
• SQL interface
• ACID compliance
• Columnar, in-memory performance
• BLU augmented with Netezza in-DB analytics
• Built for Systems of Insight
• Native integration with Watson Analytics
DB2 on Cloud
• DB2 RDBMS provisioned
on Bluemix
• SQL interface
• ACID compliance
• Fast provisioning
• Built for Systems of Record
ANALYTICAL TRANSACTIONAL
UNSTRUCTURED
STRUCTURED
Mixed workloads and data types are knit together with DataWorks for true hybrid services
IBM Cloud Data Services
5. ©2015 IBM Corporation
The IBM Cloud
- “Bare-metal” outperforms virtualized
- Dedicated hardware
- 40 data centers worldwide
6. ©2015 IBM Corporation
What is Bluemix?
Bluemix is an open-standard, cloud-based platform for building,
managing, and running applications of all types (web, mobile, big
data, new smart devices, and so on).
Go Live in Seconds
Zero to running in one click.
Development plans deploy in
seconds. Enterprise plans
deploy in 1-2 days.
DevOps
Development, monitoring,
deployment, and logging tools
allow the developer to run the
entire application.
APIs and Services
A catalog of IBM, third party,
and open source API services
allow the developer to stitch an
application together in minutes.
On-Prem Integration
Build hybrid environments.
Connect to on-premise assets
plus other public and private
clouds.
Flexible Pricing
Sign up in minutes. Pay as
you go and subscription
models offer choice and
flexibility.
Layered Security
IBM secures the platform and
infrastructure and provides
you with the tools to secure
your apps.
7. 7 © 2015 IBM Corporation
Security and Compliance for All CDS Offerings
Vulnerability Scanning
Audit Log consolidation and analysis
Use Access management
PSRIT – Security Incident Management
Legal, Regulations and Compliance
Education, training and Awareness
Business Continuity and Disaster Recovery
Network Architecture and Design
Intrusion Prevention
Operation System security hardening
Secure Engineering Development practices: threat
modeling, risk assessment, static and dynamic
code analysis
Secured development life Cycle
Security Architecture and Design
Access Control, Authentication and Authorization
Data Protection
Security Logging
Functional
Infrastructure
Development
Governance &
Compliance
Operational
SoftLayer: physical security compliance
8. Scales & remains available to
1 billion users across Asia,
North America, Europe
Transactional
Throughput
300 million requests / day
( 3,500 / second )
Cluster Distribution Global (mobile devices)
Media Types
Ingested
Structured, semi-structured,
unstructured (logs, audio)
fully managed services in
support of massive
concurrent user growth
Transactional
Throughput
2 billion requests / day
( 20 : 1 read-writes )
Data Volume 130 TB
Cluster
Growth
From 6 to over 200 servers
(in 12 months)
9. ©2015 IBM Corporation
A Large Investment Research & Management Firm: needed a
persistent data store to maintain and access financial analytical reports
Cloudant’s schema-less architecture and horizontal scalability enables
their users users to have real-time access to reports and analytics
generated by IBM PureData for Analytics
Use Case
10. ©2015 IBM Corporation
Use Case
The Red 10 is a data-driven marketing analytics firm in the UK. With dashDB, they are able to provide
real-time analytics and updates to give an accurate view to the audiences
With dashDB, they can provide 1) a live view of the UK & Ireland markets, 2) new segmentation based
on live contact views, 3) an instant view of all relevant information, and 4) the right message, at the right
time, through the right medium.
This enables growth for less and increased conversion across the sales funnel for their clients.
11. ©2015 IBM Corporation
A global pharmaceutical company with more than 90 years of innovation and leadership in
diabetes care is using IBM’s Enterprise Hadoop as a Service (EHaaS) offering
They are using BigInsights on Cloud on sets of electronic medical records (EMR) data to
analyze the relevance of a pharmacological treatment of obesity and obtain costs estimates to
build an economic model for obesity treatments.
Pharmaceutical Use Case
12. ©2015 IBM Corporation
Common Spark use cases
1. Running large data processing batch jobs (e.g. nightly ETL from
production systems, primary Hadoop use case)
2. Interactive querying of very large data sets (e.g. BI)
3. Complex analytics and data mining across various types of data
4. Building and deploying rich analytics models (e.g. risk metrics)
5. Implementing near-realtime stream event processing (e.g. fraud / security
detection)
13. The Search Continues:
SETI Institute enhances
E.T. search with advanced
analytic platform
Need
• Allen Telescope Array (ATA) has been recording and sifting
data from the cosmos for SETI in an effort to
explore, understand, and explain the origin and nature of
life in the universe.
• Perform omni-data analysis on over a decade of radio
telescope signal with new analytic algorithm to look for
narrow band signals
Benefits
• Tens of millions of ATA signal events have been recorded in
binary files, which in turn are linked to hundreds of millions
of records in a structured database that provides additional
information about the signal event, such as the exact date-
time of the signal, the target coordinates, and other details.
The IBM Spark project is linking these two data sets – in
their entirety – for the first time.
• IBM Spark : By analyzing the vast archives of ATA content,
new algorithms are already being developed to isolate
human radio frequency interference (RFI) from external
signals which deserve further scrutiny.
14. ©2015 IBM Corporation
Systems of Insight
Systems of Engagement
(NoSQL, Mobile Apps, Social Media, IoT, others)
Systems of Record
(DB2, Oracle, HDP, flat files, others;
cloud-based or on-premise)
Continuous
Synchronization
IBM & Third Party Integrations
(Watson Analytics, Cognos, SPSS, SAS,
Tableau, ESRI ArcGIS, Aginity, others)
Watson
Analytics
Cloud data services for systems of engagement, insight, and record with self-service BI
– helping you understand and engage with your customers better