The Future of Analytics in Action
IBM Cloud Data Services
©2015 IBM Corporation
Why the Journey to Cloud-based Analytics?
MISSION
To provide the best experience for developers and
enterprises with a comprehensive set of rich,
integrated cloud data services covering content, data
and analytics.
Fully managed 24x7 so you
can focus on new
development
Pay as you go with no
big up-front capital
investments
Instant provisioning saves
weeks of data center setup
FASTER
INNOVATION
BETTER IT
ECONOMICS
LOWER RISK
OF FAILURE
©2015 IBM Corporation
IBM Cloud Data Services
Cloudant dashDB
BigInsights on
Cloud
Spark as a
Service
DB2 on Cloud
NoSQL DBaaS
Analytic Data
Warehouse
Hadoop in the Cloud
Fully-managed
Spark Service
Hosted Database in
the Cloud
• Global data
distribution
• Massively scalable
• Eventually
consistent data
model
• Built for mobile,
Systems of
Engagement
• SQL interface
• Massively parallel
• ACID compliance
• Columnar, in-
memory
performance
• BLU augmented
with NZ in-DB
analytics
• Built for Systems of
Insight
• Bare metal
performance
• Build on reference
architecture
• BigInsights
enterprise features
• Optimized for
extremely fast and
large scale data
processing
• Spark SQL,
Streaming, MLlib,
GraphX
• Build and run apps
benefiting from
operational,
maintenance and
hardware
excellence
• Power of DB2
• Fast Provisioning
• Flexible pricing
• No loss of DBA
control
• Built for Systems of
Record
©2015 IBM Corporation
Watson Analytics
Analytics & Visualization
Services
DataWorks
Data Refinery
Services
BigInsights on Cloud
• Spark for in-memory Hadoop
• Built on IBM Open Platform
• Bare metal performance
• BigInsights enterprise features
Cloudant
• Database as a Service (DBaaS)
• Massively scalable for global data distribution
• Eventually consistent data model
• Built for mobile, Systems of Engagement
dashDB
• SQL interface
• ACID compliance
• Columnar, in-memory performance
• BLU augmented with Netezza in-DB analytics
• Built for Systems of Insight
• Native integration with Watson Analytics
DB2 on Cloud
• DB2 RDBMS provisioned
on Bluemix
• SQL interface
• ACID compliance
• Fast provisioning
• Built for Systems of Record
ANALYTICAL TRANSACTIONAL
UNSTRUCTURED
STRUCTURED
Mixed workloads and data types are knit together with DataWorks for true hybrid services
IBM Cloud Data Services
©2015 IBM Corporation
The IBM Cloud
- “Bare-metal” outperforms virtualized
- Dedicated hardware
- 40 data centers worldwide
©2015 IBM Corporation
What is Bluemix?
Bluemix is an open-standard, cloud-based platform for building,
managing, and running applications of all types (web, mobile, big
data, new smart devices, and so on).
Go Live in Seconds
Zero to running in one click.
Development plans deploy in
seconds. Enterprise plans
deploy in 1-2 days.
DevOps
Development, monitoring,
deployment, and logging tools
allow the developer to run the
entire application.
APIs and Services
A catalog of IBM, third party,
and open source API services
allow the developer to stitch an
application together in minutes.
On-Prem Integration
Build hybrid environments.
Connect to on-premise assets
plus other public and private
clouds.
Flexible Pricing
Sign up in minutes. Pay as
you go and subscription
models offer choice and
flexibility.
Layered Security
IBM secures the platform and
infrastructure and provides
you with the tools to secure
your apps.
7 © 2015 IBM Corporation
Security and Compliance for All CDS Offerings
 Vulnerability Scanning
 Audit Log consolidation and analysis
 Use Access management
 PSRIT – Security Incident Management
 Legal, Regulations and Compliance
 Education, training and Awareness
 Business Continuity and Disaster Recovery
 Network Architecture and Design
 Intrusion Prevention
 Operation System security hardening
 Secure Engineering Development practices: threat
modeling, risk assessment, static and dynamic
code analysis
 Secured development life Cycle
 Security Architecture and Design
 Access Control, Authentication and Authorization
 Data Protection
 Security Logging
Functional
Infrastructure
Development
Governance &
Compliance
Operational
SoftLayer: physical security compliance
Scales & remains available to
1 billion users across Asia,
North America, Europe
Transactional
Throughput
300 million requests / day
( 3,500 / second )
Cluster Distribution Global (mobile devices)
Media Types
Ingested
Structured, semi-structured,
unstructured (logs, audio)
fully managed services in
support of massive
concurrent user growth
Transactional
Throughput
2 billion requests / day
( 20 : 1 read-writes )
Data Volume 130 TB
Cluster
Growth
From 6 to over 200 servers
(in 12 months)
©2015 IBM Corporation
A Large Investment Research & Management Firm: needed a
persistent data store to maintain and access financial analytical reports
Cloudant’s schema-less architecture and horizontal scalability enables
their users users to have real-time access to reports and analytics
generated by IBM PureData for Analytics
Use Case
©2015 IBM Corporation
Use Case
The Red 10 is a data-driven marketing analytics firm in the UK. With dashDB, they are able to provide
real-time analytics and updates to give an accurate view to the audiences
With dashDB, they can provide 1) a live view of the UK & Ireland markets, 2) new segmentation based
on live contact views, 3) an instant view of all relevant information, and 4) the right message, at the right
time, through the right medium.
This enables growth for less and increased conversion across the sales funnel for their clients.
©2015 IBM Corporation
A global pharmaceutical company with more than 90 years of innovation and leadership in
diabetes care is using IBM’s Enterprise Hadoop as a Service (EHaaS) offering
They are using BigInsights on Cloud on sets of electronic medical records (EMR) data to
analyze the relevance of a pharmacological treatment of obesity and obtain costs estimates to
build an economic model for obesity treatments.
Pharmaceutical Use Case
©2015 IBM Corporation
Common Spark use cases
1. Running large data processing batch jobs (e.g. nightly ETL from
production systems, primary Hadoop use case)
2. Interactive querying of very large data sets (e.g. BI)
3. Complex analytics and data mining across various types of data
4. Building and deploying rich analytics models (e.g. risk metrics)
5. Implementing near-realtime stream event processing (e.g. fraud / security
detection)
The Search Continues:
SETI Institute enhances
E.T. search with advanced
analytic platform
Need
• Allen Telescope Array (ATA) has been recording and sifting
data from the cosmos for SETI in an effort to
explore, understand, and explain the origin and nature of
life in the universe.
• Perform omni-data analysis on over a decade of radio
telescope signal with new analytic algorithm to look for
narrow band signals
Benefits
• Tens of millions of ATA signal events have been recorded in
binary files, which in turn are linked to hundreds of millions
of records in a structured database that provides additional
information about the signal event, such as the exact date-
time of the signal, the target coordinates, and other details.
The IBM Spark project is linking these two data sets – in
their entirety – for the first time.
• IBM Spark : By analyzing the vast archives of ATA content,
new algorithms are already being developed to isolate
human radio frequency interference (RFI) from external
signals which deserve further scrutiny.
©2015 IBM Corporation
Systems of Insight
Systems of Engagement
(NoSQL, Mobile Apps, Social Media, IoT, others)
Systems of Record
(DB2, Oracle, HDP, flat files, others;
cloud-based or on-premise)
Continuous
Synchronization
IBM & Third Party Integrations
(Watson Analytics, Cognos, SPSS, SAS,
Tableau, ESRI ArcGIS, Aginity, others)
Watson
Analytics
Cloud data services for systems of engagement, insight, and record with self-service BI
– helping you understand and engage with your customers better
IBM CDS Overview

IBM CDS Overview

  • 1.
    The Future ofAnalytics in Action IBM Cloud Data Services
  • 2.
    ©2015 IBM Corporation Whythe Journey to Cloud-based Analytics? MISSION To provide the best experience for developers and enterprises with a comprehensive set of rich, integrated cloud data services covering content, data and analytics. Fully managed 24x7 so you can focus on new development Pay as you go with no big up-front capital investments Instant provisioning saves weeks of data center setup FASTER INNOVATION BETTER IT ECONOMICS LOWER RISK OF FAILURE
  • 3.
    ©2015 IBM Corporation IBMCloud Data Services Cloudant dashDB BigInsights on Cloud Spark as a Service DB2 on Cloud NoSQL DBaaS Analytic Data Warehouse Hadoop in the Cloud Fully-managed Spark Service Hosted Database in the Cloud • Global data distribution • Massively scalable • Eventually consistent data model • Built for mobile, Systems of Engagement • SQL interface • Massively parallel • ACID compliance • Columnar, in- memory performance • BLU augmented with NZ in-DB analytics • Built for Systems of Insight • Bare metal performance • Build on reference architecture • BigInsights enterprise features • Optimized for extremely fast and large scale data processing • Spark SQL, Streaming, MLlib, GraphX • Build and run apps benefiting from operational, maintenance and hardware excellence • Power of DB2 • Fast Provisioning • Flexible pricing • No loss of DBA control • Built for Systems of Record
  • 4.
    ©2015 IBM Corporation WatsonAnalytics Analytics & Visualization Services DataWorks Data Refinery Services BigInsights on Cloud • Spark for in-memory Hadoop • Built on IBM Open Platform • Bare metal performance • BigInsights enterprise features Cloudant • Database as a Service (DBaaS) • Massively scalable for global data distribution • Eventually consistent data model • Built for mobile, Systems of Engagement dashDB • SQL interface • ACID compliance • Columnar, in-memory performance • BLU augmented with Netezza in-DB analytics • Built for Systems of Insight • Native integration with Watson Analytics DB2 on Cloud • DB2 RDBMS provisioned on Bluemix • SQL interface • ACID compliance • Fast provisioning • Built for Systems of Record ANALYTICAL TRANSACTIONAL UNSTRUCTURED STRUCTURED Mixed workloads and data types are knit together with DataWorks for true hybrid services IBM Cloud Data Services
  • 5.
    ©2015 IBM Corporation TheIBM Cloud - “Bare-metal” outperforms virtualized - Dedicated hardware - 40 data centers worldwide
  • 6.
    ©2015 IBM Corporation Whatis Bluemix? Bluemix is an open-standard, cloud-based platform for building, managing, and running applications of all types (web, mobile, big data, new smart devices, and so on). Go Live in Seconds Zero to running in one click. Development plans deploy in seconds. Enterprise plans deploy in 1-2 days. DevOps Development, monitoring, deployment, and logging tools allow the developer to run the entire application. APIs and Services A catalog of IBM, third party, and open source API services allow the developer to stitch an application together in minutes. On-Prem Integration Build hybrid environments. Connect to on-premise assets plus other public and private clouds. Flexible Pricing Sign up in minutes. Pay as you go and subscription models offer choice and flexibility. Layered Security IBM secures the platform and infrastructure and provides you with the tools to secure your apps.
  • 7.
    7 © 2015IBM Corporation Security and Compliance for All CDS Offerings  Vulnerability Scanning  Audit Log consolidation and analysis  Use Access management  PSRIT – Security Incident Management  Legal, Regulations and Compliance  Education, training and Awareness  Business Continuity and Disaster Recovery  Network Architecture and Design  Intrusion Prevention  Operation System security hardening  Secure Engineering Development practices: threat modeling, risk assessment, static and dynamic code analysis  Secured development life Cycle  Security Architecture and Design  Access Control, Authentication and Authorization  Data Protection  Security Logging Functional Infrastructure Development Governance & Compliance Operational SoftLayer: physical security compliance
  • 8.
    Scales & remainsavailable to 1 billion users across Asia, North America, Europe Transactional Throughput 300 million requests / day ( 3,500 / second ) Cluster Distribution Global (mobile devices) Media Types Ingested Structured, semi-structured, unstructured (logs, audio) fully managed services in support of massive concurrent user growth Transactional Throughput 2 billion requests / day ( 20 : 1 read-writes ) Data Volume 130 TB Cluster Growth From 6 to over 200 servers (in 12 months)
  • 9.
    ©2015 IBM Corporation ALarge Investment Research & Management Firm: needed a persistent data store to maintain and access financial analytical reports Cloudant’s schema-less architecture and horizontal scalability enables their users users to have real-time access to reports and analytics generated by IBM PureData for Analytics Use Case
  • 10.
    ©2015 IBM Corporation UseCase The Red 10 is a data-driven marketing analytics firm in the UK. With dashDB, they are able to provide real-time analytics and updates to give an accurate view to the audiences With dashDB, they can provide 1) a live view of the UK & Ireland markets, 2) new segmentation based on live contact views, 3) an instant view of all relevant information, and 4) the right message, at the right time, through the right medium. This enables growth for less and increased conversion across the sales funnel for their clients.
  • 11.
    ©2015 IBM Corporation Aglobal pharmaceutical company with more than 90 years of innovation and leadership in diabetes care is using IBM’s Enterprise Hadoop as a Service (EHaaS) offering They are using BigInsights on Cloud on sets of electronic medical records (EMR) data to analyze the relevance of a pharmacological treatment of obesity and obtain costs estimates to build an economic model for obesity treatments. Pharmaceutical Use Case
  • 12.
    ©2015 IBM Corporation CommonSpark use cases 1. Running large data processing batch jobs (e.g. nightly ETL from production systems, primary Hadoop use case) 2. Interactive querying of very large data sets (e.g. BI) 3. Complex analytics and data mining across various types of data 4. Building and deploying rich analytics models (e.g. risk metrics) 5. Implementing near-realtime stream event processing (e.g. fraud / security detection)
  • 13.
    The Search Continues: SETIInstitute enhances E.T. search with advanced analytic platform Need • Allen Telescope Array (ATA) has been recording and sifting data from the cosmos for SETI in an effort to explore, understand, and explain the origin and nature of life in the universe. • Perform omni-data analysis on over a decade of radio telescope signal with new analytic algorithm to look for narrow band signals Benefits • Tens of millions of ATA signal events have been recorded in binary files, which in turn are linked to hundreds of millions of records in a structured database that provides additional information about the signal event, such as the exact date- time of the signal, the target coordinates, and other details. The IBM Spark project is linking these two data sets – in their entirety – for the first time. • IBM Spark : By analyzing the vast archives of ATA content, new algorithms are already being developed to isolate human radio frequency interference (RFI) from external signals which deserve further scrutiny.
  • 14.
    ©2015 IBM Corporation Systemsof Insight Systems of Engagement (NoSQL, Mobile Apps, Social Media, IoT, others) Systems of Record (DB2, Oracle, HDP, flat files, others; cloud-based or on-premise) Continuous Synchronization IBM & Third Party Integrations (Watson Analytics, Cognos, SPSS, SAS, Tableau, ESRI ArcGIS, Aginity, others) Watson Analytics Cloud data services for systems of engagement, insight, and record with self-service BI – helping you understand and engage with your customers better