SlideShare a Scribd company logo
1 of 27
Download to read offline
AI Scalability
for a Decade
Spark Summit
Europe 10/2019
Dave McDonnell
Manager, Business Development
IBM Storage Division
§ The Path to AI
§ Challenges
§ IA and AI
The Path to AI
AI is transforming every industry
Autonomous driving
Collision avoidance
Route optimization
Location-based advertising
Customer Experience
Stock forecasting
Buyer behavior
Clinical trials
Drug discovery
Genomics
Experimental sensor capture
Hypothesis modeling
Seismic analysis
exploration
Smart metering /
Usage forecasting
Market prediction
Fraud detection
Risk mitigation
Threat detection/assessment
Video surveillance
Social media monitoring
Traffic flow analysis
Manufacturing quality control
Supply chain optimization
Warranty analysis
Cognitive applications require cognitive
infrastructure
Scalability
Cost Management
Simplicity
Security
Governance
…
Infrastructure
Challenges
What is AI?
Laptop
R Programming
Python Programming
Concept
Understanding
Defining a Problem
Building Model
Proof of Concept with
GPU Technology
POC in the Cloud
POC On Premise
Does it work?
Can it do something
meaningful for me
today?
Which AI opportunities
are next?
Expanding
Growth in Cloud Usage
GPU Workstation
GPU Server
How to improve?
More applications
More GPU
workstations/servers
USB Drives. Disk
Array #1. Disk Array
#2. etc..
Challenges
Searching for
information
Sharing information
Performance
Scaling
Back-Up
Archiving
Security
Governance
Inefficient workload
management results in poor
server utilization rates and
throughput.
Replicated data
Difficult to scale
Lack of security with
open-source frameworks
and applications
introduces risk.
Multiple Spark teams each with
dedicated servers = wasted
capacity and high administrative
overhead.
Scaling Spark
is challenging
47%
Advanced data
management
50%
Data volume
and quality
44%
Skills gap
Top 3 challenges
for organizations
deploying AI workloads
The Problem
•Data needs to be labelled
•Its manual, too much data, time taking
•Data is scarce and expensive to aggerate
•Developing custom models quickly
•Data scientists are required
•Not all Data scientists are created equal
•Deploying custom solutions around these Models
•Need application developers with skills in OpenCV
•Leveraging GPUs
•Train and Infer faster
•Need to deploy models at the Edge – Nvidia TX2/Xavier/T4
•Keep up with CUDA drivers
AI and IA
80%
of data is either
inaccessible,
untrusted or
unanalyzed
81%
of users do not
understand the
data required
for AI
(information architecture)
12
No amount of AI algorithmic sophistication will overcome a
lack of data [architecture] … bad data is simply paralyzing…“
There is no
Without an
”
Let’s Help You Get There..
Software is your friend
A workflow view
AI and Big Data Analytics at Scale
© IBM Corporation 2019 14
Cloud Object Storage and Spectrum Discover and Spectrum Scale
• Ingests & indexes system metadata
via Action Agent SDK
• Extracts labels from images
• Adds as custom tags
• Global ingest of IOT data from
vehicles
• Geo-dispersed COS
• Searches for images with
labeled as having
‘Pothole’ feature
SORT & EXTRACTINGEST CURATE TRAIN
Stop sign
Data Scientist
• Enriches data catalog
with new tags derived
from analysis
via Action Agent SDK
• Trains a model
Pothole
Data Scientist
Tools like Power Vision AI
Is your data stranded?
Complex workflows can lead to data isolation
Replication
sprawl Time
to deliver
Synch
issues
Custody
chain
Performance
disparities
High labor
cost
Mgt.
Complication
Optimization
nightmares
?
Ingest Preparation Training Inference
The End-to-End Enterprise Data Pipeline
Machine Learning and Deep Learning do happen in a silo
EDGE INSIGHTS
Ingest Organize Analyze Prepare Train Inference
Streams/
NFS/S3
Data Input
Integrate new data
with existing
repositories
Correlate data from
data lake for newer
insights and views
Use select datasets to identify
patterns and train models for future
decision making
ETL, Tagging
BI, HPC
Sample Workloads
Data as the shared asset between various Analytics and AI stages
in an E2E enterprise data pipeline
Hadoop
HDFS
Data Lake
Cloud Based
Data Providers
IOT
Deep
Learning
ImpactHDFS
Transparency
Connector
/ESS
Enabling Agility, Straight-Thru Processing
for Improved Data Engineering ETL
Spark AI Grid with Watson ML Accelerator
Reduce cost and improve service levels
© IBM Corporation 2018 18
IBM Elastic Storage Server (ESS)
Integrated scale-out data management for file and object data
Optimal building block for high-performance, scalable,
reliable enterprise Spectrum Scale storage
• Faster data access with choice to scale-up or out
• Easy to deploy clusters with unified system GUI
• Simplified storage administration with IBM Spectrum Control integration
One solution for all your Spectrum Scale data needs
• Single repository of data with unified file and object support
• Anywhere access with multi-protocol support:
NFS 4.0, SMB, OpenStack Swift, Cinder, and Manila
• Ideal for Big Data Analytics with full Hadoop transparency
Ready for business critical data
• Disaster recovery with synchronous or asynchronous replication
• Ensure reliability and fast rebuild times using Spectrum Scale RAID’s
dispersed data and erasure code
• Five 99999s of availability
ESS 5U84
Storage
ESS 5U84
Storage
ESS 5U84
Storage
ESS 5U84
Storage
ESS 5U84
Storage
ESS 5U84
Storage
Power Systems LC922 – Delivering enhanced price-performance for Apache Spark
Reduce operating costs and deliver results faster compared to tested Intel Xeon systems
1. Results are based IBM Internal Measurements running four concurrent streams of 99 TPC-DS like queries against a 3TB dataset. Results valid as of 4/25/18 and conducted under laboratory condition with speculative execution controls to mitigate user-to-kernel and user-to-user side-channel attacks on both
systems, individual results can vary based on workload size, use of storage subsystems & other conditions
2. Hardware: 4 nodes IBM Power LC922 (2x20-core/2.7 GHz/512 GB memory) using 12 x 8TB HDD, 10 GbE two-port, RHEL 7.5 LE for Power9 and 4 nodes of Intel Xeon Gold 6140; 36 cores (2 x 18c chips) at 2.3 GHz; 512 GB memory, 12 x 8TB HDDs, 10Gbps NIC, Red Hat Enterprise Linux 7.5
3. Software: Apache Spark 2.3.0 located at http://spark.apache.org/downloads.html ; and open source Hadoop HDP 2.7.5
4. Pricing is based on Power LC922 http://www-03.ibm.com/systems/power/hardware/linux-lc.html and publicly available x86 pricing.
5. Apache®, Apache Spark®, and associated logos are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. No endorsement by The Apache Software Foundation is implied by the use of these marks.
Intel Xeon SP Gold
6140 server
22.6 QpH
Power LC922
29.5 QpH
Power LC922
$43,948
Intel Xeon SP Gold
6150 server
$53,548
18%
LOWER
Price2,3,4
30%
MORE
Performance1
Power LC922 Delivers
1.6x
Price-performance
+
Dedicated Resources
IBM Spectrum Conductor
A complete enterprise-grade solution for Data Engineering and Analytics
• Dynamic Sharing with guaranteed SLA
• Rapidly deploy & support multiple concurrent
instances and versions of Spark, Notebooks,
Anaconda, MongoDB & other services
• Proven at scale:
• 5K hosts, 150K cores, >1B tasks/day
• High performance workload and resource manager:
• 30-224% faster
IBM Spectrum Conductor / Spark + AI Summit / 2019-Apr-23 / © 2019 IBM Corporation 20
Secure Multitenant Shared Resources
Shared Resources
Dedicated Resource Silos
IBM Spectrum Conductor / Spark + AI Summit / 2019-Apr-23 / © 2019 IBM Corporation 21
Superior performance and scalability compared to
competing orchestration / resource management
solutions
30 to 224% faster than YARN
25 and 88% faster than Apache Mesos
Consistent and predictable delivering 77% relative
standard deviation (RSD)
YARN & Mesos are relatively unpredictable 777%
and 304% RSD respectively
Audited results
2607
327
899
574
1673
253
582
177
1660
202
478 458
0
500
1000
1500
2000
2500
3000
Case 1: Sync
interactive multi-user
Case 2:
Asynchonous batch
multi-user
Case 3: Mixed multi-
user
Case 4: Mixed multi-
tenant
Throughput of Spark SMB-2 benchmark workload on various
Resource Managers
(Jobs/hour - higher is better)
IBM Spectrum Conductor Apache YARN v2.7.3 Apache Mesos v1.0.1
Audited benchmark results https://stacresearch.com/news/2017/05/19/IBM170405
IBM Spectrum Conductor
Better performance and predictability
IBM PowerAI Vision: “Point-and-Click” AI for Images & Video
Label Image or
Video Data
Auto-Train AI
Model
Package & Deploy
AI Model
Analytics in Sports
23© Copyright IBM Corporation 2018
Leading the
Pack in
AI Infrastructure
IBM Systems Reference
Architecture for AI
IBM PowerAI
IBM Spectrum Computing
IBM Storage
IBM Accelerated
Compute Platform
IBM Power Servers
IBM Spectrum Computing
IBM Spectrum Scale & ESS
IBM Storage Solutions
for AI / ML / DL
IBM Spectrum Scale
IBM Cloud Object Storage
IBM Spectrum Discover
All Built upon Data Infrastructure with:
Wells Fargo:
Financial Risk Modeling
“Wells Fargo data scientists
build, enhance, & validate
hundreds of models each day,
speed is critical, along with
scalability, as they deal with
greater amounts of data &
more complicated models.
Academically, people talk
about fancy algorithms. But
in real life, how efficiently
the models run in
distributed environments is
critical. IBM is a very good
partner & we are very pleased
with their solution.”
Richard Liu, Quantitative
Analytics Manager, Wells Fargo,
IBM Think18
How Do You Get AI Scalability for a Decade ?
Plan for the Future
• Where is the biggest pain today?
• What does it look like in 3, 5, and 10 years?
• Think workflow
Software Is Your Friend
• Capabilities, Efficiency, Balance, Cost, Scalability
• Usability
Hybrid MultiCloud World
• Cloud or on premise? It’s both !
• Drive the linkage with Data Science and IT
(information architecture)
There is no
Without an
AI takes flight
“In any business, differentiation is everything. Data is the
source of differentiation. How we started on the path to
today’s Delta: we had to get the basics first. The foundation.
We now have the foundation, the data infrastructure. In order to
improve the processes we have. In the past, we had the data,
but we didn’t have the sourcing, or the data infrastructure to get
at the data. So we built this data ocean with billions of
data points and turned it into action for Delta to better
serve customers.”
“That’s a great point. You can’t do enterprise wide analytics and
AI (Artificial Intelligence) until you have the right “data basics”
first, i.e. you have to have the foundation information
architecture and data infrastructure in place.
Most companies have random acts of digital and AI all
over. But until you can pull them together, and re-imagine
how the work is to be done, you can’t scale any of it.”
Ed Bastian, CEO, Delta Airlines Ginni Rometty, CEO, IBM
“In 2010, we had 5,600 maintenance cancellations, at least one every day that year.
In 2018, we had just 55 maintenance cancellations.
That’s a 99% improvement.”
– Ed Bastian, CEO, Delta Airlines
IBM Keynote
Jan. 9, 2019

More Related Content

What's hot

Amazon EC2 Foundations - SRV319 - Anaheim AWS Summit
Amazon EC2 Foundations - SRV319 - Anaheim AWS SummitAmazon EC2 Foundations - SRV319 - Anaheim AWS Summit
Amazon EC2 Foundations - SRV319 - Anaheim AWS SummitAmazon Web Services
 
HPC on Azure for Reserach
HPC on Azure for ReserachHPC on Azure for Reserach
HPC on Azure for ReserachJürgen Ambrosi
 
Amazon EC2 Foundations - SRV319 - Atlanta AWS Summit
Amazon EC2 Foundations - SRV319 - Atlanta AWS SummitAmazon EC2 Foundations - SRV319 - Atlanta AWS Summit
Amazon EC2 Foundations - SRV319 - Atlanta AWS SummitAmazon Web Services
 
Part 1: Lambda Architectures: Simplified by Apache Kudu
Part 1: Lambda Architectures: Simplified by Apache KuduPart 1: Lambda Architectures: Simplified by Apache Kudu
Part 1: Lambda Architectures: Simplified by Apache KuduCloudera, Inc.
 
Amazon EC2 Foundations - SRV319 - Toronto AWS Summit
Amazon EC2 Foundations - SRV319 - Toronto AWS SummitAmazon EC2 Foundations - SRV319 - Toronto AWS Summit
Amazon EC2 Foundations - SRV319 - Toronto AWS SummitAmazon Web Services
 
High Performance Computing (HPC) on AWS 101
High Performance Computing (HPC) on AWS 101High Performance Computing (HPC) on AWS 101
High Performance Computing (HPC) on AWS 101Amazon Web Services
 
Apache kylin boost your SQLs on extremely large dataset
Apache kylin boost your SQLs on extremely large datasetApache kylin boost your SQLs on extremely large dataset
Apache kylin boost your SQLs on extremely large datasetChun'en Ni
 
Foundations of Amazon EC2 - SRV319 - Chicago AWS Summit
Foundations of Amazon EC2 - SRV319 - Chicago AWS SummitFoundations of Amazon EC2 - SRV319 - Chicago AWS Summit
Foundations of Amazon EC2 - SRV319 - Chicago AWS SummitAmazon Web Services
 
Announcing Amazon EC2 F1 Instances with Custom FPGAs
Announcing Amazon EC2 F1 Instances with Custom FPGAsAnnouncing Amazon EC2 F1 Instances with Custom FPGAs
Announcing Amazon EC2 F1 Instances with Custom FPGAsAmazon Web Services
 
Big Data and High Performance Computing Solutions in the AWS Cloud
Big Data and High Performance Computing Solutions in the AWS CloudBig Data and High Performance Computing Solutions in the AWS Cloud
Big Data and High Performance Computing Solutions in the AWS CloudAmazon Web Services
 
Hadoop in the Clouds, Virtualization and Virtual Machines
Hadoop in the Clouds, Virtualization and Virtual MachinesHadoop in the Clouds, Virtualization and Virtual Machines
Hadoop in the Clouds, Virtualization and Virtual MachinesDataWorks Summit
 
Why Networked FICON Storage Is Better Than Direct Attached Storage
Why Networked FICON Storage Is Better Than Direct Attached StorageWhy Networked FICON Storage Is Better Than Direct Attached Storage
Why Networked FICON Storage Is Better Than Direct Attached StorageHitachi Vantara
 
Empowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine LearningEmpowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine LearningDataWorks Summit
 
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ... Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...Cloudera, Inc.
 
Challenges for running Hadoop on AWS - AdvancedAWS Meetup
Challenges for running Hadoop on AWS - AdvancedAWS MeetupChallenges for running Hadoop on AWS - AdvancedAWS Meetup
Challenges for running Hadoop on AWS - AdvancedAWS MeetupAndrei Savu
 
Build Big Data Enterprise solutions faster on Azure HDInsight
Build Big Data Enterprise solutions faster on Azure HDInsightBuild Big Data Enterprise solutions faster on Azure HDInsight
Build Big Data Enterprise solutions faster on Azure HDInsightDataWorks Summit
 

What's hot (20)

Amazon EC2 Foundations - SRV319 - Anaheim AWS Summit
Amazon EC2 Foundations - SRV319 - Anaheim AWS SummitAmazon EC2 Foundations - SRV319 - Anaheim AWS Summit
Amazon EC2 Foundations - SRV319 - Anaheim AWS Summit
 
Amazon EC2 Foundations
Amazon EC2 FoundationsAmazon EC2 Foundations
Amazon EC2 Foundations
 
HPC on Azure for Reserach
HPC on Azure for ReserachHPC on Azure for Reserach
HPC on Azure for Reserach
 
Amazon EC2 Foundations - SRV319 - Atlanta AWS Summit
Amazon EC2 Foundations - SRV319 - Atlanta AWS SummitAmazon EC2 Foundations - SRV319 - Atlanta AWS Summit
Amazon EC2 Foundations - SRV319 - Atlanta AWS Summit
 
Part 1: Lambda Architectures: Simplified by Apache Kudu
Part 1: Lambda Architectures: Simplified by Apache KuduPart 1: Lambda Architectures: Simplified by Apache Kudu
Part 1: Lambda Architectures: Simplified by Apache Kudu
 
Amazon EC2 Foundations - SRV319 - Toronto AWS Summit
Amazon EC2 Foundations - SRV319 - Toronto AWS SummitAmazon EC2 Foundations - SRV319 - Toronto AWS Summit
Amazon EC2 Foundations - SRV319 - Toronto AWS Summit
 
What Can HPC on AWS Do?
What Can HPC on AWS Do?What Can HPC on AWS Do?
What Can HPC on AWS Do?
 
High Performance Computing (HPC) on AWS 101
High Performance Computing (HPC) on AWS 101High Performance Computing (HPC) on AWS 101
High Performance Computing (HPC) on AWS 101
 
Apache kylin boost your SQLs on extremely large dataset
Apache kylin boost your SQLs on extremely large datasetApache kylin boost your SQLs on extremely large dataset
Apache kylin boost your SQLs on extremely large dataset
 
Foundations of Amazon EC2 - SRV319 - Chicago AWS Summit
Foundations of Amazon EC2 - SRV319 - Chicago AWS SummitFoundations of Amazon EC2 - SRV319 - Chicago AWS Summit
Foundations of Amazon EC2 - SRV319 - Chicago AWS Summit
 
Announcing Amazon EC2 F1 Instances with Custom FPGAs
Announcing Amazon EC2 F1 Instances with Custom FPGAsAnnouncing Amazon EC2 F1 Instances with Custom FPGAs
Announcing Amazon EC2 F1 Instances with Custom FPGAs
 
Amazon EC2 Foundations
Amazon EC2 FoundationsAmazon EC2 Foundations
Amazon EC2 Foundations
 
Sgi hadoop
Sgi hadoopSgi hadoop
Sgi hadoop
 
Big Data and High Performance Computing Solutions in the AWS Cloud
Big Data and High Performance Computing Solutions in the AWS CloudBig Data and High Performance Computing Solutions in the AWS Cloud
Big Data and High Performance Computing Solutions in the AWS Cloud
 
Hadoop in the Clouds, Virtualization and Virtual Machines
Hadoop in the Clouds, Virtualization and Virtual MachinesHadoop in the Clouds, Virtualization and Virtual Machines
Hadoop in the Clouds, Virtualization and Virtual Machines
 
Why Networked FICON Storage Is Better Than Direct Attached Storage
Why Networked FICON Storage Is Better Than Direct Attached StorageWhy Networked FICON Storage Is Better Than Direct Attached Storage
Why Networked FICON Storage Is Better Than Direct Attached Storage
 
Empowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine LearningEmpowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine Learning
 
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ... Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
Gartner Data and Analytics Summit: Bringing Self-Service BI & SQL Analytics ...
 
Challenges for running Hadoop on AWS - AdvancedAWS Meetup
Challenges for running Hadoop on AWS - AdvancedAWS MeetupChallenges for running Hadoop on AWS - AdvancedAWS Meetup
Challenges for running Hadoop on AWS - AdvancedAWS Meetup
 
Build Big Data Enterprise solutions faster on Azure HDInsight
Build Big Data Enterprise solutions faster on Azure HDInsightBuild Big Data Enterprise solutions faster on Azure HDInsight
Build Big Data Enterprise solutions faster on Azure HDInsight
 

Similar to AI Scalability for the Next Decade

Breaking the Silos: Storage for Analytics & AI
Breaking the Silos: Storage for Analytics & AIBreaking the Silos: Storage for Analytics & AI
Breaking the Silos: Storage for Analytics & AIDataWorks Summit
 
Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...Indrajit Poddar
 
Innovation with ai at scale on the edge vt sept 2019 v0
Innovation with ai at scale  on the edge vt sept 2019 v0Innovation with ai at scale  on the edge vt sept 2019 v0
Innovation with ai at scale on the edge vt sept 2019 v0Ganesan Narayanasamy
 
2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit MumbaiAnand Haridass
 
Trends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsTrends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsIgor José F. Freitas
 
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors DataWorks Summit/Hadoop Summit
 
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...Optimizing Hortonworks Apache Spark machine learning workloads for contempora...
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...Indrajit Poddar
 
Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...DataWorks Summit
 
TechEvent Databricks on Azure
TechEvent Databricks on AzureTechEvent Databricks on Azure
TechEvent Databricks on AzureTrivadis
 
Ibm symp14 referentin_barbara koch_power_8 launch bk
Ibm symp14 referentin_barbara koch_power_8 launch bkIbm symp14 referentin_barbara koch_power_8 launch bk
Ibm symp14 referentin_barbara koch_power_8 launch bkIBM Switzerland
 
Ανδρέας Τσαγκάρης, 5th Digital Banking Forum
Ανδρέας Τσαγκάρης, 5th Digital Banking ForumΑνδρέας Τσαγκάρης, 5th Digital Banking Forum
Ανδρέας Τσαγκάρης, 5th Digital Banking ForumStarttech Ventures
 
Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture Intel® Software
 
Excellent slides on the new z13s announced on 16th Feb 2016
Excellent slides on the new z13s announced on 16th Feb 2016Excellent slides on the new z13s announced on 16th Feb 2016
Excellent slides on the new z13s announced on 16th Feb 2016Luigi Tommaseo
 
Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020Riccardo Zamana
 
OpenPOWER/POWER9 Webinar from MIT and IBM
OpenPOWER/POWER9 Webinar from MIT and IBM OpenPOWER/POWER9 Webinar from MIT and IBM
OpenPOWER/POWER9 Webinar from MIT and IBM Ganesan Narayanasamy
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 editionDavid Talby
 
Spectrum Scale final
Spectrum Scale finalSpectrum Scale final
Spectrum Scale finalJoe Krotz
 
Azure Global Bootcamp 2018 Paris Keynote
Azure Global Bootcamp 2018 Paris KeynoteAzure Global Bootcamp 2018 Paris Keynote
Azure Global Bootcamp 2018 Paris KeynoteAlex Danvy
 
AWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:CapAWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:CapIan Massingham
 
AWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:CapAWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:CapAdrian Hornsby
 

Similar to AI Scalability for the Next Decade (20)

Breaking the Silos: Storage for Analytics & AI
Breaking the Silos: Storage for Analytics & AIBreaking the Silos: Storage for Analytics & AI
Breaking the Silos: Storage for Analytics & AI
 
Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...
 
Innovation with ai at scale on the edge vt sept 2019 v0
Innovation with ai at scale  on the edge vt sept 2019 v0Innovation with ai at scale  on the edge vt sept 2019 v0
Innovation with ai at scale on the edge vt sept 2019 v0
 
2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai2016 August POWER Up Your Insights - IBM System Summit Mumbai
2016 August POWER Up Your Insights - IBM System Summit Mumbai
 
Trends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systemsTrends towards the merge of HPC + Big Data systems
Trends towards the merge of HPC + Big Data systems
 
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
How to Optimize Hortonworks Apache Spark ML Workloads on Modern Processors
 
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...Optimizing Hortonworks Apache Spark machine learning workloads for contempora...
Optimizing Hortonworks Apache Spark machine learning workloads for contempora...
 
Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...
 
TechEvent Databricks on Azure
TechEvent Databricks on AzureTechEvent Databricks on Azure
TechEvent Databricks on Azure
 
Ibm symp14 referentin_barbara koch_power_8 launch bk
Ibm symp14 referentin_barbara koch_power_8 launch bkIbm symp14 referentin_barbara koch_power_8 launch bk
Ibm symp14 referentin_barbara koch_power_8 launch bk
 
Ανδρέας Τσαγκάρης, 5th Digital Banking Forum
Ανδρέας Τσαγκάρης, 5th Digital Banking ForumΑνδρέας Τσαγκάρης, 5th Digital Banking Forum
Ανδρέας Τσαγκάρης, 5th Digital Banking Forum
 
Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture Accelerate Machine Learning Software on Intel Architecture
Accelerate Machine Learning Software on Intel Architecture
 
Excellent slides on the new z13s announced on 16th Feb 2016
Excellent slides on the new z13s announced on 16th Feb 2016Excellent slides on the new z13s announced on 16th Feb 2016
Excellent slides on the new z13s announced on 16th Feb 2016
 
Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020Azure Data Explorer deep dive - review 04.2020
Azure Data Explorer deep dive - review 04.2020
 
OpenPOWER/POWER9 Webinar from MIT and IBM
OpenPOWER/POWER9 Webinar from MIT and IBM OpenPOWER/POWER9 Webinar from MIT and IBM
OpenPOWER/POWER9 Webinar from MIT and IBM
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 edition
 
Spectrum Scale final
Spectrum Scale finalSpectrum Scale final
Spectrum Scale final
 
Azure Global Bootcamp 2018 Paris Keynote
Azure Global Bootcamp 2018 Paris KeynoteAzure Global Bootcamp 2018 Paris Keynote
Azure Global Bootcamp 2018 Paris Keynote
 
AWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:CapAWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:Cap
 
AWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:CapAWS re:Invent 2016 Day 1 Keynote re:Cap
AWS re:Invent 2016 Day 1 Keynote re:Cap
 

More from Paula Koziol

Delivering Modern Data Protection for VMware Environments
Delivering Modern Data Protection for VMware EnvironmentsDelivering Modern Data Protection for VMware Environments
Delivering Modern Data Protection for VMware EnvironmentsPaula Koziol
 
IBM Storage for SAP HANA Deployments
IBM Storage for SAP HANA DeploymentsIBM Storage for SAP HANA Deployments
IBM Storage for SAP HANA DeploymentsPaula Koziol
 
IBM Power Systems at FIS InFocus 2019
IBM Power Systems at FIS InFocus 2019IBM Power Systems at FIS InFocus 2019
IBM Power Systems at FIS InFocus 2019Paula Koziol
 
IBM Storage at FIS InFocus 2019
IBM Storage at FIS InFocus 2019IBM Storage at FIS InFocus 2019
IBM Storage at FIS InFocus 2019Paula Koziol
 
Unlock Real Value from Back Up Data with IBM Spectrum Protect Plus
Unlock Real Value from Back Up Data with IBM Spectrum Protect PlusUnlock Real Value from Back Up Data with IBM Spectrum Protect Plus
Unlock Real Value from Back Up Data with IBM Spectrum Protect PlusPaula Koziol
 
A Winning Combination: IBM Storage and VMware
A Winning Combination: IBM Storage and VMwareA Winning Combination: IBM Storage and VMware
A Winning Combination: IBM Storage and VMwarePaula Koziol
 
Data Protection Modernization - Restore, Reuse, Reinvent
Data Protection Modernization - Restore, Reuse, ReinventData Protection Modernization - Restore, Reuse, Reinvent
Data Protection Modernization - Restore, Reuse, ReinventPaula Koziol
 
IBM Storage at Fiserv Forum 2018
IBM Storage at Fiserv Forum 2018IBM Storage at Fiserv Forum 2018
IBM Storage at Fiserv Forum 2018Paula Koziol
 
IBM Storage at FIS Connect 2018
IBM Storage at FIS Connect 2018 IBM Storage at FIS Connect 2018
IBM Storage at FIS Connect 2018 Paula Koziol
 
Addressing VMware Data Backup and Availability Challenges with IBM Spectrum P...
Addressing VMware Data Backup and Availability Challenges with IBM Spectrum P...Addressing VMware Data Backup and Availability Challenges with IBM Spectrum P...
Addressing VMware Data Backup and Availability Challenges with IBM Spectrum P...Paula Koziol
 
IBM & Veeam: Bridging the availability gap
IBM & Veeam: Bridging the availability gapIBM & Veeam: Bridging the availability gap
IBM & Veeam: Bridging the availability gapPaula Koziol
 
Transform to Cognitive Healthcare with IBM Software Defined Infrastructure an...
Transform to Cognitive Healthcare with IBM Software Defined Infrastructure an...Transform to Cognitive Healthcare with IBM Software Defined Infrastructure an...
Transform to Cognitive Healthcare with IBM Software Defined Infrastructure an...Paula Koziol
 
Accelerate Your Signature Banking Applications with IBM Storage Offerings
Accelerate Your Signature Banking Applications with IBM Storage OfferingsAccelerate Your Signature Banking Applications with IBM Storage Offerings
Accelerate Your Signature Banking Applications with IBM Storage OfferingsPaula Koziol
 
Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...
Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...
Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...Paula Koziol
 
IBM Storage and VMware – A Winning Combination
IBM Storage and VMware – A Winning CombinationIBM Storage and VMware – A Winning Combination
IBM Storage and VMware – A Winning CombinationPaula Koziol
 
Scalable Data Computing for Healthcare and Life Sciences Industry
Scalable Data Computing for Healthcare and Life Sciences Industry Scalable Data Computing for Healthcare and Life Sciences Industry
Scalable Data Computing for Healthcare and Life Sciences Industry Paula Koziol
 
Future Proof Your Data: IBM Storage at VeeamON
Future Proof Your Data: IBM Storage at VeeamONFuture Proof Your Data: IBM Storage at VeeamON
Future Proof Your Data: IBM Storage at VeeamONPaula Koziol
 
IBM Storage at SAPPHIRE 2017
IBM Storage at SAPPHIRE 2017IBM Storage at SAPPHIRE 2017
IBM Storage at SAPPHIRE 2017Paula Koziol
 
Optimize Your VMware SDDC with IBM Infrastructure
Optimize Your VMware SDDC with IBM InfrastructureOptimize Your VMware SDDC with IBM Infrastructure
Optimize Your VMware SDDC with IBM InfrastructurePaula Koziol
 
IBM in Surveillance: Solutions that Deliver Innovation
IBM in Surveillance: Solutions that Deliver InnovationIBM in Surveillance: Solutions that Deliver Innovation
IBM in Surveillance: Solutions that Deliver InnovationPaula Koziol
 

More from Paula Koziol (20)

Delivering Modern Data Protection for VMware Environments
Delivering Modern Data Protection for VMware EnvironmentsDelivering Modern Data Protection for VMware Environments
Delivering Modern Data Protection for VMware Environments
 
IBM Storage for SAP HANA Deployments
IBM Storage for SAP HANA DeploymentsIBM Storage for SAP HANA Deployments
IBM Storage for SAP HANA Deployments
 
IBM Power Systems at FIS InFocus 2019
IBM Power Systems at FIS InFocus 2019IBM Power Systems at FIS InFocus 2019
IBM Power Systems at FIS InFocus 2019
 
IBM Storage at FIS InFocus 2019
IBM Storage at FIS InFocus 2019IBM Storage at FIS InFocus 2019
IBM Storage at FIS InFocus 2019
 
Unlock Real Value from Back Up Data with IBM Spectrum Protect Plus
Unlock Real Value from Back Up Data with IBM Spectrum Protect PlusUnlock Real Value from Back Up Data with IBM Spectrum Protect Plus
Unlock Real Value from Back Up Data with IBM Spectrum Protect Plus
 
A Winning Combination: IBM Storage and VMware
A Winning Combination: IBM Storage and VMwareA Winning Combination: IBM Storage and VMware
A Winning Combination: IBM Storage and VMware
 
Data Protection Modernization - Restore, Reuse, Reinvent
Data Protection Modernization - Restore, Reuse, ReinventData Protection Modernization - Restore, Reuse, Reinvent
Data Protection Modernization - Restore, Reuse, Reinvent
 
IBM Storage at Fiserv Forum 2018
IBM Storage at Fiserv Forum 2018IBM Storage at Fiserv Forum 2018
IBM Storage at Fiserv Forum 2018
 
IBM Storage at FIS Connect 2018
IBM Storage at FIS Connect 2018 IBM Storage at FIS Connect 2018
IBM Storage at FIS Connect 2018
 
Addressing VMware Data Backup and Availability Challenges with IBM Spectrum P...
Addressing VMware Data Backup and Availability Challenges with IBM Spectrum P...Addressing VMware Data Backup and Availability Challenges with IBM Spectrum P...
Addressing VMware Data Backup and Availability Challenges with IBM Spectrum P...
 
IBM & Veeam: Bridging the availability gap
IBM & Veeam: Bridging the availability gapIBM & Veeam: Bridging the availability gap
IBM & Veeam: Bridging the availability gap
 
Transform to Cognitive Healthcare with IBM Software Defined Infrastructure an...
Transform to Cognitive Healthcare with IBM Software Defined Infrastructure an...Transform to Cognitive Healthcare with IBM Software Defined Infrastructure an...
Transform to Cognitive Healthcare with IBM Software Defined Infrastructure an...
 
Accelerate Your Signature Banking Applications with IBM Storage Offerings
Accelerate Your Signature Banking Applications with IBM Storage OfferingsAccelerate Your Signature Banking Applications with IBM Storage Offerings
Accelerate Your Signature Banking Applications with IBM Storage Offerings
 
Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...
Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...
Implementing a Disaster Recovery Solution using VMware Site Recovery Manager ...
 
IBM Storage and VMware – A Winning Combination
IBM Storage and VMware – A Winning CombinationIBM Storage and VMware – A Winning Combination
IBM Storage and VMware – A Winning Combination
 
Scalable Data Computing for Healthcare and Life Sciences Industry
Scalable Data Computing for Healthcare and Life Sciences Industry Scalable Data Computing for Healthcare and Life Sciences Industry
Scalable Data Computing for Healthcare and Life Sciences Industry
 
Future Proof Your Data: IBM Storage at VeeamON
Future Proof Your Data: IBM Storage at VeeamONFuture Proof Your Data: IBM Storage at VeeamON
Future Proof Your Data: IBM Storage at VeeamON
 
IBM Storage at SAPPHIRE 2017
IBM Storage at SAPPHIRE 2017IBM Storage at SAPPHIRE 2017
IBM Storage at SAPPHIRE 2017
 
Optimize Your VMware SDDC with IBM Infrastructure
Optimize Your VMware SDDC with IBM InfrastructureOptimize Your VMware SDDC with IBM Infrastructure
Optimize Your VMware SDDC with IBM Infrastructure
 
IBM in Surveillance: Solutions that Deliver Innovation
IBM in Surveillance: Solutions that Deliver InnovationIBM in Surveillance: Solutions that Deliver Innovation
IBM in Surveillance: Solutions that Deliver Innovation
 

Recently uploaded

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 

Recently uploaded (20)

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 

AI Scalability for the Next Decade

  • 1. AI Scalability for a Decade Spark Summit Europe 10/2019 Dave McDonnell Manager, Business Development IBM Storage Division
  • 2. § The Path to AI § Challenges § IA and AI
  • 4. AI is transforming every industry Autonomous driving Collision avoidance Route optimization Location-based advertising Customer Experience Stock forecasting Buyer behavior Clinical trials Drug discovery Genomics Experimental sensor capture Hypothesis modeling Seismic analysis exploration Smart metering / Usage forecasting Market prediction Fraud detection Risk mitigation Threat detection/assessment Video surveillance Social media monitoring Traffic flow analysis Manufacturing quality control Supply chain optimization Warranty analysis
  • 5. Cognitive applications require cognitive infrastructure Scalability Cost Management Simplicity Security Governance …
  • 7. What is AI? Laptop R Programming Python Programming Concept Understanding Defining a Problem Building Model Proof of Concept with GPU Technology POC in the Cloud POC On Premise Does it work? Can it do something meaningful for me today? Which AI opportunities are next? Expanding Growth in Cloud Usage GPU Workstation GPU Server How to improve? More applications More GPU workstations/servers USB Drives. Disk Array #1. Disk Array #2. etc.. Challenges Searching for information Sharing information Performance Scaling Back-Up Archiving Security Governance
  • 8. Inefficient workload management results in poor server utilization rates and throughput. Replicated data Difficult to scale Lack of security with open-source frameworks and applications introduces risk. Multiple Spark teams each with dedicated servers = wasted capacity and high administrative overhead. Scaling Spark is challenging
  • 9. 47% Advanced data management 50% Data volume and quality 44% Skills gap Top 3 challenges for organizations deploying AI workloads
  • 10. The Problem •Data needs to be labelled •Its manual, too much data, time taking •Data is scarce and expensive to aggerate •Developing custom models quickly •Data scientists are required •Not all Data scientists are created equal •Deploying custom solutions around these Models •Need application developers with skills in OpenCV •Leveraging GPUs •Train and Infer faster •Need to deploy models at the Edge – Nvidia TX2/Xavier/T4 •Keep up with CUDA drivers
  • 12. 80% of data is either inaccessible, untrusted or unanalyzed 81% of users do not understand the data required for AI (information architecture) 12 No amount of AI algorithmic sophistication will overcome a lack of data [architecture] … bad data is simply paralyzing…“ There is no Without an ”
  • 13. Let’s Help You Get There.. Software is your friend A workflow view AI and Big Data Analytics at Scale
  • 14. © IBM Corporation 2019 14 Cloud Object Storage and Spectrum Discover and Spectrum Scale • Ingests & indexes system metadata via Action Agent SDK • Extracts labels from images • Adds as custom tags • Global ingest of IOT data from vehicles • Geo-dispersed COS • Searches for images with labeled as having ‘Pothole’ feature SORT & EXTRACTINGEST CURATE TRAIN Stop sign Data Scientist • Enriches data catalog with new tags derived from analysis via Action Agent SDK • Trains a model Pothole Data Scientist Tools like Power Vision AI
  • 15. Is your data stranded? Complex workflows can lead to data isolation Replication sprawl Time to deliver Synch issues Custody chain Performance disparities High labor cost Mgt. Complication Optimization nightmares ? Ingest Preparation Training Inference
  • 16. The End-to-End Enterprise Data Pipeline Machine Learning and Deep Learning do happen in a silo EDGE INSIGHTS Ingest Organize Analyze Prepare Train Inference Streams/ NFS/S3 Data Input Integrate new data with existing repositories Correlate data from data lake for newer insights and views Use select datasets to identify patterns and train models for future decision making ETL, Tagging BI, HPC Sample Workloads Data as the shared asset between various Analytics and AI stages in an E2E enterprise data pipeline
  • 17. Hadoop HDFS Data Lake Cloud Based Data Providers IOT Deep Learning ImpactHDFS Transparency Connector /ESS Enabling Agility, Straight-Thru Processing for Improved Data Engineering ETL Spark AI Grid with Watson ML Accelerator Reduce cost and improve service levels
  • 18. © IBM Corporation 2018 18 IBM Elastic Storage Server (ESS) Integrated scale-out data management for file and object data Optimal building block for high-performance, scalable, reliable enterprise Spectrum Scale storage • Faster data access with choice to scale-up or out • Easy to deploy clusters with unified system GUI • Simplified storage administration with IBM Spectrum Control integration One solution for all your Spectrum Scale data needs • Single repository of data with unified file and object support • Anywhere access with multi-protocol support: NFS 4.0, SMB, OpenStack Swift, Cinder, and Manila • Ideal for Big Data Analytics with full Hadoop transparency Ready for business critical data • Disaster recovery with synchronous or asynchronous replication • Ensure reliability and fast rebuild times using Spectrum Scale RAID’s dispersed data and erasure code • Five 99999s of availability ESS 5U84 Storage ESS 5U84 Storage ESS 5U84 Storage ESS 5U84 Storage ESS 5U84 Storage ESS 5U84 Storage
  • 19. Power Systems LC922 – Delivering enhanced price-performance for Apache Spark Reduce operating costs and deliver results faster compared to tested Intel Xeon systems 1. Results are based IBM Internal Measurements running four concurrent streams of 99 TPC-DS like queries against a 3TB dataset. Results valid as of 4/25/18 and conducted under laboratory condition with speculative execution controls to mitigate user-to-kernel and user-to-user side-channel attacks on both systems, individual results can vary based on workload size, use of storage subsystems & other conditions 2. Hardware: 4 nodes IBM Power LC922 (2x20-core/2.7 GHz/512 GB memory) using 12 x 8TB HDD, 10 GbE two-port, RHEL 7.5 LE for Power9 and 4 nodes of Intel Xeon Gold 6140; 36 cores (2 x 18c chips) at 2.3 GHz; 512 GB memory, 12 x 8TB HDDs, 10Gbps NIC, Red Hat Enterprise Linux 7.5 3. Software: Apache Spark 2.3.0 located at http://spark.apache.org/downloads.html ; and open source Hadoop HDP 2.7.5 4. Pricing is based on Power LC922 http://www-03.ibm.com/systems/power/hardware/linux-lc.html and publicly available x86 pricing. 5. Apache®, Apache Spark®, and associated logos are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries. No endorsement by The Apache Software Foundation is implied by the use of these marks. Intel Xeon SP Gold 6140 server 22.6 QpH Power LC922 29.5 QpH Power LC922 $43,948 Intel Xeon SP Gold 6150 server $53,548 18% LOWER Price2,3,4 30% MORE Performance1 Power LC922 Delivers 1.6x Price-performance +
  • 20. Dedicated Resources IBM Spectrum Conductor A complete enterprise-grade solution for Data Engineering and Analytics • Dynamic Sharing with guaranteed SLA • Rapidly deploy & support multiple concurrent instances and versions of Spark, Notebooks, Anaconda, MongoDB & other services • Proven at scale: • 5K hosts, 150K cores, >1B tasks/day • High performance workload and resource manager: • 30-224% faster IBM Spectrum Conductor / Spark + AI Summit / 2019-Apr-23 / © 2019 IBM Corporation 20 Secure Multitenant Shared Resources Shared Resources Dedicated Resource Silos
  • 21. IBM Spectrum Conductor / Spark + AI Summit / 2019-Apr-23 / © 2019 IBM Corporation 21 Superior performance and scalability compared to competing orchestration / resource management solutions 30 to 224% faster than YARN 25 and 88% faster than Apache Mesos Consistent and predictable delivering 77% relative standard deviation (RSD) YARN & Mesos are relatively unpredictable 777% and 304% RSD respectively Audited results 2607 327 899 574 1673 253 582 177 1660 202 478 458 0 500 1000 1500 2000 2500 3000 Case 1: Sync interactive multi-user Case 2: Asynchonous batch multi-user Case 3: Mixed multi- user Case 4: Mixed multi- tenant Throughput of Spark SMB-2 benchmark workload on various Resource Managers (Jobs/hour - higher is better) IBM Spectrum Conductor Apache YARN v2.7.3 Apache Mesos v1.0.1 Audited benchmark results https://stacresearch.com/news/2017/05/19/IBM170405 IBM Spectrum Conductor Better performance and predictability
  • 22. IBM PowerAI Vision: “Point-and-Click” AI for Images & Video Label Image or Video Data Auto-Train AI Model Package & Deploy AI Model
  • 23. Analytics in Sports 23© Copyright IBM Corporation 2018
  • 24. Leading the Pack in AI Infrastructure IBM Systems Reference Architecture for AI IBM PowerAI IBM Spectrum Computing IBM Storage IBM Accelerated Compute Platform IBM Power Servers IBM Spectrum Computing IBM Spectrum Scale & ESS IBM Storage Solutions for AI / ML / DL IBM Spectrum Scale IBM Cloud Object Storage IBM Spectrum Discover All Built upon Data Infrastructure with: Wells Fargo: Financial Risk Modeling “Wells Fargo data scientists build, enhance, & validate hundreds of models each day, speed is critical, along with scalability, as they deal with greater amounts of data & more complicated models. Academically, people talk about fancy algorithms. But in real life, how efficiently the models run in distributed environments is critical. IBM is a very good partner & we are very pleased with their solution.” Richard Liu, Quantitative Analytics Manager, Wells Fargo, IBM Think18
  • 25. How Do You Get AI Scalability for a Decade ? Plan for the Future • Where is the biggest pain today? • What does it look like in 3, 5, and 10 years? • Think workflow Software Is Your Friend • Capabilities, Efficiency, Balance, Cost, Scalability • Usability Hybrid MultiCloud World • Cloud or on premise? It’s both ! • Drive the linkage with Data Science and IT (information architecture) There is no Without an
  • 26.
  • 27. AI takes flight “In any business, differentiation is everything. Data is the source of differentiation. How we started on the path to today’s Delta: we had to get the basics first. The foundation. We now have the foundation, the data infrastructure. In order to improve the processes we have. In the past, we had the data, but we didn’t have the sourcing, or the data infrastructure to get at the data. So we built this data ocean with billions of data points and turned it into action for Delta to better serve customers.” “That’s a great point. You can’t do enterprise wide analytics and AI (Artificial Intelligence) until you have the right “data basics” first, i.e. you have to have the foundation information architecture and data infrastructure in place. Most companies have random acts of digital and AI all over. But until you can pull them together, and re-imagine how the work is to be done, you can’t scale any of it.” Ed Bastian, CEO, Delta Airlines Ginni Rometty, CEO, IBM “In 2010, we had 5,600 maintenance cancellations, at least one every day that year. In 2018, we had just 55 maintenance cancellations. That’s a 99% improvement.” – Ed Bastian, CEO, Delta Airlines IBM Keynote Jan. 9, 2019