Big Data: Infrastructure Implications for “The Enterprise of Things” - StampedeCon 2014

Hype, Hopes, Hell & Hadoop!
Big Data: Reality Check and Infrastructure
Implications of “The Enterprise of Everything”!
Jean-Luc Chatelain, EVP & CTO !StampedeCon 2014!
2!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
2! And now, a quick word from my sponsor J!
3!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
DDN | Who We Are!
•  Main Office: Santa Clara, California, USA!
•  Employees: ~550 in 20 Countries!
•  Installed Base: End Customers in 50 Countries!
•  Go To Market: Partner & Reseller Assisted, Direct!
•  DDN: World’s Largest Private Storage Company!
!
We Design, Deploy and Optimize Storage Systems that Solve
HPC, Big Data and Cloud Business Challenges at Scale!
World-Renowned & Award-Winning
All !Time!Winner!
4!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
Big Data & Cloud Infrastructure !
DDN’s Award-Winning Product Portfolio!
Analytics Reference
Architectures
EXAScaler™
10Ks of Clients
1TB/s+, HSM
Linux HPC Clients
NFS & CIFS [2014]
Petascale
Lustre® Storage
Enterprise
Scale-Out File Storage
GRIDScaler™
~10K Clients
1TB/s+, HSM
Linux/Windows HPC Clients
NFS & CIFS
SFA12KX™
48GB/s, 1.7M IOPS!
1,680 Drives in 2
Racks!
Optional Embedded
Computing!
SFA7700™
13GB/s; 600K
IOPS!
•  7700X!
•  7700E!
!
Storage Fusion Architecture™ Core Storage Platforms!
SATA! SSD!
Flexible Drive Configuration!
SAS!
SFX™ Automated Flash Caching!
WOS® 3.0
32 Trillion Unique Objects
Geo-Replicated Cloud Storage
256 Million Objects/Second
Self-Healing Cloud
Embedded metadata mgmt
Cloud Foundation
Big Data Platform!
Management!
DirectMon®!
Cloud
Tiering
Infinite Memory Engine™
Distributed File System Buffer Cache
WOS7000
60 Drives in 4U!
Self-Contained Servers!
!
Adaptive Transparent Flash Cache !
SFX API Gives Users Control!
[pre-staging, alignment, bypass]!
S3/Swift
Hype & Hopes!
6!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
Hype!
2011! 2014!
#bigdata in the trough of disillusion is great news for the enterprise!!
Today!
7!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
Back To The Future?!
The term “Big Data” coined circa 1999(1)!
•  Pervasive in some existing markets since late 90’s!
–  HPC sensu latissimo!
–  Life Sciences!
–  Intelligence!
–  ASP (remember that word?)!
!
Is there anything new here? Why the hype?!
(1) A Personal Perspective on the Origin(s) and Development of Big Data" Diebold 2012!
8!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
Is There a #bigdata Definition? !
For some yes; for others no – or maybe there are multiple definitions!
•  It is “a basket of
technologies”!
•  It creates “a mindset
change in decision
making”!
“Data sets that exceed the boundaries and sizes of current infrastructure
capabilities, forcing technologists to take a non-traditional approach”!
Normal
Processing!
Capabilities!
File/Object Size, Content Volume!
Activity:IOPS!
Lots of
data
Large file
sizes
Lots of
transactions
9!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
#bigdata: 2 Dimensions of the 3 V’s

!
Petabytes of Data!
but also!
Trillions of Information
Objects!
GB/s to TB/s!
but also!
Millions of Information!
Object per second!
Structured & Unstructured!
but also!
Streams & Batches
workloads!
The “trillions” & “millions” are the primary drivers of complexity "
and challenge “Time to Results”!
Velocity!Volume! Variety!
Remember . . .!
1ms lost per operation on a billion operations workload= 11.5 days lost!!
10!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
So, is #bigdata the new thing?!
11!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
Quiz!!
12!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
The Dawn of a Telemetry Revolution!
Internet
of
Things!
Social!
Sensors!
Telemetry
Revolution!
The Birth of a!
Mindset Change in!
Business Decision
Making!
Hell!
14!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
Governance, Regulation, Compliance!
The Universe of Big Data is
a massive black hole into
which GRC has fallen"
•  Governance!
•  Regulation!
•  Compliance!
•  Security!
•  Privacy!
Now, welcome to the era of shadow data and"
behold the plague of hyper-scalability!
15!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
Tackling #bigdata Is Non-trivial!
Value extraction (insights
driving business results) is
only done on 1% of total
enterprise data!
Time to value & time to result is
business critical !
–  Inadequate infrastructure =
failure & credibility loss!
The cardinality
dimensions of the 3V’s
are the infrastructure
killers!
Material: network, compute,
storage!
–  Human: DBA, sysadmin &
storadmin!
Today #bigdata project cannot
live in IT or it will fail!
Dare to be different!
#bigdata nullifies the feature
race and favors the benefit race!
16!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
Let’s Talk Real #bignumbers!
HPC is a forward looking time machine that eats #bigdata for lunch!
•  Enterprise’s
#bigdata problems
of today were HPC
problems 3 to 5
years ago!
•  HPC & WEB
architectures are
converging!
17!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
The #bigdata Effect on Existing IT Infrastructures!
18!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
Top 3 #bigdata Infrastructure Challenges!
19!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
The Scalability Devil Effect on Typical Analytics!
•  Economics of large capacity EDW storage!
•  Scalability of NAS/SAN file systems!
•  Bandwidth demand of OLAP engine!
•  IOPS demand of modelization!
•  Memory requirements of visualization!
•  MPP drives I/O blending!
Structured
Data
Unstructured
Data
ETL
ETL
EDW
NAS/SAN
ETL
ETL
OLAP
Engine
Semantic
Engine
Model
Visualize
Report!
Hadoop!
21!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
Hadoop!
•  IS NOT a person or the solution to world famine or a BI
platform or an analytics platform or an EDW or a CEP
engine or …..!
•  IS a growing basket of technologies facilitating BI and/or
analytics especially if there is a lot of unstructured data!
•  IS at the core of many “science projects”!
•  IS in the infancy of deployment in the traditional enterprise!
•  HDFS “data lake” concept is very important!
22!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
BI & Analytics Today!
Database
File System
ETL
(primary)
Enterprise
Data
Warehouse
Reporting
&
Visualization
ETL
(secondary)
Analytics
CEP
Business
Auditing
&
Planning
23!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
Hadoop Effect!
Database
ETL
Enterprise
Data
Warehouse
Reporting
&
Visualization
Analytics
CEP
Business
Auditing
&
Planning
Buiness
Data
Warehouse
24!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
24!
#bigdata “At Work” with DDN

Case Studies!
25!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
Accelerating Fraud Awareness!
Harnessing Hadoop and Big Data!
DDN helps PayPal’s Financial Linking
System achieve 200–250ms
processing and customer
transparency!
!
“On the cost side, the same
performance at 3-4 times less cost,
that’s clearly important. The fact is,
you’ve got scalability you didn’t have
previously.”!
Ryan Quick, Principal Architect, PayPal!
26!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
Accelerating Financial Insights!
“Other technologies paled in
comparison to the performance levels
achieved with DDN’s SFA12K.” !
Brian Alexseychuk, Managing Director of Infrastructure!
!
!
•  Resolved scaling challenges and
parallelized workflows!
•  Exceeded competitors on metrics such
as scalability, speed, density, and TCO!
•  Improved revenues, reduced trade
slippage by 70% & cut telecom expenses!
27!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
Accelerating Time To Cure!
“If you can serve some of the fastest
computers on the planet, then you
can help us.”!
Phil Butcher, Head IT!
!
!
“If you need 10K cores to perform an
extra layer of analysis in an hour …
you need a real solution that can
address everything from very small
to extremely large data sets.”!
Tim Cutts, Head of Scientific Computing!
28!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
Accelerating Intelligence Insights!
Naval Research Lab 

Large Data Program!
!
Application!
•  Deep storage & fast distributed search !
•  Super-HD, 2/3-D, and streaming data!
DDN enables rapid threat detection by speeding
up real-time data and imagery up to 500%.!
In Conclusion!
30!
© 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others.
Any statements or representations around future events are subject to change. ddn.com
2 Faces of #bigdata = 

Opportunities for Innovation!
Technology!
–  Hyper-scalability: DB & FS!
–  Privacy (masking, obfuscation)!
–  Keyless security!
–  Visualization and navigation of
large datasets!
–  HDFS persistence!
–  Provenance!
–  In-memory computing!
–  In-Storage Processing!
–  GraphDB on MPP!
–  Brute force or machine
learning?!
–  Predictive & prescriptive
analytics!
Business!
–  Agility!
–  Narrow casted solutions with
higher stickiness!
–  Data driven business decision!
–  Retain existing customers and
gain new ones!
Information is
the currency of
today’s global
business!
1 of 30

Recommended

Why Infrastructure Matters for Big Data & Analytics by
Why Infrastructure Matters for Big Data & AnalyticsWhy Infrastructure Matters for Big Data & Analytics
Why Infrastructure Matters for Big Data & AnalyticsRick Perret
17.5K views17 slides
Big Data Analytics Infrastructure for Dummies by
Big Data Analytics Infrastructure for DummiesBig Data Analytics Infrastructure for Dummies
Big Data Analytics Infrastructure for DummiesPatrick Bouillaud
2.6K views51 slides
Big Data Infrastructure and Analytics Solution on FITAT2013 by
Big Data Infrastructure and Analytics Solution on FITAT2013Big Data Infrastructure and Analytics Solution on FITAT2013
Big Data Infrastructure and Analytics Solution on FITAT2013Erdenebayar Erdenebileg
2.6K views28 slides
Big Data World Forum by
Big Data World ForumBig Data World Forum
Big Data World Forumbigdatawf
1.2K views27 slides
Big data ibm keynote d advani presentation by
Big data ibm keynote d advani presentationBig data ibm keynote d advani presentation
Big data ibm keynote d advani presentationMassTLC
3.4K views22 slides
Big Data World Forum by
Big Data World ForumBig Data World Forum
Big Data World Forumbigdatawf
786 views15 slides

More Related Content

What's hot

IBM-Why Big Data? by
IBM-Why Big Data?IBM-Why Big Data?
IBM-Why Big Data?Kun Le
2.8K views97 slides
Record manager 8.0 presentation by
Record manager 8.0  presentationRecord manager 8.0  presentation
Record manager 8.0 presentationAndrey Karpov
745 views36 slides
Analyzing Big Data - Jeff Scheel by
Analyzing Big Data - Jeff ScheelAnalyzing Big Data - Jeff Scheel
Analyzing Big Data - Jeff ScheelKangaroot
1.7K views33 slides
Value proposition for big data isv partners 0714 by
Value proposition for big data isv partners 0714Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714Niu Bai
3.7K views43 slides
Overview - IBM Big Data Platform by
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data PlatformVikas Manoria
21.7K views33 slides
Telco Big Data Workshop Sample by
Telco Big Data Workshop SampleTelco Big Data Workshop Sample
Telco Big Data Workshop SampleAlan Quayle
8.2K views167 slides

What's hot(20)

IBM-Why Big Data? by Kun Le
IBM-Why Big Data?IBM-Why Big Data?
IBM-Why Big Data?
Kun Le2.8K views
Record manager 8.0 presentation by Andrey Karpov
Record manager 8.0  presentationRecord manager 8.0  presentation
Record manager 8.0 presentation
Andrey Karpov745 views
Analyzing Big Data - Jeff Scheel by Kangaroot
Analyzing Big Data - Jeff ScheelAnalyzing Big Data - Jeff Scheel
Analyzing Big Data - Jeff Scheel
Kangaroot1.7K views
Value proposition for big data isv partners 0714 by Niu Bai
Value proposition for big data isv partners 0714Value proposition for big data isv partners 0714
Value proposition for big data isv partners 0714
Niu Bai3.7K views
Overview - IBM Big Data Platform by Vikas Manoria
Overview - IBM Big Data PlatformOverview - IBM Big Data Platform
Overview - IBM Big Data Platform
Vikas Manoria21.7K views
Telco Big Data Workshop Sample by Alan Quayle
Telco Big Data Workshop SampleTelco Big Data Workshop Sample
Telco Big Data Workshop Sample
Alan Quayle8.2K views
Accelerate Digital Transformation with Data Virtualization in Banking, Financ... by Denodo
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Accelerate Digital Transformation with Data Virtualization in Banking, Financ...
Denodo 110 views
Telco Big Data 2012 Highlights by Alan Quayle
Telco Big Data 2012 HighlightsTelco Big Data 2012 Highlights
Telco Big Data 2012 Highlights
Alan Quayle2K views
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We... by Impetus Technologies
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
Big Data Use Cases for Different Verticals and Adoption Patterns - Impetus We...
IBM Smarter Analytics by Adrian Turcu
IBM Smarter AnalyticsIBM Smarter Analytics
IBM Smarter Analytics
Adrian Turcu1.7K views
Making Hadoop Ready for the Enterprise by DataWorks Summit
Making Hadoop Ready for the Enterprise Making Hadoop Ready for the Enterprise
Making Hadoop Ready for the Enterprise
DataWorks Summit2.5K views
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights... by DATAVERSITY
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
DATAVERSITY2.6K views
Ανδρέας Τσαγκάρης, 5th Digital Banking Forum by Starttech Ventures
Ανδρέας Τσαγκάρης, 5th Digital Banking ForumΑνδρέας Τσαγκάρης, 5th Digital Banking Forum
Ανδρέας Τσαγκάρης, 5th Digital Banking Forum
Starttech Ventures469 views
Why Infrastructure matters?! by Gabi Bauer
Why Infrastructure matters?!Why Infrastructure matters?!
Why Infrastructure matters?!
Gabi Bauer834 views
1524 how ibm's big data solution can help you gain insight into your data cen... by IBM
1524 how ibm's big data solution can help you gain insight into your data cen...1524 how ibm's big data solution can help you gain insight into your data cen...
1524 how ibm's big data solution can help you gain insight into your data cen...
IBM1.2K views
The Data Axioms lecture-overview-big data-usama-9-2015 by CMR WORLD TECH
The Data Axioms lecture-overview-big data-usama-9-2015The Data Axioms lecture-overview-big data-usama-9-2015
The Data Axioms lecture-overview-big data-usama-9-2015
CMR WORLD TECH573 views
Ibm big data-platform by IBM Sverige
Ibm big data-platformIbm big data-platform
Ibm big data-platform
IBM Sverige4K views

Similar to Big Data: Infrastructure Implications for “The Enterprise of Things” - StampedeCon 2014

Hype, Hopes, Hell & Hadoop (#bigdata and the enterprise of everything) by
Hype, Hopes, Hell & Hadoop (#bigdata and the enterprise of everything)Hype, Hopes, Hell & Hadoop (#bigdata and the enterprise of everything)
Hype, Hopes, Hell & Hadoop (#bigdata and the enterprise of everything)jlchatelain
933 views31 slides
Ddn Vision by
Ddn VisionDdn Vision
Ddn Visioninside-BigData.com
832 views6 slides
DDN Service Strategy by
DDN Service StrategyDDN Service Strategy
DDN Service Strategyinside-BigData.com
994 views9 slides
Optimizing Lustre and GPFS with DDN by
Optimizing Lustre and GPFS with DDNOptimizing Lustre and GPFS with DDN
Optimizing Lustre and GPFS with DDNinside-BigData.com
3.6K views15 slides
Getting Started with Big Data for Business Managers by
Getting Started with Big Data for Business ManagersGetting Started with Big Data for Business Managers
Getting Started with Big Data for Business ManagersDatameer
1.1K views30 slides
Big Data Management: A Unified Approach to Drive Business Results by
Big Data Management: A Unified Approach to Drive Business ResultsBig Data Management: A Unified Approach to Drive Business Results
Big Data Management: A Unified Approach to Drive Business ResultsCA Technologies
1.6K views30 slides

Similar to Big Data: Infrastructure Implications for “The Enterprise of Things” - StampedeCon 2014(20)

Hype, Hopes, Hell & Hadoop (#bigdata and the enterprise of everything) by jlchatelain
Hype, Hopes, Hell & Hadoop (#bigdata and the enterprise of everything)Hype, Hopes, Hell & Hadoop (#bigdata and the enterprise of everything)
Hype, Hopes, Hell & Hadoop (#bigdata and the enterprise of everything)
jlchatelain933 views
Getting Started with Big Data for Business Managers by Datameer
Getting Started with Big Data for Business ManagersGetting Started with Big Data for Business Managers
Getting Started with Big Data for Business Managers
Datameer1.1K views
Big Data Management: A Unified Approach to Drive Business Results by CA Technologies
Big Data Management: A Unified Approach to Drive Business ResultsBig Data Management: A Unified Approach to Drive Business Results
Big Data Management: A Unified Approach to Drive Business Results
CA Technologies1.6K views
Integrating Structure and Analytics with Unstructured Data by DATAVERSITY
Integrating Structure and Analytics with Unstructured DataIntegrating Structure and Analytics with Unstructured Data
Integrating Structure and Analytics with Unstructured Data
DATAVERSITY2.1K views
The New Database Frontier: Harnessing the Cloud by Inside Analysis
The New Database Frontier: Harnessing the CloudThe New Database Frontier: Harnessing the Cloud
The New Database Frontier: Harnessing the Cloud
Inside Analysis635 views
Action from Insight - Joining the 2 Percent Who are Getting Big Data Right by StampedeCon
Action from Insight - Joining the 2 Percent Who are Getting Big Data RightAction from Insight - Joining the 2 Percent Who are Getting Big Data Right
Action from Insight - Joining the 2 Percent Who are Getting Big Data Right
StampedeCon599 views
HP Enterprise Software: Making your applications and information work for you by HP Enterprise Italia
HP Enterprise Software: Making your applications and information work for youHP Enterprise Software: Making your applications and information work for you
HP Enterprise Software: Making your applications and information work for you
Extending BI with Big Data Analytics by Datameer
Extending BI with Big Data AnalyticsExtending BI with Big Data Analytics
Extending BI with Big Data Analytics
Datameer4.4K views
CIO priorities and Data Virtualization: Balancing the Yin and Yang of the IT by Denodo
CIO priorities and Data Virtualization: Balancing the Yin and Yang of the ITCIO priorities and Data Virtualization: Balancing the Yin and Yang of the IT
CIO priorities and Data Virtualization: Balancing the Yin and Yang of the IT
Denodo 132 views
软实力与创新竞争力 by Lin Haiqiu
软实力与创新竞争力软实力与创新竞争力
软实力与创新竞争力
Lin Haiqiu577 views
Presumption of Abundance: Architecting the Future of Success by Inside Analysis
Presumption of Abundance: Architecting the Future of SuccessPresumption of Abundance: Architecting the Future of Success
Presumption of Abundance: Architecting the Future of Success
Inside Analysis421 views
Benefiting from Big Data - A New Approach for the Telecom Industry by Persontyle
Benefiting from Big Data - A New Approach for the Telecom Industry  Benefiting from Big Data - A New Approach for the Telecom Industry
Benefiting from Big Data - A New Approach for the Telecom Industry
Persontyle9.5K views
The 4 Biggest Trends In Big Data and Analytics Right For 2021 by Bernard Marr
The 4 Biggest Trends In Big Data and Analytics Right For 2021The 4 Biggest Trends In Big Data and Analytics Right For 2021
The 4 Biggest Trends In Big Data and Analytics Right For 2021
Bernard Marr17.7K views

More from StampedeCon

Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo... by
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...StampedeCon
2.6K views34 slides
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017 by
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017StampedeCon
638 views51 slides
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017 by
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017StampedeCon
397 views19 slides
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam... by
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...StampedeCon
417 views19 slides
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017 by
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017StampedeCon
394 views32 slides
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017 by
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017StampedeCon
1.4K views62 slides

More from StampedeCon(20)

Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo... by StampedeCon
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
Why Should We Trust You-Interpretability of Deep Neural Networks - StampedeCo...
StampedeCon2.6K views
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017 by StampedeCon
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
StampedeCon638 views
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017 by StampedeCon
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
Predicting Outcomes When Your Outcomes are Graphs - StampedeCon AI Summit 2017
StampedeCon397 views
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam... by StampedeCon
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
Novel Semi-supervised Probabilistic ML Approach to SNP Variant Calling - Stam...
StampedeCon417 views
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017 by StampedeCon
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
How to Talk about AI to Non-analaysts - Stampedecon AI Summit 2017
StampedeCon394 views
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017 by StampedeCon
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
Getting Started with Keras and TensorFlow - StampedeCon AI Summit 2017
StampedeCon1.4K views
Foundations of Machine Learning - StampedeCon AI Summit 2017 by StampedeCon
Foundations of Machine Learning - StampedeCon AI Summit 2017Foundations of Machine Learning - StampedeCon AI Summit 2017
Foundations of Machine Learning - StampedeCon AI Summit 2017
StampedeCon574 views
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem... by StampedeCon
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
Don't Start from Scratch: Transfer Learning for Novel Computer Vision Problem...
StampedeCon392 views
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti... by StampedeCon
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
Bringing the Whole Elephant Into View Can Cognitive Systems Bring Real Soluti...
StampedeCon221 views
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017 by StampedeCon
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
Automated AI The Next Frontier in Analytics - StampedeCon AI Summit 2017
StampedeCon574 views
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017 by StampedeCon
AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017AI in the Enterprise: Past,  Present &  Future - StampedeCon AI Summit 2017
AI in the Enterprise: Past, Present & Future - StampedeCon AI Summit 2017
StampedeCon563 views
A Different Data Science Approach - StampedeCon AI Summit 2017 by StampedeCon
A Different Data Science Approach - StampedeCon AI Summit 2017A Different Data Science Approach - StampedeCon AI Summit 2017
A Different Data Science Approach - StampedeCon AI Summit 2017
StampedeCon278 views
Graph in Customer 360 - StampedeCon Big Data Conference 2017 by StampedeCon
Graph in Customer 360 - StampedeCon Big Data Conference 2017Graph in Customer 360 - StampedeCon Big Data Conference 2017
Graph in Customer 360 - StampedeCon Big Data Conference 2017
StampedeCon1.2K views
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017 by StampedeCon
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
StampedeCon1.8K views
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017 by StampedeCon
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
Doing Big Data Using Amazon's Analogs - StampedeCon Big Data Conference 2017
StampedeCon120 views
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu... by StampedeCon
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
Enabling New Business Capabilities with Cloud-based Streaming Data Architectu...
StampedeCon169 views
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz... by StampedeCon
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
Big Data Meets IoT: Lessons From the Cloud on Polling, Collecting, and Analyz...
StampedeCon666 views
Innovation in the Data Warehouse - StampedeCon 2016 by StampedeCon
Innovation in the Data Warehouse - StampedeCon 2016Innovation in the Data Warehouse - StampedeCon 2016
Innovation in the Data Warehouse - StampedeCon 2016
StampedeCon914 views
Creating a Data Driven Organization - StampedeCon 2016 by StampedeCon
Creating a Data Driven Organization - StampedeCon 2016Creating a Data Driven Organization - StampedeCon 2016
Creating a Data Driven Organization - StampedeCon 2016
StampedeCon1.1K views
Using The Internet of Things for Population Health Management - StampedeCon 2016 by StampedeCon
Using The Internet of Things for Population Health Management - StampedeCon 2016Using The Internet of Things for Population Health Management - StampedeCon 2016
Using The Internet of Things for Population Health Management - StampedeCon 2016
StampedeCon1.1K views

Recently uploaded

2024: A Travel Odyssey The Role of Generative AI in the Tourism Universe by
2024: A Travel Odyssey The Role of Generative AI in the Tourism Universe2024: A Travel Odyssey The Role of Generative AI in the Tourism Universe
2024: A Travel Odyssey The Role of Generative AI in the Tourism UniverseSimone Puorto
13 views61 slides
Data Integrity for Banking and Financial Services by
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial ServicesPrecisely
29 views26 slides
"Running students' code in isolation. The hard way", Yurii Holiuk by
"Running students' code in isolation. The hard way", Yurii Holiuk "Running students' code in isolation. The hard way", Yurii Holiuk
"Running students' code in isolation. The hard way", Yurii Holiuk Fwdays
24 views34 slides
"Node.js Development in 2024: trends and tools", Nikita Galkin by
"Node.js Development in 2024: trends and tools", Nikita Galkin "Node.js Development in 2024: trends and tools", Nikita Galkin
"Node.js Development in 2024: trends and tools", Nikita Galkin Fwdays
17 views38 slides
Uni Systems for Power Platform.pptx by
Uni Systems for Power Platform.pptxUni Systems for Power Platform.pptx
Uni Systems for Power Platform.pptxUni Systems S.M.S.A.
58 views21 slides
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f... by
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc
72 views29 slides

Recently uploaded(20)

2024: A Travel Odyssey The Role of Generative AI in the Tourism Universe by Simone Puorto
2024: A Travel Odyssey The Role of Generative AI in the Tourism Universe2024: A Travel Odyssey The Role of Generative AI in the Tourism Universe
2024: A Travel Odyssey The Role of Generative AI in the Tourism Universe
Simone Puorto13 views
Data Integrity for Banking and Financial Services by Precisely
Data Integrity for Banking and Financial ServicesData Integrity for Banking and Financial Services
Data Integrity for Banking and Financial Services
Precisely29 views
"Running students' code in isolation. The hard way", Yurii Holiuk by Fwdays
"Running students' code in isolation. The hard way", Yurii Holiuk "Running students' code in isolation. The hard way", Yurii Holiuk
"Running students' code in isolation. The hard way", Yurii Holiuk
Fwdays24 views
"Node.js Development in 2024: trends and tools", Nikita Galkin by Fwdays
"Node.js Development in 2024: trends and tools", Nikita Galkin "Node.js Development in 2024: trends and tools", Nikita Galkin
"Node.js Development in 2024: trends and tools", Nikita Galkin
Fwdays17 views
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f... by TrustArc
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc Webinar - Managing Online Tracking Technology Vendors_ A Checklist f...
TrustArc72 views
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive by Network Automation Forum
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLiveAutomating a World-Class Technology Conference; Behind the Scenes of CiscoLive
Automating a World-Class Technology Conference; Behind the Scenes of CiscoLive
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N... by James Anderson
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
GDG Cloud Southlake 28 Brad Taylor and Shawn Augenstein Old Problems in the N...
James Anderson126 views
"Surviving highload with Node.js", Andrii Shumada by Fwdays
"Surviving highload with Node.js", Andrii Shumada "Surviving highload with Node.js", Andrii Shumada
"Surviving highload with Node.js", Andrii Shumada
Fwdays33 views
STPI OctaNE CoE Brochure.pdf by madhurjyapb
STPI OctaNE CoE Brochure.pdfSTPI OctaNE CoE Brochure.pdf
STPI OctaNE CoE Brochure.pdf
madhurjyapb14 views
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院 by IttrainingIttraining
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
【USB韌體設計課程】精選講義節錄-USB的列舉過程_艾鍗學院
STKI Israeli Market Study 2023 corrected forecast 2023_24 v3.pdf by Dr. Jimmy Schwarzkopf
STKI Israeli Market Study 2023   corrected forecast 2023_24 v3.pdfSTKI Israeli Market Study 2023   corrected forecast 2023_24 v3.pdf
STKI Israeli Market Study 2023 corrected forecast 2023_24 v3.pdf
Case Study Copenhagen Energy and Business Central.pdf by Aitana
Case Study Copenhagen Energy and Business Central.pdfCase Study Copenhagen Energy and Business Central.pdf
Case Study Copenhagen Energy and Business Central.pdf
Aitana17 views

Big Data: Infrastructure Implications for “The Enterprise of Things” - StampedeCon 2014

  • 1. Hype, Hopes, Hell & Hadoop! Big Data: Reality Check and Infrastructure Implications of “The Enterprise of Everything”! Jean-Luc Chatelain, EVP & CTO !StampedeCon 2014!
  • 2. 2! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com 2! And now, a quick word from my sponsor J!
  • 3. 3! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com DDN | Who We Are! •  Main Office: Santa Clara, California, USA! •  Employees: ~550 in 20 Countries! •  Installed Base: End Customers in 50 Countries! •  Go To Market: Partner & Reseller Assisted, Direct! •  DDN: World’s Largest Private Storage Company! ! We Design, Deploy and Optimize Storage Systems that Solve HPC, Big Data and Cloud Business Challenges at Scale! World-Renowned & Award-Winning All !Time!Winner!
  • 4. 4! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com Big Data & Cloud Infrastructure ! DDN’s Award-Winning Product Portfolio! Analytics Reference Architectures EXAScaler™ 10Ks of Clients 1TB/s+, HSM Linux HPC Clients NFS & CIFS [2014] Petascale Lustre® Storage Enterprise Scale-Out File Storage GRIDScaler™ ~10K Clients 1TB/s+, HSM Linux/Windows HPC Clients NFS & CIFS SFA12KX™ 48GB/s, 1.7M IOPS! 1,680 Drives in 2 Racks! Optional Embedded Computing! SFA7700™ 13GB/s; 600K IOPS! •  7700X! •  7700E! ! Storage Fusion Architecture™ Core Storage Platforms! SATA! SSD! Flexible Drive Configuration! SAS! SFX™ Automated Flash Caching! WOS® 3.0 32 Trillion Unique Objects Geo-Replicated Cloud Storage 256 Million Objects/Second Self-Healing Cloud Embedded metadata mgmt Cloud Foundation Big Data Platform! Management! DirectMon®! Cloud Tiering Infinite Memory Engine™ Distributed File System Buffer Cache WOS7000 60 Drives in 4U! Self-Contained Servers! ! Adaptive Transparent Flash Cache ! SFX API Gives Users Control! [pre-staging, alignment, bypass]! S3/Swift
  • 6. 6! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com Hype! 2011! 2014! #bigdata in the trough of disillusion is great news for the enterprise!! Today!
  • 7. 7! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com Back To The Future?! The term “Big Data” coined circa 1999(1)! •  Pervasive in some existing markets since late 90’s! –  HPC sensu latissimo! –  Life Sciences! –  Intelligence! –  ASP (remember that word?)! ! Is there anything new here? Why the hype?! (1) A Personal Perspective on the Origin(s) and Development of Big Data" Diebold 2012!
  • 8. 8! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com Is There a #bigdata Definition? ! For some yes; for others no – or maybe there are multiple definitions! •  It is “a basket of technologies”! •  It creates “a mindset change in decision making”! “Data sets that exceed the boundaries and sizes of current infrastructure capabilities, forcing technologists to take a non-traditional approach”! Normal Processing! Capabilities! File/Object Size, Content Volume! Activity:IOPS! Lots of data Large file sizes Lots of transactions
  • 9. 9! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com #bigdata: 2 Dimensions of the 3 V’s
 ! Petabytes of Data! but also! Trillions of Information Objects! GB/s to TB/s! but also! Millions of Information! Object per second! Structured & Unstructured! but also! Streams & Batches workloads! The “trillions” & “millions” are the primary drivers of complexity " and challenge “Time to Results”! Velocity!Volume! Variety! Remember . . .! 1ms lost per operation on a billion operations workload= 11.5 days lost!!
  • 10. 10! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com So, is #bigdata the new thing?!
  • 11. 11! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com Quiz!!
  • 12. 12! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com The Dawn of a Telemetry Revolution! Internet of Things! Social! Sensors! Telemetry Revolution! The Birth of a! Mindset Change in! Business Decision Making!
  • 13. Hell!
  • 14. 14! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com Governance, Regulation, Compliance! The Universe of Big Data is a massive black hole into which GRC has fallen" •  Governance! •  Regulation! •  Compliance! •  Security! •  Privacy! Now, welcome to the era of shadow data and" behold the plague of hyper-scalability!
  • 15. 15! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com Tackling #bigdata Is Non-trivial! Value extraction (insights driving business results) is only done on 1% of total enterprise data! Time to value & time to result is business critical ! –  Inadequate infrastructure = failure & credibility loss! The cardinality dimensions of the 3V’s are the infrastructure killers! Material: network, compute, storage! –  Human: DBA, sysadmin & storadmin! Today #bigdata project cannot live in IT or it will fail! Dare to be different! #bigdata nullifies the feature race and favors the benefit race!
  • 16. 16! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com Let’s Talk Real #bignumbers! HPC is a forward looking time machine that eats #bigdata for lunch! •  Enterprise’s #bigdata problems of today were HPC problems 3 to 5 years ago! •  HPC & WEB architectures are converging!
  • 17. 17! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com The #bigdata Effect on Existing IT Infrastructures!
  • 18. 18! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com Top 3 #bigdata Infrastructure Challenges!
  • 19. 19! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com The Scalability Devil Effect on Typical Analytics! •  Economics of large capacity EDW storage! •  Scalability of NAS/SAN file systems! •  Bandwidth demand of OLAP engine! •  IOPS demand of modelization! •  Memory requirements of visualization! •  MPP drives I/O blending! Structured Data Unstructured Data ETL ETL EDW NAS/SAN ETL ETL OLAP Engine Semantic Engine Model Visualize Report!
  • 21. 21! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com Hadoop! •  IS NOT a person or the solution to world famine or a BI platform or an analytics platform or an EDW or a CEP engine or …..! •  IS a growing basket of technologies facilitating BI and/or analytics especially if there is a lot of unstructured data! •  IS at the core of many “science projects”! •  IS in the infancy of deployment in the traditional enterprise! •  HDFS “data lake” concept is very important!
  • 22. 22! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com BI & Analytics Today! Database File System ETL (primary) Enterprise Data Warehouse Reporting & Visualization ETL (secondary) Analytics CEP Business Auditing & Planning
  • 23. 23! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com Hadoop Effect! Database ETL Enterprise Data Warehouse Reporting & Visualization Analytics CEP Business Auditing & Planning Buiness Data Warehouse
  • 24. 24! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com 24! #bigdata “At Work” with DDN
 Case Studies!
  • 25. 25! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com Accelerating Fraud Awareness! Harnessing Hadoop and Big Data! DDN helps PayPal’s Financial Linking System achieve 200–250ms processing and customer transparency! ! “On the cost side, the same performance at 3-4 times less cost, that’s clearly important. The fact is, you’ve got scalability you didn’t have previously.”! Ryan Quick, Principal Architect, PayPal!
  • 26. 26! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com Accelerating Financial Insights! “Other technologies paled in comparison to the performance levels achieved with DDN’s SFA12K.” ! Brian Alexseychuk, Managing Director of Infrastructure! ! ! •  Resolved scaling challenges and parallelized workflows! •  Exceeded competitors on metrics such as scalability, speed, density, and TCO! •  Improved revenues, reduced trade slippage by 70% & cut telecom expenses!
  • 27. 27! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com Accelerating Time To Cure! “If you can serve some of the fastest computers on the planet, then you can help us.”! Phil Butcher, Head IT! ! ! “If you need 10K cores to perform an extra layer of analysis in an hour … you need a real solution that can address everything from very small to extremely large data sets.”! Tim Cutts, Head of Scientific Computing!
  • 28. 28! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com Accelerating Intelligence Insights! Naval Research Lab 
 Large Data Program! ! Application! •  Deep storage & fast distributed search ! •  Super-HD, 2/3-D, and streaming data! DDN enables rapid threat detection by speeding up real-time data and imagery up to 500%.!
  • 30. 30! © 2014 DataDirect Networks, Inc. * Other names and brands may be claimed as the property of others. Any statements or representations around future events are subject to change. ddn.com 2 Faces of #bigdata = 
 Opportunities for Innovation! Technology! –  Hyper-scalability: DB & FS! –  Privacy (masking, obfuscation)! –  Keyless security! –  Visualization and navigation of large datasets! –  HDFS persistence! –  Provenance! –  In-memory computing! –  In-Storage Processing! –  GraphDB on MPP! –  Brute force or machine learning?! –  Predictive & prescriptive analytics! Business! –  Agility! –  Narrow casted solutions with higher stickiness! –  Data driven business decision! –  Retain existing customers and gain new ones! Information is the currency of today’s global business!