SlideShare a Scribd company logo
1 of 23
The Journey to Big Data Analytics
Dr. Stefan Radtke
CTO Isilon Storage Division, EMEA
Dell EMC
IDC Conference, Madrid, January 31st 2017
2© Copyright 2016 Dell EMC All rights reserved.
Welcome!
Dr. Stefan Radtke
CTO Isilon, EMEA
Dell EMC | Storage Division
- 1995-2011 : 17 Years for IBM in various technical roles
- 2012-2013 : Global Architect, EMC Global Alliances
- 2013-2016 : CTO, EMEA, Isilon Storage Division, EMC
- 2016-today : CTO, EMEA, Isilon Storage Division, Dell EMC
Phone: +49-176-34434460
E-Mail: Stefan.Radtke@dell.com
LinkedIn: http://de.linkedin.com/in/drstefanradtke
Blog: http://stefanradtke.blogspot.com
3© Copyright 2016 Dell EMC All rights reserved.
Analytics affects all Industries
Smart factories, process control, supply/labour efficiency, and automation control
Smart meters and grids, preventative maintenance, and environmental monitoring
Smart infrastructure, traffic optimization, maintenance, and fleet tracking
Hospital patient monitoring, home healthcare, and remote diagnosis
Wearables, automotive, smart home, and entertainment disruptions
“just-in-time” management, promotions, and Location-based advertising
Manufacturing
Retail
Energy
Transportation/
Infrastructure
Consumer
Healthcare
4© Copyright 2016 Dell EMC All rights reserved.
Step 2: Define question to be answered
Step 3: Use Business Intelligence (BI)
tool’s graphical user interface (GUI) to
construct query
Step 4: BI tool creates SQL
Step 5: SQL is run against data
warehouse to create report
DW
Traditional BI Engagement Process
Step 1: Pre-build data schema
(schema-on-load)
5© Copyright 2016 Dell EMC All rights reserved.
Evolution Of The Analytic Questions
• How many widgets did I
sell last month?
• What were sales by zip
code for Christmas last
year?
• How many of Product X
were returned last
month?
• What were company
revenues and profits for
the past quarter?
• How many employees
did I hire last year?
What Happened?
(Descriptive/BI)
What Will Happen?
(Predictive)
• How many widgets will I
sell next month?
• What will be sales by zip
code over this Christmas
season?
• How many of Product X will
be returned next month?
• What are projected
company revenues and
profits for next quarter?
• How many employees will I
need to hire next year?
What Should I do?
(Prescriptive)
• Order [5,0000] component Z to
support widget sales for next month
• Hire [Y] new sales reps by these zip
code to handle projected Christmas
sales
• Set aside [$125K] in financial
reserve to cover Product X returns
• Sell the following product mix to
achieve quarterly revenue and
margin goals
• Increase hiring pipeline by 35% to
achieve hiring goals
6© Copyright 2016 Dell EMC All rights reserved.
Step 1: Define Hypothesis to test or
Prediction to be made
Step 3: Build schema (schema-on-
query)
Step 4: Visualize the data (Tableau,
Spotfire, ggplot2,…)
Step 6: Evaluate model results
(probabilities, confidence levels)
Data Science Engagement ProcessRepeat
Step 5: Build analytic models (SAS,
R, MADlib, Mahout,…)
Kronos
Historical
Google
Trends
Physician
Notes
Local
Events
Weather
Forecast
Epic
Lawson
CDC
Step 2: Gather data…and more data
(Data Lake: SQL + Hadoop)
7© Copyright 2016 Dell EMC All rights reserved.
Why we need to collect
ALL data !
8© Copyright 2016 Dell EMC All rights reserved.
Holistic Data Collection
Data Lake
9© Copyright 2016 Dell EMC All rights reserved.
Companies understand the
Value of Information but
many of them don’t know
how to start the
journey.
10© Copyright 2016 Dell EMC All rights reserved.
Brainstorm the right questions to ask
11© Copyright 2016 Dell EMC All rights reserved.
Do not filter any Data Sources
• Brainstorming predictive and
prescriptive questions typically
uncovers numerous new data sources
that are worthy of consideration. And
this is a key point: ALL data sources
are worthy of consideration!
• Do NOT filter the data sources at
this point in the process.
12© Copyright 2016 Dell EMC All rights reserved.
But how do we get “Smart”
First consideration: What is the business initiative or “what” we want to accomplish?
For example, Reduce traffic congestions.
Some key questions/parameters to consider:
• Traffic flow decisions: New roads? New lanes? New turn lanes? New bike lanes? Pedestrian crossings?
Railroad crossings? Bus stops?
• Road repair and maintenance decisions: Fixing potholes? Repaving surfaces? Materials and
equipment needed? When to fix potholes and repave streets?
• Construction permits decisions: Types of permits needed? Impact on traffic flow? Length of time to
complete the work? Number of employees to consider?
• Events management decisions: Traffic (cars and pedestrians) attending proposed event? Impact on
normal traffic flow? Date, time, location and duration of events?
• Parks decisions: Location of parks? Size of parks? Hours of operation? Park equipment maintenance?
• Schools decisions: Location and size of new schools? Hours of operations? Location of stoplights and
stop signs?
13© Copyright 2016 Dell EMC All rights reserved.
Data Assessment: Value vs. Feasibility
Business Initiative: Improve Traffic Flow
Impact
Feasibility
14© Copyright 2016 Dell EMC All rights reserved.
Prioritization Matrix
Hi
Hi
Lo Implementation Feasibility
BusinessValue
C
A
F
B
D
E
Use Cases (Decisions)
A. Optimize Traffic Flow
B. Improve Road Repair and
Maintenance
C. Optimize Construction
Permits
D. Improve Events
Management
E. Optimize Park Hours and
Activities
F. Optimize School Hours and
Activities
Business Initiative: “Smart” City Initiative
15© Copyright 2016 Dell EMC All rights reserved.
KEEPING UP WITH NEW
TECHNOLOGIES AND TOOLS
Product
Performance
& Reliability
Customer
Sentiment
Analysis
Supply
Chain
Optimization
Competitive
War Games
IDENTIFYING
THE RIGHT USE CASE
FINDING, CURATING, AND
GOVERNING THE DATA
Why do most efforts to stand up Big
Data Initiatives stall or fail ?
16© Copyright 2016 Dell EMC All rights reserved.
Demonstrate
the potential
value using
data science
techniques
Strategic Approach
Align business
and IT goals
around big data
Identify strategic
opportunities for
big data
analytics
Prioritize key
use cases by
assessing
feasibility and
ROI
Recommend the
appropriate
analytics
engagement and
deployment
roadmap
1 2 3 4 5
Workshop
Objectives
17© Copyright 2016 Dell EMC All rights reserved.
Get help for the Journey
Platform for
Big Data and
Analytics Solutions
BUSINESS
TECHNOLOGY
DEPLOYASSESS PROVE
Big Data
Proof of
Value
Big Data
Proof of
Technology
Big Data
Applied Analytics
Implementation
Big Data
Technology
Implementation
Big Data
Vision
Workshop
Big Data
Technology
Advisory
Dell EMC Service Offerings
18© Copyright 2016 Dell EMC All rights reserved.
My prediction: market is looking for well
integrated systems. Easy to use and
with flexible deployment options.
19© Copyright 2016 Dell EMC All rights reserved.
Advance Analytic (Data Science) Models
Graph AnalyticsForecasting
Time Decomposition
Association Analytics
Behavioral Analytics
20© Copyright 2016 Dell EMC All rights reserved.
Complex and growing Ecosystem
21© Copyright 2016 Dell EMC All rights reserved.
DATA LAKE
COMPUTE:
EMC Converged Platform
Integrated and Converged Infrastructure
PLATFORM MANAGER
ADMINISTRATION ANALYTICS CATALOG DATA CATALOG
Rich UX
INFRASTRUCTURE SOFTWARE OPTIONS
YOUR WORKSPACE
COMPUTE,
NETWORK,
STORAGE
CONVERGED
INFRASTRUCTUREDATA SETS
DATA CURATOR
ENRICH
INGEST
INDEX
FIND AND
INGEST DATA
AT LEAST ONE HADOOP DISTRIBUTION
DATA GOVERNOR
LINEAGE
QUALITY
SECURITY
GOVERN/
PUBLISH
DATA, TOOLS, APPS
Integrated Analytics 3rd Party Openness
DATA SCIENCE
Pivotal Big Data Suite
EXTENSION PACKS
R, RStudio, Anaconda
python, jupyter
PUBLISHED
TOOLS AND
DATASETS
22© Copyright 2016 Dell EMC All rights reserved.
Thanks for your Attention!
Dr. Stefan Radtke
CTO Isilon, EMEA
Dell EMC | Storage Division
- 1995-2011 : 17 Years for IBM in various technical roles
- 2012-2013 : Global Architect, EMC Global Alliances
- 2013-2016 : CTO, EMEA, Isilon Storage Division, EMC
- 2016-today : CTO, EMEA, Isilon Storage Division, Dell EMC
Phone: +49-176-34434460
E-Mail: Stefan.Radtke@dell.com
LinkedIn: http://de.linkedin.com/in/drstefanradtke
Blog: http://stefanradtke.blogspot.com
The Journey to Big Data Analytics

More Related Content

What's hot

Digital Transformation Templates.ppt
Digital Transformation Templates.pptDigital Transformation Templates.ppt
Digital Transformation Templates.pptOlusegun Mosugu
 
Future of Data and AI in Retail - NRF 2023
Future of Data and AI in Retail - NRF 2023Future of Data and AI in Retail - NRF 2023
Future of Data and AI in Retail - NRF 2023Rob Saker
 
Champion the Indian SaaS Story.pptx
Champion the Indian SaaS Story.pptxChampion the Indian SaaS Story.pptx
Champion the Indian SaaS Story.pptxSaaSBOOMi
 
Cloud proposition for banking
Cloud proposition for bankingCloud proposition for banking
Cloud proposition for bankingAjay Kumar Uppal
 
Enterprise Digital Transformation: The Geisinger Experience
Enterprise Digital Transformation: The Geisinger ExperienceEnterprise Digital Transformation: The Geisinger Experience
Enterprise Digital Transformation: The Geisinger ExperienceDamo Consulting Inc.
 
Essential Metadata Strategies
Essential Metadata StrategiesEssential Metadata Strategies
Essential Metadata StrategiesDATAVERSITY
 
Consulting toolkit preparing dummy pack
Consulting toolkit   preparing dummy packConsulting toolkit   preparing dummy pack
Consulting toolkit preparing dummy packchrisdoran
 
ChatGPT OpenAI Primer for Business
ChatGPT OpenAI Primer for BusinessChatGPT OpenAI Primer for Business
ChatGPT OpenAI Primer for BusinessDion Hinchcliffe
 
DBS Bank: Changing the Way We Hire, Work and Deliver Digital Experiences to o...
DBS Bank: Changing the Way We Hire, Work and Deliver Digital Experiences to o...DBS Bank: Changing the Way We Hire, Work and Deliver Digital Experiences to o...
DBS Bank: Changing the Way We Hire, Work and Deliver Digital Experiences to o...VMware Tanzu
 
CX Measurement: How Customer-Obsessed Companies Embrace Journey-Based Metrics
CX Measurement: How Customer-Obsessed Companies Embrace Journey-Based MetricsCX Measurement: How Customer-Obsessed Companies Embrace Journey-Based Metrics
CX Measurement: How Customer-Obsessed Companies Embrace Journey-Based MetricsPointillist
 
Digital Transformation Toolkit - Overview and Approach
Digital Transformation Toolkit - Overview and ApproachDigital Transformation Toolkit - Overview and Approach
Digital Transformation Toolkit - Overview and ApproachPeterFranz6
 
Digital Transformation Strategy & Framework | By ex-McKinsey
Digital Transformation Strategy & Framework | By ex-McKinseyDigital Transformation Strategy & Framework | By ex-McKinsey
Digital Transformation Strategy & Framework | By ex-McKinseyAurelien Domont, MBA
 
Deloitte Automotive IT Strategy Teaser Slides
Deloitte Automotive IT Strategy Teaser SlidesDeloitte Automotive IT Strategy Teaser Slides
Deloitte Automotive IT Strategy Teaser SlidesDeloitte Deutschland
 
The Transformation Journey with Cloud Technology
The Transformation Journey with Cloud TechnologyThe Transformation Journey with Cloud Technology
The Transformation Journey with Cloud TechnologyAmazon Web Services
 
Digital Transformation Strategy and Plan Template
Digital Transformation Strategy and Plan TemplateDigital Transformation Strategy and Plan Template
Digital Transformation Strategy and Plan TemplateAurelien Domont, MBA
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Selling Cloud Services To Earn More Revenue PowerPoint Presentation Slides
Selling Cloud Services To Earn More Revenue PowerPoint Presentation SlidesSelling Cloud Services To Earn More Revenue PowerPoint Presentation Slides
Selling Cloud Services To Earn More Revenue PowerPoint Presentation SlidesSlideTeam
 
Digital Transformation: Re-imagine from the outside In
Digital Transformation: Re-imagine from the outside InDigital Transformation: Re-imagine from the outside In
Digital Transformation: Re-imagine from the outside Inaccenture
 

What's hot (20)

Séminaire RCU
Séminaire RCUSéminaire RCU
Séminaire RCU
 
Digital Transformation Templates.ppt
Digital Transformation Templates.pptDigital Transformation Templates.ppt
Digital Transformation Templates.ppt
 
Future of Data and AI in Retail - NRF 2023
Future of Data and AI in Retail - NRF 2023Future of Data and AI in Retail - NRF 2023
Future of Data and AI in Retail - NRF 2023
 
Champion the Indian SaaS Story.pptx
Champion the Indian SaaS Story.pptxChampion the Indian SaaS Story.pptx
Champion the Indian SaaS Story.pptx
 
Cloud proposition for banking
Cloud proposition for bankingCloud proposition for banking
Cloud proposition for banking
 
Enterprise Digital Transformation: The Geisinger Experience
Enterprise Digital Transformation: The Geisinger ExperienceEnterprise Digital Transformation: The Geisinger Experience
Enterprise Digital Transformation: The Geisinger Experience
 
Essential Metadata Strategies
Essential Metadata StrategiesEssential Metadata Strategies
Essential Metadata Strategies
 
Consulting toolkit preparing dummy pack
Consulting toolkit   preparing dummy packConsulting toolkit   preparing dummy pack
Consulting toolkit preparing dummy pack
 
ChatGPT OpenAI Primer for Business
ChatGPT OpenAI Primer for BusinessChatGPT OpenAI Primer for Business
ChatGPT OpenAI Primer for Business
 
DBS Bank: Changing the Way We Hire, Work and Deliver Digital Experiences to o...
DBS Bank: Changing the Way We Hire, Work and Deliver Digital Experiences to o...DBS Bank: Changing the Way We Hire, Work and Deliver Digital Experiences to o...
DBS Bank: Changing the Way We Hire, Work and Deliver Digital Experiences to o...
 
CX Measurement: How Customer-Obsessed Companies Embrace Journey-Based Metrics
CX Measurement: How Customer-Obsessed Companies Embrace Journey-Based MetricsCX Measurement: How Customer-Obsessed Companies Embrace Journey-Based Metrics
CX Measurement: How Customer-Obsessed Companies Embrace Journey-Based Metrics
 
Digital Transformation Toolkit - Overview and Approach
Digital Transformation Toolkit - Overview and ApproachDigital Transformation Toolkit - Overview and Approach
Digital Transformation Toolkit - Overview and Approach
 
Digital Transformation Strategy & Framework | By ex-McKinsey
Digital Transformation Strategy & Framework | By ex-McKinseyDigital Transformation Strategy & Framework | By ex-McKinsey
Digital Transformation Strategy & Framework | By ex-McKinsey
 
Deloitte Automotive IT Strategy Teaser Slides
Deloitte Automotive IT Strategy Teaser SlidesDeloitte Automotive IT Strategy Teaser Slides
Deloitte Automotive IT Strategy Teaser Slides
 
Is AI generation the next platform shift?
Is AI generation the next platform shift?Is AI generation the next platform shift?
Is AI generation the next platform shift?
 
The Transformation Journey with Cloud Technology
The Transformation Journey with Cloud TechnologyThe Transformation Journey with Cloud Technology
The Transformation Journey with Cloud Technology
 
Digital Transformation Strategy and Plan Template
Digital Transformation Strategy and Plan TemplateDigital Transformation Strategy and Plan Template
Digital Transformation Strategy and Plan Template
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Selling Cloud Services To Earn More Revenue PowerPoint Presentation Slides
Selling Cloud Services To Earn More Revenue PowerPoint Presentation SlidesSelling Cloud Services To Earn More Revenue PowerPoint Presentation Slides
Selling Cloud Services To Earn More Revenue PowerPoint Presentation Slides
 
Digital Transformation: Re-imagine from the outside In
Digital Transformation: Re-imagine from the outside InDigital Transformation: Re-imagine from the outside In
Digital Transformation: Re-imagine from the outside In
 

Viewers also liked

GIS Barrier to Adoption by Southern Precision
GIS Barrier to Adoption by Southern Precision GIS Barrier to Adoption by Southern Precision
GIS Barrier to Adoption by Southern Precision Amanda Woods
 
Setting up a big data platform at kelkoo
Setting up a big data platform at kelkooSetting up a big data platform at kelkoo
Setting up a big data platform at kelkooFabrice dos Santos
 
Performance and Application of GIS and Big Data ETL Processes Using FME
Performance and Application of GIS and Big Data ETL Processes Using FMEPerformance and Application of GIS and Big Data ETL Processes Using FME
Performance and Application of GIS and Big Data ETL Processes Using FMESafe Software
 
Intro to Big Data in Urban GIS Research
Intro to Big Data in Urban GIS ResearchIntro to Big Data in Urban GIS Research
Intro to Big Data in Urban GIS ResearchRobert Goodspeed
 
THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0
THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0
THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0Srini Alavala
 
Film company case study
Film company case studyFilm company case study
Film company case studycallison1318
 
Mergers & Acquisitions 2013 V blank
Mergers & Acquisitions 2013  V blankMergers & Acquisitions 2013  V blank
Mergers & Acquisitions 2013 V blankchrismaupetit
 
Business Unusual: How the Dell-EMC Merger Sends Shockwaves Across the Global ...
Business Unusual: How the Dell-EMC Merger Sends Shockwaves Across the Global ...Business Unusual: How the Dell-EMC Merger Sends Shockwaves Across the Global ...
Business Unusual: How the Dell-EMC Merger Sends Shockwaves Across the Global ...Dana Gardner
 
Top mergers
Top mergersTop mergers
Top mergersMd Saim
 
Merger & Acquisition by top Technology companies
Merger & Acquisition by top Technology companiesMerger & Acquisition by top Technology companies
Merger & Acquisition by top Technology companiesSatyajit Paul
 
Merger of american airlines and us airways
Merger of american airlines and us airwaysMerger of american airlines and us airways
Merger of american airlines and us airwaysReena Sisodiya
 
Gillette and p&g
Gillette and p&gGillette and p&g
Gillette and p&gPankaj Baid
 
Merger and Acqusition
Merger and AcqusitionMerger and Acqusition
Merger and Acqusitionnitesh gunjan
 
Corporate Restructuring - A Financial Perspective
Corporate Restructuring - A Financial PerspectiveCorporate Restructuring - A Financial Perspective
Corporate Restructuring - A Financial PerspectiveDr. Mohamed Ibrahim
 

Viewers also liked (20)

GIS Barrier to Adoption by Southern Precision
GIS Barrier to Adoption by Southern Precision GIS Barrier to Adoption by Southern Precision
GIS Barrier to Adoption by Southern Precision
 
Open in Geo? Opening Geo!
Open in Geo? Opening Geo!Open in Geo? Opening Geo!
Open in Geo? Opening Geo!
 
Setting up a big data platform at kelkoo
Setting up a big data platform at kelkooSetting up a big data platform at kelkoo
Setting up a big data platform at kelkoo
 
Performance and Application of GIS and Big Data ETL Processes Using FME
Performance and Application of GIS and Big Data ETL Processes Using FMEPerformance and Application of GIS and Big Data ETL Processes Using FME
Performance and Application of GIS and Big Data ETL Processes Using FME
 
Big Data use cases in telcos
Big Data use cases in telcosBig Data use cases in telcos
Big Data use cases in telcos
 
Get Big Geo Data
Get Big Geo DataGet Big Geo Data
Get Big Geo Data
 
Intro to Big Data in Urban GIS Research
Intro to Big Data in Urban GIS ResearchIntro to Big Data in Urban GIS Research
Intro to Big Data in Urban GIS Research
 
THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0
THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0
THT10839_OpenWorldSF2015 CSP Location Data Monetization V1.0
 
Google Big Table
Google Big TableGoogle Big Table
Google Big Table
 
Film company case study
Film company case studyFilm company case study
Film company case study
 
Mergers & Acquisitions 2013 V blank
Mergers & Acquisitions 2013  V blankMergers & Acquisitions 2013  V blank
Mergers & Acquisitions 2013 V blank
 
Business Unusual: How the Dell-EMC Merger Sends Shockwaves Across the Global ...
Business Unusual: How the Dell-EMC Merger Sends Shockwaves Across the Global ...Business Unusual: How the Dell-EMC Merger Sends Shockwaves Across the Global ...
Business Unusual: How the Dell-EMC Merger Sends Shockwaves Across the Global ...
 
Top mergers
Top mergersTop mergers
Top mergers
 
Merger and ecquisition
Merger and ecquisitionMerger and ecquisition
Merger and ecquisition
 
Merger & Acquisition by top Technology companies
Merger & Acquisition by top Technology companiesMerger & Acquisition by top Technology companies
Merger & Acquisition by top Technology companies
 
Merger of american airlines and us airways
Merger of american airlines and us airwaysMerger of american airlines and us airways
Merger of american airlines and us airways
 
Gillette and p&g
Gillette and p&gGillette and p&g
Gillette and p&g
 
Disney M&A Case Study
Disney M&A Case StudyDisney M&A Case Study
Disney M&A Case Study
 
Merger and Acqusition
Merger and AcqusitionMerger and Acqusition
Merger and Acqusition
 
Corporate Restructuring - A Financial Perspective
Corporate Restructuring - A Financial PerspectiveCorporate Restructuring - A Financial Perspective
Corporate Restructuring - A Financial Perspective
 

Similar to The Journey to Big Data Analytics

The Data Lake: Empowering Your Data Science Team
The Data Lake: Empowering Your Data Science TeamThe Data Lake: Empowering Your Data Science Team
The Data Lake: Empowering Your Data Science TeamSenturus
 
Energy Central Webinar on June 14, 2016
Energy Central Webinar on June 14, 2016Energy Central Webinar on June 14, 2016
Energy Central Webinar on June 14, 2016OMNETRIC
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseJeff Kelly
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXtsigitnist02
 
Advanced IT Analytics: A Look at Real Adoptions in the Real World
Advanced IT Analytics: A Look at Real Adoptions in the Real WorldAdvanced IT Analytics: A Look at Real Adoptions in the Real World
Advanced IT Analytics: A Look at Real Adoptions in the Real WorldEnterprise Management Associates
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieSunil Ranka
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataMatt Stubbs
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataMatt Stubbs
 
The new dominant companies are running on data
The new dominant companies are running on data The new dominant companies are running on data
The new dominant companies are running on data SnapLogic
 
10 Lessons Learned from Meeting with 150 Banks Across the Globe
10 Lessons Learned from Meeting with 150 Banks Across the Globe10 Lessons Learned from Meeting with 150 Banks Across the Globe
10 Lessons Learned from Meeting with 150 Banks Across the GlobeDataWorks Summit
 
Enterprise as a Discipline
Enterprise as a DisciplineEnterprise as a Discipline
Enterprise as a DisciplineJohn Gøtze
 
MongoDB World 2019: Data Digital Decoupling
MongoDB World 2019: Data Digital DecouplingMongoDB World 2019: Data Digital Decoupling
MongoDB World 2019: Data Digital DecouplingMongoDB
 
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...Precisely
 
Capgemini’s Data WARP: Accelerate your Journey to Insights
Capgemini’s Data WARP: Accelerate your Journey to InsightsCapgemini’s Data WARP: Accelerate your Journey to Insights
Capgemini’s Data WARP: Accelerate your Journey to InsightsCapgemini
 
Réinventez le Data Management avec la Data Virtualization de Denodo
Réinventez le Data Management avec la Data Virtualization de DenodoRéinventez le Data Management avec la Data Virtualization de Denodo
Réinventez le Data Management avec la Data Virtualization de DenodoDenodo
 
Cloud Choices Quantifying the Cost and Risk Implications of Cloud
Cloud Choices Quantifying the Cost and Risk Implications of CloudCloud Choices Quantifying the Cost and Risk Implications of Cloud
Cloud Choices Quantifying the Cost and Risk Implications of CloudAmazon Web Services
 

Similar to The Journey to Big Data Analytics (20)

The Data Lake: Empowering Your Data Science Team
The Data Lake: Empowering Your Data Science TeamThe Data Lake: Empowering Your Data Science Team
The Data Lake: Empowering Your Data Science Team
 
Energy Central Webinar on June 14, 2016
Energy Central Webinar on June 14, 2016Energy Central Webinar on June 14, 2016
Energy Central Webinar on June 14, 2016
 
Create your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouseCreate your Big Data vision and Hadoop-ify your data warehouse
Create your Big Data vision and Hadoop-ify your data warehouse
 
Energy Management Solution - iARMS-EMS/PMS
Energy Management Solution - iARMS-EMS/PMSEnergy Management Solution - iARMS-EMS/PMS
Energy Management Solution - iARMS-EMS/PMS
 
Kba symposium 6th april
Kba symposium 6th aprilKba symposium 6th april
Kba symposium 6th april
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
 
Advanced IT Analytics: A Look at Real Adoptions in the Real World
Advanced IT Analytics: A Look at Real Adoptions in the Real WorldAdvanced IT Analytics: A Look at Real Adoptions in the Real World
Advanced IT Analytics: A Look at Real Adoptions in the Real World
 
Why Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A LieWhy Everything You Know About bigdata Is A Lie
Why Everything You Know About bigdata Is A Lie
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
 
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on DataBig Data LDN 2017: The New Dominant Companies Are Running on Data
Big Data LDN 2017: The New Dominant Companies Are Running on Data
 
The new dominant companies are running on data
The new dominant companies are running on data The new dominant companies are running on data
The new dominant companies are running on data
 
10 Lessons Learned from Meeting with 150 Banks Across the Globe
10 Lessons Learned from Meeting with 150 Banks Across the Globe10 Lessons Learned from Meeting with 150 Banks Across the Globe
10 Lessons Learned from Meeting with 150 Banks Across the Globe
 
Enterprise as a Discipline
Enterprise as a DisciplineEnterprise as a Discipline
Enterprise as a Discipline
 
Big Data & Analytics Day
Big Data & Analytics Day Big Data & Analytics Day
Big Data & Analytics Day
 
MongoDB World 2019: Data Digital Decoupling
MongoDB World 2019: Data Digital DecouplingMongoDB World 2019: Data Digital Decoupling
MongoDB World 2019: Data Digital Decoupling
 
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...
Digital Transformation: How to Run Best-in-Class IT Operations in a World of ...
 
Capgemini’s Data WARP: Accelerate your Journey to Insights
Capgemini’s Data WARP: Accelerate your Journey to InsightsCapgemini’s Data WARP: Accelerate your Journey to Insights
Capgemini’s Data WARP: Accelerate your Journey to Insights
 
Taming Big Data With Modern Software Architecture
Taming Big Data  With Modern Software ArchitectureTaming Big Data  With Modern Software Architecture
Taming Big Data With Modern Software Architecture
 
Réinventez le Data Management avec la Data Virtualization de Denodo
Réinventez le Data Management avec la Data Virtualization de DenodoRéinventez le Data Management avec la Data Virtualization de Denodo
Réinventez le Data Management avec la Data Virtualization de Denodo
 
Cloud Choices Quantifying the Cost and Risk Implications of Cloud
Cloud Choices Quantifying the Cost and Risk Implications of CloudCloud Choices Quantifying the Cost and Risk Implications of Cloud
Cloud Choices Quantifying the Cost and Risk Implications of Cloud
 

Recently uploaded

Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdfHuman37
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...GQ Research
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfJohn Sterrett
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxellehsormae
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16
 

Recently uploaded (20)

Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf20240419 - Measurecamp Amsterdam - SAM.pdf
20240419 - Measurecamp Amsterdam - SAM.pdf
 
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
Biometric Authentication: The Evolution, Applications, Benefits and Challenge...
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
DBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdfDBA Basics: Getting Started with Performance Tuning.pdf
DBA Basics: Getting Started with Performance Tuning.pdf
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptx
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
 

The Journey to Big Data Analytics

  • 1. The Journey to Big Data Analytics Dr. Stefan Radtke CTO Isilon Storage Division, EMEA Dell EMC IDC Conference, Madrid, January 31st 2017
  • 2. 2© Copyright 2016 Dell EMC All rights reserved. Welcome! Dr. Stefan Radtke CTO Isilon, EMEA Dell EMC | Storage Division - 1995-2011 : 17 Years for IBM in various technical roles - 2012-2013 : Global Architect, EMC Global Alliances - 2013-2016 : CTO, EMEA, Isilon Storage Division, EMC - 2016-today : CTO, EMEA, Isilon Storage Division, Dell EMC Phone: +49-176-34434460 E-Mail: Stefan.Radtke@dell.com LinkedIn: http://de.linkedin.com/in/drstefanradtke Blog: http://stefanradtke.blogspot.com
  • 3. 3© Copyright 2016 Dell EMC All rights reserved. Analytics affects all Industries Smart factories, process control, supply/labour efficiency, and automation control Smart meters and grids, preventative maintenance, and environmental monitoring Smart infrastructure, traffic optimization, maintenance, and fleet tracking Hospital patient monitoring, home healthcare, and remote diagnosis Wearables, automotive, smart home, and entertainment disruptions “just-in-time” management, promotions, and Location-based advertising Manufacturing Retail Energy Transportation/ Infrastructure Consumer Healthcare
  • 4. 4© Copyright 2016 Dell EMC All rights reserved. Step 2: Define question to be answered Step 3: Use Business Intelligence (BI) tool’s graphical user interface (GUI) to construct query Step 4: BI tool creates SQL Step 5: SQL is run against data warehouse to create report DW Traditional BI Engagement Process Step 1: Pre-build data schema (schema-on-load)
  • 5. 5© Copyright 2016 Dell EMC All rights reserved. Evolution Of The Analytic Questions • How many widgets did I sell last month? • What were sales by zip code for Christmas last year? • How many of Product X were returned last month? • What were company revenues and profits for the past quarter? • How many employees did I hire last year? What Happened? (Descriptive/BI) What Will Happen? (Predictive) • How many widgets will I sell next month? • What will be sales by zip code over this Christmas season? • How many of Product X will be returned next month? • What are projected company revenues and profits for next quarter? • How many employees will I need to hire next year? What Should I do? (Prescriptive) • Order [5,0000] component Z to support widget sales for next month • Hire [Y] new sales reps by these zip code to handle projected Christmas sales • Set aside [$125K] in financial reserve to cover Product X returns • Sell the following product mix to achieve quarterly revenue and margin goals • Increase hiring pipeline by 35% to achieve hiring goals
  • 6. 6© Copyright 2016 Dell EMC All rights reserved. Step 1: Define Hypothesis to test or Prediction to be made Step 3: Build schema (schema-on- query) Step 4: Visualize the data (Tableau, Spotfire, ggplot2,…) Step 6: Evaluate model results (probabilities, confidence levels) Data Science Engagement ProcessRepeat Step 5: Build analytic models (SAS, R, MADlib, Mahout,…) Kronos Historical Google Trends Physician Notes Local Events Weather Forecast Epic Lawson CDC Step 2: Gather data…and more data (Data Lake: SQL + Hadoop)
  • 7. 7© Copyright 2016 Dell EMC All rights reserved. Why we need to collect ALL data !
  • 8. 8© Copyright 2016 Dell EMC All rights reserved. Holistic Data Collection Data Lake
  • 9. 9© Copyright 2016 Dell EMC All rights reserved. Companies understand the Value of Information but many of them don’t know how to start the journey.
  • 10. 10© Copyright 2016 Dell EMC All rights reserved. Brainstorm the right questions to ask
  • 11. 11© Copyright 2016 Dell EMC All rights reserved. Do not filter any Data Sources • Brainstorming predictive and prescriptive questions typically uncovers numerous new data sources that are worthy of consideration. And this is a key point: ALL data sources are worthy of consideration! • Do NOT filter the data sources at this point in the process.
  • 12. 12© Copyright 2016 Dell EMC All rights reserved. But how do we get “Smart” First consideration: What is the business initiative or “what” we want to accomplish? For example, Reduce traffic congestions. Some key questions/parameters to consider: • Traffic flow decisions: New roads? New lanes? New turn lanes? New bike lanes? Pedestrian crossings? Railroad crossings? Bus stops? • Road repair and maintenance decisions: Fixing potholes? Repaving surfaces? Materials and equipment needed? When to fix potholes and repave streets? • Construction permits decisions: Types of permits needed? Impact on traffic flow? Length of time to complete the work? Number of employees to consider? • Events management decisions: Traffic (cars and pedestrians) attending proposed event? Impact on normal traffic flow? Date, time, location and duration of events? • Parks decisions: Location of parks? Size of parks? Hours of operation? Park equipment maintenance? • Schools decisions: Location and size of new schools? Hours of operations? Location of stoplights and stop signs?
  • 13. 13© Copyright 2016 Dell EMC All rights reserved. Data Assessment: Value vs. Feasibility Business Initiative: Improve Traffic Flow Impact Feasibility
  • 14. 14© Copyright 2016 Dell EMC All rights reserved. Prioritization Matrix Hi Hi Lo Implementation Feasibility BusinessValue C A F B D E Use Cases (Decisions) A. Optimize Traffic Flow B. Improve Road Repair and Maintenance C. Optimize Construction Permits D. Improve Events Management E. Optimize Park Hours and Activities F. Optimize School Hours and Activities Business Initiative: “Smart” City Initiative
  • 15. 15© Copyright 2016 Dell EMC All rights reserved. KEEPING UP WITH NEW TECHNOLOGIES AND TOOLS Product Performance & Reliability Customer Sentiment Analysis Supply Chain Optimization Competitive War Games IDENTIFYING THE RIGHT USE CASE FINDING, CURATING, AND GOVERNING THE DATA Why do most efforts to stand up Big Data Initiatives stall or fail ?
  • 16. 16© Copyright 2016 Dell EMC All rights reserved. Demonstrate the potential value using data science techniques Strategic Approach Align business and IT goals around big data Identify strategic opportunities for big data analytics Prioritize key use cases by assessing feasibility and ROI Recommend the appropriate analytics engagement and deployment roadmap 1 2 3 4 5 Workshop Objectives
  • 17. 17© Copyright 2016 Dell EMC All rights reserved. Get help for the Journey Platform for Big Data and Analytics Solutions BUSINESS TECHNOLOGY DEPLOYASSESS PROVE Big Data Proof of Value Big Data Proof of Technology Big Data Applied Analytics Implementation Big Data Technology Implementation Big Data Vision Workshop Big Data Technology Advisory Dell EMC Service Offerings
  • 18. 18© Copyright 2016 Dell EMC All rights reserved. My prediction: market is looking for well integrated systems. Easy to use and with flexible deployment options.
  • 19. 19© Copyright 2016 Dell EMC All rights reserved. Advance Analytic (Data Science) Models Graph AnalyticsForecasting Time Decomposition Association Analytics Behavioral Analytics
  • 20. 20© Copyright 2016 Dell EMC All rights reserved. Complex and growing Ecosystem
  • 21. 21© Copyright 2016 Dell EMC All rights reserved. DATA LAKE COMPUTE: EMC Converged Platform Integrated and Converged Infrastructure PLATFORM MANAGER ADMINISTRATION ANALYTICS CATALOG DATA CATALOG Rich UX INFRASTRUCTURE SOFTWARE OPTIONS YOUR WORKSPACE COMPUTE, NETWORK, STORAGE CONVERGED INFRASTRUCTUREDATA SETS DATA CURATOR ENRICH INGEST INDEX FIND AND INGEST DATA AT LEAST ONE HADOOP DISTRIBUTION DATA GOVERNOR LINEAGE QUALITY SECURITY GOVERN/ PUBLISH DATA, TOOLS, APPS Integrated Analytics 3rd Party Openness DATA SCIENCE Pivotal Big Data Suite EXTENSION PACKS R, RStudio, Anaconda python, jupyter PUBLISHED TOOLS AND DATASETS
  • 22. 22© Copyright 2016 Dell EMC All rights reserved. Thanks for your Attention! Dr. Stefan Radtke CTO Isilon, EMEA Dell EMC | Storage Division - 1995-2011 : 17 Years for IBM in various technical roles - 2012-2013 : Global Architect, EMC Global Alliances - 2013-2016 : CTO, EMEA, Isilon Storage Division, EMC - 2016-today : CTO, EMEA, Isilon Storage Division, Dell EMC Phone: +49-176-34434460 E-Mail: Stefan.Radtke@dell.com LinkedIn: http://de.linkedin.com/in/drstefanradtke Blog: http://stefanradtke.blogspot.com

Editor's Notes

  1. IoT solutions have found their way into many different vertical industries as shown in the slide. As these different industries have specific needs, it is best to categorize the type of IoT solution which you want to build, and define what type of data you will need to collect for analyses. -Manufacturing uses IoT solutions in machine to machine communications in what are called “smart factories”. IoT elevates automation to a new level providing unparalleled process control, control , quality and efficiency. - Retail uses IoT solutions in everything from learning customer trends to “real-time” priding and inventory control with RFID tagging. Energy companies deploy “Smart grids” and use IoT for telemetry , metering and general data collection Transportation companies used IoT for global logistics to analyze traffic pattern, increase efficiency and save on variable costs like fuel Consumers are now seeing IoT in wearable devices like fitness trackers, and collision avoidance in their automobiles Healthcare uses IoT solutions in all types of solutions from patient monitoring to drug companies collecting remote data on trials
  2. Time-to-analysis bottlenecked by need to decide questions (queries) before developing the schema
  3. In order to gain meaning from Big Data, you need “Data Science” Business Intelligence is different than data science BI reports on historical performance – retrospective reporting and on-going business monitoring What happened last quarter? How many did we sell? Data science is about predicting the future and understanding why things happen What is the optimal solution? What will happen next? Data science provides a new approach to uncovering and acting on the insights buried across the wealth of available data sources
  4. Ade range of devices (traffic lights, parking meters, weather instruments, etc.) and video cameras (traffic, pedestrian and bike traffic flow) generating data about city operations. A citizen could combine these sensor and video-generated data with other data sources, such as social media (Facebook, Instagram, Yelp) + citizen comments (emails, phone calls) + city reports (police blotters, fire reports, emergency services, construction permits, work orders, building hours, etc.) + local events (concerts, sporting events, farmers markets, parades, festivals, etc.) to create a rich perspective on the city’s activities, problems and overall economic and social vitality.
  5. In order to gain meaning from Big Data, you need “Data Science” Business Intelligence is different than data science BI reports on historical performance – retrospective reporting and on-going business monitoring What happened last quarter? How many did we sell? Data science is about predicting the future and understanding why things happen What is the optimal solution? What will happen next? Data science provides a new approach to uncovering and acting on the insights buried across the wealth of available data sources
  6. Once you know the decisions, the next step is to brainstorm the questions stakeholders need to answer in support of key decisions. This process will help to identify variables and metrics that might be better predictors of the decisions we are trying to make. While most organizations have a good handle on the “descriptive” (What happened?) questions, the business stakeholders struggle with the “predictive” (what is likely to happen?) and the “prescriptive” (what should I do?) questions (see Figure 1).
  7. Getting smart starts by understanding the city’s key business initiative or business objective (i.e., “what” we want to accomplish). For example, let’s identify and understand the decisions that city management (our key business stakeholder in this example) needs to make to support the business initiative of “Improving traffic flow.” This could include: Traffic flow decisions: New roads? New lanes? New turn lanes? New bike lanes? Pedestrian crossings? Railroad crossings? Bus stops? Road repair and maintenance decisions: Fixing potholes? Repaving surfaces? Materials and equipment needed? When to fix potholes and repave streets? Construction permits decisions: Types of permits needed? Impact on traffic flow? Length of time to complete the work? Number of employees to consider? Events management decisions: Traffic (cars and pedestrians) attending proposed event? Impact on normal traffic flow? Date, time, location and duration of events? Parks decisions: Location of parks? Size of parks? Hours of operation? Park equipment maintenance? Schools decisions: Location and size of new schools? Hours of operations? Location of stoplights and stop signs?
  8. Property of William D Schmarzo
  9. However, enabling big data initiatives is difficult. Many enterprise IT departments were not built to support the third platform and they struggle to deploy and maintain a big data infrastructure which is based on an ever-expanding suite of big data technologies and tools. Data all over the company in different formats with different levels of governance and security… A never-ending list of new technologies and applications that is changing every day… and their interactions with the infrastructure to run optimally Once enabled, customers see the potential and the numbers of Projects, questions and objectives arise quickly. Use cases explode across the organization as the benefits of analytics are understood, The organization struggles to prioritize and to focus on the use case with the most impact and feasibility for the business. and the prioritization and rapid execution become paramount – enabling them to become an Agile Digital Business Customers are very good at building a physical Hadoop cluster, but struggle to take the next steps: They often focus on technical activities vs. business outcomes They have little or no data governance, but need it desperately They struggle with managing and deploying the various tools They can’t easily expand restrictive physical infrastructures, slowing analytics to a crawl They cannot keep up with a never-ending array of emerging technologies Ultimately, most efforts to stand up big data initiatives stall or fail due to the complexity of the infrastructure, tools, technologies and culture changes required. BUT EXTRACTING BUSINESS VALUE IS HARD EXPECT DEPLOYMENT, MANAGEMENT, AND GOVERNANCE COMPLEXITY
  10. The Workshop Objectives are to align business and IT goals around big data, identify strategic opportunities for big data analytics, prioritize key use cases by assessing feasibility and ROI, demonstrate the potential value using data science techniques, and to recommend the appropriate analytics engagement and deployment roadmap.
  11. Technology Track Some clients have already made progress implementing certain data and analytics use cases, and now the IT organization seeks to expand its capabilities and operationalize the processes, to meet growing demands for better/faster data and analytics. But what often happens is that IT hits a technology wall, because the underlying infrastructure, tools and processes don’t support the new demands of the business. Some typical scenarios we see are gaps in the Big Data capabilities within the IT environment, and long delays in delivery of incoming requests for data and analytics. Uniquely, we help you understand your technology gaps in context to your business goals. The point is that you need to make the right recommendations about where to invest. Big Data Technology Advisory: For customers who need to document and/or understand the existing technical environment and it’s limitations with respect to big data For customers who want a technology roadmap for specific big data capabilities Identifies gaps in current infrastructure Produces a plan for activating technical big data use cases Big Data Proof of Technology: Pilots a known technology use on EMC equipment within the customer’s IT environment Demonstrates how the data lake approach and functionality works with existing customer systems and data sources Big Data Technology Implementation: Install and configure core data lake architecture Automate data ingest, data preparation and analytics execution around a specific technology use case Implementation of Platforms for Big Data and Analytics Solutions: Implement data lakes and analytics and app/dev capabilities into production environments Integration of analytic results into management applications Define and implement business rules and policies as they relate to data Overview of Business Track Our point of view is that success starts with aligning IT and the business around a single strategic business initiative within a 9-12 month timeframe. This helps us identify an analytics use case that will accelerate a current business goal or solve a current problem. You need to deliver the right analytic recommendations to the data science teams – the workhorses of your Big Data ecosystem – to help them surface insights that can drive business value. We have a unique methodology to identify and prioritize a single analytics use case with the best combination of implementation feasibility and business value. It’s a 3-week engagement that applies research, interviews, data science expertise and techniques to your business – culminating in a 1-day workshop to identify and agree on the best analytics use case and path forward to solving a business problem. This approach sets us apart from the “bring in a bunch of technology and see what it can do” approach that’s pushed by many vendors. We call this a Big Data Vision Workshop. The next step is for you to understand the ROI of your use case, so you know in advance, how it will pay off, how much, and over what timeframe. We’ll prove out your prioritized use case ahead of time, on-site, with your data, on a real analytics environment to generate the required analytic lift. We then model an app/process for you that would leverage that insight to achieve the business objective. We call this a Proof of Value. EMC’s proven end-to-end methodology includes: Inventory, evaluation, and prioritization of data sources Data preparations, integration and enrichment Analytic models, visualizations, rules, scoring, etc. Documentation of actionable insights Future state data architecture, IT process, organization and governance recommendations Implementation Roadmap Confirmed analytic lift and high-level business case / ROI For an applied analytics implementation, we can configure the architecture for production in your environment, and build your application, so you can implement your solution quickly and easily – speeding time-to-value while establishing a path for future use cases. We have both the consulting and technology components under one roof to help you assess, prove, and deploy, your use case.
  12. Understand content, scope and relationships within the data Diagrams 1 & 2: Average net income ranges from category 1 of $192,728 to category 7 of $29,958. Category 4 does not have data. Diagram 3: Cardmembers located at mid to upper categories (1-3) and the lowest category (7). The lowest category however, are the youngest population with high peak at age 21. This could be a student group. Diagram 4: Cardmember household size are congregated at the 1 to 3 ranges
  13. Converged Infrastructure – Blocks and Racks for delivering storage, compute, and local storage - This is quota allocated to a workspace - The BDL Leverages Isilon as the optimal multi-protocol Data Lake for storing raw content – and then making it accessible to the BDL via its HDFS protocol support The Data Curator is built on a key notion of Data Awareness of the BDL It allows for the indexing of content in the Lake as well as anywhere in the enterprise and beyond Data Scientists can search for, and even sample, data in the lake and across the enterprise that will help them build their model. Statistic – 80% of an data scientist’s time is spent looking for and getting the right data Once found, the Data Curator has an ingestion capability that supports the transfer, blending, and wrangling of that “right data” into a Data Scientists private workspace or sandbox for further preparation The Platform Manager provisions analytic applications and curated data sets into the Users workspace. The Data Governor supports the ability to have policy-driven data stewardship for Security, Lineage, and Quality