SlideShare a Scribd company logo
1© Copyright 2014 EMC Corporation. All rights reserved.
Data Science + Data Engineering
Annika Jimenez
Secret Weapon of the Strategic Enterprise
2© Copyright 2014 EMC Corporation. All rights reserved.
Agenda
 Data Science: What is it and why do we do it?
 The Importance of Data Engineering
 An Example: Kaiser Code-a-thon
 Transforming Your Enterprise with Pivotal
 Pivotal Data Labs: Data Engineering + Data Science
 Closing Advice
3© Copyright 2014 EMC Corporation. All rights reserved.
What Matters: Apps. Data. Analytics.
Apps power business, and
those apps generate data
Analytic insights from that
data drive new app
functionality, which in-turn
drives new data
The faster you can move
around the cycle, the faster
you learn, innovate and pull
away from the competition
4© Copyright 2014 EMC Corporation. All rights reserved.
Primary Motions for Pivotal
Agile: data-driven apps and
rapid time to value
Data Lake: store everything,
analyze anything
Enterprise PaaS: revolutionary
software development and
speed; build the right thing
5© Copyright 2014 EMC Corporation. All rights reserved.
DATA SCIENCE
The use of statistical and machine learning
techniques on big, multi-structured data
– in a distributed computing environment –
to identify correlations and causal relationships,
classify and predict events, identify patterns and
anomalies, and infer probabilities,
interest, and sentiment.
6© Copyright 2014 EMC Corporation. All rights reserved.
But, why do
we use Data Science?
7© Copyright 2014 EMC Corporation. All rights reserved.
 BI – show dashboard
8© Copyright 2014 EMC Corporation. All rights reserved.
Is the Goal Any of These Things?
A. Cool Visualizations
B. Custom Querying
C. Decision Enablement
D. Insights
E. All of the above…
NO
9© Copyright 2014 EMC Corporation. All rights reserved.
DRIVE AUTOMATED
LOW LATENCY ACTIONS
IN RESPONSE TO
EVENTS OF INTEREST
10© Copyright 2014 EMC Corporation. All rights reserved.
YOUR
DATA
DATA
SCIENCE
+= MODELS
11© Copyright 2014 EMC Corporation. All rights reserved.
Drive
Automated
Low Latency
Actions
Production
Data Feeds
Low
Latency
Model
Scoring
API
Availability
or Push to
Apps
Business
Logic
Applicatio
n
Response
New
Events
(aka, Data)Model
Operationalization
(“O16N”)
12© Copyright 2014 EMC Corporation. All rights reserved.
Data Science Value Chain
Instrumen-
tation
Logs
Capture
Store
Transform
& Prepare
Access Model Dev. Deploy Apps
Process
Change
Product
Engineer
Data
Engineer
DBA
Data
Engineer
Data
Engineer Data
Scientist
Data
Engineer
Application
Developer
PMO
13© Copyright 2014 EMC Corporation. All rights reserved.
→ Kaiser Blog for Full Story
14© Copyright 2014 EMC Corporation. All rights reserved.
Code-a-Thon Details – Logistics
 24-Hour Data Science Code-a-Thon
 5 resources per vendor
 Vendors were asked to be prepared
for any use in the domain
 A 15-minute presentation to senior
leaders, executives, doctors and
pharmacists
 Teams were required to use Tableau
in their presentation
15© Copyright 2014 EMC Corporation. All rights reserved.
Code-a-Thon – Pivotal Team
Hulya Emir-Farinas
Data Scientist
Noah Zimmerman
Data Scientist
Jacque Istok
Application Developer
Dillon Woods
Application Developer
Randy Williard
Big Data Engineer
Jemish Patel
Big Data Engineer
Adam Shook
Big Data Engineer
Roy Mims
Coordinator
16© Copyright 2014 EMC Corporation. All rights reserved.
The Day of the Code-a-Thon…
17© Copyright 2014 EMC Corporation. All rights reserved.
Key Insight
18© Copyright 2014 EMC Corporation. All rights reserved.
Asthma Population Management Application
19© Copyright 2014 EMC Corporation. All rights reserved.
Asthma Management Application
20© Copyright 2014 EMC Corporation. All rights reserved.
What Did We Learn in 2013?
 Pivotal has a world-class Data Science team, the best there is
 Data Science alone is good, but Data Science + Expert Data
Engineering and Architecture is great
 Corollary: Data Science + Data Engineering + Apps trumps
everything
– This is the path to rapid value creation and ROI
21© Copyright 2014 EMC Corporation. All rights reserved.
DATA
SCIENCE
DATA
ENGINEERING
PIVOTAL
LABS
Data Science + Data Engineering +
Pivotal Labs = The Magic in the Middle
22© Copyright 2014 EMC Corporation. All rights reserved.
What Is Pivotal Data Labs?
Data Science Data Engineering
+
23© Copyright 2014 EMC Corporation. All rights reserved.
Pivotal Data Scientists are technical
professionals with strong programming
skills, anchored in
vertical/horizontal domains or in
specialized academic research, able to
identify real-world problems
requiring predictive analytics, formulate
these mathematically, and solve them
by applying machine learning and
statistical algorithms, on Big Data,
in Pivotal and third-party
technologies.
Pivotal Data Engineers are Big Data
experts and industry veterans with a
passion to leverage these skills to
drive business value for Pivotal
customers. They posses expert
knowledge and
skills with the Pivotal data products
and excel at architecting enterprise
scale solutions to the most
demanding data and analytic
challenges.
Data Science Data Engineering
+
What Is Pivotal Data Labs?
24© Copyright 2014 EMC Corporation. All rights reserved.
What is a
“Data
Scientist”?
ProgrammingSkills
Mathematical/Statistical Skills
25© Copyright 2014 EMC Corporation. All rights reserved.
What is a
“Data
Engineer”?
ProgrammingSkills
Architectural Skills
26© Copyright 2014 EMC Corporation. All rights reserved.
World’s Leading Experts
Pivotal Labs – Pivotal Data Labs
BATCH BATCH
NEAR TIME NEAR TIMEHAWQGreenplum DB
Pivotal HD
REAL TIME REAL TIMEGemFire XDGemFire
27© Copyright 2014 EMC Corporation. All rights reserved.
Pivotal One
SOLUTIONS
Pivotal One
SERVICES
S
PivotalOne
PIVOTAL
MySQL
Elastic Runtime
Services:
Java, Spring, Ruby,
Node.JS
Value-adds:
Installation,
Management
& Monitoring
(Core OSS)
• Data Lake Solutions (Security Analytics, Corp Comm Analytics, Business)
• RTI for Telco (RTI4T)
PIVOTAL
GemFire
XD
PIVOTAL
Data Dispatch
Coming in 2014
Pivotal HD
Hadoop+Que
ry
GPDB
MPP
Analytics
GemFire
In-Memory
Grid
Spring
App
Framework
RabbitMQ
, Redis…
Pivotal One
MARKETPLACE
Pivotal Data Labs in Data Fabric
Building Towards Pivotal One
28© Copyright 2014 EMC Corporation. All rights reserved.
Introducing Pivotal Data Labs
Our Charter:
Pivotal Data Labs is Pivotal’s differentiated and highly opinionated
data-centric service delivery organization.
Our Goals:
Expedite customer time-to-value and ROI, by driving business-aligned
innovation and solutions assurance within Pivotal’s Data Fabric
technologies.
Drive customer adoption and autonomy across the full spectrum of
Pivotal Data technologies through best-in-class data science and data
engineering services, with a deep emphasis on knowledge transfer.
29© Copyright 2014 EMC Corporation. All rights reserved.
Highly-Opinionated & Differentiated
 “Highly-Opinionated” – Highly prescriptive in our
counsel of data best practices to customers and
partners, drawing from best-in-class talent and deep
experience operating on the Pivotal Data Fabric
 “Differentiated” – An expert Data services
business that is unique in its class and unlike the
Data services available elsewhere
30© Copyright 2014 EMC Corporation. All rights reserved.
What Will PDL Deliver For Customers?
Accelerated time-to-value
and real ROI for customers
31© Copyright 2014 EMC Corporation. All rights reserved.
How Do We Do This?
 Best-in-class Data Science to drive value creation on
Pivotal stack and customer data
 Best-in-class Data Engineering to drive pragmatic, well-
designed, customized architecture for end-to-end Pivotal
stack
 Assured solutions success in Pivotal data service delivery
 Operationally-optimized predictive models
 Collaboration with Pivotal Labs to deliver data-driven
applications
32© Copyright 2014 EMC Corporation. All rights reserved.
INSTALL VERIFY ENABLE
We will verify the installation
making sure it’s fully operational
in your environment and ready
to give you the Pivotal
advantage.
Our experts work with your staff
to plan, install and fully-
configure the Pivotal software
based on your environment and
requirements.
Lastly, we’ll train your people
and conduct knowledge transfer
to make sure you are
comfortable using and
supporting Pivotal software.
Getting Started with Pivotal Software
Engagement Management – site prep, project management, customer support overview
Hardware Validation / Installation
Software Installation / Verification
Training & Knowledge
Transfer
PIVOTAL
ONBOARDIN
G
SERVICES
33© Copyright 2014 EMC Corporation. All rights reserved.
PIVOTAL
SOLUTION
ASSURANCE
INCEPTION ADVISE SUPPORT
Leveraging expert-services in
Pivotal Data Labs, Pivotal Labs,
and Certified Pivotal Partners
we’ll work with your architects
and developers to assure that
your system design and
development is aligned with
Pivotal best practices.
Getting off the ground with a
well-formed plan, a solid team
and realistic expectations are
fundamental to overall success.
Our experts help with design,
guidance, oversight and
lessons from other customers to
get your initiative going in the
best direction from the start.
.
We’ll act as your conduit to
Certified Professional Services,
Customer Support and Pivotal
R&D to make you successful,
quickly determine answers and
bring in specialized expertise
where needed.
Leverage Pivotal Data Experts for Success
Engagement / Success Alignment – Regular meetings for status and guidance, Resource advice
Architecture Design
Implementation Design and
Assistance
Resource Assistance,
Expedited Response
34© Copyright 2014 EMC Corporation. All rights reserved.
DISCOVERY INSIGHTS RESULTS
Once the data is understood, we
set ourselves apart by making
optimal use of Pivotal’s Data
Fabric, our analytical tools and
our data science experts to build
models creating deep actionable
insights on key events of interest
in any use case.
Getting the most from your data
requires understanding what
you have today and discovering
what your data can do for you.
Combining our data scientists
with your data starts the path to
value creation.
Driving insights into actionable
results is enabled through data
scoring and model code
optimization, documentation
and knowledge transfer.
Value Creation With Predictive Insights
Engagement / Business / Technical Alignment – Regular meetings for status and validation
Data exploration, readiness
assessment, and prep Combining domain knowledge and
data science for predictive modeling
Code documentation, and
knowledge sharing
DATA
SCIENCE
LABS
35© Copyright 2014 EMC Corporation. All rights reserved.
DATA
LAB INCEPTION IMPLEMENT EXCEL
Delivered by a dedicated team
drawing from Pivotal Data Labs’
experts in architecture, data
engineering, data science, and
application development,
implementations of Pivotal
technologies will be targeted to
maximize value creation against
your specific technical and
business goals.
Getting off the ground with a
well-formed holistic
architectural, data science, and
application plan is fundamental
to overall success. Our experts
will drive this process
leveraging deep experience in
these areas, to streamline the
path to success for your
initiative.
Years of experience building
successful data projects give us
the know-how to quickly and
efficiently work through the
challenging phases of any
project including design,
scaling, integration and
production readiness.
Engagement / Business / Technical Alignment – Regular meetings for status and guidance
Data platform architecting
and strategic analytiic use-
case roadmaps
Data and Application Fabric
deployments, Data Science
modeling & App development Knowledge sharing,
training and expert
assistance
Deep Partnering to Maximize Value Creation
36© Copyright 2014 EMC Corporation. All rights reserved.
Pivotal Data Labs + Pivotal Labs =
The Magic in the Middle
RAPID VALUE!
PIVOTAL
LABS
*
PIVOTAL DATA LABS
37© Copyright 2014 EMC Corporation. All rights reserved.
My Advice to Enterprises
1. Know your data and its potential value
2. Get “vision” and question status quo
3. Understand the technical paradigm shift underway
4. Hire or grow your Data Dream Team:
Data Scientists and Data Engineers
5. Clear the path to operationalization (aka, value)
6. Manage the disruption, don’t reject it
Pivotal data science_data_engineering_secret_weapons_of_the_strategic_enterprise

More Related Content

What's hot

Industrial IoT and OT/IT Convergence
Industrial IoT and OT/IT ConvergenceIndustrial IoT and OT/IT Convergence
Industrial IoT and OT/IT Convergence
Michelle Holley
 

What's hot (20)

Hitachi Cloud and Solutions
 Hitachi Cloud and Solutions Hitachi Cloud and Solutions
Hitachi Cloud and Solutions
 
Dell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western OntarioDell NVIDIA AI Roadshow - South Western Ontario
Dell NVIDIA AI Roadshow - South Western Ontario
 
Advantages of Converged Infrastructures
Advantages of Converged InfrastructuresAdvantages of Converged Infrastructures
Advantages of Converged Infrastructures
 
Industry edge communications edition spring 2013
Industry edge communications edition spring 2013Industry edge communications edition spring 2013
Industry edge communications edition spring 2013
 
Smart Enterprise Drivers 2020 - Strategic Realities Reshaping the Smart Enter...
Smart Enterprise Drivers 2020 - Strategic Realities Reshaping the Smart Enter...Smart Enterprise Drivers 2020 - Strategic Realities Reshaping the Smart Enter...
Smart Enterprise Drivers 2020 - Strategic Realities Reshaping the Smart Enter...
 
The Enterprise Business Case for Cloud Transformation: Introducing Everest Gr...
The Enterprise Business Case for Cloud Transformation: Introducing Everest Gr...The Enterprise Business Case for Cloud Transformation: Introducing Everest Gr...
The Enterprise Business Case for Cloud Transformation: Introducing Everest Gr...
 
Nec smart enterprise_trends_2014-slides
Nec smart enterprise_trends_2014-slidesNec smart enterprise_trends_2014-slides
Nec smart enterprise_trends_2014-slides
 
Top 10 Reasons for Colocation
Top 10 Reasons for ColocationTop 10 Reasons for Colocation
Top 10 Reasons for Colocation
 
2016 asl hitachi
2016 asl hitachi2016 asl hitachi
2016 asl hitachi
 
Value Journal - March 2021
Value Journal - March 2021Value Journal - March 2021
Value Journal - March 2021
 
Possibility Thinking about Cloud Computing
Possibility Thinking about Cloud ComputingPossibility Thinking about Cloud Computing
Possibility Thinking about Cloud Computing
 
Apresentação Portuguesa 2011 Ntt V1 Dez
Apresentação Portuguesa 2011 Ntt V1 DezApresentação Portuguesa 2011 Ntt V1 Dez
Apresentação Portuguesa 2011 Ntt V1 Dez
 
Experiencing the Live IIoT
Experiencing the Live IIoTExperiencing the Live IIoT
Experiencing the Live IIoT
 
Industrial IoT and OT/IT Convergence
Industrial IoT and OT/IT ConvergenceIndustrial IoT and OT/IT Convergence
Industrial IoT and OT/IT Convergence
 
Utilities Transformation: Improving the Time to Value of Technology
Utilities Transformation: Improving the Time to Value of TechnologyUtilities Transformation: Improving the Time to Value of Technology
Utilities Transformation: Improving the Time to Value of Technology
 
The world of Machine Learning, Deep Learning and PowerAI
The world of Machine Learning, Deep Learning and PowerAIThe world of Machine Learning, Deep Learning and PowerAI
The world of Machine Learning, Deep Learning and PowerAI
 
How to utilize cloud in your corporate IT strategy
How to utilize cloud in your corporate IT strategy How to utilize cloud in your corporate IT strategy
How to utilize cloud in your corporate IT strategy
 
Quelle stratégie pour EMC en 2015 ? Repensons l'IT
Quelle stratégie pour EMC en 2015 ? Repensons l'ITQuelle stratégie pour EMC en 2015 ? Repensons l'IT
Quelle stratégie pour EMC en 2015 ? Repensons l'IT
 
Riding the Cloud
Riding the Cloud Riding the Cloud
Riding the Cloud
 
Bladetec_Manifesto
Bladetec_ManifestoBladetec_Manifesto
Bladetec_Manifesto
 

Viewers also liked

Value Delivered - Predictive Analytics 2015 v3
Value Delivered - Predictive Analytics 2015 v3Value Delivered - Predictive Analytics 2015 v3
Value Delivered - Predictive Analytics 2015 v3
Roger Moore
 
Beautiful quotestoliveby
Beautiful quotestolivebyBeautiful quotestoliveby
Beautiful quotestoliveby
Chandan Dubey
 
Ppp burgernomics etc
Ppp burgernomics etcPpp burgernomics etc
Ppp burgernomics etc
Travis Klein
 
тестээр үнэлэх
тестээр үнэлэхтестээр үнэлэх
тестээр үнэлэх
pvsa_8990
 

Viewers also liked (20)

EMC's IT Transformation Journey ( EMC Forum 2014 )
EMC's IT Transformation Journey ( EMC Forum 2014 )EMC's IT Transformation Journey ( EMC Forum 2014 )
EMC's IT Transformation Journey ( EMC Forum 2014 )
 
Value Delivered - Predictive Analytics 2015 v3
Value Delivered - Predictive Analytics 2015 v3Value Delivered - Predictive Analytics 2015 v3
Value Delivered - Predictive Analytics 2015 v3
 
The Investee Club
The Investee ClubThe Investee Club
The Investee Club
 
Analytics: The widening divide
Analytics: The widening divideAnalytics: The widening divide
Analytics: The widening divide
 
Business Design Concepts
Business Design ConceptsBusiness Design Concepts
Business Design Concepts
 
Sales Talent Academy
Sales Talent AcademySales Talent Academy
Sales Talent Academy
 
Project Management Journey
Project Management JourneyProject Management Journey
Project Management Journey
 
WORLD-CLASS EXCELLENCE MODEL Congreso internacional PMI Bogotá 1a3 nov2012 h
WORLD-CLASS EXCELLENCE MODEL Congreso internacional PMI Bogotá 1a3 nov2012 hWORLD-CLASS EXCELLENCE MODEL Congreso internacional PMI Bogotá 1a3 nov2012 h
WORLD-CLASS EXCELLENCE MODEL Congreso internacional PMI Bogotá 1a3 nov2012 h
 
Introduction to Business Design - Rotman DesignWorks
Introduction to Business Design - Rotman DesignWorksIntroduction to Business Design - Rotman DesignWorks
Introduction to Business Design - Rotman DesignWorks
 
The big data value chain r1-31 oct13
The big data value chain r1-31 oct13The big data value chain r1-31 oct13
The big data value chain r1-31 oct13
 
Beautiful quotestoliveby
Beautiful quotestolivebyBeautiful quotestoliveby
Beautiful quotestoliveby
 
UP THERE, EVERYWHERE Partners
UP THERE, EVERYWHERE PartnersUP THERE, EVERYWHERE Partners
UP THERE, EVERYWHERE Partners
 
Math
MathMath
Math
 
Ppp burgernomics etc
Ppp burgernomics etcPpp burgernomics etc
Ppp burgernomics etc
 
тестээр үнэлэх
тестээр үнэлэхтестээр үнэлэх
тестээр үнэлэх
 
Be well happy
Be well happyBe well happy
Be well happy
 
Fenice display system
Fenice display systemFenice display system
Fenice display system
 
IT Financial Transparency: EMC’s Successful Journey to Achieving Enterprise C...
IT Financial Transparency: EMC’s Successful Journey to Achieving Enterprise C...IT Financial Transparency: EMC’s Successful Journey to Achieving Enterprise C...
IT Financial Transparency: EMC’s Successful Journey to Achieving Enterprise C...
 
HTTP 완벽가이드- 18 웹 호스팅
HTTP 완벽가이드- 18 웹 호스팅HTTP 완벽가이드- 18 웹 호스팅
HTTP 완벽가이드- 18 웹 호스팅
 
Atlassian Crowd
Atlassian CrowdAtlassian Crowd
Atlassian Crowd
 

Similar to Pivotal data science_data_engineering_secret_weapons_of_the_strategic_enterprise

ML_CORP_DECK_Partners
ML_CORP_DECK_PartnersML_CORP_DECK_Partners
ML_CORP_DECK_Partners
Lloyd SOLDATT
 
Bilytica - Corporate Introduction - Jan 2015
Bilytica - Corporate Introduction - Jan 2015Bilytica - Corporate Introduction - Jan 2015
Bilytica - Corporate Introduction - Jan 2015
Hannah Naser
 
GTL Corporate Presentation
GTL Corporate PresentationGTL Corporate Presentation
GTL Corporate Presentation
sourav1981
 
Gateway Corporate Presentation
Gateway Corporate PresentationGateway Corporate Presentation
Gateway Corporate Presentation
Ravi Krishna
 
Imaginea Overview
Imaginea OverviewImaginea Overview
Imaginea Overview
Jimit Shah
 

Similar to Pivotal data science_data_engineering_secret_weapons_of_the_strategic_enterprise (20)

CFO ERP Considerations: Cloud, On-Premise, and Beyond - Emtec, Inc.
CFO ERP Considerations: Cloud, On-Premise, and Beyond - Emtec, Inc.CFO ERP Considerations: Cloud, On-Premise, and Beyond - Emtec, Inc.
CFO ERP Considerations: Cloud, On-Premise, and Beyond - Emtec, Inc.
 
DSS Company Profile English Aug 2014
DSS Company Profile English Aug 2014DSS Company Profile English Aug 2014
DSS Company Profile English Aug 2014
 
The Cloud Foundry Story
The Cloud Foundry StoryThe Cloud Foundry Story
The Cloud Foundry Story
 
Have your cake and eat it too: adopting technologies without sacrificing - Pa...
Have your cake and eat it too: adopting technologies without sacrificing - Pa...Have your cake and eat it too: adopting technologies without sacrificing - Pa...
Have your cake and eat it too: adopting technologies without sacrificing - Pa...
 
ML_CORP_DECK_Partners
ML_CORP_DECK_PartnersML_CORP_DECK_Partners
ML_CORP_DECK_Partners
 
Bilytica - Corporate Introduction - Jan 2015
Bilytica - Corporate Introduction - Jan 2015Bilytica - Corporate Introduction - Jan 2015
Bilytica - Corporate Introduction - Jan 2015
 
Iasa Architect responsibilities in the cloud
Iasa Architect responsibilities in the cloudIasa Architect responsibilities in the cloud
Iasa Architect responsibilities in the cloud
 
Pure App + Patterns + Prolifics = Feeding Change
Pure App + Patterns + Prolifics = Feeding Change Pure App + Patterns + Prolifics = Feeding Change
Pure App + Patterns + Prolifics = Feeding Change
 
Best Practices for Managing IaaS, PaaS, and Container-Based Deployments - App...
Best Practices for Managing IaaS, PaaS, and Container-Based Deployments - App...Best Practices for Managing IaaS, PaaS, and Container-Based Deployments - App...
Best Practices for Managing IaaS, PaaS, and Container-Based Deployments - App...
 
CEPTES - Your Trusted Salesforce Partner
CEPTES - Your Trusted Salesforce Partner CEPTES - Your Trusted Salesforce Partner
CEPTES - Your Trusted Salesforce Partner
 
Gateway Technolabs Corporate Presentation
Gateway Technolabs Corporate PresentationGateway Technolabs Corporate Presentation
Gateway Technolabs Corporate Presentation
 
GTL Corporate Presentation
GTL Corporate PresentationGTL Corporate Presentation
GTL Corporate Presentation
 
Implementing PeopleSoft 9.2 During the Age of the Cloud
Implementing PeopleSoft 9.2 During the Age of the CloudImplementing PeopleSoft 9.2 During the Age of the Cloud
Implementing PeopleSoft 9.2 During the Age of the Cloud
 
Gateway Corporate Presentation
Gateway Corporate PresentationGateway Corporate Presentation
Gateway Corporate Presentation
 
Herding Cats in the Digital World
Herding Cats in the Digital WorldHerding Cats in the Digital World
Herding Cats in the Digital World
 
Delivering Enterprise Applications: Faster. Cheaper. Better
Delivering Enterprise Applications: Faster. Cheaper. BetterDelivering Enterprise Applications: Faster. Cheaper. Better
Delivering Enterprise Applications: Faster. Cheaper. Better
 
IPS Corporate Presentation
IPS Corporate PresentationIPS Corporate Presentation
IPS Corporate Presentation
 
CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014
 
Best Software Development Company |Salesforce Consulting Services in Singapor...
Best Software Development Company |Salesforce Consulting Services in Singapor...Best Software Development Company |Salesforce Consulting Services in Singapor...
Best Software Development Company |Salesforce Consulting Services in Singapor...
 
Imaginea Overview
Imaginea OverviewImaginea Overview
Imaginea Overview
 

More from EMC

Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lake
EMC
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic
EMC
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education Services
EMC
 

More from EMC (20)

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDINDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
 
Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote
 
EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX
 
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOTransforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
 
Citrix ready-webinar-xtremio
Citrix ready-webinar-xtremioCitrix ready-webinar-xtremio
Citrix ready-webinar-xtremio
 
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
 
EMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC with Mirantis Openstack
EMC with Mirantis Openstack
 
Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lake
 
Force Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereForce Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop Elsewhere
 
Pivotal : Moments in Container History
Pivotal : Moments in Container History Pivotal : Moments in Container History
Pivotal : Moments in Container History
 
Data Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewData Lake Protection - A Technical Review
Data Lake Protection - A Technical Review
 
Mobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeMobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or Foe
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic
 
Intelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityIntelligence-Driven GRC for Security
Intelligence-Driven GRC for Security
 
The Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeThe Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure Age
 
EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015
 
EMC Academic Summit 2015
EMC Academic Summit 2015EMC Academic Summit 2015
EMC Academic Summit 2015
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education Services
 
Using EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsUsing EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere Environments
 
Using EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookUsing EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBook
 

Recently uploaded

Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Peter Udo Diehl
 

Recently uploaded (20)

Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo DiehlFuture Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
Future Visions: Predictions to Guide and Time Tech Innovation, Peter Udo Diehl
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
Designing for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at ComcastDesigning for Hardware Accessibility at Comcast
Designing for Hardware Accessibility at Comcast
 
AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101AI presentation and introduction - Retrieval Augmented Generation RAG 101
AI presentation and introduction - Retrieval Augmented Generation RAG 101
 
Strategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering TeamsStrategic AI Integration in Engineering Teams
Strategic AI Integration in Engineering Teams
 
Syngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdfSyngulon - Selection technology May 2024.pdf
Syngulon - Selection technology May 2024.pdf
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG Evaluation
 
IESVE for Early Stage Design and Planning
IESVE for Early Stage Design and PlanningIESVE for Early Stage Design and Planning
IESVE for Early Stage Design and Planning
 
Agentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdfAgentic RAG What it is its types applications and implementation.pdf
Agentic RAG What it is its types applications and implementation.pdf
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
 
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
Choosing the Right FDO Deployment Model for Your Application _ Geoffrey at In...
 
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
SOQL 201 for Admins & Developers: Slice & Dice Your Org’s Data With Aggregate...
 
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya HalderCustom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
Custom Approval Process: A New Perspective, Pavel Hrbacek & Anindya Halder
 
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptxUnpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
Unpacking Value Delivery - Agile Oxford Meetup - May 2024.pptx
 
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
Measures in SQL (a talk at SF Distributed Systems meetup, 2024-05-22)
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
ASRock Industrial FDO Solutions in Action for Industrial Edge AI _ Kenny at A...
 

Pivotal data science_data_engineering_secret_weapons_of_the_strategic_enterprise

  • 1. 1© Copyright 2014 EMC Corporation. All rights reserved. Data Science + Data Engineering Annika Jimenez Secret Weapon of the Strategic Enterprise
  • 2. 2© Copyright 2014 EMC Corporation. All rights reserved. Agenda  Data Science: What is it and why do we do it?  The Importance of Data Engineering  An Example: Kaiser Code-a-thon  Transforming Your Enterprise with Pivotal  Pivotal Data Labs: Data Engineering + Data Science  Closing Advice
  • 3. 3© Copyright 2014 EMC Corporation. All rights reserved. What Matters: Apps. Data. Analytics. Apps power business, and those apps generate data Analytic insights from that data drive new app functionality, which in-turn drives new data The faster you can move around the cycle, the faster you learn, innovate and pull away from the competition
  • 4. 4© Copyright 2014 EMC Corporation. All rights reserved. Primary Motions for Pivotal Agile: data-driven apps and rapid time to value Data Lake: store everything, analyze anything Enterprise PaaS: revolutionary software development and speed; build the right thing
  • 5. 5© Copyright 2014 EMC Corporation. All rights reserved. DATA SCIENCE The use of statistical and machine learning techniques on big, multi-structured data – in a distributed computing environment – to identify correlations and causal relationships, classify and predict events, identify patterns and anomalies, and infer probabilities, interest, and sentiment.
  • 6. 6© Copyright 2014 EMC Corporation. All rights reserved. But, why do we use Data Science?
  • 7. 7© Copyright 2014 EMC Corporation. All rights reserved.  BI – show dashboard
  • 8. 8© Copyright 2014 EMC Corporation. All rights reserved. Is the Goal Any of These Things? A. Cool Visualizations B. Custom Querying C. Decision Enablement D. Insights E. All of the above… NO
  • 9. 9© Copyright 2014 EMC Corporation. All rights reserved. DRIVE AUTOMATED LOW LATENCY ACTIONS IN RESPONSE TO EVENTS OF INTEREST
  • 10. 10© Copyright 2014 EMC Corporation. All rights reserved. YOUR DATA DATA SCIENCE += MODELS
  • 11. 11© Copyright 2014 EMC Corporation. All rights reserved. Drive Automated Low Latency Actions Production Data Feeds Low Latency Model Scoring API Availability or Push to Apps Business Logic Applicatio n Response New Events (aka, Data)Model Operationalization (“O16N”)
  • 12. 12© Copyright 2014 EMC Corporation. All rights reserved. Data Science Value Chain Instrumen- tation Logs Capture Store Transform & Prepare Access Model Dev. Deploy Apps Process Change Product Engineer Data Engineer DBA Data Engineer Data Engineer Data Scientist Data Engineer Application Developer PMO
  • 13. 13© Copyright 2014 EMC Corporation. All rights reserved. → Kaiser Blog for Full Story
  • 14. 14© Copyright 2014 EMC Corporation. All rights reserved. Code-a-Thon Details – Logistics  24-Hour Data Science Code-a-Thon  5 resources per vendor  Vendors were asked to be prepared for any use in the domain  A 15-minute presentation to senior leaders, executives, doctors and pharmacists  Teams were required to use Tableau in their presentation
  • 15. 15© Copyright 2014 EMC Corporation. All rights reserved. Code-a-Thon – Pivotal Team Hulya Emir-Farinas Data Scientist Noah Zimmerman Data Scientist Jacque Istok Application Developer Dillon Woods Application Developer Randy Williard Big Data Engineer Jemish Patel Big Data Engineer Adam Shook Big Data Engineer Roy Mims Coordinator
  • 16. 16© Copyright 2014 EMC Corporation. All rights reserved. The Day of the Code-a-Thon…
  • 17. 17© Copyright 2014 EMC Corporation. All rights reserved. Key Insight
  • 18. 18© Copyright 2014 EMC Corporation. All rights reserved. Asthma Population Management Application
  • 19. 19© Copyright 2014 EMC Corporation. All rights reserved. Asthma Management Application
  • 20. 20© Copyright 2014 EMC Corporation. All rights reserved. What Did We Learn in 2013?  Pivotal has a world-class Data Science team, the best there is  Data Science alone is good, but Data Science + Expert Data Engineering and Architecture is great  Corollary: Data Science + Data Engineering + Apps trumps everything – This is the path to rapid value creation and ROI
  • 21. 21© Copyright 2014 EMC Corporation. All rights reserved. DATA SCIENCE DATA ENGINEERING PIVOTAL LABS Data Science + Data Engineering + Pivotal Labs = The Magic in the Middle
  • 22. 22© Copyright 2014 EMC Corporation. All rights reserved. What Is Pivotal Data Labs? Data Science Data Engineering +
  • 23. 23© Copyright 2014 EMC Corporation. All rights reserved. Pivotal Data Scientists are technical professionals with strong programming skills, anchored in vertical/horizontal domains or in specialized academic research, able to identify real-world problems requiring predictive analytics, formulate these mathematically, and solve them by applying machine learning and statistical algorithms, on Big Data, in Pivotal and third-party technologies. Pivotal Data Engineers are Big Data experts and industry veterans with a passion to leverage these skills to drive business value for Pivotal customers. They posses expert knowledge and skills with the Pivotal data products and excel at architecting enterprise scale solutions to the most demanding data and analytic challenges. Data Science Data Engineering + What Is Pivotal Data Labs?
  • 24. 24© Copyright 2014 EMC Corporation. All rights reserved. What is a “Data Scientist”? ProgrammingSkills Mathematical/Statistical Skills
  • 25. 25© Copyright 2014 EMC Corporation. All rights reserved. What is a “Data Engineer”? ProgrammingSkills Architectural Skills
  • 26. 26© Copyright 2014 EMC Corporation. All rights reserved. World’s Leading Experts Pivotal Labs – Pivotal Data Labs BATCH BATCH NEAR TIME NEAR TIMEHAWQGreenplum DB Pivotal HD REAL TIME REAL TIMEGemFire XDGemFire
  • 27. 27© Copyright 2014 EMC Corporation. All rights reserved. Pivotal One SOLUTIONS Pivotal One SERVICES S PivotalOne PIVOTAL MySQL Elastic Runtime Services: Java, Spring, Ruby, Node.JS Value-adds: Installation, Management & Monitoring (Core OSS) • Data Lake Solutions (Security Analytics, Corp Comm Analytics, Business) • RTI for Telco (RTI4T) PIVOTAL GemFire XD PIVOTAL Data Dispatch Coming in 2014 Pivotal HD Hadoop+Que ry GPDB MPP Analytics GemFire In-Memory Grid Spring App Framework RabbitMQ , Redis… Pivotal One MARKETPLACE Pivotal Data Labs in Data Fabric Building Towards Pivotal One
  • 28. 28© Copyright 2014 EMC Corporation. All rights reserved. Introducing Pivotal Data Labs Our Charter: Pivotal Data Labs is Pivotal’s differentiated and highly opinionated data-centric service delivery organization. Our Goals: Expedite customer time-to-value and ROI, by driving business-aligned innovation and solutions assurance within Pivotal’s Data Fabric technologies. Drive customer adoption and autonomy across the full spectrum of Pivotal Data technologies through best-in-class data science and data engineering services, with a deep emphasis on knowledge transfer.
  • 29. 29© Copyright 2014 EMC Corporation. All rights reserved. Highly-Opinionated & Differentiated  “Highly-Opinionated” – Highly prescriptive in our counsel of data best practices to customers and partners, drawing from best-in-class talent and deep experience operating on the Pivotal Data Fabric  “Differentiated” – An expert Data services business that is unique in its class and unlike the Data services available elsewhere
  • 30. 30© Copyright 2014 EMC Corporation. All rights reserved. What Will PDL Deliver For Customers? Accelerated time-to-value and real ROI for customers
  • 31. 31© Copyright 2014 EMC Corporation. All rights reserved. How Do We Do This?  Best-in-class Data Science to drive value creation on Pivotal stack and customer data  Best-in-class Data Engineering to drive pragmatic, well- designed, customized architecture for end-to-end Pivotal stack  Assured solutions success in Pivotal data service delivery  Operationally-optimized predictive models  Collaboration with Pivotal Labs to deliver data-driven applications
  • 32. 32© Copyright 2014 EMC Corporation. All rights reserved. INSTALL VERIFY ENABLE We will verify the installation making sure it’s fully operational in your environment and ready to give you the Pivotal advantage. Our experts work with your staff to plan, install and fully- configure the Pivotal software based on your environment and requirements. Lastly, we’ll train your people and conduct knowledge transfer to make sure you are comfortable using and supporting Pivotal software. Getting Started with Pivotal Software Engagement Management – site prep, project management, customer support overview Hardware Validation / Installation Software Installation / Verification Training & Knowledge Transfer PIVOTAL ONBOARDIN G SERVICES
  • 33. 33© Copyright 2014 EMC Corporation. All rights reserved. PIVOTAL SOLUTION ASSURANCE INCEPTION ADVISE SUPPORT Leveraging expert-services in Pivotal Data Labs, Pivotal Labs, and Certified Pivotal Partners we’ll work with your architects and developers to assure that your system design and development is aligned with Pivotal best practices. Getting off the ground with a well-formed plan, a solid team and realistic expectations are fundamental to overall success. Our experts help with design, guidance, oversight and lessons from other customers to get your initiative going in the best direction from the start. . We’ll act as your conduit to Certified Professional Services, Customer Support and Pivotal R&D to make you successful, quickly determine answers and bring in specialized expertise where needed. Leverage Pivotal Data Experts for Success Engagement / Success Alignment – Regular meetings for status and guidance, Resource advice Architecture Design Implementation Design and Assistance Resource Assistance, Expedited Response
  • 34. 34© Copyright 2014 EMC Corporation. All rights reserved. DISCOVERY INSIGHTS RESULTS Once the data is understood, we set ourselves apart by making optimal use of Pivotal’s Data Fabric, our analytical tools and our data science experts to build models creating deep actionable insights on key events of interest in any use case. Getting the most from your data requires understanding what you have today and discovering what your data can do for you. Combining our data scientists with your data starts the path to value creation. Driving insights into actionable results is enabled through data scoring and model code optimization, documentation and knowledge transfer. Value Creation With Predictive Insights Engagement / Business / Technical Alignment – Regular meetings for status and validation Data exploration, readiness assessment, and prep Combining domain knowledge and data science for predictive modeling Code documentation, and knowledge sharing DATA SCIENCE LABS
  • 35. 35© Copyright 2014 EMC Corporation. All rights reserved. DATA LAB INCEPTION IMPLEMENT EXCEL Delivered by a dedicated team drawing from Pivotal Data Labs’ experts in architecture, data engineering, data science, and application development, implementations of Pivotal technologies will be targeted to maximize value creation against your specific technical and business goals. Getting off the ground with a well-formed holistic architectural, data science, and application plan is fundamental to overall success. Our experts will drive this process leveraging deep experience in these areas, to streamline the path to success for your initiative. Years of experience building successful data projects give us the know-how to quickly and efficiently work through the challenging phases of any project including design, scaling, integration and production readiness. Engagement / Business / Technical Alignment – Regular meetings for status and guidance Data platform architecting and strategic analytiic use- case roadmaps Data and Application Fabric deployments, Data Science modeling & App development Knowledge sharing, training and expert assistance Deep Partnering to Maximize Value Creation
  • 36. 36© Copyright 2014 EMC Corporation. All rights reserved. Pivotal Data Labs + Pivotal Labs = The Magic in the Middle RAPID VALUE! PIVOTAL LABS * PIVOTAL DATA LABS
  • 37. 37© Copyright 2014 EMC Corporation. All rights reserved. My Advice to Enterprises 1. Know your data and its potential value 2. Get “vision” and question status quo 3. Understand the technical paradigm shift underway 4. Hire or grow your Data Dream Team: Data Scientists and Data Engineers 5. Clear the path to operationalization (aka, value) 6. Manage the disruption, don’t reject it