More Related Content More from Matt Stubbs (20) Big Data LDN 2018: DATAOPS: BRINGING DEVOPS TO DATA INTEGRATION1. 1© StreamSets, Inc. All rights reserved.
Bringing DevOps to Data Integration
Girish Pancha | CEO
2. 2© StreamSets, Inc. All rights reserved. 2© StreamSets, Inc. All rights reserved.
data drift (noun) — unexpected,
unannounced and unending
changes to data structure,
infrastructure, and semantics.
Data Drift is the
silent killer of
Data Analytics
3. 3© StreamSets, Inc. All rights reserved. 3© StreamSets, Inc. All rights reserved.
Data Analytics is being transformed…
REPORTS
DATA MINING
ETL
DATA
INTEGRATION
ON-PREM,
EDW
4. 4© StreamSets, Inc. All rights reserved. 4© StreamSets, Inc. All rights reserved.
…by AI, self-service, edge-to-cloud, need for speed
MACHINE
LEARNING
STREAM
COMPUTE
DATA PREP
CLOUD,
DATA LAKE
DATA MINING
ETL
DATA
INTEGRATION
ON-PREM,
EDW
1010101010
1010101010
REPORTS APPLICATIONSAND
5. 5© StreamSets, Inc. All rights reserved. 5© StreamSets, Inc. All rights reserved.
DATA OPS
This digital transformation requires DataOps
REPORTS APPLICATIONSAND
MACHINE
LEARNING
STREAM
COMPUTE
DATA PREP
CLOUD,
DATA LAKE
DATA MINING
ETL
DATA
INTEGRATION
ON-PREM,
EDW
1010101010
1010101010
6. 6© StreamSets, Inc. All rights reserved. 6© StreamSets, Inc. All rights reserved.
OLD WAY
NEW WAY
FIXED PLATFORM
APPLICATION
DOWNTIME
AD-HOC DATA
PIPELINES
OPERATIONAL
BLINDNESS
ARCHITECTURAL
AGILITY
APPLICATION
EFFICIENCY
ENGINEERING
PRODUCTIVITY
BUSINESS
CONFIDENCE
7. 7© StreamSets, Inc. All rights reserved. 7© StreamSets, Inc. All rights reserved.
AUTOMATION
DataOps applies DevOps practices to data analytics
MONITORINGDRIFT AWARENESS
8. 8© StreamSets, Inc. All rights reserved. 8© StreamSets, Inc. All rights reserved.
DATAOPS
MATURITY
MODEL
TRADITIONAL ANALYTICS
Fixed Platform
Application Downtime
Ad-hoc Data Pipelines
Operational Blindness
00
NOT
AW
ARE
9. 9© StreamSets, Inc. All rights reserved. 9© StreamSets, Inc. All rights reserved.
TRADITIONAL ANALYTICS
Fixed Platform
Application Downtime
Ad-hoc Data Pipelines
Operational Blindness
01
MODERN ANALYTICS
Integration Automation
System MonitoringSYSTEM
AW
ARE
00
NOT
AW
ARE
DATAOPS
MATURITY
MODEL
10. 10© StreamSets, Inc. All rights reserved. 10© StreamSets, Inc. All rights reserved.
TRADITIONAL ANALYTICS
Fixed Platform
Application Downtime
Ad-hoc Data Pipelines
Operational Blindness
DATAOPS
MATURITY
MODEL
02
APPLICATION
AW
ARE
MODERN ANALYTICS
Application Automation
Data Monitoring
01
SYSTEM
AW
ARE
MODERN ANALYTICS
Integration Automation
System Monitoring
00
NOT
AW
ARE
11. 11© StreamSets, Inc. All rights reserved. 11© StreamSets, Inc. All rights reserved.
03
BUSINESSAW
ARE
MODERN ANALYTICS
Entity-Centric Automation
Decision Monitoring
DATAOPS
MATURITY
MODEL
TRADITIONAL ANALYTICS
Fixed Platform
Application Downtime
Ad-hoc Data Pipelines
Operational Blindness
02
APPLICATION
AW
ARE
MODERN ANALYTICS
Application Automation
Data Monitoring
01
SYSTEM
AW
ARE
MODERN ANALYTICS
Integration Automation
System Monitoring
00
NOT
AW
ARE
12. 12© StreamSets, Inc. All rights reserved. 12© StreamSets, Inc. All rights reserved.
StreamSets is where DevOps meets Data Integration
MACHINE
LEARNING
STREAM
COMPUTE
DATA PREP
CLOUD,
DATA LAKE
DATA MINING
ETL
DATA
INTEGRATION
ON-PREM,
EDW
1010101010
1010101010
REPORTS APPLICATIONSAND
13. 13© StreamSets, Inc. All rights reserved. 13© StreamSets, Inc. All rights reserved.
03
BUSINESSAW
ARE
DECOUPLED EVERYTHING, INTRUMENTED EVERYWHERE
DATAOPS
MATURITY
STARTS WITH
SMART PIPELINES
TRADITIONAL ANALYTICS
Fixed Platform
Application Downtime
Ad-hoc Data Pipelines
Operational Blindness
02
APPLICATION
AW
ARE
01
SYSTEM
AW
ARE
00
NOT
AW
ARE
MODERN ANALYTICS
Architectural Agility
MODERN ANALYTICS
Engineering Productivity
Application Efficiency
MODERN ANALYTICS
Business Confidence
14. 14© StreamSets, Inc. All rights reserved. 14© StreamSets, Inc. All rights reserved.
DataOps in the Wild
SNOWFLAKE, DATABRICKS
AMAZON, AZURE
CDC DATA
REAL-TIME OPERATIONS
REDUCE EDW SPEND
IMPROVE CLAIMS FRAUD
DETECTION
FINSERV
SPLUNK, HADOOP
ON PREMISE, GOOGLE
LOG SHIPPING, EDGE DATA
INSIDER THREAT DETECTION
SECURE NETWORKS
MINIMIZE RISK AND
PROTECT BRAND
TELCO
CLOUDERA, KAFKA
ON PREMISE
R&D, CLINICAL DATA
GENOMICS & PROTEOMICS
DEMOCRATIZE DATA
REDUCE TIME TO DRUG
DISCOVERY AND APPROVAL
PHARMA
15. 15© StreamSets, Inc. All rights reserved. 15© StreamSets, Inc. All rights reserved.
Where does your DataOps journey
begin?
MACHINE LEARNING
AI & DATA SCIENCE
CLOUD DATA
WAREHOUSING
STREAMING
DATA
APPLICATIONS
© StreamSets, Inc. All rights reserved.
Data Analytics is being transformed
by AI, self-service, edge-to-cloud, need for speed
16. 16© StreamSets, Inc. All rights reserved.
Thank you
16© StreamSets, Inc. All rights reserved.
Girish Pancha
e: girish@streamsets.com
t: @girishpancha