SlideShare a Scribd company logo
1
Learn More About Unravel
info@unraveldata.com
2
July 16, 2019
George Demarest
Senior Director of Product Marketing
Understanding DataOps
and its Impact on
Application Quality
3
Data Applications: The Perfect Storm of Human Error
Swiss Cheese Model of Human Error
Latent failures
Organizational
Challenges
Latent failures
Immature Supervision
Preconditions for
unsafe acts
Latent failures
Unsafe acts
Active failures
Incident!
4
Data Applications: The Perfect Storm of Human Error
Swiss Cheese Model of Human Error in Data Applications
Latent failures
Organizational
Challenges
Latent failures
Immature Supervision
Preconditions for
unsafe acts
Latent failures
Unsafe acts
Active failures
OOM
Incident!
“We don’t need a CDO/CAO”
“Developers will allocate their own cluster resources”
“Big Data is experimental. Data team: govern thyself”
“I need max memory for all of my jobs”
5
DataOps is a Thing
6
DataOps is a Thing
https://www.dataopsmanifesto.org/
7
Gartner Definition of DataOps
DataOps is a collaborative data management practice focused on
improving the communication, integration and automation of data flows
between data managers and consumers across an organization.
The goal of DataOps is to create predictable delivery and change
management of data, data models and related artifacts.
DataOps uses technology to automate data delivery with the appropriate
levels of security, quality and metadata to improve the use and value of
data in a dynamic environment.
8
Are We Losing the Battle of Complexity?
Multiple layers of
complexity make data
applications difficult to
tune, troubleshoot,
operationalize, and scale.
Only intelligent automation
can win this fight.
9
Managing distributed apps is hard and expensive
Hard to identify root cause Large “attack surface”
10
Without AI, DataOps is a manual, logistical challenge
One complete correlated view
with intelligent automation.
Multiple tools, no complete
view, no intelligence.
DataOps Without AI AI-Powered DataOps
11
Unravel is AI Powered Automation for DataOps
Without Unravel With Unravel
• One full-stack, cross platform console
• One complete correlated view
• Automatic, lightweight data/log collection
• Built-in AI/ML powered recommendations,
insights, remediation
• Chargeback/showback reporting
• Multiple tools
• Manual data/log collection
• Fragmented view, multiple consoles
• Minimal intelligence; manual tuning
and troubleshooting
• Manual cost analysis
12
Where is the value for DataOps created?
Business Value Accelerate business decisions though timely data driven insights
Performance Guarantee modern data application SLAs
Throughput Optimize cluster performance and job completion times
Quality Minimize failed jobs
Efficiency Minimize big data cluster and resource contention
Productivity Autonomous remediation scales Ops teams
13
Essential Elements of an AI-Powered DataOps
• Data Collection and Correlation
- Observe and collect all relevant data
- Correlate collected data and derived metadata
• Operational Data Model
- Monitoring, troubleshooting, tuning, and managing
requires an operational data model
- Richer, more powerful than a CMDB
• Analytics
- Basic and advanced statistical analysis – correlate,
classify, extrapolate from operational metadata
- Predictive analytics and forecasting for capacity and
growth
- Pattern and anomaly detection, root-cause analysis
- Prescriptive analytics and recommendations
- Context, topology and coded expertise
• Automation
- Auto-tuning of applications
- Autonomous resource allocation and optimization
- Cluster load balancing and job scheduling
- Automatic response to alerts and recovery from failures
14
Our Solution – Extensible Data Operations Platform
15
Automated DataOps Use Cases for Unravel
Automated Cloud Cost Management
• Optimize cost by right-sizing cloud images
• Optimize cost by choosing the optimal
price plan
Automated Workload Management
• Eliminate CPU, Memory, Network I/O and
Disk I/O contention
• Correctly size VM’s and Cloud Images
• Place VM’s in the best Hosts and Clusters
Automated Root Cause Analysis
• Intelligent analysis of application
failures
• Use Unravel data model and learned
app behaviors to automate RCA
Automated Performance Optimization
• Automatically learn the performance
characteristics apps and supporting stack
• Automatically optimize for a chosen KPI
(performance, efficiency)
16
Example: Root Cause Analysis of App Failures
16
Challenge
• Many levels of correlated stack traces
• Identifying the root cause is hard and time consuming
17
Resolution
• Reduce troubleshooting time from days to seconds
• Improve productivity of data scientists and analysts
Automated Root Cause Analysis of Failures
18
Error
Template
Extraction
Feature
vectors
Learning
Algorithm
for
Predictive
Model
Container
Logs
Predictive
Model
Root causes
Automated Root Cause Analysis of Failures
19
Unravel insights and
recommendations for
spark application tuning
20
Unravel is AI Powered Automation for DataOps
Without Unravel With Unravel
• One full-stack, cross platform console
• One complete correlated view
• Automatic, lightweight data/log collection
• Built-in AI/ML powered recommendations,
insights, remediation
• Chargeback/showback reporting
• Multiple tools
• Manual data/log collection
• Fragmented view, multiple consoles
• Minimal intelligence; manual tuning
and troubleshooting
• Manual cost analysis
21
Unravel – What makes us different
FULL-STACK
COVERAGE
Only Unravel works across your
entire ecosystem to demystify and
simplify operations.
AI-DRIVEN
RECOMMENDATIONS
Unravel does more than monitor – it
shows you how to make things
better.
AUTOMATED TUNING AND
REMEDIATION
Unravel operationalizes big data by
automating it.
FULLY-EXTENSIBLE
FOR CLOUD ADOPTION
Only Unravel works future-proofs
your cloud adoption choices
22
Learn More About Unravel
info@unraveldata.com

More Related Content

What's hot

Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
Designing An Enterprise Data Fabric
Designing An Enterprise Data FabricDesigning An Enterprise Data Fabric
Designing An Enterprise Data Fabric
Alan McSweeney
 

What's hot (20)

TechEvent Databricks on Azure
TechEvent Databricks on AzureTechEvent Databricks on Azure
TechEvent Databricks on Azure
 
DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks Delta
 
Data Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital TransformationData Architecture Strategies: Data Architecture for Digital Transformation
Data Architecture Strategies: Data Architecture for Digital Transformation
 
Data Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to MeshData Mesh Part 4 Monolith to Mesh
Data Mesh Part 4 Monolith to Mesh
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Designing An Enterprise Data Fabric
Designing An Enterprise Data FabricDesigning An Enterprise Data Fabric
Designing An Enterprise Data Fabric
 
DataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data ArchitectureDataOps - The Foundation for Your Agile Data Architecture
DataOps - The Foundation for Your Agile Data Architecture
 
Azure data platform overview
Azure data platform overviewAzure data platform overview
Azure data platform overview
 
Lakehouse in Azure
Lakehouse in AzureLakehouse in Azure
Lakehouse in Azure
 
Data Architecture Best Practices for Advanced Analytics
Data Architecture Best Practices for Advanced AnalyticsData Architecture Best Practices for Advanced Analytics
Data Architecture Best Practices for Advanced Analytics
 
Data Mesh using Microsoft Fabric
Data Mesh using Microsoft FabricData Mesh using Microsoft Fabric
Data Mesh using Microsoft Fabric
 
Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)
 
Owning Your Own (Data) Lake House
Owning Your Own (Data) Lake HouseOwning Your Own (Data) Lake House
Owning Your Own (Data) Lake House
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 
DevOps + DataOps = Digital Transformation
DevOps + DataOps = Digital Transformation DevOps + DataOps = Digital Transformation
DevOps + DataOps = Digital Transformation
 
Data Mesh
Data MeshData Mesh
Data Mesh
 
Databricks on AWS.pptx
Databricks on AWS.pptxDatabricks on AWS.pptx
Databricks on AWS.pptx
 
Data Engineering.pdf
Data Engineering.pdfData Engineering.pdf
Data Engineering.pdf
 
Databricks Platform.pptx
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptx
 

Similar to Understanding DataOps and Its Impact on Application Quality

HyperconvergedFantasyAnalytics
HyperconvergedFantasyAnalyticsHyperconvergedFantasyAnalytics
HyperconvergedFantasyAnalytics
Jerry Jermann
 

Similar to Understanding DataOps and Its Impact on Application Quality (20)

Doing DevOps for Big Data? What You Need to Know About AIOps
Doing DevOps for Big Data? What You Need to Know About AIOpsDoing DevOps for Big Data? What You Need to Know About AIOps
Doing DevOps for Big Data? What You Need to Know About AIOps
 
Doing DevOps for Big Data? What You Need to Know About AIOps
Doing DevOps for Big Data? What You Need to Know About AIOpsDoing DevOps for Big Data? What You Need to Know About AIOps
Doing DevOps for Big Data? What You Need to Know About AIOps
 
Machine Data Analytics
Machine Data AnalyticsMachine Data Analytics
Machine Data Analytics
 
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...
 
Get ready for_an_autonomous_data_driven_future_ext
Get ready for_an_autonomous_data_driven_future_extGet ready for_an_autonomous_data_driven_future_ext
Get ready for_an_autonomous_data_driven_future_ext
 
Data Analytics in Digital Transformation
Data Analytics in Digital TransformationData Analytics in Digital Transformation
Data Analytics in Digital Transformation
 
ICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data Science
 
Effective Cost Management for Amazon EMR
Effective Cost Management for Amazon EMREffective Cost Management for Amazon EMR
Effective Cost Management for Amazon EMR
 
HyperconvergedFantasyAnalytics
HyperconvergedFantasyAnalyticsHyperconvergedFantasyAnalytics
HyperconvergedFantasyAnalytics
 
Application Modernization
Application ModernizationApplication Modernization
Application Modernization
 
Innovating With Data and Analytics
Innovating With Data and AnalyticsInnovating With Data and Analytics
Innovating With Data and Analytics
 
NZS-4555 - IT Analytics Keynote - IT Analytics for the Enterprise
NZS-4555 - IT Analytics Keynote - IT Analytics for the EnterpriseNZS-4555 - IT Analytics Keynote - IT Analytics for the Enterprise
NZS-4555 - IT Analytics Keynote - IT Analytics for the Enterprise
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
 
Top Business Intelligence Trends for 2016 by Panorama Software
Top Business Intelligence Trends for 2016 by Panorama SoftwareTop Business Intelligence Trends for 2016 by Panorama Software
Top Business Intelligence Trends for 2016 by Panorama Software
 
SOUG Day - autonomous what is next
SOUG Day - autonomous what is nextSOUG Day - autonomous what is next
SOUG Day - autonomous what is next
 
Initiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AIInitiate Edinburgh 2019 - Big Data Meets AI
Initiate Edinburgh 2019 - Big Data Meets AI
 
Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?
 
New usage model for real-time analytics by Dr. WILLIAM L. BAIN at Big Data S...
 New usage model for real-time analytics by Dr. WILLIAM L. BAIN at Big Data S... New usage model for real-time analytics by Dr. WILLIAM L. BAIN at Big Data S...
New usage model for real-time analytics by Dr. WILLIAM L. BAIN at Big Data S...
 
Fueling AI & Machine Learning: Legacy Data as a Competitive Advantage
Fueling AI & Machine Learning: Legacy Data as a Competitive AdvantageFueling AI & Machine Learning: Legacy Data as a Competitive Advantage
Fueling AI & Machine Learning: Legacy Data as a Competitive Advantage
 
DoneDeal - AWS Data Analytics Platform
DoneDeal - AWS Data Analytics PlatformDoneDeal - AWS Data Analytics Platform
DoneDeal - AWS Data Analytics Platform
 

More from DevOps.com

Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
DevOps.com
 
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
DevOps.com
 

More from DevOps.com (20)

Modernizing on IBM Z Made Easier With Open Source Software
Modernizing on IBM Z Made Easier With Open Source SoftwareModernizing on IBM Z Made Easier With Open Source Software
Modernizing on IBM Z Made Easier With Open Source Software
 
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
 
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
Comparing Microsoft SQL Server 2019 Performance Across Various Kubernetes Pla...
 
Next Generation Vulnerability Assessment Using Datadog and Snyk
Next Generation Vulnerability Assessment Using Datadog and SnykNext Generation Vulnerability Assessment Using Datadog and Snyk
Next Generation Vulnerability Assessment Using Datadog and Snyk
 
Vulnerability Discovery in the Cloud
Vulnerability Discovery in the CloudVulnerability Discovery in the Cloud
Vulnerability Discovery in the Cloud
 
2021 Open Source Governance: Top Ten Trends and Predictions
2021 Open Source Governance: Top Ten Trends and Predictions2021 Open Source Governance: Top Ten Trends and Predictions
2021 Open Source Governance: Top Ten Trends and Predictions
 
A New Year’s Ransomware Resolution
A New Year’s Ransomware ResolutionA New Year’s Ransomware Resolution
A New Year’s Ransomware Resolution
 
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)Getting Started with Runtime Security on Azure Kubernetes Service (AKS)
Getting Started with Runtime Security on Azure Kubernetes Service (AKS)
 
Don't Panic! Effective Incident Response
Don't Panic! Effective Incident ResponseDon't Panic! Effective Incident Response
Don't Panic! Effective Incident Response
 
Creating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's Culture
Creating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's CultureCreating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's Culture
Creating a Culture of Chaos: Chaos Engineering Is Not Just Tools, It's Culture
 
Role Based Access Controls (RBAC) for SSH and Kubernetes Access with Teleport
Role Based Access Controls (RBAC) for SSH and Kubernetes Access with TeleportRole Based Access Controls (RBAC) for SSH and Kubernetes Access with Teleport
Role Based Access Controls (RBAC) for SSH and Kubernetes Access with Teleport
 
Monitoring Serverless Applications with Datadog
Monitoring Serverless Applications with DatadogMonitoring Serverless Applications with Datadog
Monitoring Serverless Applications with Datadog
 
Deliver your App Anywhere … Publicly or Privately
Deliver your App Anywhere … Publicly or PrivatelyDeliver your App Anywhere … Publicly or Privately
Deliver your App Anywhere … Publicly or Privately
 
Securing medical apps in the age of covid final
Securing medical apps in the age of covid finalSecuring medical apps in the age of covid final
Securing medical apps in the age of covid final
 
How to Build a Healthy On-Call Culture
How to Build a Healthy On-Call CultureHow to Build a Healthy On-Call Culture
How to Build a Healthy On-Call Culture
 
The Evolving Role of the Developer in 2021
The Evolving Role of the Developer in 2021The Evolving Role of the Developer in 2021
The Evolving Role of the Developer in 2021
 
Service Mesh: Two Big Words But Do You Need It?
Service Mesh: Two Big Words But Do You Need It?Service Mesh: Two Big Words But Do You Need It?
Service Mesh: Two Big Words But Do You Need It?
 
Secure Data Sharing in OpenShift Environments
Secure Data Sharing in OpenShift EnvironmentsSecure Data Sharing in OpenShift Environments
Secure Data Sharing in OpenShift Environments
 
How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...
How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...
How to Govern Identities and Access in Cloud Infrastructure: AppsFlyer Case S...
 
Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...
Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...
Elevate Your Enterprise Python and R AI, ML Software Strategy with Anaconda T...
 

Recently uploaded

Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 

Recently uploaded (20)

Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone KomSalesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
Salesforce Adoption – Metrics, Methods, and Motivation, Antone Kom
 
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
From Siloed Products to Connected Ecosystem: Building a Sustainable and Scala...
 
Key Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdfKey Trends Shaping the Future of Infrastructure.pdf
Key Trends Shaping the Future of Infrastructure.pdf
 
IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024IoT Analytics Company Presentation May 2024
IoT Analytics Company Presentation May 2024
 
Powerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara LaskowskaPowerful Start- the Key to Project Success, Barbara Laskowska
Powerful Start- the Key to Project Success, Barbara Laskowska
 
Introduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG EvaluationIntroduction to Open Source RAG and RAG Evaluation
Introduction to Open Source RAG and RAG Evaluation
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptxIOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
IOS-PENTESTING-BEGINNERS-PRACTICAL-GUIDE-.pptx
 
Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........Bits & Pixels using AI for Good.........
Bits & Pixels using AI for Good.........
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Optimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through ObservabilityOptimizing NoSQL Performance Through Observability
Optimizing NoSQL Performance Through Observability
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
Speed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in MinutesSpeed Wins: From Kafka to APIs in Minutes
Speed Wins: From Kafka to APIs in Minutes
 
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
Behind the Scenes From the Manager's Chair: Decoding the Secrets of Successfu...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
AI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří KarpíšekAI revolution and Salesforce, Jiří Karpíšek
AI revolution and Salesforce, Jiří Karpíšek
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 

Understanding DataOps and Its Impact on Application Quality

  • 1. 1 Learn More About Unravel info@unraveldata.com
  • 2. 2 July 16, 2019 George Demarest Senior Director of Product Marketing Understanding DataOps and its Impact on Application Quality
  • 3. 3 Data Applications: The Perfect Storm of Human Error Swiss Cheese Model of Human Error Latent failures Organizational Challenges Latent failures Immature Supervision Preconditions for unsafe acts Latent failures Unsafe acts Active failures Incident!
  • 4. 4 Data Applications: The Perfect Storm of Human Error Swiss Cheese Model of Human Error in Data Applications Latent failures Organizational Challenges Latent failures Immature Supervision Preconditions for unsafe acts Latent failures Unsafe acts Active failures OOM Incident! “We don’t need a CDO/CAO” “Developers will allocate their own cluster resources” “Big Data is experimental. Data team: govern thyself” “I need max memory for all of my jobs”
  • 6. 6 DataOps is a Thing https://www.dataopsmanifesto.org/
  • 7. 7 Gartner Definition of DataOps DataOps is a collaborative data management practice focused on improving the communication, integration and automation of data flows between data managers and consumers across an organization. The goal of DataOps is to create predictable delivery and change management of data, data models and related artifacts. DataOps uses technology to automate data delivery with the appropriate levels of security, quality and metadata to improve the use and value of data in a dynamic environment.
  • 8. 8 Are We Losing the Battle of Complexity? Multiple layers of complexity make data applications difficult to tune, troubleshoot, operationalize, and scale. Only intelligent automation can win this fight.
  • 9. 9 Managing distributed apps is hard and expensive Hard to identify root cause Large “attack surface”
  • 10. 10 Without AI, DataOps is a manual, logistical challenge One complete correlated view with intelligent automation. Multiple tools, no complete view, no intelligence. DataOps Without AI AI-Powered DataOps
  • 11. 11 Unravel is AI Powered Automation for DataOps Without Unravel With Unravel • One full-stack, cross platform console • One complete correlated view • Automatic, lightweight data/log collection • Built-in AI/ML powered recommendations, insights, remediation • Chargeback/showback reporting • Multiple tools • Manual data/log collection • Fragmented view, multiple consoles • Minimal intelligence; manual tuning and troubleshooting • Manual cost analysis
  • 12. 12 Where is the value for DataOps created? Business Value Accelerate business decisions though timely data driven insights Performance Guarantee modern data application SLAs Throughput Optimize cluster performance and job completion times Quality Minimize failed jobs Efficiency Minimize big data cluster and resource contention Productivity Autonomous remediation scales Ops teams
  • 13. 13 Essential Elements of an AI-Powered DataOps • Data Collection and Correlation - Observe and collect all relevant data - Correlate collected data and derived metadata • Operational Data Model - Monitoring, troubleshooting, tuning, and managing requires an operational data model - Richer, more powerful than a CMDB • Analytics - Basic and advanced statistical analysis – correlate, classify, extrapolate from operational metadata - Predictive analytics and forecasting for capacity and growth - Pattern and anomaly detection, root-cause analysis - Prescriptive analytics and recommendations - Context, topology and coded expertise • Automation - Auto-tuning of applications - Autonomous resource allocation and optimization - Cluster load balancing and job scheduling - Automatic response to alerts and recovery from failures
  • 14. 14 Our Solution – Extensible Data Operations Platform
  • 15. 15 Automated DataOps Use Cases for Unravel Automated Cloud Cost Management • Optimize cost by right-sizing cloud images • Optimize cost by choosing the optimal price plan Automated Workload Management • Eliminate CPU, Memory, Network I/O and Disk I/O contention • Correctly size VM’s and Cloud Images • Place VM’s in the best Hosts and Clusters Automated Root Cause Analysis • Intelligent analysis of application failures • Use Unravel data model and learned app behaviors to automate RCA Automated Performance Optimization • Automatically learn the performance characteristics apps and supporting stack • Automatically optimize for a chosen KPI (performance, efficiency)
  • 16. 16 Example: Root Cause Analysis of App Failures 16 Challenge • Many levels of correlated stack traces • Identifying the root cause is hard and time consuming
  • 17. 17 Resolution • Reduce troubleshooting time from days to seconds • Improve productivity of data scientists and analysts Automated Root Cause Analysis of Failures
  • 19. 19 Unravel insights and recommendations for spark application tuning
  • 20. 20 Unravel is AI Powered Automation for DataOps Without Unravel With Unravel • One full-stack, cross platform console • One complete correlated view • Automatic, lightweight data/log collection • Built-in AI/ML powered recommendations, insights, remediation • Chargeback/showback reporting • Multiple tools • Manual data/log collection • Fragmented view, multiple consoles • Minimal intelligence; manual tuning and troubleshooting • Manual cost analysis
  • 21. 21 Unravel – What makes us different FULL-STACK COVERAGE Only Unravel works across your entire ecosystem to demystify and simplify operations. AI-DRIVEN RECOMMENDATIONS Unravel does more than monitor – it shows you how to make things better. AUTOMATED TUNING AND REMEDIATION Unravel operationalizes big data by automating it. FULLY-EXTENSIBLE FOR CLOUD ADOPTION Only Unravel works future-proofs your cloud adoption choices
  • 22. 22 Learn More About Unravel info@unraveldata.com