SlideShare a Scribd company logo
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AIOps: steps towards autonomous
operations
E N T 3 0 1 - R
Ajit Zadgaonkar
Global DevOps Leader
AWS Professional
Services
Stephen Felisan
CIO, Edmunds.com
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
First, a little about me…
Likes
• Operational efficiency
• Architecting for operability
• Making new mistakes
Dislikes
• Engineering Vs business breakdowns
• Processes killing creativity
• Alert fatigue
Ajit Zadgaonkar
AWS Professional Services
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Agenda
The battle of Ops world
AiOps
See it in action
Words of Wisdom
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
What we are really trying to achieve …
Infrastructure
Support
Innovation
Infrastructure
Support
Innovation
Innovation
Support
✅
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Efficiency expressed in numbers …
440x
Shorter Lead Times
60x
Fewer Failures
21%
Less unplanned rework
44%
More new Work
* Source: Puppet State of DevOps Report
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Nay Ops : Aye Ops
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
4 Challenges in Ops
Lack of Information
Information overload
Data correlation
Dynamic Trends
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Ops done better
Root cause identification
Freedom from white noise
Data correlation map
Proactive remediation
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The Key metric : 1. MTTR
The right Needle in the haystack
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
The Key metric : 2. Proactive Actions
# problems auto~magically avoided
“Innovation and operations don’t have
to be inversely proportional”
Ajit Zadgaonkar
AI believer, Ops sufferer
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
How to get life back in 4 steps ?
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Step 1. Set up for Observability
Infrastructure Logs
Platform Logs
Application Logs
Code telemetry
Events
Vended Metrics
API and Users
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Step 2. Set up delivery, Collection, and Storage
ingestion
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Step 3. Querying and patterns mining
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Step 4.
Alerting, Notification, and remediation
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
AiOps
Predictive &
Preventive Insights
Patterns,
tools & data
ingestion
Threat intel
Budgets
Planned events
Purposeful Actions
Timely
Recovery
Decisive
Response
New Pattern
discovery
Continuous Learning
Infrastructure Logs
Platform Logs
Application Logs
Code telemetry
Events
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Take Ah! Ops to AiOPs
Track every resource possible
Alert on services, availability, and response times
Capture, learn, and improve
Socialize insights with everyone on the team
Plan metrics and monitoring, track patterns
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Nay Ops : Aye Ops : AIOps
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
How Edmunds.com optimized Ops
Stephen Felisan, CIO, Edmunds.com
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Container Stability Prediction
• Removes bad containers
• Focuses on actions and
remediation
• Started with Tensor Flow, moving
to ML
Outcomes:
*Improved user experience
*95% reduction in on-call
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Breakout repeats
Tuesday, November 27
AIOps: steps towards autonomous operations
1:00 PM – 2:00 PM | Mirage, Montego D
Thursday, November 29
AIOps: steps towards autonomous operations
2:30 PM – 3:30 PM | Venetian, Level 4, Lando 4202
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Related breakouts
Tuesday, November 27
How to Ace Digital Transformation: Best Practices and Lessons Learned - ARC322
7:00 PM –8:00 PM | Aria East, Plaza Level, Orovada 3, T1
Tuesday, November 27
Moving to DevOps the Amazon Way - DEV210-R
8:30 AM – 9:3 0AM | Bellagio, Level 1, Grand Ballroom 2, T1
Monday, November 26
Enterprise DevOps: Patterns of Efficiency - ENT311
3:15 PM– 4:15 PM | Bellagio, Level 1, Bellagio Ballroom 6
Thank you!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Ajit Zadgaonkar
zadajit@amazon.com
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.

More Related Content

What's hot

Building-a-Data-Lake-on-AWS
Building-a-Data-Lake-on-AWSBuilding-a-Data-Lake-on-AWS
Building-a-Data-Lake-on-AWS
Amazon Web Services
 
Running a High-Performance Kubernetes Cluster with Amazon EKS (CON318-R1) - A...
Running a High-Performance Kubernetes Cluster with Amazon EKS (CON318-R1) - A...Running a High-Performance Kubernetes Cluster with Amazon EKS (CON318-R1) - A...
Running a High-Performance Kubernetes Cluster with Amazon EKS (CON318-R1) - A...
Amazon Web Services
 
Using AIOps to reduce incidents volume
Using AIOps to reduce incidents volumeUsing AIOps to reduce incidents volume
Using AIOps to reduce incidents volume
Amazon Web Services
 
Introduction to AWS Lake Formation.pptx
Introduction to AWS Lake Formation.pptxIntroduction to AWS Lake Formation.pptx
Introduction to AWS Lake Formation.pptx
SwathiPonugumati
 
Building Data Lakes for Analytics on AWS
Building Data Lakes for Analytics on AWSBuilding Data Lakes for Analytics on AWS
Building Data Lakes for Analytics on AWS
Amazon Web Services
 
The AWS Playbook for Cloud Readiness & Large Scale Migrations
The AWS Playbook for Cloud Readiness & Large Scale MigrationsThe AWS Playbook for Cloud Readiness & Large Scale Migrations
The AWS Playbook for Cloud Readiness & Large Scale Migrations
Amazon Web Services
 
Introduction to Amazon EKS
Introduction to Amazon EKSIntroduction to Amazon EKS
Introduction to Amazon EKS
Amazon Web Services
 
AWS Well-Architected Framework: Operational Excellence Pillar
AWS Well-Architected Framework: Operational Excellence PillarAWS Well-Architected Framework: Operational Excellence Pillar
AWS Well-Architected Framework: Operational Excellence Pillar
Jonathan LaCour
 
AWS Cloud Adoption Framework
AWS Cloud Adoption Framework AWS Cloud Adoption Framework
AWS Cloud Adoption Framework
Amazon Web Services
 
Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...
Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...
Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...
Amazon Web Services
 
FinOps - AWS Cost and Operational Efficiency - Pop-up Loft Tel Aviv
FinOps - AWS Cost and Operational Efficiency - Pop-up Loft Tel AvivFinOps - AWS Cost and Operational Efficiency - Pop-up Loft Tel Aviv
FinOps - AWS Cost and Operational Efficiency - Pop-up Loft Tel Aviv
Amazon Web Services
 
[NEW LAUNCH!] Introducing Amazon Forecast (AIM344) - AWS re:Invent 2018
[NEW LAUNCH!] Introducing Amazon Forecast  (AIM344) - AWS re:Invent 2018[NEW LAUNCH!] Introducing Amazon Forecast  (AIM344) - AWS re:Invent 2018
[NEW LAUNCH!] Introducing Amazon Forecast (AIM344) - AWS re:Invent 2018
Amazon Web Services
 
Building Highly Scalable Retail Order Management Systems with Serverless
Building Highly Scalable Retail Order Management Systems with ServerlessBuilding Highly Scalable Retail Order Management Systems with Serverless
Building Highly Scalable Retail Order Management Systems with Serverless
Amazon Web Services
 
AIOps: Anomalies Detection of Distributed Traces
AIOps: Anomalies Detection of Distributed TracesAIOps: Anomalies Detection of Distributed Traces
AIOps: Anomalies Detection of Distributed Traces
Jorge Cardoso
 
NET304_Deep Dive into the New Network Load Balancer
NET304_Deep Dive into the New Network Load BalancerNET304_Deep Dive into the New Network Load Balancer
NET304_Deep Dive into the New Network Load Balancer
Amazon Web Services
 
The future of AIOps
The future of AIOpsThe future of AIOps
The future of AIOps
GAVS Technologies
 
HigherEducation-Cloud Operating Model and Approach Forward.pdf
HigherEducation-Cloud Operating Model and Approach Forward.pdfHigherEducation-Cloud Operating Model and Approach Forward.pdf
HigherEducation-Cloud Operating Model and Approach Forward.pdfAmazon Web Services
 
Security and governance with AWS Control Tower and AWS Organizations - SEC204...
Security and governance with AWS Control Tower and AWS Organizations - SEC204...Security and governance with AWS Control Tower and AWS Organizations - SEC204...
Security and governance with AWS Control Tower and AWS Organizations - SEC204...
Amazon Web Services
 
Setting up a Cloud Center of Excellence (CCoE) for Enterprise Customers
Setting up a Cloud Center of Excellence (CCoE) for Enterprise CustomersSetting up a Cloud Center of Excellence (CCoE) for Enterprise Customers
Setting up a Cloud Center of Excellence (CCoE) for Enterprise Customers
Ali Asgar Juzer
 
Cloud Adoption Framework Define Your Cloud Strategy and Accelerate Results
Cloud Adoption Framework Define Your Cloud Strategy and Accelerate Results Cloud Adoption Framework Define Your Cloud Strategy and Accelerate Results
Cloud Adoption Framework Define Your Cloud Strategy and Accelerate Results Amazon Web Services
 

What's hot (20)

Building-a-Data-Lake-on-AWS
Building-a-Data-Lake-on-AWSBuilding-a-Data-Lake-on-AWS
Building-a-Data-Lake-on-AWS
 
Running a High-Performance Kubernetes Cluster with Amazon EKS (CON318-R1) - A...
Running a High-Performance Kubernetes Cluster with Amazon EKS (CON318-R1) - A...Running a High-Performance Kubernetes Cluster with Amazon EKS (CON318-R1) - A...
Running a High-Performance Kubernetes Cluster with Amazon EKS (CON318-R1) - A...
 
Using AIOps to reduce incidents volume
Using AIOps to reduce incidents volumeUsing AIOps to reduce incidents volume
Using AIOps to reduce incidents volume
 
Introduction to AWS Lake Formation.pptx
Introduction to AWS Lake Formation.pptxIntroduction to AWS Lake Formation.pptx
Introduction to AWS Lake Formation.pptx
 
Building Data Lakes for Analytics on AWS
Building Data Lakes for Analytics on AWSBuilding Data Lakes for Analytics on AWS
Building Data Lakes for Analytics on AWS
 
The AWS Playbook for Cloud Readiness & Large Scale Migrations
The AWS Playbook for Cloud Readiness & Large Scale MigrationsThe AWS Playbook for Cloud Readiness & Large Scale Migrations
The AWS Playbook for Cloud Readiness & Large Scale Migrations
 
Introduction to Amazon EKS
Introduction to Amazon EKSIntroduction to Amazon EKS
Introduction to Amazon EKS
 
AWS Well-Architected Framework: Operational Excellence Pillar
AWS Well-Architected Framework: Operational Excellence PillarAWS Well-Architected Framework: Operational Excellence Pillar
AWS Well-Architected Framework: Operational Excellence Pillar
 
AWS Cloud Adoption Framework
AWS Cloud Adoption Framework AWS Cloud Adoption Framework
AWS Cloud Adoption Framework
 
Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...
Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...
Instrumenting Kubernetes for Observability Using AWS X-Ray and Amazon CloudWa...
 
FinOps - AWS Cost and Operational Efficiency - Pop-up Loft Tel Aviv
FinOps - AWS Cost and Operational Efficiency - Pop-up Loft Tel AvivFinOps - AWS Cost and Operational Efficiency - Pop-up Loft Tel Aviv
FinOps - AWS Cost and Operational Efficiency - Pop-up Loft Tel Aviv
 
[NEW LAUNCH!] Introducing Amazon Forecast (AIM344) - AWS re:Invent 2018
[NEW LAUNCH!] Introducing Amazon Forecast  (AIM344) - AWS re:Invent 2018[NEW LAUNCH!] Introducing Amazon Forecast  (AIM344) - AWS re:Invent 2018
[NEW LAUNCH!] Introducing Amazon Forecast (AIM344) - AWS re:Invent 2018
 
Building Highly Scalable Retail Order Management Systems with Serverless
Building Highly Scalable Retail Order Management Systems with ServerlessBuilding Highly Scalable Retail Order Management Systems with Serverless
Building Highly Scalable Retail Order Management Systems with Serverless
 
AIOps: Anomalies Detection of Distributed Traces
AIOps: Anomalies Detection of Distributed TracesAIOps: Anomalies Detection of Distributed Traces
AIOps: Anomalies Detection of Distributed Traces
 
NET304_Deep Dive into the New Network Load Balancer
NET304_Deep Dive into the New Network Load BalancerNET304_Deep Dive into the New Network Load Balancer
NET304_Deep Dive into the New Network Load Balancer
 
The future of AIOps
The future of AIOpsThe future of AIOps
The future of AIOps
 
HigherEducation-Cloud Operating Model and Approach Forward.pdf
HigherEducation-Cloud Operating Model and Approach Forward.pdfHigherEducation-Cloud Operating Model and Approach Forward.pdf
HigherEducation-Cloud Operating Model and Approach Forward.pdf
 
Security and governance with AWS Control Tower and AWS Organizations - SEC204...
Security and governance with AWS Control Tower and AWS Organizations - SEC204...Security and governance with AWS Control Tower and AWS Organizations - SEC204...
Security and governance with AWS Control Tower and AWS Organizations - SEC204...
 
Setting up a Cloud Center of Excellence (CCoE) for Enterprise Customers
Setting up a Cloud Center of Excellence (CCoE) for Enterprise CustomersSetting up a Cloud Center of Excellence (CCoE) for Enterprise Customers
Setting up a Cloud Center of Excellence (CCoE) for Enterprise Customers
 
Cloud Adoption Framework Define Your Cloud Strategy and Accelerate Results
Cloud Adoption Framework Define Your Cloud Strategy and Accelerate Results Cloud Adoption Framework Define Your Cloud Strategy and Accelerate Results
Cloud Adoption Framework Define Your Cloud Strategy and Accelerate Results
 

Similar to AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018

Digital Transformation Playbook in Five Steps (ARC322) - AWS re:Invent 2018
Digital Transformation Playbook in Five Steps (ARC322) - AWS re:Invent 2018Digital Transformation Playbook in Five Steps (ARC322) - AWS re:Invent 2018
Digital Transformation Playbook in Five Steps (ARC322) - AWS re:Invent 2018
Amazon Web Services
 
Moving to DevOps the Amazon Way (DEV210-R1) - AWS re:Invent 2018
Moving to DevOps the Amazon Way (DEV210-R1) - AWS re:Invent 2018Moving to DevOps the Amazon Way (DEV210-R1) - AWS re:Invent 2018
Moving to DevOps the Amazon Way (DEV210-R1) - AWS re:Invent 2018
Amazon Web Services
 
Building a Monitoring Plan.pdf
Building a Monitoring Plan.pdfBuilding a Monitoring Plan.pdf
Building a Monitoring Plan.pdf
Amazon Web Services
 
The Future of Enterprise IT - Lessons Learned
The Future of Enterprise IT - Lessons LearnedThe Future of Enterprise IT - Lessons Learned
The Future of Enterprise IT - Lessons Learned
Amazon Web Services
 
Operating at Scale- Preparing for the Journey [Portuguese]
Operating at Scale- Preparing for the Journey [Portuguese]Operating at Scale- Preparing for the Journey [Portuguese]
Operating at Scale- Preparing for the Journey [Portuguese]
Amazon Web Services
 
Releasing Mission-Critical Software at Amazon (DEV209-R1) - AWS re:Invent 2018
Releasing Mission-Critical Software at Amazon (DEV209-R1) - AWS re:Invent 2018Releasing Mission-Critical Software at Amazon (DEV209-R1) - AWS re:Invent 2018
Releasing Mission-Critical Software at Amazon (DEV209-R1) - AWS re:Invent 2018
Amazon Web Services
 
Operating at Scale: Preparing for the Journey
Operating at Scale: Preparing for the JourneyOperating at Scale: Preparing for the Journey
Operating at Scale: Preparing for the Journey
Amazon Web Services
 
Leadership Session: The Future of Enterprise IT (ENT220-L) - AWS re:Invent 2018
Leadership Session:  The Future of Enterprise IT (ENT220-L) - AWS re:Invent 2018Leadership Session:  The Future of Enterprise IT (ENT220-L) - AWS re:Invent 2018
Leadership Session: The Future of Enterprise IT (ENT220-L) - AWS re:Invent 2018
Amazon Web Services
 
Enterprise Cloud Adoption
Enterprise Cloud Adoption Enterprise Cloud Adoption
Enterprise Cloud Adoption
Tom Laszewski
 
Continuously Delivering Your Software on AWS - Adrian White - AWS TechShift A...
Continuously Delivering Your Software on AWS - Adrian White - AWS TechShift A...Continuously Delivering Your Software on AWS - Adrian White - AWS TechShift A...
Continuously Delivering Your Software on AWS - Adrian White - AWS TechShift A...
Amazon Web Services
 
Trends in Digital Transformation (ARC212) - AWS re:Invent 2018
Trends in Digital Transformation (ARC212) - AWS re:Invent 2018Trends in Digital Transformation (ARC212) - AWS re:Invent 2018
Trends in Digital Transformation (ARC212) - AWS re:Invent 2018
Amazon Web Services
 
How Trupanion Became an AI-driven Company for Pets
How Trupanion Became an AI-driven Company for PetsHow Trupanion Became an AI-driven Company for Pets
How Trupanion Became an AI-driven Company for Pets
Amazon Web Services
 
Life of a Code Change to a Tier 1 Service - AWS Online Tech Talks
Life of a Code Change to a Tier 1 Service - AWS Online Tech TalksLife of a Code Change to a Tier 1 Service - AWS Online Tech Talks
Life of a Code Change to a Tier 1 Service - AWS Online Tech Talks
Amazon Web Services
 
Transforming Product Development- Transformation Day Philadelphia 2018
Transforming Product Development- Transformation Day Philadelphia 2018Transforming Product Development- Transformation Day Philadelphia 2018
Transforming Product Development- Transformation Day Philadelphia 2018
Amazon Web Services
 
Your road to a Well Architected solution in the Cloud - Tel Aviv Summit 2018
Your road to a Well Architected solution in the Cloud - Tel Aviv Summit 2018Your road to a Well Architected solution in the Cloud - Tel Aviv Summit 2018
Your road to a Well Architected solution in the Cloud - Tel Aviv Summit 2018
Amazon Web Services
 
Hitchhiker's Guide to Cloud Ops
Hitchhiker's Guide to Cloud Ops Hitchhiker's Guide to Cloud Ops
Hitchhiker's Guide to Cloud Ops
Amazon Web Services
 
Practical Human-in-the-Loop Machine Learning
 Practical Human-in-the-Loop Machine Learning Practical Human-in-the-Loop Machine Learning
Practical Human-in-the-Loop Machine Learning
Amazon Web Services
 
Transforming Product Development- AWS Transformation Day Raleigh 2018.pdf
Transforming Product Development- AWS Transformation Day Raleigh 2018.pdfTransforming Product Development- AWS Transformation Day Raleigh 2018.pdf
Transforming Product Development- AWS Transformation Day Raleigh 2018.pdf
Amazon Web Services
 
Transforming Product Development - AWS Transformation Day Boston 2018
Transforming Product Development - AWS Transformation Day Boston 2018Transforming Product Development - AWS Transformation Day Boston 2018
Transforming Product Development - AWS Transformation Day Boston 2018
Amazon Web Services
 
TECHTalks - Boston MA - Tim Harney
TECHTalks - Boston MA - Tim HarneyTECHTalks - Boston MA - Tim Harney
TECHTalks - Boston MA - Tim Harney
EagleDream Technologies
 

Similar to AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018 (20)

Digital Transformation Playbook in Five Steps (ARC322) - AWS re:Invent 2018
Digital Transformation Playbook in Five Steps (ARC322) - AWS re:Invent 2018Digital Transformation Playbook in Five Steps (ARC322) - AWS re:Invent 2018
Digital Transformation Playbook in Five Steps (ARC322) - AWS re:Invent 2018
 
Moving to DevOps the Amazon Way (DEV210-R1) - AWS re:Invent 2018
Moving to DevOps the Amazon Way (DEV210-R1) - AWS re:Invent 2018Moving to DevOps the Amazon Way (DEV210-R1) - AWS re:Invent 2018
Moving to DevOps the Amazon Way (DEV210-R1) - AWS re:Invent 2018
 
Building a Monitoring Plan.pdf
Building a Monitoring Plan.pdfBuilding a Monitoring Plan.pdf
Building a Monitoring Plan.pdf
 
The Future of Enterprise IT - Lessons Learned
The Future of Enterprise IT - Lessons LearnedThe Future of Enterprise IT - Lessons Learned
The Future of Enterprise IT - Lessons Learned
 
Operating at Scale- Preparing for the Journey [Portuguese]
Operating at Scale- Preparing for the Journey [Portuguese]Operating at Scale- Preparing for the Journey [Portuguese]
Operating at Scale- Preparing for the Journey [Portuguese]
 
Releasing Mission-Critical Software at Amazon (DEV209-R1) - AWS re:Invent 2018
Releasing Mission-Critical Software at Amazon (DEV209-R1) - AWS re:Invent 2018Releasing Mission-Critical Software at Amazon (DEV209-R1) - AWS re:Invent 2018
Releasing Mission-Critical Software at Amazon (DEV209-R1) - AWS re:Invent 2018
 
Operating at Scale: Preparing for the Journey
Operating at Scale: Preparing for the JourneyOperating at Scale: Preparing for the Journey
Operating at Scale: Preparing for the Journey
 
Leadership Session: The Future of Enterprise IT (ENT220-L) - AWS re:Invent 2018
Leadership Session:  The Future of Enterprise IT (ENT220-L) - AWS re:Invent 2018Leadership Session:  The Future of Enterprise IT (ENT220-L) - AWS re:Invent 2018
Leadership Session: The Future of Enterprise IT (ENT220-L) - AWS re:Invent 2018
 
Enterprise Cloud Adoption
Enterprise Cloud Adoption Enterprise Cloud Adoption
Enterprise Cloud Adoption
 
Continuously Delivering Your Software on AWS - Adrian White - AWS TechShift A...
Continuously Delivering Your Software on AWS - Adrian White - AWS TechShift A...Continuously Delivering Your Software on AWS - Adrian White - AWS TechShift A...
Continuously Delivering Your Software on AWS - Adrian White - AWS TechShift A...
 
Trends in Digital Transformation (ARC212) - AWS re:Invent 2018
Trends in Digital Transformation (ARC212) - AWS re:Invent 2018Trends in Digital Transformation (ARC212) - AWS re:Invent 2018
Trends in Digital Transformation (ARC212) - AWS re:Invent 2018
 
How Trupanion Became an AI-driven Company for Pets
How Trupanion Became an AI-driven Company for PetsHow Trupanion Became an AI-driven Company for Pets
How Trupanion Became an AI-driven Company for Pets
 
Life of a Code Change to a Tier 1 Service - AWS Online Tech Talks
Life of a Code Change to a Tier 1 Service - AWS Online Tech TalksLife of a Code Change to a Tier 1 Service - AWS Online Tech Talks
Life of a Code Change to a Tier 1 Service - AWS Online Tech Talks
 
Transforming Product Development- Transformation Day Philadelphia 2018
Transforming Product Development- Transformation Day Philadelphia 2018Transforming Product Development- Transformation Day Philadelphia 2018
Transforming Product Development- Transformation Day Philadelphia 2018
 
Your road to a Well Architected solution in the Cloud - Tel Aviv Summit 2018
Your road to a Well Architected solution in the Cloud - Tel Aviv Summit 2018Your road to a Well Architected solution in the Cloud - Tel Aviv Summit 2018
Your road to a Well Architected solution in the Cloud - Tel Aviv Summit 2018
 
Hitchhiker's Guide to Cloud Ops
Hitchhiker's Guide to Cloud Ops Hitchhiker's Guide to Cloud Ops
Hitchhiker's Guide to Cloud Ops
 
Practical Human-in-the-Loop Machine Learning
 Practical Human-in-the-Loop Machine Learning Practical Human-in-the-Loop Machine Learning
Practical Human-in-the-Loop Machine Learning
 
Transforming Product Development- AWS Transformation Day Raleigh 2018.pdf
Transforming Product Development- AWS Transformation Day Raleigh 2018.pdfTransforming Product Development- AWS Transformation Day Raleigh 2018.pdf
Transforming Product Development- AWS Transformation Day Raleigh 2018.pdf
 
Transforming Product Development - AWS Transformation Day Boston 2018
Transforming Product Development - AWS Transformation Day Boston 2018Transforming Product Development - AWS Transformation Day Boston 2018
Transforming Product Development - AWS Transformation Day Boston 2018
 
TECHTalks - Boston MA - Tim Harney
TECHTalks - Boston MA - Tim HarneyTECHTalks - Boston MA - Tim Harney
TECHTalks - Boston MA - Tim Harney
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
Amazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
Amazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
Amazon Web Services
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Amazon Web Services
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
Amazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
Amazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Amazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
Amazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Amazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
Amazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

AIOps: Steps Towards Autonomous Operations (DEV301-R1) - AWS re:Invent 2018

  • 1.
  • 2. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. AIOps: steps towards autonomous operations E N T 3 0 1 - R Ajit Zadgaonkar Global DevOps Leader AWS Professional Services Stephen Felisan CIO, Edmunds.com
  • 3. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. First, a little about me… Likes • Operational efficiency • Architecting for operability • Making new mistakes Dislikes • Engineering Vs business breakdowns • Processes killing creativity • Alert fatigue Ajit Zadgaonkar AWS Professional Services
  • 4. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Agenda The battle of Ops world AiOps See it in action Words of Wisdom
  • 5. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 6. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. What we are really trying to achieve … Infrastructure Support Innovation Infrastructure Support Innovation Innovation Support ✅
  • 7. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Efficiency expressed in numbers … 440x Shorter Lead Times 60x Fewer Failures 21% Less unplanned rework 44% More new Work * Source: Puppet State of DevOps Report
  • 8. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Nay Ops : Aye Ops
  • 9. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. 4 Challenges in Ops Lack of Information Information overload Data correlation Dynamic Trends
  • 10. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Ops done better Root cause identification Freedom from white noise Data correlation map Proactive remediation
  • 11. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. The Key metric : 1. MTTR The right Needle in the haystack
  • 12. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. The Key metric : 2. Proactive Actions # problems auto~magically avoided
  • 13. “Innovation and operations don’t have to be inversely proportional” Ajit Zadgaonkar AI believer, Ops sufferer
  • 14. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. How to get life back in 4 steps ?
  • 15. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Step 1. Set up for Observability Infrastructure Logs Platform Logs Application Logs Code telemetry Events Vended Metrics API and Users
  • 16. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Step 2. Set up delivery, Collection, and Storage ingestion
  • 17. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Step 3. Querying and patterns mining
  • 18. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Step 4. Alerting, Notification, and remediation
  • 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. AiOps Predictive & Preventive Insights Patterns, tools & data ingestion Threat intel Budgets Planned events Purposeful Actions Timely Recovery Decisive Response New Pattern discovery Continuous Learning Infrastructure Logs Platform Logs Application Logs Code telemetry Events
  • 20. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 21. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 22. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 23. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 24. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 25. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Take Ah! Ops to AiOPs Track every resource possible Alert on services, availability, and response times Capture, learn, and improve Socialize insights with everyone on the team Plan metrics and monitoring, track patterns
  • 26. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Nay Ops : Aye Ops : AIOps
  • 27. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. How Edmunds.com optimized Ops Stephen Felisan, CIO, Edmunds.com
  • 28. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Container Stability Prediction • Removes bad containers • Focuses on actions and remediation • Started with Tensor Flow, moving to ML Outcomes: *Improved user experience *95% reduction in on-call
  • 29. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Breakout repeats Tuesday, November 27 AIOps: steps towards autonomous operations 1:00 PM – 2:00 PM | Mirage, Montego D Thursday, November 29 AIOps: steps towards autonomous operations 2:30 PM – 3:30 PM | Venetian, Level 4, Lando 4202
  • 30. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Related breakouts Tuesday, November 27 How to Ace Digital Transformation: Best Practices and Lessons Learned - ARC322 7:00 PM –8:00 PM | Aria East, Plaza Level, Orovada 3, T1 Tuesday, November 27 Moving to DevOps the Amazon Way - DEV210-R 8:30 AM – 9:3 0AM | Bellagio, Level 1, Grand Ballroom 2, T1 Monday, November 26 Enterprise DevOps: Patterns of Efficiency - ENT311 3:15 PM– 4:15 PM | Bellagio, Level 1, Bellagio Ballroom 6
  • 31. Thank you! © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Ajit Zadgaonkar zadajit@amazon.com
  • 32. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.