SlideShare a Scribd company logo
1 of 28
S U M M I T
SYDNEY
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
AIOps: Steps towards
autonomous operations
Sri (Srichakri) Nadendla
Enterprise Solutions Architect
Amazon Web Services
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Agenda
• Effective operational practices
• Enablers for autonomous operations
• Demo using Amazon Sagemaker
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Operations
School of hard knocks
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Whatever can go wrong,
will go wrong.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Operations objectives
Keep it
safe
Keep the
lights on
Reliability
(Availability + Performance)
Security
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Ops holy grail
Prevent Correct Baseline
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Enablers to effective operations
Collection Patterns
Actions
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Foundations to observability
Alerts and
notifications
Data, tools
and patterns
Ingestion
Metrics,
events
and logs
Threat intel Budgets
Planned events
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Step 1 – Enable instrumentation
Metrics,
events
and logs
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
AWS services in the context
Instrumentation
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Step 2 – Ingest and store
Data
storage
Ingestion
Metrics,
events
and logs
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
AWS services in the context
Instrumentation Ingestion Storage
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Step 3 – Query and pattern mining
Data
Storage
Ingestion
Metrics,
events
and logs
Threat intel Budgets
Planned events
Analysis
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
AWS services in the context
Instrumentation Ingestion Storage Analysis
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Step 4 – Alerts and Remediation
Alerts
& notifications
Data, tools
& patterns
Ingestion
Metrics,
events
and logs
Threat intel Budgets
Planned events
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
AWS services in the context
Instrumentation Ingestion Storage Analysis Alerts &
Actions
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Some challenges
Metrics
galore
Dashboards
fatigue
Manual
correlation
Static
thresholds
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Responsiveness matters
Root cause
identification
Dynamic
detection
Proactive
remediation
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Path to autonomous operations
Data, tools
& patterns
Ingestion
Metrics,
events
and logs
Threat intel Budgets
Planned events
Predictive,
actionable and
automated
remediation
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Key techniques
Correlation Anomaly
detection
Forecasting
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon SageMaker: Build, train, and deploy
machine learning models at scale
1
2
3
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Artificial Intelligence
amplifies the possibilities of
human-machine collaboration
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Additional considerations
Shift left
(CI/CD)
Runbook
invocation
Knowledge
assist
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
GOOD intentions should have GOALs
Mean time
between
failures
Proactive
actions
# Problems
avoided
Time to detect
Time to resolve
Mean time to
recover
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
What we are really trying to achieve …
Infrastructure
Support
Innovation
Infrastructure
Support
Innovation
Innovation
Support
✅
Thank you!
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Sri (Srichakri) Nadendla
nadendls@amazon.com

More Related Content

What's hot

OpsRamp Webinar - State of AIOps for Incident Management and Beyond
OpsRamp Webinar - State of AIOps for Incident Management and BeyondOpsRamp Webinar - State of AIOps for Incident Management and Beyond
OpsRamp Webinar - State of AIOps for Incident Management and BeyondOpsRamp
 
ITSM and ITOM Coming Together
ITSM and ITOM Coming TogetherITSM and ITOM Coming Together
ITSM and ITOM Coming TogetherOpsRamp
 
Agile Network India | Agility Day @Noida | SRE & AIOps | Murugan Muthayan
Agile Network India | Agility Day @Noida | SRE & AIOps | Murugan MuthayanAgile Network India | Agility Day @Noida | SRE & AIOps | Murugan Muthayan
Agile Network India | Agility Day @Noida | SRE & AIOps | Murugan MuthayanAgileNetwork
 
AIOps, IT Analytics, and Business Performance: What’s Needed and What Works
AIOps, IT Analytics, and Business Performance: What’s Needed and What Works AIOps, IT Analytics, and Business Performance: What’s Needed and What Works
AIOps, IT Analytics, and Business Performance: What’s Needed and What Works Enterprise Management Associates
 
AIOps-Driven Network Performance Management: The First Step Toward Self-Heali...
AIOps-Driven Network Performance Management: The First Step Toward Self-Heali...AIOps-Driven Network Performance Management: The First Step Toward Self-Heali...
AIOps-Driven Network Performance Management: The First Step Toward Self-Heali...Enterprise Management Associates
 
Context is Critical: How Richer Data Yields Richer Results in AIOps | Bhanu S...
Context is Critical: How Richer Data Yields Richer Results in AIOps | Bhanu S...Context is Critical: How Richer Data Yields Richer Results in AIOps | Bhanu S...
Context is Critical: How Richer Data Yields Richer Results in AIOps | Bhanu S...OpsRamp
 
Doing DevOps for Big Data? What You Need to Know About AIOps
Doing DevOps for Big Data? What You Need to Know About AIOpsDoing DevOps for Big Data? What You Need to Know About AIOps
Doing DevOps for Big Data? What You Need to Know About AIOpsDevOps.com
 
No Ops? Or Yes, Ops! The Future of Operations in a DevOps World
No Ops? Or Yes, Ops! The Future of Operations in a DevOps WorldNo Ops? Or Yes, Ops! The Future of Operations in a DevOps World
No Ops? Or Yes, Ops! The Future of Operations in a DevOps WorldOpsRamp
 
2019 Performance Monitoring and Management Trends and Insights
2019 Performance Monitoring and Management Trends and Insights2019 Performance Monitoring and Management Trends and Insights
2019 Performance Monitoring and Management Trends and InsightsOpsRamp
 
Doing DevOps for Big Data? What You Need to Know About AIOps
Doing DevOps for Big Data? What You Need to Know About AIOpsDoing DevOps for Big Data? What You Need to Know About AIOps
Doing DevOps for Big Data? What You Need to Know About AIOpsDevOps.com
 
Bringing AIOps to Hybrid Cloud Monitoring and Management
Bringing AIOps to Hybrid Cloud Monitoring and ManagementBringing AIOps to Hybrid Cloud Monitoring and Management
Bringing AIOps to Hybrid Cloud Monitoring and ManagementOpsRamp
 
What Does Artificial Intelligence Have to Do with IT Operations?
What Does Artificial Intelligence Have to Do with IT Operations?What Does Artificial Intelligence Have to Do with IT Operations?
What Does Artificial Intelligence Have to Do with IT Operations?Precisely
 
OpsRamp Platform Fall Release
OpsRamp Platform Fall ReleaseOpsRamp Platform Fall Release
OpsRamp Platform Fall ReleaseOpsRamp
 
OpsRamp Platform Winter 2020 Release
OpsRamp Platform Winter 2020 ReleaseOpsRamp Platform Winter 2020 Release
OpsRamp Platform Winter 2020 ReleaseOpsRamp
 
Six IIoT Success Stories Powered by the PI System
Six IIoT Success Stories Powered by the PI SystemSix IIoT Success Stories Powered by the PI System
Six IIoT Success Stories Powered by the PI SystemOSIsoft, LLC
 
10 IT Infrastructure & Operations Trends for 2019
10 IT Infrastructure & Operations Trends for 201910 IT Infrastructure & Operations Trends for 2019
10 IT Infrastructure & Operations Trends for 2019OpsRamp
 

What's hot (19)

OpsRamp Webinar - State of AIOps for Incident Management and Beyond
OpsRamp Webinar - State of AIOps for Incident Management and BeyondOpsRamp Webinar - State of AIOps for Incident Management and Beyond
OpsRamp Webinar - State of AIOps for Incident Management and Beyond
 
ITSM and ITOM Coming Together
ITSM and ITOM Coming TogetherITSM and ITOM Coming Together
ITSM and ITOM Coming Together
 
Unifying IT with Outcome-Aware AIOps
Unifying IT with Outcome-Aware AIOps  Unifying IT with Outcome-Aware AIOps
Unifying IT with Outcome-Aware AIOps
 
Agile Network India | Agility Day @Noida | SRE & AIOps | Murugan Muthayan
Agile Network India | Agility Day @Noida | SRE & AIOps | Murugan MuthayanAgile Network India | Agility Day @Noida | SRE & AIOps | Murugan Muthayan
Agile Network India | Agility Day @Noida | SRE & AIOps | Murugan Muthayan
 
AIOps, IT Analytics, and Business Performance: What’s Needed and What Works
AIOps, IT Analytics, and Business Performance: What’s Needed and What Works AIOps, IT Analytics, and Business Performance: What’s Needed and What Works
AIOps, IT Analytics, and Business Performance: What’s Needed and What Works
 
AIOps-Driven Network Performance Management: The First Step Toward Self-Heali...
AIOps-Driven Network Performance Management: The First Step Toward Self-Heali...AIOps-Driven Network Performance Management: The First Step Toward Self-Heali...
AIOps-Driven Network Performance Management: The First Step Toward Self-Heali...
 
Context is Critical: How Richer Data Yields Richer Results in AIOps | Bhanu S...
Context is Critical: How Richer Data Yields Richer Results in AIOps | Bhanu S...Context is Critical: How Richer Data Yields Richer Results in AIOps | Bhanu S...
Context is Critical: How Richer Data Yields Richer Results in AIOps | Bhanu S...
 
The future of AIOps
The future of AIOpsThe future of AIOps
The future of AIOps
 
Doing DevOps for Big Data? What You Need to Know About AIOps
Doing DevOps for Big Data? What You Need to Know About AIOpsDoing DevOps for Big Data? What You Need to Know About AIOps
Doing DevOps for Big Data? What You Need to Know About AIOps
 
AIOps in 2020: A Beginner's Guide
AIOps in 2020: A Beginner's GuideAIOps in 2020: A Beginner's Guide
AIOps in 2020: A Beginner's Guide
 
No Ops? Or Yes, Ops! The Future of Operations in a DevOps World
No Ops? Or Yes, Ops! The Future of Operations in a DevOps WorldNo Ops? Or Yes, Ops! The Future of Operations in a DevOps World
No Ops? Or Yes, Ops! The Future of Operations in a DevOps World
 
2019 Performance Monitoring and Management Trends and Insights
2019 Performance Monitoring and Management Trends and Insights2019 Performance Monitoring and Management Trends and Insights
2019 Performance Monitoring and Management Trends and Insights
 
Doing DevOps for Big Data? What You Need to Know About AIOps
Doing DevOps for Big Data? What You Need to Know About AIOpsDoing DevOps for Big Data? What You Need to Know About AIOps
Doing DevOps for Big Data? What You Need to Know About AIOps
 
Bringing AIOps to Hybrid Cloud Monitoring and Management
Bringing AIOps to Hybrid Cloud Monitoring and ManagementBringing AIOps to Hybrid Cloud Monitoring and Management
Bringing AIOps to Hybrid Cloud Monitoring and Management
 
What Does Artificial Intelligence Have to Do with IT Operations?
What Does Artificial Intelligence Have to Do with IT Operations?What Does Artificial Intelligence Have to Do with IT Operations?
What Does Artificial Intelligence Have to Do with IT Operations?
 
OpsRamp Platform Fall Release
OpsRamp Platform Fall ReleaseOpsRamp Platform Fall Release
OpsRamp Platform Fall Release
 
OpsRamp Platform Winter 2020 Release
OpsRamp Platform Winter 2020 ReleaseOpsRamp Platform Winter 2020 Release
OpsRamp Platform Winter 2020 Release
 
Six IIoT Success Stories Powered by the PI System
Six IIoT Success Stories Powered by the PI SystemSix IIoT Success Stories Powered by the PI System
Six IIoT Success Stories Powered by the PI System
 
10 IT Infrastructure & Operations Trends for 2019
10 IT Infrastructure & Operations Trends for 201910 IT Infrastructure & Operations Trends for 2019
10 IT Infrastructure & Operations Trends for 2019
 

Similar to AIOps - Steps Towards Autonomous Operations - AWS Summit Sydney 2019

Automate Security Event Management Using Trust-Based Decision Models - AWS Su...
Automate Security Event Management Using Trust-Based Decision Models - AWS Su...Automate Security Event Management Using Trust-Based Decision Models - AWS Su...
Automate Security Event Management Using Trust-Based Decision Models - AWS Su...Amazon Web Services
 
AWS Summit Singapore 2019 | Realising Business Value with AWS Analytics Services
AWS Summit Singapore 2019 | Realising Business Value with AWS Analytics ServicesAWS Summit Singapore 2019 | Realising Business Value with AWS Analytics Services
AWS Summit Singapore 2019 | Realising Business Value with AWS Analytics ServicesAWS Summits
 
Automatic-Labelling-and-Model-Tuning-with-Amazon-SageMaker
Automatic-Labelling-and-Model-Tuning-with-Amazon-SageMakerAutomatic-Labelling-and-Model-Tuning-with-Amazon-SageMaker
Automatic-Labelling-and-Model-Tuning-with-Amazon-SageMakerAmazon Web Services
 
Monitoring with container insights & anomaly detection - SVC308 - New York AW...
Monitoring with container insights & anomaly detection - SVC308 - New York AW...Monitoring with container insights & anomaly detection - SVC308 - New York AW...
Monitoring with container insights & anomaly detection - SVC308 - New York AW...Amazon Web Services
 
Automatic Labelling and Model Tuning with Amazon SageMaker - AWS Summit Sydney
Automatic Labelling and Model Tuning with Amazon SageMaker - AWS Summit SydneyAutomatic Labelling and Model Tuning with Amazon SageMaker - AWS Summit Sydney
Automatic Labelling and Model Tuning with Amazon SageMaker - AWS Summit SydneyAmazon Web Services
 
The Theory and Practice, Practice, Practice of AWS Operations - AWS Summit Sy...
The Theory and Practice, Practice, Practice of AWS Operations - AWS Summit Sy...The Theory and Practice, Practice, Practice of AWS Operations - AWS Summit Sy...
The Theory and Practice, Practice, Practice of AWS Operations - AWS Summit Sy...Amazon Web Services
 
Optimize deep learning training and inferencing using GPU and Amazon SageMake...
Optimize deep learning training and inferencing using GPU and Amazon SageMake...Optimize deep learning training and inferencing using GPU and Amazon SageMake...
Optimize deep learning training and inferencing using GPU and Amazon SageMake...Amazon Web Services
 
Predicting Demand In A Diverse Retail Environment - AWS Summit Sydney
Predicting Demand In A Diverse Retail Environment - AWS Summit SydneyPredicting Demand In A Diverse Retail Environment - AWS Summit Sydney
Predicting Demand In A Diverse Retail Environment - AWS Summit SydneyAmazon Web Services
 
Rendi le tue app più smart con i servizi AI di AWS
Rendi le tue app più smart con i servizi AI di AWSRendi le tue app più smart con i servizi AI di AWS
Rendi le tue app più smart con i servizi AI di AWSAmazon Web Services
 
Industry 4.0 in the cloud - SVC214 - Chicago AWS Summit
Industry 4.0 in the cloud - SVC214 - Chicago AWS SummitIndustry 4.0 in the cloud - SVC214 - Chicago AWS Summit
Industry 4.0 in the cloud - SVC214 - Chicago AWS SummitAmazon Web Services
 
AWS IoT services - Extract value for industrial applications - SVC205 - Santa...
AWS IoT services - Extract value for industrial applications - SVC205 - Santa...AWS IoT services - Extract value for industrial applications - SVC205 - Santa...
AWS IoT services - Extract value for industrial applications - SVC205 - Santa...Amazon Web Services
 
Threat detection and mitigation at AWS - SEC201 - Atlanta AWS Summit
Threat detection and mitigation at AWS - SEC201 - Atlanta AWS SummitThreat detection and mitigation at AWS - SEC201 - Atlanta AWS Summit
Threat detection and mitigation at AWS - SEC201 - Atlanta AWS SummitAmazon Web Services
 
The Zen of governance - Establish guardrails and empower builders - SVC201 - ...
The Zen of governance - Establish guardrails and empower builders - SVC201 - ...The Zen of governance - Establish guardrails and empower builders - SVC201 - ...
The Zen of governance - Establish guardrails and empower builders - SVC201 - ...Amazon Web Services
 
Take action on your security & compliance alerts with AWS Security Hub - SEC2...
Take action on your security & compliance alerts with AWS Security Hub - SEC2...Take action on your security & compliance alerts with AWS Security Hub - SEC2...
Take action on your security & compliance alerts with AWS Security Hub - SEC2...Amazon Web Services
 
Introduction to AWS Global Accelerator - SVC212 - New York AWS Summit
Introduction to AWS Global Accelerator - SVC212 - New York AWS SummitIntroduction to AWS Global Accelerator - SVC212 - New York AWS Summit
Introduction to AWS Global Accelerator - SVC212 - New York AWS SummitAmazon Web Services
 
Drive digital transformation with AI
Drive digital transformation with AIDrive digital transformation with AI
Drive digital transformation with AIAmazon Web Services
 
Cloud Operating Models for Accelerated Cloud Transformation - AWS Summit Sydney
Cloud Operating Models for Accelerated Cloud Transformation - AWS Summit SydneyCloud Operating Models for Accelerated Cloud Transformation - AWS Summit Sydney
Cloud Operating Models for Accelerated Cloud Transformation - AWS Summit SydneyAmazon Web Services
 
AWS IoT services: Extract value for industrial applications - SVC202 - Mexico...
AWS IoT services: Extract value for industrial applications - SVC202 - Mexico...AWS IoT services: Extract value for industrial applications - SVC202 - Mexico...
AWS IoT services: Extract value for industrial applications - SVC202 - Mexico...Amazon Web Services
 
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...Amazon Web Services
 
AWS及客戶在AI/ML的數位運行過程中得到的重要經驗與學習
AWS及客戶在AI/ML的數位運行過程中得到的重要經驗與學習AWS及客戶在AI/ML的數位運行過程中得到的重要經驗與學習
AWS及客戶在AI/ML的數位運行過程中得到的重要經驗與學習Amazon Web Services
 

Similar to AIOps - Steps Towards Autonomous Operations - AWS Summit Sydney 2019 (20)

Automate Security Event Management Using Trust-Based Decision Models - AWS Su...
Automate Security Event Management Using Trust-Based Decision Models - AWS Su...Automate Security Event Management Using Trust-Based Decision Models - AWS Su...
Automate Security Event Management Using Trust-Based Decision Models - AWS Su...
 
AWS Summit Singapore 2019 | Realising Business Value with AWS Analytics Services
AWS Summit Singapore 2019 | Realising Business Value with AWS Analytics ServicesAWS Summit Singapore 2019 | Realising Business Value with AWS Analytics Services
AWS Summit Singapore 2019 | Realising Business Value with AWS Analytics Services
 
Automatic-Labelling-and-Model-Tuning-with-Amazon-SageMaker
Automatic-Labelling-and-Model-Tuning-with-Amazon-SageMakerAutomatic-Labelling-and-Model-Tuning-with-Amazon-SageMaker
Automatic-Labelling-and-Model-Tuning-with-Amazon-SageMaker
 
Monitoring with container insights & anomaly detection - SVC308 - New York AW...
Monitoring with container insights & anomaly detection - SVC308 - New York AW...Monitoring with container insights & anomaly detection - SVC308 - New York AW...
Monitoring with container insights & anomaly detection - SVC308 - New York AW...
 
Automatic Labelling and Model Tuning with Amazon SageMaker - AWS Summit Sydney
Automatic Labelling and Model Tuning with Amazon SageMaker - AWS Summit SydneyAutomatic Labelling and Model Tuning with Amazon SageMaker - AWS Summit Sydney
Automatic Labelling and Model Tuning with Amazon SageMaker - AWS Summit Sydney
 
The Theory and Practice, Practice, Practice of AWS Operations - AWS Summit Sy...
The Theory and Practice, Practice, Practice of AWS Operations - AWS Summit Sy...The Theory and Practice, Practice, Practice of AWS Operations - AWS Summit Sy...
The Theory and Practice, Practice, Practice of AWS Operations - AWS Summit Sy...
 
Optimize deep learning training and inferencing using GPU and Amazon SageMake...
Optimize deep learning training and inferencing using GPU and Amazon SageMake...Optimize deep learning training and inferencing using GPU and Amazon SageMake...
Optimize deep learning training and inferencing using GPU and Amazon SageMake...
 
Predicting Demand In A Diverse Retail Environment - AWS Summit Sydney
Predicting Demand In A Diverse Retail Environment - AWS Summit SydneyPredicting Demand In A Diverse Retail Environment - AWS Summit Sydney
Predicting Demand In A Diverse Retail Environment - AWS Summit Sydney
 
Rendi le tue app più smart con i servizi AI di AWS
Rendi le tue app più smart con i servizi AI di AWSRendi le tue app più smart con i servizi AI di AWS
Rendi le tue app più smart con i servizi AI di AWS
 
Industry 4.0 in the cloud - SVC214 - Chicago AWS Summit
Industry 4.0 in the cloud - SVC214 - Chicago AWS SummitIndustry 4.0 in the cloud - SVC214 - Chicago AWS Summit
Industry 4.0 in the cloud - SVC214 - Chicago AWS Summit
 
AWS IoT services - Extract value for industrial applications - SVC205 - Santa...
AWS IoT services - Extract value for industrial applications - SVC205 - Santa...AWS IoT services - Extract value for industrial applications - SVC205 - Santa...
AWS IoT services - Extract value for industrial applications - SVC205 - Santa...
 
Threat detection and mitigation at AWS - SEC201 - Atlanta AWS Summit
Threat detection and mitigation at AWS - SEC201 - Atlanta AWS SummitThreat detection and mitigation at AWS - SEC201 - Atlanta AWS Summit
Threat detection and mitigation at AWS - SEC201 - Atlanta AWS Summit
 
The Zen of governance - Establish guardrails and empower builders - SVC201 - ...
The Zen of governance - Establish guardrails and empower builders - SVC201 - ...The Zen of governance - Establish guardrails and empower builders - SVC201 - ...
The Zen of governance - Establish guardrails and empower builders - SVC201 - ...
 
Take action on your security & compliance alerts with AWS Security Hub - SEC2...
Take action on your security & compliance alerts with AWS Security Hub - SEC2...Take action on your security & compliance alerts with AWS Security Hub - SEC2...
Take action on your security & compliance alerts with AWS Security Hub - SEC2...
 
Introduction to AWS Global Accelerator - SVC212 - New York AWS Summit
Introduction to AWS Global Accelerator - SVC212 - New York AWS SummitIntroduction to AWS Global Accelerator - SVC212 - New York AWS Summit
Introduction to AWS Global Accelerator - SVC212 - New York AWS Summit
 
Drive digital transformation with AI
Drive digital transformation with AIDrive digital transformation with AI
Drive digital transformation with AI
 
Cloud Operating Models for Accelerated Cloud Transformation - AWS Summit Sydney
Cloud Operating Models for Accelerated Cloud Transformation - AWS Summit SydneyCloud Operating Models for Accelerated Cloud Transformation - AWS Summit Sydney
Cloud Operating Models for Accelerated Cloud Transformation - AWS Summit Sydney
 
AWS IoT services: Extract value for industrial applications - SVC202 - Mexico...
AWS IoT services: Extract value for industrial applications - SVC202 - Mexico...AWS IoT services: Extract value for industrial applications - SVC202 - Mexico...
AWS IoT services: Extract value for industrial applications - SVC202 - Mexico...
 
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...
 
AWS及客戶在AI/ML的數位運行過程中得到的重要經驗與學習
AWS及客戶在AI/ML的數位運行過程中得到的重要經驗與學習AWS及客戶在AI/ML的數位運行過程中得到的重要經驗與學習
AWS及客戶在AI/ML的數位運行過程中得到的重要經驗與學習
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

AIOps - Steps Towards Autonomous Operations - AWS Summit Sydney 2019

  • 1. S U M M I T SYDNEY
  • 2. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T AIOps: Steps towards autonomous operations Sri (Srichakri) Nadendla Enterprise Solutions Architect Amazon Web Services
  • 3. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Agenda • Effective operational practices • Enablers for autonomous operations • Demo using Amazon Sagemaker
  • 4. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Operations School of hard knocks
  • 5. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Whatever can go wrong, will go wrong.
  • 6. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Operations objectives Keep it safe Keep the lights on Reliability (Availability + Performance) Security
  • 7. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Ops holy grail Prevent Correct Baseline
  • 8. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Enablers to effective operations Collection Patterns Actions
  • 9. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Foundations to observability Alerts and notifications Data, tools and patterns Ingestion Metrics, events and logs Threat intel Budgets Planned events
  • 10. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Step 1 – Enable instrumentation Metrics, events and logs
  • 11. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T AWS services in the context Instrumentation
  • 12. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Step 2 – Ingest and store Data storage Ingestion Metrics, events and logs
  • 13. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T AWS services in the context Instrumentation Ingestion Storage
  • 14. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Step 3 – Query and pattern mining Data Storage Ingestion Metrics, events and logs Threat intel Budgets Planned events Analysis
  • 15. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T AWS services in the context Instrumentation Ingestion Storage Analysis
  • 16. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Step 4 – Alerts and Remediation Alerts & notifications Data, tools & patterns Ingestion Metrics, events and logs Threat intel Budgets Planned events
  • 17. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T AWS services in the context Instrumentation Ingestion Storage Analysis Alerts & Actions
  • 18. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Some challenges Metrics galore Dashboards fatigue Manual correlation Static thresholds
  • 19. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Responsiveness matters Root cause identification Dynamic detection Proactive remediation
  • 20. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Path to autonomous operations Data, tools & patterns Ingestion Metrics, events and logs Threat intel Budgets Planned events Predictive, actionable and automated remediation
  • 21. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Key techniques Correlation Anomaly detection Forecasting
  • 22. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon SageMaker: Build, train, and deploy machine learning models at scale 1 2 3
  • 23. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 24. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Artificial Intelligence amplifies the possibilities of human-machine collaboration
  • 25. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Additional considerations Shift left (CI/CD) Runbook invocation Knowledge assist
  • 26. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T GOOD intentions should have GOALs Mean time between failures Proactive actions # Problems avoided Time to detect Time to resolve Mean time to recover
  • 27. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T What we are really trying to achieve … Infrastructure Support Innovation Infrastructure Support Innovation Innovation Support ✅
  • 28. Thank you! S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Sri (Srichakri) Nadendla nadendls@amazon.com