Learn to leverage AI to streamline your cloud operations with Anomaly Detection and Root Cause Analysis.
Use AI to maintain cloud governance, optimize cloud workloads, and simplify cloud operations. Find out more, request a demo, or set up a free trial at www.yotascale.com.
Human Factors of XR: Using Human Factors to Design XR Systems
Managing AWS Costs with Anomaly Detection and Root Cause Analysis
1. Autonomous Cloud Operations
Managing AWS Costs with Anomaly
Detection and Root Cause Analysis
Imran Moin
Chief Product Officer
imran@yotascale.com
2. AGENDA
• Company Overview
• Key Pain Points in managing AWS cloud infrastructure
• YotaScale Solution Overview
• Deep dive into Anomaly Detection
• Real world cost anomalies found by YotaScale
• Live Product Demonstration
• Q&A
10. • Detect Anomalies
• Root Cause Analysis
• Intelligent Workflow
Anomaly Detection
Live Monitoring
PREVENT RUNAWAY COSTS
• Contextually aware
corrective action
• Deep library of best
practices
• EC2 & PaaS Support
Continuous
Optimization
Up to 40% Savings
OPTIMALLY EFFICIENT
INFRASTRUCTURE
• Scorecard
• 100% tag hygiene
• Slice and dice analysis
• Accountability
& transparency
Contextual
Analytics
Org Benchmark
ACCOUNTABILITY
& TRANSPARENCY
Through the use of
machine learning,
YotaScale processes
millions of data signals
and provides
contextually relevant
anomaly detection and
optimization
recommendations that
reduce your cloud spend
11. YotaScale Anomaly Detection Overview
● YotaScale’s ML/AI powered Anomaly Detection can detect cost anomalies happening across
any possible dimension
● Customers get alerted real-time via Email, Slack, etc.
● Quick time to resolution due to YotaScale’s Root Cause Analysis (RCA)
DETECT/ ALERT
Detect and Alert on
real-time cost anomalies
PROVIDE RCA
Provide Root Cause on
what caused that anomaly
REMEDIATE
Suggest possible fixes to
the customer
12. Key Features for Anomaly Detection
Identify and Customize
Anomalies
● Sophisticated ML Models
● Customizable Dimensions
● Severity Per Anomaly
Provide Root Cause Analysis
● ML Models find correlations /
causations for each anomaly
● Linked to business events
(positive or negative)
Suggest Possible Fixes
● Identify Solutions
● Manual Scripts
● Approval based
implementation
● Automation
Workflow Integration
● Single Sign-On (SSO)
● Slack Integration
● JIRA Integration
13. Closed Feedback Loop on Anomaly Models
● Customer actions for each cost anomaly
○ Dismiss
○ Resolve
○ Snooze
● Anomaly ML models fine-tuned based on customer feedback
Actions for every
anomaly
Dismiss
Anomalies
Resolve
Anomalies
14. Remediation for Cost Anomalies (Future Roadmap)
Out of Band
(manual)
instructions
Out of Band
(manual) script
In Band
(manual) Script
Approval Based
Implementation
Automation
(Autonomous
execution by
YotaScale)
15. With real-time anomaly detection, root
cause and remediation YotaScale caught
this anomaly in time and saved
thousands of dollars.
“Our virus scanning
engine died. We could
not figure out the right
host and in the process
spun up hundreds of
machines. YotaScale
detected the issue in
realtime.”
Jonathan Monette
Senior Architect
“Our virus scanning
engine died. We could
not figure out the right
host and in the process
spun up hundreds of
machines. YotaScale
detected the issue in
real-time.”
Senior Application Architect
YotaScale’s Anomaly
Detection discovers
applications and services
and alerts you to
significant changes.
16. ANOMALY DETECTION ROOT CAUSE SAVES DAYS OF TROUBLESHOOTING
“Our API gateway team
saw an unusual amount of
requests resulting in a
huge spike in resource
provisioning. YotaScale
pinpointed the exact issue
saving valuable cycles”
YotaScale was able to pinpoint the exact
issue and save days of investigative work
on where to go look.John Smithan
Lead Site Reliability Engineer
“Our API gateway team
saw an unusual amount of
requests resulting in a
huge spike in resource
provisioning. YotaScale
pinpointed the exact issue
saving valuable cycles”
Lead Site Reliability Engineer
Going beyond alerting,
YotaScale can provide a
detailed analysis of the
resources that caused an
anomaly.
17. Key Benefits of Anomaly Detection
1. Get real-time notifications about any unusual cost spikes across any business
critical dimension
2. Serves as insurance policy against runaway cloud costs - can save up to
10-20% of yearly cloud spend
3. Helps troubleshoot root cause of cost spikes and save valuable time for
CloudOps, Finance and Engineering teams