SlideShare a Scribd company logo
1 of 11
6.53
PLATFORM
OBSERVABILITY
2
SCCM
DCM
Symantec EP
Security
DCM
Group Policy
Tanium
Tenable
Adptiva
Active Directory
Aternity Cloud
Exploit / Credential
Guard
Elastic
WinLogBeats
Cisco
AnyConnect
Complex
Tech
Stack
• Challenge #1: Numerous requests (from different
channels) and interruptions
• Challenge #2: No easier way to determine which
tech stack is causing the issue
• Challenge #3: Recurring issues
• Challenge #4: Long resolution times and lost
issues
• Challenge #5: Repetitive, time-consuming tasks
OPS Challenges
Platform Observability
“is when you infer the
internal state of a system
only by observing the data it
generates, such as logs,
metrics, and traces”
3
When observability is implemented well, a system will
not require operations teams to spend much effort on
understanding its internal state.
Monitoring - something you do to
determine the state of an application, a
system, a service…
Observability - is based on the
acquisition of data that allows you to ask
questions you didn’t know you have and
solve problems that you never thought of
Observability is all about data: At scale, at
speed and analytics-driven
Monitoring tells you when something is wrong.
Observability lets you ask why
Monitoring
Observability
Things we
are aware of
but DON’T
understand
Things we
are NOT
aware of
and DON’T
understand
Observability Impacts & Importance on Effective
SRE/DevOps
5
Cloud Migration
• Hybrid cloud Monitoring
• Cloud Cost Management
• Cloud capacity Planning
Cloud Monitoring
• Cloud services monitoring
• Kubernetes & container
monitoring
• Serverless monitoring
• KPI monitoring with
custom metrics
• Observability-as-a-
service
App Optimization
• Application
Modernization
• Microservices monitoring
& troubleshooting
• Business SLA monitoring
• DevOps application
Lifecyle monitoring
Higher visibility
Better workflow
Faster alerts
Finding out
unknown issues
Reduced
operational cost
Increased developer
velocity
Three Pilers of Observability
Metrics Traces Events / Logs
What is happening?
Where is it happening?
Why is it happening?
Detect Troubleshoot Pinpoint
• quantify performance
• produce alerts such as when a system
is down, or load balancers reach
capacity
• monitor events for anomalous activities
• API Queries
• Server-to-Server Workload
• Internal API Calls
• Frontend API Traffic
• answer the “who, what, where, when,
and how”
Centralized Observability – Key benefits
Current Benefits
• Faster issue detection and resolutions for
improved end user satisfaction
• Improved performance and platform reliability
• Lower infrastructure costs by right sizing and
identifying bottlenecks
• Improved time to market by enabling agile
frequent product delivery through automated
CICD pipelines without impacting quality
Outlook (Gartner)
• The observability landscape is in its early
stages. Enterprise are becoming frustrated
with limitation in monitoring tools and despite
decades of investment in monitoring tools
continue to rely on customers to notice
outages
• By 2024, over 30% of enterprises
implementing distributed systems will have
adopted observability techniques to improve
service performance, up from less than 10%
in 2020.
7
Observability Stack
8
• Log Analysis
• Real User Monitoring
• Infrastructure Monitoring
• Search
• Dashboards
• Machine Learning
• Alerting
• Logs
• Metrics
• Traces
Applications
Infrastructure
Cloud
Middleware
Network
Social
Business
Etc.,..
• Proactively detect issues
• Reduce MTTD and MTTR
• Continually optimize
• Deliver on SLAs
Any Data / Any Source
Open Access
Data
Common data
model
UI and Use
Cases
Continuous Feedback Loop
9
DEV Ops
Monitoring
Service Desk
Issue
Triage
Postmortem
Incident Review
Error Budget decisions
Monitor data using dashboards
Proactive reaction to automated alarms
Application Metrics and preventive
prediction of application load
Postmortem Analysis
Avoidance of future errors
Simplification of development process
Acceleration of Provisioning
CI/CD pipelines
Monitor data using dashboards
Proactive reaction to automated alarms
Application Metrics and preventive
prediction of application load
Releasing new features
Expected system changes
Planned downtime
Service Level Objective and
Indicators (SLO and SLI)
• Define service quality in advance
• Determine metrics and indicators
Risk Acceptance and Mitigation
Plan
• Assess risk viability and reliability
• Evaluate status with the pre
standard set
Automation
• Automate repetitive tasks
• Ensure faster delivery and
execution of tasks
Proactive Monitoring
• Prepare with constant
improvement plan to minimize
incidents
• Establish well monitored
environments
Observability is a Framework
• Observability is not a tool, but rather a framework, to enable quick interrogation of services to
identify underlying cause of issues even when the issues have never occurred before. Has
great potential to improve operational efficiency and ultimately overall stability of our
platforms.
• The more data points ingested into observability the more insights and value we can gain from
the platform.
• Requires willingness to adopt new ways of working shifting away from existing siloed technical
domains that lack full transparency to full visibility into the inner workings of platforms
Workplace and Collaboration teams depends on
10
Quotes on Monitoring vs Observability
11
“Monitoring is a single plane for the most part. You set up rules,
aggregations, and alerts on when a known scenario plays out
(e.g., a trajectory towards 100% disk usage is an indication of an
issue in the imminent future). Observability, on the other hand, is
the means to map an environment or context and the ability to
fluidly traverse that map, thus reaching a greater awareness of
‘what is.’” - Ryan Sheldrake, field CTO, Lacework
“Observability is about putting mechanisms in place
that allow teams to actively debug their system. It is
based on exploring properties and patterns not
defined in advance. The main purpose of observability
is to use the system’s outputs to gather insights and
act on them.” -Parveen Arora, co-founder and
director, VVnT SeQuor
“The key difference between observability and
monitoring is that compared to monitoring, site
observability gives a more complete assessment of
the overall environment in which the application
resides and hence is more effective in fulfilling the
key success factor for an application – that of site
reliability.” - Sushant Mehta, senior manager application
development, Diyar United Company
"Monitoring is the process of using observability. When monitoring
occurs, one has already decided which events and applications
will be tracked. Observability creates the potential to monitor
different events along the pipeline and the overall software
development lifecycle. As processes get built, the potential for
observability should be included across a broad spectrum.
Monitoring finds specific events across the system and creates
artifacts and reports that can be integrated into overall metrics." -
Mark Peters, technical lead, Novetta
Ref: https://enterprisersproject.com/article/2021/9/devops-monitoring-vs-observability

More Related Content

Similar to What is Platform Observability? An Overview

Unified Monitoring Webinar with Dustin Whittle
Unified Monitoring Webinar with Dustin WhittleUnified Monitoring Webinar with Dustin Whittle
Unified Monitoring Webinar with Dustin WhittleAppDynamics
 
Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...
Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...
Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...Splunk
 
Monitoring Containerized Micro-Services In Azure
Monitoring Containerized Micro-Services In AzureMonitoring Containerized Micro-Services In Azure
Monitoring Containerized Micro-Services In AzureAlex Bulankou
 
Effective Patch and Software Update Management
Effective Patch and Software Update ManagementEffective Patch and Software Update Management
Effective Patch and Software Update ManagementQuest
 
Myths of validation
Myths of validationMyths of validation
Myths of validationJeff Thomas
 
Splunk MINT and Stream Breakout
Splunk MINT and Stream BreakoutSplunk MINT and Stream Breakout
Splunk MINT and Stream BreakoutSplunk
 
What’s New: Splunk App for Stream and Splunk MINT
What’s New: Splunk App for Stream and Splunk MINTWhat’s New: Splunk App for Stream and Splunk MINT
What’s New: Splunk App for Stream and Splunk MINTSplunk
 
How much does it cost to be Secure?
How much does it cost to be Secure?How much does it cost to be Secure?
How much does it cost to be Secure?mbmobile
 
Devops Indonesia Presentation Monitoring Framework
Devops Indonesia Presentation Monitoring FrameworkDevops Indonesia Presentation Monitoring Framework
Devops Indonesia Presentation Monitoring FrameworkYusuf Hadiwinata Sutandar
 
Industrial Control Systems Cybersecurity Technology Selection
Industrial Control Systems Cybersecurity Technology SelectionIndustrial Control Systems Cybersecurity Technology Selection
Industrial Control Systems Cybersecurity Technology SelectionDragos, Inc.
 
Does DevSecOps really exist?
Does DevSecOps really exist?Does DevSecOps really exist?
Does DevSecOps really exist?continohq
 
3Audit Software & Tools.pptx
3Audit Software & Tools.pptx3Audit Software & Tools.pptx
3Audit Software & Tools.pptxjack952975
 
Migrating to the Cloud – Is Application Performance Monitoring still required?
Migrating to the Cloud – Is Application Performance Monitoring still required?Migrating to the Cloud – Is Application Performance Monitoring still required?
Migrating to the Cloud – Is Application Performance Monitoring still required?eG Innovations
 
SplunkLive! London - Splunk App for Stream & MINT Breakout
SplunkLive! London - Splunk App for Stream & MINT BreakoutSplunkLive! London - Splunk App for Stream & MINT Breakout
SplunkLive! London - Splunk App for Stream & MINT BreakoutSplunk
 
SLVA - Security monitoring and reporting itweb workshop
SLVA - Security monitoring and reporting   itweb workshopSLVA - Security monitoring and reporting   itweb workshop
SLVA - Security monitoring and reporting itweb workshopSLVA Information Security
 
DevOps Indonesia #14 - Building monitoring framework on container infrastructure
DevOps Indonesia #14 - Building monitoring framework on container infrastructureDevOps Indonesia #14 - Building monitoring framework on container infrastructure
DevOps Indonesia #14 - Building monitoring framework on container infrastructureDevOps Indonesia
 

Similar to What is Platform Observability? An Overview (20)

Unified Monitoring Webinar with Dustin Whittle
Unified Monitoring Webinar with Dustin WhittleUnified Monitoring Webinar with Dustin Whittle
Unified Monitoring Webinar with Dustin Whittle
 
Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...
Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...
Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...
 
Presentation2
Presentation2Presentation2
Presentation2
 
Monitoring Containerized Micro-Services In Azure
Monitoring Containerized Micro-Services In AzureMonitoring Containerized Micro-Services In Azure
Monitoring Containerized Micro-Services In Azure
 
Effective Patch and Software Update Management
Effective Patch and Software Update ManagementEffective Patch and Software Update Management
Effective Patch and Software Update Management
 
Myths of validation
Myths of validationMyths of validation
Myths of validation
 
Unit Iii
Unit IiiUnit Iii
Unit Iii
 
Splunk MINT and Stream Breakout
Splunk MINT and Stream BreakoutSplunk MINT and Stream Breakout
Splunk MINT and Stream Breakout
 
What’s New: Splunk App for Stream and Splunk MINT
What’s New: Splunk App for Stream and Splunk MINTWhat’s New: Splunk App for Stream and Splunk MINT
What’s New: Splunk App for Stream and Splunk MINT
 
How much does it cost to be Secure?
How much does it cost to be Secure?How much does it cost to be Secure?
How much does it cost to be Secure?
 
Devops Indonesia Presentation Monitoring Framework
Devops Indonesia Presentation Monitoring FrameworkDevops Indonesia Presentation Monitoring Framework
Devops Indonesia Presentation Monitoring Framework
 
Industrial Control Systems Cybersecurity Technology Selection
Industrial Control Systems Cybersecurity Technology SelectionIndustrial Control Systems Cybersecurity Technology Selection
Industrial Control Systems Cybersecurity Technology Selection
 
Does DevSecOps really exist?
Does DevSecOps really exist?Does DevSecOps really exist?
Does DevSecOps really exist?
 
3Audit Software & Tools.pptx
3Audit Software & Tools.pptx3Audit Software & Tools.pptx
3Audit Software & Tools.pptx
 
Migrating to the Cloud – Is Application Performance Monitoring still required?
Migrating to the Cloud – Is Application Performance Monitoring still required?Migrating to the Cloud – Is Application Performance Monitoring still required?
Migrating to the Cloud – Is Application Performance Monitoring still required?
 
SplunkLive! London - Splunk App for Stream & MINT Breakout
SplunkLive! London - Splunk App for Stream & MINT BreakoutSplunkLive! London - Splunk App for Stream & MINT Breakout
SplunkLive! London - Splunk App for Stream & MINT Breakout
 
Quality 4.0 and reimagining quality
Quality 4.0 and reimagining qualityQuality 4.0 and reimagining quality
Quality 4.0 and reimagining quality
 
SLVA - Security monitoring and reporting itweb workshop
SLVA - Security monitoring and reporting   itweb workshopSLVA - Security monitoring and reporting   itweb workshop
SLVA - Security monitoring and reporting itweb workshop
 
DevOps Indonesia #14 - Building monitoring framework on container infrastructure
DevOps Indonesia #14 - Building monitoring framework on container infrastructureDevOps Indonesia #14 - Building monitoring framework on container infrastructure
DevOps Indonesia #14 - Building monitoring framework on container infrastructure
 
Vapt life cycle
Vapt life cycleVapt life cycle
Vapt life cycle
 

More from Kumar Kolaganti

FIFA 2014 in pictures, charts and infographics
FIFA 2014 in pictures, charts and infographicsFIFA 2014 in pictures, charts and infographics
FIFA 2014 in pictures, charts and infographicsKumar Kolaganti
 
uPerform implementation roadmap
uPerform implementation roadmapuPerform implementation roadmap
uPerform implementation roadmapKumar Kolaganti
 
Risk management in simple terms with some humor
Risk management in simple terms with some humorRisk management in simple terms with some humor
Risk management in simple terms with some humorKumar Kolaganti
 
VisualOomph design portfolio
VisualOomph design portfolioVisualOomph design portfolio
VisualOomph design portfolioKumar Kolaganti
 
Leading a Toastmasters club using a process oriented approach
Leading a Toastmasters club using a process oriented approachLeading a Toastmasters club using a process oriented approach
Leading a Toastmasters club using a process oriented approachKumar Kolaganti
 
Toastmasters club process diagrams
Toastmasters club process diagramsToastmasters club process diagrams
Toastmasters club process diagramsKumar Kolaganti
 
Kumar Kolaganti A Decade With KPMG
Kumar Kolaganti   A Decade With KPMGKumar Kolaganti   A Decade With KPMG
Kumar Kolaganti A Decade With KPMGKumar Kolaganti
 
Artof Speaking Aug2009 Newsletter
Artof Speaking Aug2009 NewsletterArtof Speaking Aug2009 Newsletter
Artof Speaking Aug2009 NewsletterKumar Kolaganti
 
So Many Acronyms, So Little Time
So Many Acronyms, So Little TimeSo Many Acronyms, So Little Time
So Many Acronyms, So Little TimeKumar Kolaganti
 
Artof Speaking Club Satisfaction Survey Responses Oct 08
Artof Speaking Club Satisfaction Survey Responses Oct 08Artof Speaking Club Satisfaction Survey Responses Oct 08
Artof Speaking Club Satisfaction Survey Responses Oct 08Kumar Kolaganti
 

More from Kumar Kolaganti (12)

IFRS17 Periodic Table
IFRS17 Periodic TableIFRS17 Periodic Table
IFRS17 Periodic Table
 
FIFA 2014 in pictures, charts and infographics
FIFA 2014 in pictures, charts and infographicsFIFA 2014 in pictures, charts and infographics
FIFA 2014 in pictures, charts and infographics
 
uPerform implementation roadmap
uPerform implementation roadmapuPerform implementation roadmap
uPerform implementation roadmap
 
Risk management in simple terms with some humor
Risk management in simple terms with some humorRisk management in simple terms with some humor
Risk management in simple terms with some humor
 
VisualOomph design portfolio
VisualOomph design portfolioVisualOomph design portfolio
VisualOomph design portfolio
 
Leading a Toastmasters club using a process oriented approach
Leading a Toastmasters club using a process oriented approachLeading a Toastmasters club using a process oriented approach
Leading a Toastmasters club using a process oriented approach
 
Toastmasters club process diagrams
Toastmasters club process diagramsToastmasters club process diagrams
Toastmasters club process diagrams
 
Kumar Kolaganti A Decade With KPMG
Kumar Kolaganti   A Decade With KPMGKumar Kolaganti   A Decade With KPMG
Kumar Kolaganti A Decade With KPMG
 
Special K
Special KSpecial K
Special K
 
Artof Speaking Aug2009 Newsletter
Artof Speaking Aug2009 NewsletterArtof Speaking Aug2009 Newsletter
Artof Speaking Aug2009 Newsletter
 
So Many Acronyms, So Little Time
So Many Acronyms, So Little TimeSo Many Acronyms, So Little Time
So Many Acronyms, So Little Time
 
Artof Speaking Club Satisfaction Survey Responses Oct 08
Artof Speaking Club Satisfaction Survey Responses Oct 08Artof Speaking Club Satisfaction Survey Responses Oct 08
Artof Speaking Club Satisfaction Survey Responses Oct 08
 

Recently uploaded

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsAndrey Dotsenko
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 

Recently uploaded (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 

What is Platform Observability? An Overview

  • 2. 2 SCCM DCM Symantec EP Security DCM Group Policy Tanium Tenable Adptiva Active Directory Aternity Cloud Exploit / Credential Guard Elastic WinLogBeats Cisco AnyConnect Complex Tech Stack • Challenge #1: Numerous requests (from different channels) and interruptions • Challenge #2: No easier way to determine which tech stack is causing the issue • Challenge #3: Recurring issues • Challenge #4: Long resolution times and lost issues • Challenge #5: Repetitive, time-consuming tasks OPS Challenges
  • 3. Platform Observability “is when you infer the internal state of a system only by observing the data it generates, such as logs, metrics, and traces” 3 When observability is implemented well, a system will not require operations teams to spend much effort on understanding its internal state.
  • 4. Monitoring - something you do to determine the state of an application, a system, a service… Observability - is based on the acquisition of data that allows you to ask questions you didn’t know you have and solve problems that you never thought of Observability is all about data: At scale, at speed and analytics-driven Monitoring tells you when something is wrong. Observability lets you ask why Monitoring Observability Things we are aware of but DON’T understand Things we are NOT aware of and DON’T understand
  • 5. Observability Impacts & Importance on Effective SRE/DevOps 5 Cloud Migration • Hybrid cloud Monitoring • Cloud Cost Management • Cloud capacity Planning Cloud Monitoring • Cloud services monitoring • Kubernetes & container monitoring • Serverless monitoring • KPI monitoring with custom metrics • Observability-as-a- service App Optimization • Application Modernization • Microservices monitoring & troubleshooting • Business SLA monitoring • DevOps application Lifecyle monitoring Higher visibility Better workflow Faster alerts Finding out unknown issues Reduced operational cost Increased developer velocity
  • 6. Three Pilers of Observability Metrics Traces Events / Logs What is happening? Where is it happening? Why is it happening? Detect Troubleshoot Pinpoint • quantify performance • produce alerts such as when a system is down, or load balancers reach capacity • monitor events for anomalous activities • API Queries • Server-to-Server Workload • Internal API Calls • Frontend API Traffic • answer the “who, what, where, when, and how”
  • 7. Centralized Observability – Key benefits Current Benefits • Faster issue detection and resolutions for improved end user satisfaction • Improved performance and platform reliability • Lower infrastructure costs by right sizing and identifying bottlenecks • Improved time to market by enabling agile frequent product delivery through automated CICD pipelines without impacting quality Outlook (Gartner) • The observability landscape is in its early stages. Enterprise are becoming frustrated with limitation in monitoring tools and despite decades of investment in monitoring tools continue to rely on customers to notice outages • By 2024, over 30% of enterprises implementing distributed systems will have adopted observability techniques to improve service performance, up from less than 10% in 2020. 7
  • 8. Observability Stack 8 • Log Analysis • Real User Monitoring • Infrastructure Monitoring • Search • Dashboards • Machine Learning • Alerting • Logs • Metrics • Traces Applications Infrastructure Cloud Middleware Network Social Business Etc.,.. • Proactively detect issues • Reduce MTTD and MTTR • Continually optimize • Deliver on SLAs Any Data / Any Source Open Access Data Common data model UI and Use Cases
  • 9. Continuous Feedback Loop 9 DEV Ops Monitoring Service Desk Issue Triage Postmortem Incident Review Error Budget decisions Monitor data using dashboards Proactive reaction to automated alarms Application Metrics and preventive prediction of application load Postmortem Analysis Avoidance of future errors Simplification of development process Acceleration of Provisioning CI/CD pipelines Monitor data using dashboards Proactive reaction to automated alarms Application Metrics and preventive prediction of application load Releasing new features Expected system changes Planned downtime Service Level Objective and Indicators (SLO and SLI) • Define service quality in advance • Determine metrics and indicators Risk Acceptance and Mitigation Plan • Assess risk viability and reliability • Evaluate status with the pre standard set Automation • Automate repetitive tasks • Ensure faster delivery and execution of tasks Proactive Monitoring • Prepare with constant improvement plan to minimize incidents • Establish well monitored environments
  • 10. Observability is a Framework • Observability is not a tool, but rather a framework, to enable quick interrogation of services to identify underlying cause of issues even when the issues have never occurred before. Has great potential to improve operational efficiency and ultimately overall stability of our platforms. • The more data points ingested into observability the more insights and value we can gain from the platform. • Requires willingness to adopt new ways of working shifting away from existing siloed technical domains that lack full transparency to full visibility into the inner workings of platforms Workplace and Collaboration teams depends on 10
  • 11. Quotes on Monitoring vs Observability 11 “Monitoring is a single plane for the most part. You set up rules, aggregations, and alerts on when a known scenario plays out (e.g., a trajectory towards 100% disk usage is an indication of an issue in the imminent future). Observability, on the other hand, is the means to map an environment or context and the ability to fluidly traverse that map, thus reaching a greater awareness of ‘what is.’” - Ryan Sheldrake, field CTO, Lacework “Observability is about putting mechanisms in place that allow teams to actively debug their system. It is based on exploring properties and patterns not defined in advance. The main purpose of observability is to use the system’s outputs to gather insights and act on them.” -Parveen Arora, co-founder and director, VVnT SeQuor “The key difference between observability and monitoring is that compared to monitoring, site observability gives a more complete assessment of the overall environment in which the application resides and hence is more effective in fulfilling the key success factor for an application – that of site reliability.” - Sushant Mehta, senior manager application development, Diyar United Company "Monitoring is the process of using observability. When monitoring occurs, one has already decided which events and applications will be tracked. Observability creates the potential to monitor different events along the pipeline and the overall software development lifecycle. As processes get built, the potential for observability should be included across a broad spectrum. Monitoring finds specific events across the system and creates artifacts and reports that can be integrated into overall metrics." - Mark Peters, technical lead, Novetta Ref: https://enterprisersproject.com/article/2021/9/devops-monitoring-vs-observability

Editor's Notes

  1. Observability is the ability to understand a system's internal state by analyzing the data it generates, such as logs, metrics, and traces