SlideShare a Scribd company logo
1 of 27
Download to read offline
Data Rules for Observability
Dave McAllister - NGINX
Every company is on a cloud journey
To increase velocity, agility and responsiveness
Retain & Optimize Lift & Shift Re-Factor Re-Architect /
Cloud-Native
DEV OPS DEV OPS DEV OPS DEV OPS
Cloud Managed e.g. RDS,
DynamoDB, SaaS
Cloud First Architecture
Tightly Coupled Apps,
Slow Deployment Cycles
Primarily using
Cloud IaaS
More Modular, but
Dependent App Components
Loosely Coupled Microservices,
and Serverless Functions
VM VM VM
VM VM VM VM VM VM
Private Public
VM VM VM VM VM VM
Private Public Private Public
“By 2025, 85% of organizations will run containers in production, up from less than 30% in 2020” – Gartner, Dec 14, 2020
Observability Challenges
● Microservices create complex interactions.
● Failures don't exactlyrepeat.
● Debugging multitenancy ispainful.
● Monitoring no longer can save us alone
Cynefin Framework
4
© 2 0 2 0 S P L U N K I N C .
Use all of your data
to avoid blind spots
©2021 F5
6
The more observable a system,
the quicker we can understand
why it’s acting up and fix it
Observability is a Data
Problem
Metrics
Do I have
a problem?
Traces
Where is the
problem?
Logs
Why is the problem
happening?
Observability
DETECT TROUBLESHOOT ROOT CAUSE
Full-Stack Visibility
& Context-Rich Insights
Data is the driving factor for
observability
• AI/ML-driven Directed Troubleshooting
• Unlimited Cardinality
• Streaming data, incl. Monitoring and Alerting
• Full-fidelity metrics and traces
• Open standards, open source data ingest
Dealing with the noise
• Filter the Signals
• Linear, Low-pass, Band-pass, All-pass
• Sample the signals
• Random, Head-based, Tail-based,
Post-predictive, Dimensionality reduction
• Improve the visualization
• Smart aggregation
Problems with observability sampling
• Leads to alert storms caused by the
cascading nature of failures
• Leads to needle-in-the-haystack scenarios
and long MTTR
• Siloed infrastructure and application insights
• Routinely miss trace data when
troubleshooting edge cases and intermittent
issues
Trace sampling
No awareness of service
dependencies
Simplistic triaging
Siloed from infrastructure
Typical trace sampling
Typically observes ~1-5% of transactions
Byte-Code
Agent
Head-Based Sampling
Microservices
Tail-based sampling misses too…
START TRACE END TRACE
Sampling No Sampling
But wait! My metrics tell me everything
Your metrics are usually not sampled,
for your infrastructure
But can be for your application traces
Leading to bad duration results and
potential missed alerts
TL;DR: Data Completeness
• Your ability to use observability is dependent on your data integrity
• Don’t let the “chosen data” bias your results
• Keep it all. Otherwise you can’t track customer happiness
• Real-time matters
©2021 F5
14
Operate at the speed and
resolution of your app and
infrastructure
©2021 F5
15
The resolution and speed of the data directly
impact the insights you gain
©2021 F5
16
• Interchangeable?
• Accuracy is that the measure
is correct
• Precise means it is consistent
with other measurements
Observability depends on both
But aggregation and analysis
can skew this
Discussing accuracy
and precision
Missing the point
10 sec average =13.9
95% = 27.05
First 5 sec average =16.4
95% = 29.2
Second 5 sec average =11.4
95% = 19.4
Data resolution ≠ Reporting resolution
• But both can be problematic
• Always deliver all data points regardless of reporting
• Finer granularity means more potential precision
Minute Minute
Second Second
Area of actuality
Adding in concepts of native and chart
resolution
• In Observability
• Native resolution is our data collection interval
• Chart resolution is the aggregation points that our
graphs use
Hint: we want speedy data collection and sufficient chart
resolution
Complexity
Drift and Skew
• Ephemeral Behavior
• Cloud-compute Elasticity
©2021 F5
21
Accurate
Timestamps?
• Network latencies get
lower
• Event frequencies are
higher
• Chrony on AWS/GCP
• ~10s to 100s of
accuracy
• May not always order
events properly
Image: ClockWork.io
©2021 F5
22
Aligned
traces?
• Spans may start ahead of parent
spans starts
• Spans may start after parent span
ends
• Span durations can be impacted,
resulting in lack of precision
Image: ClockWork.io
©2021 F5
23
Predictive and
response alerting
Predictive behavior
• Sometimes you want to know what’s coming
• Prediction is only as good as the data precision
and accuracy
• Historic versus Sudden Change
• (Trend) Stationary
• Expect false positives (and negatives)
TL;DR: Data Preciseness
• Observability is only as useful as your data's precision and accuracy
• Your consideration of the data needs to account for elastic, ephemeral
and skew
• Prediction is a target, but
Keep in mind the difference between extrapolation and interpolation
The most effective debugging tool is still careful thought, coupled
with judiciously placed print statements.
-Brian Kernighan Unix for Beginners 1979
Observability is the new print statement
Closing Thoughts
Thanks for listening
• https://www.linkedin.com/in/davemc

More Related Content

Similar to Short Data Rules for Observability.pdf

Cloud Cmputing Security
Cloud Cmputing SecurityCloud Cmputing Security
Cloud Cmputing SecurityDevyani Vaidya
 
Serverless Architecture
Serverless ArchitectureServerless Architecture
Serverless ArchitectureDirk Weibel
 
Observability - The good, the bad and the ugly Xp Days 2019 Kiev Ukraine
Observability -  The good, the bad and the ugly Xp Days 2019 Kiev Ukraine Observability -  The good, the bad and the ugly Xp Days 2019 Kiev Ukraine
Observability - The good, the bad and the ugly Xp Days 2019 Kiev Ukraine Aleksandr Tavgen
 
Using Time Series for Full Observability of a SaaS Platform
Using Time Series for Full Observability of a SaaS PlatformUsing Time Series for Full Observability of a SaaS Platform
Using Time Series for Full Observability of a SaaS PlatformDevOps.com
 
Making Money in the Cloud
Making Money in the CloudMaking Money in the Cloud
Making Money in the CloudGravitant, Inc.
 
ThousandEyes EMEA - Why 74% of IT Teams Are Not Ready for the Cloud
ThousandEyes EMEA - Why 74% of IT Teams Are Not Ready for the CloudThousandEyes EMEA - Why 74% of IT Teams Are Not Ready for the Cloud
ThousandEyes EMEA - Why 74% of IT Teams Are Not Ready for the CloudThousandEyes
 
Kalix: Tackling the The Cloud to Edge Continuum
Kalix: Tackling the The Cloud to Edge ContinuumKalix: Tackling the The Cloud to Edge Continuum
Kalix: Tackling the The Cloud to Edge ContinuumJonas Bonér
 
Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...
Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...
Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...Codit
 
SDN's managing security across the virtual network final
SDN's managing security across the virtual network finalSDN's managing security across the virtual network final
SDN's managing security across the virtual network finalAlgoSec
 
Gartner IT Symposium 2013: Delivering IT-as-a-Service with Cloud Brokering an...
Gartner IT Symposium 2013: Delivering IT-as-a-Service with Cloud Brokering an...Gartner IT Symposium 2013: Delivering IT-as-a-Service with Cloud Brokering an...
Gartner IT Symposium 2013: Delivering IT-as-a-Service with Cloud Brokering an...Gravitant, Inc.
 
SplunkLive! London - Splunk App for Stream & MINT Breakout
SplunkLive! London - Splunk App for Stream & MINT BreakoutSplunkLive! London - Splunk App for Stream & MINT Breakout
SplunkLive! London - Splunk App for Stream & MINT BreakoutSplunk
 
Evaluating the Cloud
Evaluating the CloudEvaluating the Cloud
Evaluating the CloudSociusPartner
 
Event Driven Microservices architecture
Event Driven Microservices architectureEvent Driven Microservices architecture
Event Driven Microservices architectureNikhilBarthwal4
 
Migrating to the Cloud – Is Application Performance Monitoring still required?
Migrating to the Cloud – Is Application Performance Monitoring still required?Migrating to the Cloud – Is Application Performance Monitoring still required?
Migrating to the Cloud – Is Application Performance Monitoring still required?eG Innovations
 
10 Key Steps for Moving from Legacy Infrastructure to the Cloud
10 Key Steps for Moving from Legacy Infrastructure to the Cloud10 Key Steps for Moving from Legacy Infrastructure to the Cloud
10 Key Steps for Moving from Legacy Infrastructure to the CloudNGINX, Inc.
 
Acceleration_and_Security_draft_v2
Acceleration_and_Security_draft_v2Acceleration_and_Security_draft_v2
Acceleration_and_Security_draft_v2Srinivasa Addepalli
 
Cloud Billing: Enabling consumers for pay for what they use
Cloud Billing: Enabling consumers for pay for what they useCloud Billing: Enabling consumers for pay for what they use
Cloud Billing: Enabling consumers for pay for what they useEduardo Mendez Polo
 
Network Centric Cloud: Competing in a IT World with a Telecom Approach
Network Centric Cloud: Competing in a IT World with a Telecom ApproachNetwork Centric Cloud: Competing in a IT World with a Telecom Approach
Network Centric Cloud: Competing in a IT World with a Telecom ApproachEduardo Mendez Polo
 
20160000 Cloud Discovery Event - Cloud Access Security Brokers
20160000 Cloud Discovery Event - Cloud Access Security Brokers20160000 Cloud Discovery Event - Cloud Access Security Brokers
20160000 Cloud Discovery Event - Cloud Access Security BrokersRobin Vermeirsch
 

Similar to Short Data Rules for Observability.pdf (20)

Cloud Cmputing Security
Cloud Cmputing SecurityCloud Cmputing Security
Cloud Cmputing Security
 
Serverless Architecture
Serverless ArchitectureServerless Architecture
Serverless Architecture
 
Observability - The good, the bad and the ugly Xp Days 2019 Kiev Ukraine
Observability -  The good, the bad and the ugly Xp Days 2019 Kiev Ukraine Observability -  The good, the bad and the ugly Xp Days 2019 Kiev Ukraine
Observability - The good, the bad and the ugly Xp Days 2019 Kiev Ukraine
 
Using Time Series for Full Observability of a SaaS Platform
Using Time Series for Full Observability of a SaaS PlatformUsing Time Series for Full Observability of a SaaS Platform
Using Time Series for Full Observability of a SaaS Platform
 
Making Money in the Cloud
Making Money in the CloudMaking Money in the Cloud
Making Money in the Cloud
 
ThousandEyes EMEA - Why 74% of IT Teams Are Not Ready for the Cloud
ThousandEyes EMEA - Why 74% of IT Teams Are Not Ready for the CloudThousandEyes EMEA - Why 74% of IT Teams Are Not Ready for the Cloud
ThousandEyes EMEA - Why 74% of IT Teams Are Not Ready for the Cloud
 
Kalix: Tackling the The Cloud to Edge Continuum
Kalix: Tackling the The Cloud to Edge ContinuumKalix: Tackling the The Cloud to Edge Continuum
Kalix: Tackling the The Cloud to Edge Continuum
 
Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...
Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...
Maturing IoT solutions with Microsoft Azure (Sam Vanhoutte & Glenn Colpaert a...
 
SDN's managing security across the virtual network final
SDN's managing security across the virtual network finalSDN's managing security across the virtual network final
SDN's managing security across the virtual network final
 
Gartner IT Symposium 2013: Delivering IT-as-a-Service with Cloud Brokering an...
Gartner IT Symposium 2013: Delivering IT-as-a-Service with Cloud Brokering an...Gartner IT Symposium 2013: Delivering IT-as-a-Service with Cloud Brokering an...
Gartner IT Symposium 2013: Delivering IT-as-a-Service with Cloud Brokering an...
 
SplunkLive! London - Splunk App for Stream & MINT Breakout
SplunkLive! London - Splunk App for Stream & MINT BreakoutSplunkLive! London - Splunk App for Stream & MINT Breakout
SplunkLive! London - Splunk App for Stream & MINT Breakout
 
Time-resource v&v for complex systems
Time-resource v&v for complex systemsTime-resource v&v for complex systems
Time-resource v&v for complex systems
 
Evaluating the Cloud
Evaluating the CloudEvaluating the Cloud
Evaluating the Cloud
 
Event Driven Microservices architecture
Event Driven Microservices architectureEvent Driven Microservices architecture
Event Driven Microservices architecture
 
Migrating to the Cloud – Is Application Performance Monitoring still required?
Migrating to the Cloud – Is Application Performance Monitoring still required?Migrating to the Cloud – Is Application Performance Monitoring still required?
Migrating to the Cloud – Is Application Performance Monitoring still required?
 
10 Key Steps for Moving from Legacy Infrastructure to the Cloud
10 Key Steps for Moving from Legacy Infrastructure to the Cloud10 Key Steps for Moving from Legacy Infrastructure to the Cloud
10 Key Steps for Moving from Legacy Infrastructure to the Cloud
 
Acceleration_and_Security_draft_v2
Acceleration_and_Security_draft_v2Acceleration_and_Security_draft_v2
Acceleration_and_Security_draft_v2
 
Cloud Billing: Enabling consumers for pay for what they use
Cloud Billing: Enabling consumers for pay for what they useCloud Billing: Enabling consumers for pay for what they use
Cloud Billing: Enabling consumers for pay for what they use
 
Network Centric Cloud: Competing in a IT World with a Telecom Approach
Network Centric Cloud: Competing in a IT World with a Telecom ApproachNetwork Centric Cloud: Competing in a IT World with a Telecom Approach
Network Centric Cloud: Competing in a IT World with a Telecom Approach
 
20160000 Cloud Discovery Event - Cloud Access Security Brokers
20160000 Cloud Discovery Event - Cloud Access Security Brokers20160000 Cloud Discovery Event - Cloud Access Security Brokers
20160000 Cloud Discovery Event - Cloud Access Security Brokers
 

More from Dave McAllister

Murphys laws for Observability
Murphys laws for ObservabilityMurphys laws for Observability
Murphys laws for ObservabilityDave McAllister
 
Working with Hybrid Clouds and Data Architectures
Working with Hybrid Clouds and Data ArchitecturesWorking with Hybrid Clouds and Data Architectures
Working with Hybrid Clouds and Data ArchitecturesDave McAllister
 
Devising your Data Movement Strategy for IoT
Devising your Data Movement Strategy for IoTDevising your Data Movement Strategy for IoT
Devising your Data Movement Strategy for IoTDave McAllister
 
Open Source examples from Adobe : Oscon kiosk
Open Source examples from Adobe : Oscon kioskOpen Source examples from Adobe : Oscon kiosk
Open Source examples from Adobe : Oscon kioskDave McAllister
 
Rootstech-The Basics of Gamification
Rootstech-The Basics of GamificationRootstech-The Basics of Gamification
Rootstech-The Basics of GamificationDave McAllister
 
Ostt eu-what about open-v2.5
Ostt eu-what about open-v2.5Ostt eu-what about open-v2.5
Ostt eu-what about open-v2.5Dave McAllister
 

More from Dave McAllister (10)

Murphys laws for Observability
Murphys laws for ObservabilityMurphys laws for Observability
Murphys laws for Observability
 
Observe 2020-d mc
Observe 2020-d mcObserve 2020-d mc
Observe 2020-d mc
 
Working with Hybrid Clouds and Data Architectures
Working with Hybrid Clouds and Data ArchitecturesWorking with Hybrid Clouds and Data Architectures
Working with Hybrid Clouds and Data Architectures
 
Devising your Data Movement Strategy for IoT
Devising your Data Movement Strategy for IoTDevising your Data Movement Strategy for IoT
Devising your Data Movement Strategy for IoT
 
Open Source examples from Adobe : Oscon kiosk
Open Source examples from Adobe : Oscon kioskOpen Source examples from Adobe : Oscon kiosk
Open Source examples from Adobe : Oscon kiosk
 
Star 2013-pdfa-pdfa
Star 2013-pdfa-pdfaStar 2013-pdfa-pdfa
Star 2013-pdfa-pdfa
 
Rootstech-The Basics of Gamification
Rootstech-The Basics of GamificationRootstech-The Basics of Gamification
Rootstech-The Basics of Gamification
 
Ostt eu-what about open-v2.5
Ostt eu-what about open-v2.5Ostt eu-what about open-v2.5
Ostt eu-what about open-v2.5
 
Frand friend or foe
Frand  friend or foeFrand  friend or foe
Frand friend or foe
 
Open life in the cloud
Open life in the cloudOpen life in the cloud
Open life in the cloud
 

Recently uploaded

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一fhwihughh
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...ThinkInnovation
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)jennyeacort
 

Recently uploaded (20)

Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
办理学位证纽约大学毕业证(NYU毕业证书)原版一比一
 
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
Predictive Analysis - Using Insight-informed Data to Determine Factors Drivin...
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
Call Us ➥97111√47426🤳Call Girls in Aerocity (Delhi NCR)
 

Short Data Rules for Observability.pdf

  • 1. Data Rules for Observability Dave McAllister - NGINX
  • 2.
  • 3. Every company is on a cloud journey To increase velocity, agility and responsiveness Retain & Optimize Lift & Shift Re-Factor Re-Architect / Cloud-Native DEV OPS DEV OPS DEV OPS DEV OPS Cloud Managed e.g. RDS, DynamoDB, SaaS Cloud First Architecture Tightly Coupled Apps, Slow Deployment Cycles Primarily using Cloud IaaS More Modular, but Dependent App Components Loosely Coupled Microservices, and Serverless Functions VM VM VM VM VM VM VM VM VM Private Public VM VM VM VM VM VM Private Public Private Public “By 2025, 85% of organizations will run containers in production, up from less than 30% in 2020” – Gartner, Dec 14, 2020
  • 4. Observability Challenges ● Microservices create complex interactions. ● Failures don't exactlyrepeat. ● Debugging multitenancy ispainful. ● Monitoring no longer can save us alone Cynefin Framework 4
  • 5. © 2 0 2 0 S P L U N K I N C . Use all of your data to avoid blind spots
  • 6. ©2021 F5 6 The more observable a system, the quicker we can understand why it’s acting up and fix it Observability is a Data Problem Metrics Do I have a problem? Traces Where is the problem? Logs Why is the problem happening? Observability DETECT TROUBLESHOOT ROOT CAUSE Full-Stack Visibility & Context-Rich Insights
  • 7. Data is the driving factor for observability • AI/ML-driven Directed Troubleshooting • Unlimited Cardinality • Streaming data, incl. Monitoring and Alerting • Full-fidelity metrics and traces • Open standards, open source data ingest
  • 8. Dealing with the noise • Filter the Signals • Linear, Low-pass, Band-pass, All-pass • Sample the signals • Random, Head-based, Tail-based, Post-predictive, Dimensionality reduction • Improve the visualization • Smart aggregation
  • 9. Problems with observability sampling • Leads to alert storms caused by the cascading nature of failures • Leads to needle-in-the-haystack scenarios and long MTTR • Siloed infrastructure and application insights • Routinely miss trace data when troubleshooting edge cases and intermittent issues Trace sampling No awareness of service dependencies Simplistic triaging Siloed from infrastructure
  • 10. Typical trace sampling Typically observes ~1-5% of transactions Byte-Code Agent Head-Based Sampling Microservices Tail-based sampling misses too… START TRACE END TRACE
  • 12. But wait! My metrics tell me everything Your metrics are usually not sampled, for your infrastructure But can be for your application traces Leading to bad duration results and potential missed alerts
  • 13. TL;DR: Data Completeness • Your ability to use observability is dependent on your data integrity • Don’t let the “chosen data” bias your results • Keep it all. Otherwise you can’t track customer happiness • Real-time matters
  • 14. ©2021 F5 14 Operate at the speed and resolution of your app and infrastructure
  • 15. ©2021 F5 15 The resolution and speed of the data directly impact the insights you gain
  • 16. ©2021 F5 16 • Interchangeable? • Accuracy is that the measure is correct • Precise means it is consistent with other measurements Observability depends on both But aggregation and analysis can skew this Discussing accuracy and precision
  • 17. Missing the point 10 sec average =13.9 95% = 27.05 First 5 sec average =16.4 95% = 29.2 Second 5 sec average =11.4 95% = 19.4
  • 18. Data resolution ≠ Reporting resolution • But both can be problematic • Always deliver all data points regardless of reporting • Finer granularity means more potential precision Minute Minute Second Second Area of actuality
  • 19. Adding in concepts of native and chart resolution • In Observability • Native resolution is our data collection interval • Chart resolution is the aggregation points that our graphs use Hint: we want speedy data collection and sufficient chart resolution
  • 20. Complexity Drift and Skew • Ephemeral Behavior • Cloud-compute Elasticity
  • 21. ©2021 F5 21 Accurate Timestamps? • Network latencies get lower • Event frequencies are higher • Chrony on AWS/GCP • ~10s to 100s of accuracy • May not always order events properly Image: ClockWork.io
  • 22. ©2021 F5 22 Aligned traces? • Spans may start ahead of parent spans starts • Spans may start after parent span ends • Span durations can be impacted, resulting in lack of precision Image: ClockWork.io
  • 24. Predictive behavior • Sometimes you want to know what’s coming • Prediction is only as good as the data precision and accuracy • Historic versus Sudden Change • (Trend) Stationary • Expect false positives (and negatives)
  • 25. TL;DR: Data Preciseness • Observability is only as useful as your data's precision and accuracy • Your consideration of the data needs to account for elastic, ephemeral and skew • Prediction is a target, but Keep in mind the difference between extrapolation and interpolation
  • 26. The most effective debugging tool is still careful thought, coupled with judiciously placed print statements. -Brian Kernighan Unix for Beginners 1979 Observability is the new print statement Closing Thoughts
  • 27. Thanks for listening • https://www.linkedin.com/in/davemc