SlideShare a Scribd company logo
1 of 58
Download to read offline
@papa_fire
Troubleshooting with monitoring
Testing in production
DevOps monitoring
[something] testing [something]
monitoring [something] in
production
Leon Fayer
❖ @papa_fire
❖ leon@omniti.com
❖ fayerplay.com
❖ slideshare.net/LeonFayer1
THAT’S ME
WHO AM I?
๏ engineer for 20+ years
๏ professional cynic
๏ @ OmniTI
๏ build and operate big systems
๏ we are hiring!
๏ omniti.com/is/hiring
@papa_fire
I HATE TESTING
@papa_fire
testing is required
@papa_fire
testing is not enough
@papa_fire
> unit testing
> functional testing
> resilience testing
> performance testing
> …
@papa_fire
testing can give a false
sense of security
@papa_fire
testing is deterministic
@papa_fire
data problem
@papa_fire
> quantity of data
> frequency of data
> quality of data
@papa_fire
example
Wolfe+585
@papa_fire
example
Hubert Blaine Wolfeschlegelsteinhausenbergerdorffwelchevoralternwaren-
gewissenhaftschaferswessenschafewarenwohlgepflegeundsorgfaltigkeitbe
schutzenvorangreifendurchihrraubgierigfeindewelchevoralternzwolfhundert
tausendjahresvorandieerscheinenvonderersteerdemenschderraumschiff
genachtmittungsteinundsiebeniridiumelektrischmotorsgebrauchlichtalsseinur
sprungvonkraftgestartseinlangefahrthinzwischensternartigraumaufdersuchen
nachbarschaftdersternwelchegehabtbewohnbarplanetenkreisedrehensichundwo
hinderneuerassevonverstandigmenschlichkeitkonntefortpflanzenundsicher
freuenanlebenslanglichfreudeundruhemitnichteinfurchtvorangreifenvor
andererintelligentgeschopfsvonhinzwischensternartigraum, Sr.
@papa_fire
user problem
@papa_fire
“Users (n) - distributed fault injection
test suite for production
@papa_fire
example
Corrupted Blood bug
@papa_fire
example
@papa_fire
other factors
@papa_fire
> lack of foresight
	 	 	 (Y2K bug)
> too many use-cases
	 	 	 (female Tauren bug)
> change to assumptions
@papa_fire
testing is great for
“known knowns”
@papa_fire
testing is ok for
“known unknowns”
@papa_fire
testing is bad for
“unknown unknowns”
@papa_fire
enter monitoring
@papa_fire
why monitor?
@papa_fire
because testing isn’t
enough
@papa_fire
> software is never perfect
> systems are complex
> external dependency worry
> proactive is better than reactive
> …
@papa_fire
because things change
@papa_fire
because things change
in production
@papa_fire
what to monitor?
@papa_fire
in God we trust all others
we monitor
“
@papa_fire
> systems
> databases
> applications
> integration points
> performance
> user behavior
> …
@papa_fire
is it enough?
@papa_fire
is it too much?
@papa_fire
what is important?
@papa_fire
what is important?
(i.e. what to alert on)
@papa_fire
example
> servers up and running
> HTTP checks return 200
> tweets are lost
@papa_fire
s/system checks/unit tests/
@papa_fire
I don’t give a **** if the
datacenter is on fire as
long as I am still making
money
“ — CEO
@papa_fire
we monitor because things
change
@papa_fire
changes effect business
@papa_fire
top-down approach
> understand business
> define baseline
> correlate data
@papa_fire
example
๏ online marketing company
๏ major e-commerce component
๏ ~100 million users
๏ 1 billion emails/month
๏ 300,000 lines of code
๏5600+ metrics collected
@papa_fire
it all starts with a call …
@papa_fire
revenue
@papa_fire
revenue + traffic
@papa_fire
revenue + traffic + load time
@papa_fire
revenue + traffic + load time + db
@papa_fire
revenue + traffic + load time + db + email
@papa_fire
… email wasn’t monitored?
what if …
@papa_fire
… email wasn’t monitored?
(it would be after this)
what if …
@papa_fire
instrumentation
is never done
@papa_fire
example
> same symptoms
> higher decline rates
> all metrics are within norm
@papa_fire
example
> same symptoms
> higher decline rates
> all metrics are within norm
AmEx blocked
@papa_fire
tl;dr
@papa_fire
testing and monitoring
not
testing or monitoring
@papa_fire
understand the business
@papa_fire
continuous improvement
@papa_fire
{also bad at conclusions}
@papa_fire
THANK YOU
questions?

More Related Content

What's hot

MonoRails - GoGaRuCo 2012
MonoRails - GoGaRuCo 2012MonoRails - GoGaRuCo 2012
MonoRails - GoGaRuCo 2012
jackdanger
 
DevOps - Be Careful What You Wish For
DevOps - Be Careful What You Wish ForDevOps - Be Careful What You Wish For
DevOps - Be Careful What You Wish For
John Viner
 

What's hot (16)

Building Real Time App
Building Real Time AppBuilding Real Time App
Building Real Time App
 
When it all Goes Wrong |Nordic PGDay 2019 | Will Leinweber
When it all Goes Wrong |Nordic PGDay 2019 | Will LeinweberWhen it all Goes Wrong |Nordic PGDay 2019 | Will Leinweber
When it all Goes Wrong |Nordic PGDay 2019 | Will Leinweber
 
Performance tuning
Performance tuningPerformance tuning
Performance tuning
 
Monte Carlo presentation for FM
Monte Carlo presentation for FMMonte Carlo presentation for FM
Monte Carlo presentation for FM
 
Limiting WIP - Music City Tech 2021
Limiting WIP - Music City Tech 2021Limiting WIP - Music City Tech 2021
Limiting WIP - Music City Tech 2021
 
MonoRails - GoGaRuCo 2012
MonoRails - GoGaRuCo 2012MonoRails - GoGaRuCo 2012
MonoRails - GoGaRuCo 2012
 
TDC2016SP - Otimização Prematura: a Raíz de Todo o Mal
TDC2016SP - Otimização Prematura: a Raíz de Todo o MalTDC2016SP - Otimização Prematura: a Raíz de Todo o Mal
TDC2016SP - Otimização Prematura: a Raíz de Todo o Mal
 
Using the 5 WHYs to determine why your SAP Major Incident really happened
Using the 5 WHYs to determine why your SAP Major Incident really happenedUsing the 5 WHYs to determine why your SAP Major Incident really happened
Using the 5 WHYs to determine why your SAP Major Incident really happened
 
10 Deploys a Day - A Case Study of Continuous Delivery at Envato
10 Deploys a Day - A Case Study of Continuous Delivery at Envato10 Deploys a Day - A Case Study of Continuous Delivery at Envato
10 Deploys a Day - A Case Study of Continuous Delivery at Envato
 
Building on the Shoulders of Giants: the Story of Bitbucket Pipelines
Building on the Shoulders of Giants: the Story of Bitbucket PipelinesBuilding on the Shoulders of Giants: the Story of Bitbucket Pipelines
Building on the Shoulders of Giants: the Story of Bitbucket Pipelines
 
DevOps - Be Careful What You Wish For
DevOps - Be Careful What You Wish ForDevOps - Be Careful What You Wish For
DevOps - Be Careful What You Wish For
 
Magento Security Best Practises - MM17PL
Magento Security Best Practises - MM17PLMagento Security Best Practises - MM17PL
Magento Security Best Practises - MM17PL
 
Community IT Webinar - Planning for an Office Move
Community IT Webinar - Planning for an Office MoveCommunity IT Webinar - Planning for an Office Move
Community IT Webinar - Planning for an Office Move
 
Microservices: Why We Did It (and should you?)
Microservices: Why We Did It (and should you?) Microservices: Why We Did It (and should you?)
Microservices: Why We Did It (and should you?)
 
Functions.php - It's Not Just For Developers
Functions.php - It's Not Just For DevelopersFunctions.php - It's Not Just For Developers
Functions.php - It's Not Just For Developers
 
Defining Your Maintenance Strategy
Defining Your Maintenance StrategyDefining Your Maintenance Strategy
Defining Your Maintenance Strategy
 

Similar to Production testing through monitoring

Enough Blame for System Performance Issues
Enough Blame for System Performance IssuesEnough Blame for System Performance Issues
Enough Blame for System Performance Issues
Mahesh Vallampati
 

Similar to Production testing through monitoring (20)

DevOps: The Key to IT Performance
DevOps: The Key to IT PerformanceDevOps: The Key to IT Performance
DevOps: The Key to IT Performance
 
How Continuous Delivery and Lean Management Make your DevOps Amazeballs
How Continuous Delivery and Lean Management Make your DevOps AmazeballsHow Continuous Delivery and Lean Management Make your DevOps Amazeballs
How Continuous Delivery and Lean Management Make your DevOps Amazeballs
 
What I learned from 5 years of sciencing the crap out of DevOps
What I learned from 5 years of sciencing the crap out of DevOpsWhat I learned from 5 years of sciencing the crap out of DevOps
What I learned from 5 years of sciencing the crap out of DevOps
 
Continuous Delivery + DevOps = Awesome
Continuous Delivery + DevOps = AwesomeContinuous Delivery + DevOps = Awesome
Continuous Delivery + DevOps = Awesome
 
Continuous Delivery: Making DevOps Awesome
Continuous Delivery: Making DevOps AwesomeContinuous Delivery: Making DevOps Awesome
Continuous Delivery: Making DevOps Awesome
 
Defect Metrics for Organization and Project Health
Defect Metrics for Organization and Project HealthDefect Metrics for Organization and Project Health
Defect Metrics for Organization and Project Health
 
The caseforawesome
The caseforawesomeThe caseforawesome
The caseforawesome
 
The Data on DevOps: Making the Case for Awesome
The Data on DevOps: Making the Case for AwesomeThe Data on DevOps: Making the Case for Awesome
The Data on DevOps: Making the Case for Awesome
 
The DevOps Pay Raise: Quantifying Your Value to Move Up the Ladder
The DevOps Pay Raise: Quantifying Your Value to Move Up the LadderThe DevOps Pay Raise: Quantifying Your Value to Move Up the Ladder
The DevOps Pay Raise: Quantifying Your Value to Move Up the Ladder
 
Best Practices and Tools for Backing Up Salesforce Data
Best Practices and Tools for Backing Up Salesforce DataBest Practices and Tools for Backing Up Salesforce Data
Best Practices and Tools for Backing Up Salesforce Data
 
Tips & Tricks for Getting Things Done Using Analytics Data
Tips & Tricks for Getting Things Done Using Analytics DataTips & Tricks for Getting Things Done Using Analytics Data
Tips & Tricks for Getting Things Done Using Analytics Data
 
apidays LIVE Jakarta - Data Reporting and Analytics with API's by Tarush Agga...
apidays LIVE Jakarta - Data Reporting and Analytics with API's by Tarush Agga...apidays LIVE Jakarta - Data Reporting and Analytics with API's by Tarush Agga...
apidays LIVE Jakarta - Data Reporting and Analytics with API's by Tarush Agga...
 
Realising the true value of DevOps
Realising the true value of DevOpsRealising the true value of DevOps
Realising the true value of DevOps
 
DevOps and the Bottom Line
DevOps and the Bottom Line DevOps and the Bottom Line
DevOps and the Bottom Line
 
Shift left-devoxx-pl
Shift left-devoxx-plShift left-devoxx-pl
Shift left-devoxx-pl
 
To Estimate or Not to Estimate, is that the Question?
To Estimate or Not to Estimate, is that the Question?To Estimate or Not to Estimate, is that the Question?
To Estimate or Not to Estimate, is that the Question?
 
DevOps - the Future of Agile - Why/What/How - from Enterprise DevOps Israel 2015
DevOps - the Future of Agile - Why/What/How - from Enterprise DevOps Israel 2015DevOps - the Future of Agile - Why/What/How - from Enterprise DevOps Israel 2015
DevOps - the Future of Agile - Why/What/How - from Enterprise DevOps Israel 2015
 
Web Performance Culture and Tools at Etsy
Web Performance Culture and Tools at EtsyWeb Performance Culture and Tools at Etsy
Web Performance Culture and Tools at Etsy
 
Altitude NY 2018: Making Pinterest Fast
Altitude NY 2018: Making Pinterest FastAltitude NY 2018: Making Pinterest Fast
Altitude NY 2018: Making Pinterest Fast
 
Enough Blame for System Performance Issues
Enough Blame for System Performance IssuesEnough Blame for System Performance Issues
Enough Blame for System Performance Issues
 

More from Leon Fayer

More from Leon Fayer (12)

What kids can teach us about building effective teams
What kids can teach us about building effective teamsWhat kids can teach us about building effective teams
What kids can teach us about building effective teams
 
Как измерить успех
Как измерить успехКак измерить успех
Как измерить успех
 
Bias in tech
Bias in techBias in tech
Bias in tech
 
Developing applications for performance
Developing applications for performanceDeveloping applications for performance
Developing applications for performance
 
Building the right architecture for you
Building the right architecture for youBuilding the right architecture for you
Building the right architecture for you
 
Lost art of troubleshooting
Lost art of troubleshootingLost art of troubleshooting
Lost art of troubleshooting
 
Adventures in public speaking
Adventures in public speakingAdventures in public speaking
Adventures in public speaking
 
On call for developers
On call for developersOn call for developers
On call for developers
 
Database performance 101
Database performance 101Database performance 101
Database performance 101
 
What DevOps is Not
What DevOps is NotWhat DevOps is Not
What DevOps is Not
 
Breaking social dependency
Breaking social dependencyBreaking social dependency
Breaking social dependency
 
PHP performance 101: so you need to use a database
PHP performance 101: so you need to use a databasePHP performance 101: so you need to use a database
PHP performance 101: so you need to use a database
 

Recently uploaded

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
giselly40
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Recently uploaded (20)

Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 

Production testing through monitoring