SlideShare a Scribd company logo
How to Usher in a New
Monitoring System
Nikita Ostrovsky
▶ Devops Data Architect at Pulsepoint
▶ twitter.com/nikgrok
▶ github.com/nikgrok
▶ nostrovsky@pulsepoint.com
▶ Real-time ad exchange.
▶ 40+ Billion Impressions per day
▶ 150+ Billion bids made per day
▶ Log Aggr & Prometheus
▶ We’re Hiring!
▶ Devopsy DBA and Developers
Why a new Monitoring System?
▶ Alerting
▶ >80% Signal to Noise Ratio
▶ Troubleshooting
▶ >300% faster remediation times
▶ Capacity Planning
▶ Bottleneck Tracking
▶ Sprint Planning
▶ Core of your tech stack
Steps
▶ Find your pain points and Criteria
▶ Identify your Holy Beacon of Truth
▶ Try to go Use-Case by Use-Case
▶ Evangalize.
▶ Be prepared to fight some hard battles.
▶ Shutdown the old one and go out drinking.
Setting Criteria and Choosing a Tool
▶ If you want to build your own. Don’t! Stop!
Run Away!
▶ Figure out your pain points
▶ Scale
▶ Query Language
▶ Ability to Visualize
▶ Usability
▶ Alertability
▶ Robust, Scalable, Blah, Blah, Blah
▶ Don’t expect any buy-in during this step
Identify (or build) your Holy Beacon of
Truth
▶ Foundation of your Monitoring Platform
▶ Allows you to Flesh out Ownership
▶ Owner is your most important piece of metadata
▶ Dynamic
▶ Complete
▶ Metadata Support(tags, labels, etc)
▶ Possibly 2ndary metadata store
▶ Mesos/Kubernetes is not enough
▶ We love Consul for this
Try to go Use-Case by Use-Case
▶ Modular/Repeatable Solutions
▶ Seek out the easiest wins with the highest impact
▶ Be Customer Focused
▶ Every unique delivered Use-Case gives you
another person talking about it.
▶ Give other teams the tools to do it themselves.
Evangelize.
▶ Do demos/lunch and learns
▶ Be in every post-mortem
▶ Try to build highly visible dashboards
▶ And make them highly visible.
▶ Use instantly recognizable units.
▶ Vcore Seconds vs $
▶ Absolutely no new features or changes to the
old system unless necessary
Fight the hard battles
▶ As you progress, you will start to see some
push back
▶ Ownership
▶ Developers receiving alerts
▶ Division of responsibilities
▶ Stay confident.
▶ Your job is to deliver an API to the monitoring
system, not get people to migrate.
▶ Most battles fizzle out when you have >75%
Signal to Noise Ratio
Some Final Notes
▶ Be Prepared to Shave a lot of Yaks
▶ Stability is key once you start to deliver Use-
Cases
▶ Differentiate ephemeral metrics from key ones
early on
▶ Get a sense of Scale early on
▶ Try to compartmentalize solutions, but provide
a way to share metrics
▶ Talk to other teams
Kill the old one and go out drinking
Thank you!
▶ Nikita Ostrovsky
▶ Devops Data Architect at Pulsepoint
▶ Chaotic-Good Cleric
▶ twitter.com/nikgrok
▶ github.com/nikgrok
▶ nostrovsky@pulsepoint.com

More Related Content

Similar to How to Usher in a new Monitoring System

Metrics 4 faster feedback
Metrics 4 faster feedbackMetrics 4 faster feedback
Metrics 4 faster feedback
Kris Buytaert
 
Intro to Puppet Enterprise Webinar 07.27.2017
Intro to Puppet Enterprise Webinar 07.27.2017Intro to Puppet Enterprise Webinar 07.27.2017
Intro to Puppet Enterprise Webinar 07.27.2017
Claire Priester Papas
 
Agile Fundamentals and Best Practices (with Trello)
Agile Fundamentals and Best Practices (with Trello)Agile Fundamentals and Best Practices (with Trello)
Agile Fundamentals and Best Practices (with Trello)
Filippo Zanella
 
Moving to tdd bdd
Moving to tdd bddMoving to tdd bdd
Moving to tdd bdd
Kim Carter
 
Monitoring Far Beyond the Operating System - WeOp 2014
Monitoring Far Beyond the Operating System - WeOp 2014Monitoring Far Beyond the Operating System - WeOp 2014
Monitoring Far Beyond the Operating System - WeOp 2014
Marcus Vechiato
 
DevOps - Understanding Core Concepts (Old)
DevOps - Understanding Core Concepts (Old)DevOps - Understanding Core Concepts (Old)
DevOps - Understanding Core Concepts (Old)
Nitin Bhide
 
Luiz Fernando Testa Contador - Aplicando DevOps em grandes corporações
Luiz Fernando Testa Contador - Aplicando DevOps em grandes corporaçõesLuiz Fernando Testa Contador - Aplicando DevOps em grandes corporações
Luiz Fernando Testa Contador - Aplicando DevOps em grandes corporações
Agile Trends
 
How to Ace Your First 6 Months as a New PM by Empatico PM
How to Ace Your First 6 Months as a New PM by Empatico PMHow to Ace Your First 6 Months as a New PM by Empatico PM
How to Ace Your First 6 Months as a New PM by Empatico PM
Product School
 
What they don't tell you about micro-services
What they don't tell you about micro-servicesWhat they don't tell you about micro-services
What they don't tell you about micro-services
Daniel Rolnick
 
Experiment Your Way to Product Success: How User Acceptance Testing Can Save ...
Experiment Your Way to Product Success: How User Acceptance Testing Can Save ...Experiment Your Way to Product Success: How User Acceptance Testing Can Save ...
Experiment Your Way to Product Success: How User Acceptance Testing Can Save ...
Aggregage
 
The Final Frontier, Automating Dynamic Security Testing
The Final Frontier, Automating Dynamic Security TestingThe Final Frontier, Automating Dynamic Security Testing
The Final Frontier, Automating Dynamic Security Testing
Matt Tesauro
 
CD in Machine Learning Systems
CD in Machine Learning SystemsCD in Machine Learning Systems
CD in Machine Learning Systems
Thoughtworks
 
Services, tools & practices for a software house
Services, tools & practices for a software houseServices, tools & practices for a software house
Services, tools & practices for a software house
Paris Apostolopoulos
 
Intro to Puppet Enterprise for a Windows Environment - 08.23
Intro to Puppet Enterprise for a Windows Environment - 08.23Intro to Puppet Enterprise for a Windows Environment - 08.23
Intro to Puppet Enterprise for a Windows Environment - 08.23
Puppet
 
Test strategicaly
Test strategicalyTest strategicaly
Test strategicaly
Erik Lebel
 
Getting It Done
Getting It DoneGetting It Done
Getting It Done
Wez Furlong
 
Agile Release Management Best Practices
Agile Release Management Best PracticesAgile Release Management Best Practices
Agile Release Management Best Practices
Anmol Oberoi
 
DOES14: Scott Prugh, CSG - DevOps and Lean in Legacy Environments
DOES14: Scott Prugh, CSG - DevOps and Lean in Legacy EnvironmentsDOES14: Scott Prugh, CSG - DevOps and Lean in Legacy Environments
DOES14: Scott Prugh, CSG - DevOps and Lean in Legacy Environments
DevOps Enterprise Summmit
 
Managing Data Science Projects
Managing Data Science ProjectsManaging Data Science Projects
Managing Data Science Projects
Danielle Dean
 
The Most Important Thing: How Mozilla Does Security and What You Can Steal
The Most Important Thing: How Mozilla Does Security and What You Can StealThe Most Important Thing: How Mozilla Does Security and What You Can Steal
The Most Important Thing: How Mozilla Does Security and What You Can Steal
mozilla.presentations
 

Similar to How to Usher in a new Monitoring System (20)

Metrics 4 faster feedback
Metrics 4 faster feedbackMetrics 4 faster feedback
Metrics 4 faster feedback
 
Intro to Puppet Enterprise Webinar 07.27.2017
Intro to Puppet Enterprise Webinar 07.27.2017Intro to Puppet Enterprise Webinar 07.27.2017
Intro to Puppet Enterprise Webinar 07.27.2017
 
Agile Fundamentals and Best Practices (with Trello)
Agile Fundamentals and Best Practices (with Trello)Agile Fundamentals and Best Practices (with Trello)
Agile Fundamentals and Best Practices (with Trello)
 
Moving to tdd bdd
Moving to tdd bddMoving to tdd bdd
Moving to tdd bdd
 
Monitoring Far Beyond the Operating System - WeOp 2014
Monitoring Far Beyond the Operating System - WeOp 2014Monitoring Far Beyond the Operating System - WeOp 2014
Monitoring Far Beyond the Operating System - WeOp 2014
 
DevOps - Understanding Core Concepts (Old)
DevOps - Understanding Core Concepts (Old)DevOps - Understanding Core Concepts (Old)
DevOps - Understanding Core Concepts (Old)
 
Luiz Fernando Testa Contador - Aplicando DevOps em grandes corporações
Luiz Fernando Testa Contador - Aplicando DevOps em grandes corporaçõesLuiz Fernando Testa Contador - Aplicando DevOps em grandes corporações
Luiz Fernando Testa Contador - Aplicando DevOps em grandes corporações
 
How to Ace Your First 6 Months as a New PM by Empatico PM
How to Ace Your First 6 Months as a New PM by Empatico PMHow to Ace Your First 6 Months as a New PM by Empatico PM
How to Ace Your First 6 Months as a New PM by Empatico PM
 
What they don't tell you about micro-services
What they don't tell you about micro-servicesWhat they don't tell you about micro-services
What they don't tell you about micro-services
 
Experiment Your Way to Product Success: How User Acceptance Testing Can Save ...
Experiment Your Way to Product Success: How User Acceptance Testing Can Save ...Experiment Your Way to Product Success: How User Acceptance Testing Can Save ...
Experiment Your Way to Product Success: How User Acceptance Testing Can Save ...
 
The Final Frontier, Automating Dynamic Security Testing
The Final Frontier, Automating Dynamic Security TestingThe Final Frontier, Automating Dynamic Security Testing
The Final Frontier, Automating Dynamic Security Testing
 
CD in Machine Learning Systems
CD in Machine Learning SystemsCD in Machine Learning Systems
CD in Machine Learning Systems
 
Services, tools & practices for a software house
Services, tools & practices for a software houseServices, tools & practices for a software house
Services, tools & practices for a software house
 
Intro to Puppet Enterprise for a Windows Environment - 08.23
Intro to Puppet Enterprise for a Windows Environment - 08.23Intro to Puppet Enterprise for a Windows Environment - 08.23
Intro to Puppet Enterprise for a Windows Environment - 08.23
 
Test strategicaly
Test strategicalyTest strategicaly
Test strategicaly
 
Getting It Done
Getting It DoneGetting It Done
Getting It Done
 
Agile Release Management Best Practices
Agile Release Management Best PracticesAgile Release Management Best Practices
Agile Release Management Best Practices
 
DOES14: Scott Prugh, CSG - DevOps and Lean in Legacy Environments
DOES14: Scott Prugh, CSG - DevOps and Lean in Legacy EnvironmentsDOES14: Scott Prugh, CSG - DevOps and Lean in Legacy Environments
DOES14: Scott Prugh, CSG - DevOps and Lean in Legacy Environments
 
Managing Data Science Projects
Managing Data Science ProjectsManaging Data Science Projects
Managing Data Science Projects
 
The Most Important Thing: How Mozilla Does Security and What You Can Steal
The Most Important Thing: How Mozilla Does Security and What You Can StealThe Most Important Thing: How Mozilla Does Security and What You Can Steal
The Most Important Thing: How Mozilla Does Security and What You Can Steal
 

Recently uploaded

JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
Miro Wengner
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
akankshawande
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
Zilliz
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
c5vrf27qcz
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
UiPathCommunity
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
ssuserfac0301
 
High performance Serverless Java on AWS- GoTo Amsterdam 2024
High performance Serverless Java on AWS- GoTo Amsterdam 2024High performance Serverless Java on AWS- GoTo Amsterdam 2024
High performance Serverless Java on AWS- GoTo Amsterdam 2024
Vadym Kazulkin
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
Jason Yip
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Neo4j
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
Neo4j
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
Fwdays
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
Javier Junquera
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
christinelarrosa
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
Jakub Marek
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
DanBrown980551
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Tosin Akinosho
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
Tatiana Kojar
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
Chart Kalyan
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
AstuteBusiness
 

Recently uploaded (20)

JavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green MasterplanJavaLand 2024: Application Development Green Masterplan
JavaLand 2024: Application Development Green Masterplan
 
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development ProvidersYour One-Stop Shop for Python Success: Top 10 US Python Development Providers
Your One-Stop Shop for Python Success: Top 10 US Python Development Providers
 
Fueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte WebinarFueling AI with Great Data with Airbyte Webinar
Fueling AI with Great Data with Airbyte Webinar
 
Y-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PPY-Combinator seed pitch deck template PP
Y-Combinator seed pitch deck template PP
 
Session 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdfSession 1 - Intro to Robotic Process Automation.pdf
Session 1 - Intro to Robotic Process Automation.pdf
 
Taking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdfTaking AI to the Next Level in Manufacturing.pdf
Taking AI to the Next Level in Manufacturing.pdf
 
High performance Serverless Java on AWS- GoTo Amsterdam 2024
High performance Serverless Java on AWS- GoTo Amsterdam 2024High performance Serverless Java on AWS- GoTo Amsterdam 2024
High performance Serverless Java on AWS- GoTo Amsterdam 2024
 
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
[OReilly Superstream] Occupy the Space: A grassroots guide to engineering (an...
 
Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024Northern Engraving | Nameplate Manufacturing Process - 2024
Northern Engraving | Nameplate Manufacturing Process - 2024
 
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and BioinformaticiansBiomedical Knowledge Graphs for Data Scientists and Bioinformaticians
Biomedical Knowledge Graphs for Data Scientists and Bioinformaticians
 
Leveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and StandardsLeveraging the Graph for Clinical Trials and Standards
Leveraging the Graph for Clinical Trials and Standards
 
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
"Scaling RAG Applications to serve millions of users",  Kevin Goedecke"Scaling RAG Applications to serve millions of users",  Kevin Goedecke
"Scaling RAG Applications to serve millions of users", Kevin Goedecke
 
GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)GNSS spoofing via SDR (Criptored Talks 2024)
GNSS spoofing via SDR (Criptored Talks 2024)
 
Christine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptxChristine's Supplier Sourcing Presentaion.pptx
Christine's Supplier Sourcing Presentaion.pptx
 
Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)Main news related to the CCS TSI 2023 (2023/1695)
Main news related to the CCS TSI 2023 (2023/1695)
 
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
LF Energy Webinar: Carbon Data Specifications: Mechanisms to Improve Data Acc...
 
Monitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdfMonitoring and Managing Anomaly Detection on OpenShift.pdf
Monitoring and Managing Anomaly Detection on OpenShift.pdf
 
Skybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoptionSkybuffer SAM4U tool for SAP license adoption
Skybuffer SAM4U tool for SAP license adoption
 
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdfHow to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
How to Interpret Trends in the Kalyan Rajdhani Mix Chart.pdf
 
Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |Astute Business Solutions | Oracle Cloud Partner |
Astute Business Solutions | Oracle Cloud Partner |
 

How to Usher in a new Monitoring System

  • 1. How to Usher in a New Monitoring System
  • 2. Nikita Ostrovsky ▶ Devops Data Architect at Pulsepoint ▶ twitter.com/nikgrok ▶ github.com/nikgrok ▶ nostrovsky@pulsepoint.com
  • 3. ▶ Real-time ad exchange. ▶ 40+ Billion Impressions per day ▶ 150+ Billion bids made per day ▶ Log Aggr & Prometheus ▶ We’re Hiring! ▶ Devopsy DBA and Developers
  • 4. Why a new Monitoring System? ▶ Alerting ▶ >80% Signal to Noise Ratio ▶ Troubleshooting ▶ >300% faster remediation times ▶ Capacity Planning ▶ Bottleneck Tracking ▶ Sprint Planning ▶ Core of your tech stack
  • 5. Steps ▶ Find your pain points and Criteria ▶ Identify your Holy Beacon of Truth ▶ Try to go Use-Case by Use-Case ▶ Evangalize. ▶ Be prepared to fight some hard battles. ▶ Shutdown the old one and go out drinking.
  • 6. Setting Criteria and Choosing a Tool ▶ If you want to build your own. Don’t! Stop! Run Away! ▶ Figure out your pain points ▶ Scale ▶ Query Language ▶ Ability to Visualize ▶ Usability ▶ Alertability ▶ Robust, Scalable, Blah, Blah, Blah ▶ Don’t expect any buy-in during this step
  • 7. Identify (or build) your Holy Beacon of Truth ▶ Foundation of your Monitoring Platform ▶ Allows you to Flesh out Ownership ▶ Owner is your most important piece of metadata ▶ Dynamic ▶ Complete ▶ Metadata Support(tags, labels, etc) ▶ Possibly 2ndary metadata store ▶ Mesos/Kubernetes is not enough ▶ We love Consul for this
  • 8. Try to go Use-Case by Use-Case ▶ Modular/Repeatable Solutions ▶ Seek out the easiest wins with the highest impact ▶ Be Customer Focused ▶ Every unique delivered Use-Case gives you another person talking about it. ▶ Give other teams the tools to do it themselves.
  • 9. Evangelize. ▶ Do demos/lunch and learns ▶ Be in every post-mortem ▶ Try to build highly visible dashboards ▶ And make them highly visible. ▶ Use instantly recognizable units. ▶ Vcore Seconds vs $ ▶ Absolutely no new features or changes to the old system unless necessary
  • 10. Fight the hard battles ▶ As you progress, you will start to see some push back ▶ Ownership ▶ Developers receiving alerts ▶ Division of responsibilities ▶ Stay confident. ▶ Your job is to deliver an API to the monitoring system, not get people to migrate. ▶ Most battles fizzle out when you have >75% Signal to Noise Ratio
  • 11. Some Final Notes ▶ Be Prepared to Shave a lot of Yaks ▶ Stability is key once you start to deliver Use- Cases ▶ Differentiate ephemeral metrics from key ones early on ▶ Get a sense of Scale early on ▶ Try to compartmentalize solutions, but provide a way to share metrics ▶ Talk to other teams
  • 12. Kill the old one and go out drinking
  • 13. Thank you! ▶ Nikita Ostrovsky ▶ Devops Data Architect at Pulsepoint ▶ Chaotic-Good Cleric ▶ twitter.com/nikgrok ▶ github.com/nikgrok ▶ nostrovsky@pulsepoint.com