Monitoring MySQL with OpenTSDB

Sr. Database Operations Engineer at Box
Apr. 25, 2013
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
Monitoring MySQL with OpenTSDB
1 of 42

More Related Content

What's hot

HBaseCon2017 gohbase: Pure Go HBase ClientHBaseCon2017 gohbase: Pure Go HBase Client
HBaseCon2017 gohbase: Pure Go HBase ClientHBaseCon
Keynote: Apache HBase at Yahoo! ScaleKeynote: Apache HBase at Yahoo! Scale
Keynote: Apache HBase at Yahoo! ScaleHBaseCon
Gnocchi v3 brownbagGnocchi v3 brownbag
Gnocchi v3 brownbagGordon Chung
Gnocchi Profiling 2.1.xGnocchi Profiling 2.1.x
Gnocchi Profiling 2.1.xGordon Chung
Gnocchi v4 (preview)Gnocchi v4 (preview)
Gnocchi v4 (preview)Gordon Chung
Advanced Apache Cassandra Operations with JMXAdvanced Apache Cassandra Operations with JMX
Advanced Apache Cassandra Operations with JMXzznate

Similar to Monitoring MySQL with OpenTSDB

Why and How Powershell will rule the Command Line - Barcamp LA 4Why and How Powershell will rule the Command Line - Barcamp LA 4
Why and How Powershell will rule the Command Line - Barcamp LA 4Ilya Haykinson
Functional Hostnames and Why they are BadFunctional Hostnames and Why they are Bad
Functional Hostnames and Why they are BadPuppet
Digdagによる大規模データ処理の自動化とエラー処理Digdagによる大規模データ処理の自動化とエラー処理
Digdagによる大規模データ処理の自動化とエラー処理Sadayuki Furuhashi
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...
The Proto-Burst Buffer: Experience with the flash-based file system on SDSC's...Glenn K. Lockwood
Jk rubyslava 25Jk rubyslava 25
Jk rubyslava 25Jozef Képesi
Devoxx france 2015 influx dbDevoxx france 2015 influx db
Devoxx france 2015 influx dbNicolas Muller

Recently uploaded

Regulating Generative AI - LLMOps pipelines with TransparencyRegulating Generative AI - LLMOps pipelines with Transparency
Regulating Generative AI - LLMOps pipelines with TransparencyDebmalya Biswas
Nymity Framework: Privacy & Data Protection Update in 7 StatesNymity Framework: Privacy & Data Protection Update in 7 States
Nymity Framework: Privacy & Data Protection Update in 7 StatesTrustArc
Jino Clone: Develop a Car Wash Mobile App with Limited Resources Jino Clone: Develop a Car Wash Mobile App with Limited Resources
Jino Clone: Develop a Car Wash Mobile App with Limited Resources eSiteWorld TechnoLabs Pvt. Ltd.
Improving Employee Experiences on Cisco RoomOS Devices, Webex, and Microsoft ...Improving Employee Experiences on Cisco RoomOS Devices, Webex, and Microsoft ...
Improving Employee Experiences on Cisco RoomOS Devices, Webex, and Microsoft ...ThousandEyes
UiPath Tips and Techniques for Error Handling - Session 2UiPath Tips and Techniques for Error Handling - Session 2
UiPath Tips and Techniques for Error Handling - Session 2DianaGray10
Swiss Re Reinsurance Solutions - Automated Claims Experience – Insurer Innova...Swiss Re Reinsurance Solutions - Automated Claims Experience – Insurer Innova...
Swiss Re Reinsurance Solutions - Automated Claims Experience – Insurer Innova...The Digital Insurer

Recently uploaded(20)

Monitoring MySQL with OpenTSDB

Editor's Notes

  1. Will be talking about OpenTSDBHow OpenTSDB changed monitoring at boxHow we leverage it’s abilities for day-to-day management of MySQL DBs
  2. Youprobablyhave the perconacactigraphs and monitoring plugins
  3. Youaddsomeothernagioschecks for funedgecases
  4. And you use different tools from the percona toolkit like:StalkPoor man’s profiler (PMP)Query Digest
  5. Suddenly finding problems and correlating issues is difficultMaybe you don’t have a NOC yetMaybe you do, and they need better graphs
  6. IT’S BIGGER ON THE INSIDE – just kiddingFast!Easy to build graphs on the flyHella easy to scale – just add nodes (HBase or TSDs)Very easy to put data into it – NEXT SLIDES TALK ABOUT THIS YO
  7. Running threads follows the CPU spikes PERFECTLYBox has a “long query” killer that gets more aggressive as more threads stack upShould get a look at queries on the server
  8. Zoom in to get the exact time interval
  9. Know the exact time of a high stack upGo to check Box Anemometer to see what query is there
  10. This is the URL for thatCan easily paste this to anyone to see the same interactive graph
  11. If you prefer text, that’s also an option via APIYou can build cool tools using the APIWeek over Week graphsSimplifies anomaly detectionURL is pretty simpleEffectively just use “q?” and add “&ascii”
  12. Get audit log:LoginsTypes of statements issuedEtc.
  13. Get performance information about:Row and index change activityRow read activity
  14. Generate daily reports of:Are auto increments columns nearing a boundary on a table?Number of records in a tableSize of a datafile for a table
  15. Using pt-tcp-modelAllows us to identify when server stops doing work5min interval
  16. Aggregate graphs are the defaultDrill down only when problems in aggregate
  17. Aggregatesare thedefault–shift in thinking from lookingatspecificimportantservers.Zooming in on a timeslice was painfullymanual– I wroteup a patch to addmouse-zooming and upstreamed. Thiscementedopentsdb as a powerful monitoring tool for Box, overnightAuto-suggest for metricsisspotty– we wrote a quick cron job that dumps full metric list into JSON “Graphs aren’t pretty” – a few changes to the base GNUPlot options solved this. There’s also a “Smooth” option in the interface nowMigrating from POC – we had a single-node setup for the longest time until that fell over...a lotPlan for 3+ machines – it’s enough to run all the needed bits for a light-weight distributed HBase and TSD setupData pruning – ~4 bytes per metric before HDFS replication add up quicklymysql_tcollector - 370 metrics -- ~1.5k per server. X 30s interval = ~4.2MB/dayeither have a plan to prune old data or build out extra capacity and predict storage needs per server/metric added