Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Systems Monitoring

778 views

Published on

Thomas Dunbar's presentation on Building Technology for Storage Systems Monitoring.
The presentation was given during the Nagios World Conference North America held Sept 20-Oct 2nd, 2013 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/nwcna

Published in: Technology, Business
  • Be the first to comment

  • Be the first to like this

Nagios Conference 2013 - Thomas Dunbar - Building Technology for Storage Systems Monitoring

  1. 1. Building Technology for Storage Systems Monitoring Intermountain HealthCare Thomas Gwyn Dunbar Thomas.Dunbar@imail.org
  2. 2. References & Introduction * http://content.healthaffairs.org/content/30/6/1185.full.html * nagios.org, etc * Nagios: Building Enterprise-Grade Monitoring Infrastructure for Systems and Monitoring, 2nd ed., David Jacobsen * Unix Programming Environment, Kernighan & Pike * After Virtue, 3rd ed, Alasdair MacIntyre * Purgatorio, Dante - since Nagios ain’t gonna insist on sainthood 
  3. 3. IHC and IT Intermountain Healthcare is an internationally recognized, nonprofit system of 22 hospitals, a Medical Group with more than 185 physician clinics, and an affiliated health insurance company, SelectHealth. Our 33,000 employees serve patients and plan members in Utah and southeastern Idaho. IHC has an annual budget of around 5 billion dollars. Datacenters in Plano, TX and Salt Lake City, UT and Ogden, UT providing high availability systems with over 5 petabytes of storage (over 12000 spindles) using IBM DS8000 for tier 1 and Netapp for other storage. In-house developed applications run on top of multiple Oracle databases over 15TB in size. CA Service Desk/CA Spectrum/Xmatters; Nagios
  4. 4. Monitoring Trend at IHC
  5. 5. Storage’s Nagios Servers while SA team moving away from Nagios, Storage is moving to it: Using 3.5, with check_mk and pnp4nagios DNX, if need be Our own servers for business reasons Integration with CA Spectrum/Service Desk, etc
  6. 6. Storage Hardware Brocade switches, IBM DS SAN, SVC & Netapp
  7. 7. This Talk’s Perspective Comprehensive monitoring is a major, site specific application. Major applications become very difficult to replace (e.g. air traffic control, IHC systems) Hence, let’s consider fundamentals
  8. 8. Worldviews * What we look through, not what we look at * Tempts us to think it is the only way to see * Scientific: what can we know, and how * Technological: what can we build, and how * Context
  9. 9. Strategies * Building and Growth * Inputs and Feedback * Planning * Personality
  10. 10. Building Technology Coherence Clarity Continuity
  11. 11. Spectrum of Traps EventMessage: Thu 05 Sep, 2013 - 14:47:23 - Device ********** of type NetAppONTAPDev is no longer responding to primary management requests (e.g. SNMP) CA Spectrum and Nagios
  12. 12. Graphing: Time Series Down the road…correlation

×