SlideShare a Scribd company logo
1 of 32
Real World Uses for Nagios APIs 
Janice Singh 
janice.s.singh@nasa.gov
Agenda 
This presentation describes the Nagios 4 APIs and how 
the NASA Advanced Supercomputing at Ames 
Research Center is employing them to upgrade its 
graphical status display (the HUD) and explain why it’s 
worth trying to use them yourselves. 
Janice S Singh – janice.s.singh@nasa.gov 2
The HUD: 
Visualization of the Center Status 
Janice S Singh – janice.s.singh@nasa.gov 3
Monitored Resources 
• Pleiades 
– 11,176-node SGI ICE supercluster 
– 184,800 cores (plus 32,768 GPU cores) 
• Frontend systems 
• Hyperwall visualization cluster 
• Tape Storage - pDMF cluster 
• NFS servers for /home on computing systems 
• Lustre scratch filesystems with multiple servers 
• PBS (Portable Batch System) job scheduler 
Ref: http://www.nas.nasa.gov/hecc/ 
Janice S Singh – janice.s.singh@nasa.gov 4
Nagios 4 Application Programming Interface 
• No additional setup required 
• Returns JSON output – multi-language support … 
• Three kinds of APIs 
– Archive 
– Object 
– Status 
• Run from the cgi-bin directory 
• Each of the APIs have a help query 
– domain.com/nagios/cgi-bin/statusjson.cgi?query=help 
– Also gives help if there is an error in the query 
Janice S Singh – janice.s.singh@nasa.gov 5
JSON example 
http://lnxsrv78/nagios4/cgi-bin/objectjson.cgi? 
query=hostgroup&hostgroup=tools 
"data": { 
"hostgroup": { 
"group_name": "tools", 
"alias": "Tools Group", 
"members": [ 
"lamsdb", 
"lamsweb", 
"lnxsrv107", 
"nasrunner", 
"remedy", 
"reports" 
], 
"notes": "", 
"notes_url": "", 
"action_url": "" 
} 
} 
Janice S Singh – janice.s.singh@nasa.gov 6
Original Data Flow 
network firewall (The Enclave) 
nrpe nrpe 
Dedicated Nagios Node 
nsca 
nsca 
nrpe 
ssh 
Cluster 
Compute Node 
Remote Node 
Web Server 
nsca 
nagios 
nagios.cmd 
HUD format 
datagg 
nagios2.cmd 
nagios 
nagios 
nagios web 
interface HUD 
orange - pipe file 
green - text file 
purple - web site 
HUD buffer 
Janice Singh - janice.s.singh@nasa.gov 7 
downtime.log
Nagios 4 Benefits 
• Upgrading simplified configuration file 
– Frequent system configuration changes 
– Error prone 
– Time consuming 
• Was one file: 17,835 lines; now 23 files: 9,121 lines 
• Majority of the cleanup was using hostgroups 
• APIs eliminate datagg configuration file 
Janice S Singh – janice.s.singh@nasa.gov 8
Modified Data Flow 
network firewall (The Enclave) 
Cluster 
nrpe nrpe 
Dedicated Nagios Node 
nrdp 
nrdp 
nrpe 
ssh 
Compute Node 
Remote Node 
Web Server 
nagios 
nagpopd 
nagios 
nagios 
nagios web 
interface 
HUD green - flat file 
purple - web site 
HUD buffer 
Janice Singh - janice.s.singh@nasa.gov 9
Data Transfer with NRDP vs NSCA 
• Only using one pipe allows use of nrdp 
• Removing datagg layer allows using nagios 
as it was intended 
• nrdp’s larger file transfer simplifies process 
–Previously had to split/reassemble 
–Kernel limit may cause split/reassemble 
• No longer need to overload the perfdata 
Janice S Singh – janice.s.singh@nasa.gov 10
API Type - Archive 
• Gives historical information based on var/archives 
– Availability 
– Alerts 
– Notifications 
• Based on timestamps that you give it 
http://lnxsrv78/nagios4/cgi-bin/archivejson.cgi? 
query=availability&availabilityobjecttype=hosts& 
hostname=pbspl233b&starttime=-604800& 
endtime=-0 
Janice S Singh – janice.s.singh@nasa.gov 11
API Type - Object 
Mirrors what your nagios configuration is 
•Hosts 
•Services 
•Contacts 
•Commands 
•Dependencies 
•etc. 
http://lnxsrv78/nagios4/cgi-bin/objectjson.cgi? 
query=hostgroup&hostgroup=tools 
Janice S Singh – janice.s.singh@nasa.gov 12
API Type - Status 
Gives the current state of nagios checks 
•Host 
•Service 
•Comment 
•Downtime 
http://lnxsrv78/nagios4/cgi-bin/statusjson.cgi? 
query=hostlist&formatoptions=enumerate& 
hostgroup=tools 
Janice S Singh – janice.s.singh@nasa.gov 13
Status API Post Processing 
• The API return codes are different than nagios 
• nagpopd converts for HUD 
Status Code (From Nagios To Hud): 
Pending: 1 => 6 
Ok: 2 => 0 
Warning: 4 => 1 
Unknown: 8 => 3 
Critical: 16 => 2 
Janice S Singh – janice.s.singh@nasa.gov 14
API GUI Tool 
Tool to figure out the variables for the APIs 
•Display builds the query 
– Dropdowns provide only relevant variables 
– Displays and executes the query 
– Displays the resulting JSON 
– Hovering over the input gives you help tips 
•domain.com/nagios/jsonquery.html 
Janice S Singh – janice.s.singh@nasa.gov 15
API GUI Tool Screenshot 
Janice S Singh – janice.s.singh@nasa.gov 16
API GUI Tool Hover Example 
Janice S Singh – janice.s.singh@nasa.gov 17
NAS Use of APIs 
• nagpopd 
– datagg replacement 
– API for object model 
– API for status 
• Scheduled downtime handling 
Janice S Singh – janice.s.singh@nasa.gov 18
Using API for nagpopd 
Uses objectJSON: 
•Get the structure directly from the API 
•Eliminates separate HUD config file 
– Duplicate effort 
– Human errors 
– Inertia (resist making changes) 
•HUD configuration put into nagios config 
•HUD content uses custom variables 
Janice S Singh – janice.s.singh@nasa.gov 19
NAS Local Process (nagpopd) 
Prepares HUD interfacing file: 
•Object Model 
– Loaded at startup from API queries 
– Perl, but could be any OO language 
– Can apply to other processing needs 
– Specific processing via Service subclassing 
•Some objects created from custom variables 
– Some hosts form Domains 
– MultiServiceGroup for shared filesystem servers 
Janice S Singh – janice.s.singh@nasa.gov 20
Object Model 
NII 
System:: 
Main 
System:: 
Config 
System:: 
Encode 
System:: 
Log 
System:: 
Query 
System:: 
Service2Object 
Objects:: 
Domain 
Objects:: 
Host 
Objects:: 
HostGroup 
Objects:: 
MultiServiceGroup 
Objects:: 
Service 
Objects:: 
A_Service 
Objects:: 
B_Service 
Objects:: 
Z_Service …
API Queries 
• Object JSON used on startup to create the layout: 
– objectjson.cgi?query=hostlist&details=true 
– objectjson.cgi?query=hostgrouplist&details=true 
– objectjson.cgi?query=servicelist&details=true 
– objectjson.cgi?query=servicegrouplist&details=true 
• Status JSON queried in a loop to get latest data 
– statusjson.cgi?query=servicelist&details=true 
Janice S Singh – janice.s.singh@nasa.gov 22
Processing Status Information 
• Generic Service object: 
– Default process ::setStatus (no changes) 
– Default output ::writeHUDb (reformat for HUD) 
– Other output methods easily added 
• ::writeJSON (planned) 
• ::writeHTML (later version) 
• others: MySQL commands, etc 
• Service Subclass overrides methods: 
– Handles service unique process or output 
– One array maps service name to object.pm 
Janice S Singh – janice.s.singh@nasa.gov 23
Scheduled Downtime Handling 
• Old solution edited downtime.log 
• When host is down, nagios stops checking it 
• Used to sync with external program (schedule) … 
– Previous solution required shadow host 
• pleiades – actual host could be down 
• Pleiades – shadow never down 
– Now able to use APIs… 
Janice S Singh – janice.s.singh@nasa.gov 24 
Host_a 
host_a
External Program Use 
• External program (command line interface) 
$ schedule all 
ALEX 10/06/2014 10:00-10:25 10/06/2014 Raid Maintenance 
SUSAN 10/06/2014 10:00-10:25 10/06/2014 RAID maintenance 
REMEDY 10/06/2014 12:30-12:40 10/06/2014 Restart to resolve issue. 
$ 
• query=downtimelist&formatoptions=enumerate& 
details=true 
• Merges and updates nagios downtimelist … 
Janice S Singh – janice.s.singh@nasa.gov 25
Updating downtimelist 
• Use nagios external command feature 
– SCHEDULE_HOST_DOWNTIME;<host_name>; 
<start_time>;<end_time>;<fixed>;<trigger_id>; 
<duration>;<author>;<comment> 
– SCHEDULE_HOST_DOWNTIME;pioneer;1412626315; 
1412626233;1;0;7200;janice;just a test 
• Documentation described in: 
http://old.nagios.org/developerinfo/externalcommands/commandlist.Janice S Singh – janice.s.singh@nasa.gov 26
Hiccups 
Fixed by Nagios support 
•Custom variables didn’t show up in JSON output 
•Percent signs broke the JSON … sometimes fatally 
•JSON output was limited to 8k 
•Newlines didn’t show up in output 
Janice S Singh – janice.s.singh@nasa.gov 27
Hiccups 
• We have one plugin that outputs so much data it can’t be 
passed on the command line, so nrdp breaks. 
– Kernel limitation 
– Will have to send in packets 
• Having to have nsca and nrdp work at the same time 
Janice S Singh – janice.s.singh@nasa.gov 28
Future Plans 
• AJAX-style updates to only 
update the part of the page 
that needs it 
• Use the other information we 
get from the APIs 
– When a service is acknowledged 
– Use archive data to display 
alerts based on trends 
Janice S Singh – janice.s.singh@nasa.gov 29
Conclusion 
Using nagios 4 APIs has made our process much 
easier and will do more so in the future 
•Simplified configurations 
•Enabled object model 
•Improved the flow 
•Can communicate with external processes 
•Good customer support 
Janice S Singh – janice.s.singh@nasa.gov 30
Questions? 
Janice S Singh – janice.s.singh@nasa.gov 31
Thank You 
Janice Singh 
janice.s.singh@nasa.gov

More Related Content

What's hot

Dave Williams - Nagios Log Server - Practical Experience
Dave Williams - Nagios Log Server - Practical ExperienceDave Williams - Nagios Log Server - Practical Experience
Dave Williams - Nagios Log Server - Practical ExperienceNagios
 
Nagios Conference 2013 - Eric Stanley and Andy Brist - API and Nagios
Nagios Conference 2013 - Eric Stanley and Andy Brist - API and NagiosNagios Conference 2013 - Eric Stanley and Andy Brist - API and Nagios
Nagios Conference 2013 - Eric Stanley and Andy Brist - API and NagiosNagios
 
Nagios Conference 2013 - Eric Stanley - Whats New Core 4
Nagios Conference 2013 - Eric Stanley - Whats New Core 4Nagios Conference 2013 - Eric Stanley - Whats New Core 4
Nagios Conference 2013 - Eric Stanley - Whats New Core 4Nagios
 
Nagios Conference 2012 - Mike Weber - Failover
Nagios Conference 2012 - Mike Weber - FailoverNagios Conference 2012 - Mike Weber - Failover
Nagios Conference 2012 - Mike Weber - FailoverNagios
 
Andrew Nelson - Zabbix and SNMP on Linux
Andrew Nelson - Zabbix and SNMP on LinuxAndrew Nelson - Zabbix and SNMP on Linux
Andrew Nelson - Zabbix and SNMP on LinuxZabbix
 
Learning Nagios module 1
Learning Nagios module 1Learning Nagios module 1
Learning Nagios module 1Jayant Chutke
 
Matt Bruzek - Monitoring Your Public Cloud With Nagios
Matt Bruzek - Monitoring Your Public Cloud With NagiosMatt Bruzek - Monitoring Your Public Cloud With Nagios
Matt Bruzek - Monitoring Your Public Cloud With NagiosNagios
 
Monitoring NGINX (plus): key metrics and how-to
Monitoring NGINX (plus): key metrics and how-toMonitoring NGINX (plus): key metrics and how-to
Monitoring NGINX (plus): key metrics and how-toDatadog
 
Nagios Conference 2011 - Mike Weber - Training: Reducing Nagios Server Load ...
Nagios Conference 2011 - Mike Weber - Training:  Reducing Nagios Server Load ...Nagios Conference 2011 - Mike Weber - Training:  Reducing Nagios Server Load ...
Nagios Conference 2011 - Mike Weber - Training: Reducing Nagios Server Load ...Nagios
 
Maximizing PHP Performance with NGINX
Maximizing PHP Performance with NGINXMaximizing PHP Performance with NGINX
Maximizing PHP Performance with NGINXNGINX, Inc.
 
5 things you didn't know nginx could do
5 things you didn't know nginx could do5 things you didn't know nginx could do
5 things you didn't know nginx could dosarahnovotny
 
Tick Stack - Listen your infrastructure and please sleep
Tick Stack - Listen your infrastructure and please sleepTick Stack - Listen your infrastructure and please sleep
Tick Stack - Listen your infrastructure and please sleepGianluca Arbezzano
 
Keynote: Scaling Sensu Go
Keynote: Scaling Sensu GoKeynote: Scaling Sensu Go
Keynote: Scaling Sensu GoSensu Inc.
 
OSMC 2021 | Icinga-Installer – the easy way to your Icinga
OSMC 2021 | Icinga-Installer – the easy way to your IcingaOSMC 2021 | Icinga-Installer – the easy way to your Icinga
OSMC 2021 | Icinga-Installer – the easy way to your IcingaNETWAYS
 
NGINX Installation and Tuning
NGINX Installation and TuningNGINX Installation and Tuning
NGINX Installation and TuningNGINX, Inc.
 

What's hot (20)

Dave Williams - Nagios Log Server - Practical Experience
Dave Williams - Nagios Log Server - Practical ExperienceDave Williams - Nagios Log Server - Practical Experience
Dave Williams - Nagios Log Server - Practical Experience
 
Nagios Conference 2013 - Eric Stanley and Andy Brist - API and Nagios
Nagios Conference 2013 - Eric Stanley and Andy Brist - API and NagiosNagios Conference 2013 - Eric Stanley and Andy Brist - API and Nagios
Nagios Conference 2013 - Eric Stanley and Andy Brist - API and Nagios
 
Nrpe
NrpeNrpe
Nrpe
 
Nagios Conference 2013 - Eric Stanley - Whats New Core 4
Nagios Conference 2013 - Eric Stanley - Whats New Core 4Nagios Conference 2013 - Eric Stanley - Whats New Core 4
Nagios Conference 2013 - Eric Stanley - Whats New Core 4
 
Nagios Conference 2012 - Mike Weber - Failover
Nagios Conference 2012 - Mike Weber - FailoverNagios Conference 2012 - Mike Weber - Failover
Nagios Conference 2012 - Mike Weber - Failover
 
Nagios
NagiosNagios
Nagios
 
Andrew Nelson - Zabbix and SNMP on Linux
Andrew Nelson - Zabbix and SNMP on LinuxAndrew Nelson - Zabbix and SNMP on Linux
Andrew Nelson - Zabbix and SNMP on Linux
 
Learning Nagios module 1
Learning Nagios module 1Learning Nagios module 1
Learning Nagios module 1
 
Matt Bruzek - Monitoring Your Public Cloud With Nagios
Matt Bruzek - Monitoring Your Public Cloud With NagiosMatt Bruzek - Monitoring Your Public Cloud With Nagios
Matt Bruzek - Monitoring Your Public Cloud With Nagios
 
Monitoring NGINX (plus): key metrics and how-to
Monitoring NGINX (plus): key metrics and how-toMonitoring NGINX (plus): key metrics and how-to
Monitoring NGINX (plus): key metrics and how-to
 
Nagios Conference 2011 - Mike Weber - Training: Reducing Nagios Server Load ...
Nagios Conference 2011 - Mike Weber - Training:  Reducing Nagios Server Load ...Nagios Conference 2011 - Mike Weber - Training:  Reducing Nagios Server Load ...
Nagios Conference 2011 - Mike Weber - Training: Reducing Nagios Server Load ...
 
Maximizing PHP Performance with NGINX
Maximizing PHP Performance with NGINXMaximizing PHP Performance with NGINX
Maximizing PHP Performance with NGINX
 
Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)
Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)
Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)
 
5 things you didn't know nginx could do
5 things you didn't know nginx could do5 things you didn't know nginx could do
5 things you didn't know nginx could do
 
Tick Stack - Listen your infrastructure and please sleep
Tick Stack - Listen your infrastructure and please sleepTick Stack - Listen your infrastructure and please sleep
Tick Stack - Listen your infrastructure and please sleep
 
Keynote: Scaling Sensu Go
Keynote: Scaling Sensu GoKeynote: Scaling Sensu Go
Keynote: Scaling Sensu Go
 
OSMC 2021 | Icinga-Installer – the easy way to your Icinga
OSMC 2021 | Icinga-Installer – the easy way to your IcingaOSMC 2021 | Icinga-Installer – the easy way to your Icinga
OSMC 2021 | Icinga-Installer – the easy way to your Icinga
 
NGINX Installation and Tuning
NGINX Installation and TuningNGINX Installation and Tuning
NGINX Installation and Tuning
 
Sensu
SensuSensu
Sensu
 
Nginx in production
Nginx in productionNginx in production
Nginx in production
 

Similar to Nagios Conference 2014 - Janice Singh - Real World Uses for Nagios APIs

Nagios Conference 2013 - Janice Singh - Visualization of Monitoring Data at t...
Nagios Conference 2013 - Janice Singh - Visualization of Monitoring Data at t...Nagios Conference 2013 - Janice Singh - Visualization of Monitoring Data at t...
Nagios Conference 2013 - Janice Singh - Visualization of Monitoring Data at t...Nagios
 
Icinga 2009 at OSMC
Icinga 2009 at OSMCIcinga 2009 at OSMC
Icinga 2009 at OSMCIcinga
 
OSMC 2011 | Case Study - Icinga at Hyves.nl by Jeffrey Lensen
OSMC 2011 | Case Study - Icinga at Hyves.nl by Jeffrey LensenOSMC 2011 | Case Study - Icinga at Hyves.nl by Jeffrey Lensen
OSMC 2011 | Case Study - Icinga at Hyves.nl by Jeffrey LensenNETWAYS
 
OSMC 2009 | Icinga by Icinga Team
OSMC 2009 | Icinga by Icinga TeamOSMC 2009 | Icinga by Icinga Team
OSMC 2009 | Icinga by Icinga TeamNETWAYS
 
Migrating deployment processes and Continuous Integration at SAP SE
Migrating deployment processes and Continuous Integration at SAP SEMigrating deployment processes and Continuous Integration at SAP SE
Migrating deployment processes and Continuous Integration at SAP SEB1 Systems GmbH
 
NGINX Plus R20 Webinar EMEA
NGINX Plus R20 Webinar EMEANGINX Plus R20 Webinar EMEA
NGINX Plus R20 Webinar EMEANGINX, Inc.
 
Running Airflow Workflows as ETL Processes on Hadoop
Running Airflow Workflows as ETL Processes on HadoopRunning Airflow Workflows as ETL Processes on Hadoop
Running Airflow Workflows as ETL Processes on Hadoopclairvoyantllc
 
DNS_Tutorial 2.pptx
DNS_Tutorial 2.pptxDNS_Tutorial 2.pptx
DNS_Tutorial 2.pptxviditsir
 
MySQL performance monitoring using Statsd and Graphite
MySQL performance monitoring using Statsd and GraphiteMySQL performance monitoring using Statsd and Graphite
MySQL performance monitoring using Statsd and GraphiteDB-Art
 
The Autobahn Has No Speed Limit - Your XPages Shouldn't Either!
The Autobahn Has No Speed Limit - Your XPages Shouldn't Either!The Autobahn Has No Speed Limit - Your XPages Shouldn't Either!
The Autobahn Has No Speed Limit - Your XPages Shouldn't Either!Teamstudio
 
SharePoint Saturday San Antonio: SharePoint 2010 Performance
SharePoint Saturday San Antonio: SharePoint 2010 PerformanceSharePoint Saturday San Antonio: SharePoint 2010 Performance
SharePoint Saturday San Antonio: SharePoint 2010 PerformanceBrian Culver
 
Using NGINX as an Effective and Highly Available Content Cache
Using NGINX as an Effective and Highly Available Content CacheUsing NGINX as an Effective and Highly Available Content Cache
Using NGINX as an Effective and Highly Available Content CacheKevin Jones
 
Automated Data Synchronization: Data Loader, Data Mirror & Beyond
Automated Data Synchronization: Data Loader, Data Mirror & BeyondAutomated Data Synchronization: Data Loader, Data Mirror & Beyond
Automated Data Synchronization: Data Loader, Data Mirror & BeyondJeremyOtt5
 
Automating your php infrastructure with the zend server api
Automating your php infrastructure with the zend server apiAutomating your php infrastructure with the zend server api
Automating your php infrastructure with the zend server apiYonni Mendes
 
NGINX Plus R20 Webinar
NGINX Plus R20 WebinarNGINX Plus R20 Webinar
NGINX Plus R20 WebinarNGINX, Inc.
 
AWS Game Analytics - GDC 2014
AWS Game Analytics - GDC 2014AWS Game Analytics - GDC 2014
AWS Game Analytics - GDC 2014Nate Wiger
 
Scaling 100PB Data Warehouse in Cloud
Scaling 100PB Data Warehouse in CloudScaling 100PB Data Warehouse in Cloud
Scaling 100PB Data Warehouse in CloudChangshu Liu
 
What's new in SQL Server 2017
What's new in SQL Server 2017What's new in SQL Server 2017
What's new in SQL Server 2017Hasan Savran
 
Dynamic SSL Certificates and Other New Features in NGINX Plus R18 and NGINX O...
Dynamic SSL Certificates and Other New Features in NGINX Plus R18 and NGINX O...Dynamic SSL Certificates and Other New Features in NGINX Plus R18 and NGINX O...
Dynamic SSL Certificates and Other New Features in NGINX Plus R18 and NGINX O...NGINX, Inc.
 

Similar to Nagios Conference 2014 - Janice Singh - Real World Uses for Nagios APIs (20)

Nagios Conference 2013 - Janice Singh - Visualization of Monitoring Data at t...
Nagios Conference 2013 - Janice Singh - Visualization of Monitoring Data at t...Nagios Conference 2013 - Janice Singh - Visualization of Monitoring Data at t...
Nagios Conference 2013 - Janice Singh - Visualization of Monitoring Data at t...
 
Icinga 2009 at OSMC
Icinga 2009 at OSMCIcinga 2009 at OSMC
Icinga 2009 at OSMC
 
OSMC 2011 | Case Study - Icinga at Hyves.nl by Jeffrey Lensen
OSMC 2011 | Case Study - Icinga at Hyves.nl by Jeffrey LensenOSMC 2011 | Case Study - Icinga at Hyves.nl by Jeffrey Lensen
OSMC 2011 | Case Study - Icinga at Hyves.nl by Jeffrey Lensen
 
OSMC 2009 | Icinga by Icinga Team
OSMC 2009 | Icinga by Icinga TeamOSMC 2009 | Icinga by Icinga Team
OSMC 2009 | Icinga by Icinga Team
 
Migrating deployment processes and Continuous Integration at SAP SE
Migrating deployment processes and Continuous Integration at SAP SEMigrating deployment processes and Continuous Integration at SAP SE
Migrating deployment processes and Continuous Integration at SAP SE
 
NGINX Plus R20 Webinar EMEA
NGINX Plus R20 Webinar EMEANGINX Plus R20 Webinar EMEA
NGINX Plus R20 Webinar EMEA
 
Running Airflow Workflows as ETL Processes on Hadoop
Running Airflow Workflows as ETL Processes on HadoopRunning Airflow Workflows as ETL Processes on Hadoop
Running Airflow Workflows as ETL Processes on Hadoop
 
Ci for all
Ci for allCi for all
Ci for all
 
DNS_Tutorial 2.pptx
DNS_Tutorial 2.pptxDNS_Tutorial 2.pptx
DNS_Tutorial 2.pptx
 
MySQL performance monitoring using Statsd and Graphite
MySQL performance monitoring using Statsd and GraphiteMySQL performance monitoring using Statsd and Graphite
MySQL performance monitoring using Statsd and Graphite
 
The Autobahn Has No Speed Limit - Your XPages Shouldn't Either!
The Autobahn Has No Speed Limit - Your XPages Shouldn't Either!The Autobahn Has No Speed Limit - Your XPages Shouldn't Either!
The Autobahn Has No Speed Limit - Your XPages Shouldn't Either!
 
SharePoint Saturday San Antonio: SharePoint 2010 Performance
SharePoint Saturday San Antonio: SharePoint 2010 PerformanceSharePoint Saturday San Antonio: SharePoint 2010 Performance
SharePoint Saturday San Antonio: SharePoint 2010 Performance
 
Using NGINX as an Effective and Highly Available Content Cache
Using NGINX as an Effective and Highly Available Content CacheUsing NGINX as an Effective and Highly Available Content Cache
Using NGINX as an Effective and Highly Available Content Cache
 
Automated Data Synchronization: Data Loader, Data Mirror & Beyond
Automated Data Synchronization: Data Loader, Data Mirror & BeyondAutomated Data Synchronization: Data Loader, Data Mirror & Beyond
Automated Data Synchronization: Data Loader, Data Mirror & Beyond
 
Automating your php infrastructure with the zend server api
Automating your php infrastructure with the zend server apiAutomating your php infrastructure with the zend server api
Automating your php infrastructure with the zend server api
 
NGINX Plus R20 Webinar
NGINX Plus R20 WebinarNGINX Plus R20 Webinar
NGINX Plus R20 Webinar
 
AWS Game Analytics - GDC 2014
AWS Game Analytics - GDC 2014AWS Game Analytics - GDC 2014
AWS Game Analytics - GDC 2014
 
Scaling 100PB Data Warehouse in Cloud
Scaling 100PB Data Warehouse in CloudScaling 100PB Data Warehouse in Cloud
Scaling 100PB Data Warehouse in Cloud
 
What's new in SQL Server 2017
What's new in SQL Server 2017What's new in SQL Server 2017
What's new in SQL Server 2017
 
Dynamic SSL Certificates and Other New Features in NGINX Plus R18 and NGINX O...
Dynamic SSL Certificates and Other New Features in NGINX Plus R18 and NGINX O...Dynamic SSL Certificates and Other New Features in NGINX Plus R18 and NGINX O...
Dynamic SSL Certificates and Other New Features in NGINX Plus R18 and NGINX O...
 

More from Nagios

Nagios XI Best Practices
Nagios XI Best PracticesNagios XI Best Practices
Nagios XI Best PracticesNagios
 
Trevor McDonald - Nagios XI Under The Hood
Trevor McDonald  - Nagios XI Under The HoodTrevor McDonald  - Nagios XI Under The Hood
Trevor McDonald - Nagios XI Under The HoodNagios
 
Sean Falzon - Nagios - Resilient Notifications
Sean Falzon - Nagios - Resilient NotificationsSean Falzon - Nagios - Resilient Notifications
Sean Falzon - Nagios - Resilient NotificationsNagios
 
Marcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise Edition
Marcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise EditionMarcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise Edition
Marcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise EditionNagios
 
Mike Weber - Nagios and Group Deployment of Service Checks
Mike Weber - Nagios and Group Deployment of Service ChecksMike Weber - Nagios and Group Deployment of Service Checks
Mike Weber - Nagios and Group Deployment of Service ChecksNagios
 
Bryan Heden - Agile Networks - Using Nagios XI as the platform for Monitoring...
Bryan Heden - Agile Networks - Using Nagios XI as the platform for Monitoring...Bryan Heden - Agile Networks - Using Nagios XI as the platform for Monitoring...
Bryan Heden - Agile Networks - Using Nagios XI as the platform for Monitoring...Nagios
 
Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.
Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.
Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.Nagios
 
Eric Loyd - Fractal Nagios
Eric Loyd - Fractal NagiosEric Loyd - Fractal Nagios
Eric Loyd - Fractal NagiosNagios
 
Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...
Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...
Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...Nagios
 
Thomas Schmainda - Tracking Boeing Satellites With Nagios - Nagios World Conf...
Thomas Schmainda - Tracking Boeing Satellites With Nagios - Nagios World Conf...Thomas Schmainda - Tracking Boeing Satellites With Nagios - Nagios World Conf...
Thomas Schmainda - Tracking Boeing Satellites With Nagios - Nagios World Conf...Nagios
 
Nagios World Conference 2015 - Scott Wilkerson Opening
Nagios World Conference 2015 - Scott Wilkerson OpeningNagios World Conference 2015 - Scott Wilkerson Opening
Nagios World Conference 2015 - Scott Wilkerson OpeningNagios
 
Nrpe - Nagios Remote Plugin Executor. NRPE plugin for Nagios Core
Nrpe - Nagios Remote Plugin Executor. NRPE plugin for Nagios CoreNrpe - Nagios Remote Plugin Executor. NRPE plugin for Nagios Core
Nrpe - Nagios Remote Plugin Executor. NRPE plugin for Nagios CoreNagios
 
Nagios Log Server - Features
Nagios Log Server - FeaturesNagios Log Server - Features
Nagios Log Server - FeaturesNagios
 
Nagios Network Analyzer - Features
Nagios Network Analyzer - FeaturesNagios Network Analyzer - Features
Nagios Network Analyzer - FeaturesNagios
 
Nagios Conference 2014 - Dorance Martinez Cortes - Customizing Nagios
Nagios Conference 2014 - Dorance Martinez Cortes - Customizing NagiosNagios Conference 2014 - Dorance Martinez Cortes - Customizing Nagios
Nagios Conference 2014 - Dorance Martinez Cortes - Customizing NagiosNagios
 
Nagios Conference 2014 - Mike Weber - Nagios Rapid Deployment Options
Nagios Conference 2014 - Mike Weber - Nagios Rapid Deployment OptionsNagios Conference 2014 - Mike Weber - Nagios Rapid Deployment Options
Nagios Conference 2014 - Mike Weber - Nagios Rapid Deployment OptionsNagios
 
Nagios Conference 2014 - Eric Mislivec - Getting Started With Nagios Core
Nagios Conference 2014 - Eric Mislivec - Getting Started With Nagios CoreNagios Conference 2014 - Eric Mislivec - Getting Started With Nagios Core
Nagios Conference 2014 - Eric Mislivec - Getting Started With Nagios CoreNagios
 
Nagios Conference 2014 - Trevor McDonald - Monitoring The Physical World With...
Nagios Conference 2014 - Trevor McDonald - Monitoring The Physical World With...Nagios Conference 2014 - Trevor McDonald - Monitoring The Physical World With...
Nagios Conference 2014 - Trevor McDonald - Monitoring The Physical World With...Nagios
 
Nagios Conference 2014 - Andy Brist - Nagios XI Failover and HA Solutions
Nagios Conference 2014 - Andy Brist - Nagios XI Failover and HA SolutionsNagios Conference 2014 - Andy Brist - Nagios XI Failover and HA Solutions
Nagios Conference 2014 - Andy Brist - Nagios XI Failover and HA SolutionsNagios
 
Nagios Conference 2014 - Shamas Demoret - An Overview of Nagios Solutions
Nagios Conference 2014 - Shamas Demoret - An Overview of Nagios SolutionsNagios Conference 2014 - Shamas Demoret - An Overview of Nagios Solutions
Nagios Conference 2014 - Shamas Demoret - An Overview of Nagios SolutionsNagios
 

More from Nagios (20)

Nagios XI Best Practices
Nagios XI Best PracticesNagios XI Best Practices
Nagios XI Best Practices
 
Trevor McDonald - Nagios XI Under The Hood
Trevor McDonald  - Nagios XI Under The HoodTrevor McDonald  - Nagios XI Under The Hood
Trevor McDonald - Nagios XI Under The Hood
 
Sean Falzon - Nagios - Resilient Notifications
Sean Falzon - Nagios - Resilient NotificationsSean Falzon - Nagios - Resilient Notifications
Sean Falzon - Nagios - Resilient Notifications
 
Marcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise Edition
Marcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise EditionMarcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise Edition
Marcus Rochelle - Landis+Gyr - Monitoring with Nagios Enterprise Edition
 
Mike Weber - Nagios and Group Deployment of Service Checks
Mike Weber - Nagios and Group Deployment of Service ChecksMike Weber - Nagios and Group Deployment of Service Checks
Mike Weber - Nagios and Group Deployment of Service Checks
 
Bryan Heden - Agile Networks - Using Nagios XI as the platform for Monitoring...
Bryan Heden - Agile Networks - Using Nagios XI as the platform for Monitoring...Bryan Heden - Agile Networks - Using Nagios XI as the platform for Monitoring...
Bryan Heden - Agile Networks - Using Nagios XI as the platform for Monitoring...
 
Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.
Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.
Lee Myers - What To Do When Nagios Notification Don't Meet Your Needs.
 
Eric Loyd - Fractal Nagios
Eric Loyd - Fractal NagiosEric Loyd - Fractal Nagios
Eric Loyd - Fractal Nagios
 
Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...
Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...
Marcelo Perazolo, Lead Software Architect, IBM Corporation - Monitoring a Pow...
 
Thomas Schmainda - Tracking Boeing Satellites With Nagios - Nagios World Conf...
Thomas Schmainda - Tracking Boeing Satellites With Nagios - Nagios World Conf...Thomas Schmainda - Tracking Boeing Satellites With Nagios - Nagios World Conf...
Thomas Schmainda - Tracking Boeing Satellites With Nagios - Nagios World Conf...
 
Nagios World Conference 2015 - Scott Wilkerson Opening
Nagios World Conference 2015 - Scott Wilkerson OpeningNagios World Conference 2015 - Scott Wilkerson Opening
Nagios World Conference 2015 - Scott Wilkerson Opening
 
Nrpe - Nagios Remote Plugin Executor. NRPE plugin for Nagios Core
Nrpe - Nagios Remote Plugin Executor. NRPE plugin for Nagios CoreNrpe - Nagios Remote Plugin Executor. NRPE plugin for Nagios Core
Nrpe - Nagios Remote Plugin Executor. NRPE plugin for Nagios Core
 
Nagios Log Server - Features
Nagios Log Server - FeaturesNagios Log Server - Features
Nagios Log Server - Features
 
Nagios Network Analyzer - Features
Nagios Network Analyzer - FeaturesNagios Network Analyzer - Features
Nagios Network Analyzer - Features
 
Nagios Conference 2014 - Dorance Martinez Cortes - Customizing Nagios
Nagios Conference 2014 - Dorance Martinez Cortes - Customizing NagiosNagios Conference 2014 - Dorance Martinez Cortes - Customizing Nagios
Nagios Conference 2014 - Dorance Martinez Cortes - Customizing Nagios
 
Nagios Conference 2014 - Mike Weber - Nagios Rapid Deployment Options
Nagios Conference 2014 - Mike Weber - Nagios Rapid Deployment OptionsNagios Conference 2014 - Mike Weber - Nagios Rapid Deployment Options
Nagios Conference 2014 - Mike Weber - Nagios Rapid Deployment Options
 
Nagios Conference 2014 - Eric Mislivec - Getting Started With Nagios Core
Nagios Conference 2014 - Eric Mislivec - Getting Started With Nagios CoreNagios Conference 2014 - Eric Mislivec - Getting Started With Nagios Core
Nagios Conference 2014 - Eric Mislivec - Getting Started With Nagios Core
 
Nagios Conference 2014 - Trevor McDonald - Monitoring The Physical World With...
Nagios Conference 2014 - Trevor McDonald - Monitoring The Physical World With...Nagios Conference 2014 - Trevor McDonald - Monitoring The Physical World With...
Nagios Conference 2014 - Trevor McDonald - Monitoring The Physical World With...
 
Nagios Conference 2014 - Andy Brist - Nagios XI Failover and HA Solutions
Nagios Conference 2014 - Andy Brist - Nagios XI Failover and HA SolutionsNagios Conference 2014 - Andy Brist - Nagios XI Failover and HA Solutions
Nagios Conference 2014 - Andy Brist - Nagios XI Failover and HA Solutions
 
Nagios Conference 2014 - Shamas Demoret - An Overview of Nagios Solutions
Nagios Conference 2014 - Shamas Demoret - An Overview of Nagios SolutionsNagios Conference 2014 - Shamas Demoret - An Overview of Nagios Solutions
Nagios Conference 2014 - Shamas Demoret - An Overview of Nagios Solutions
 

Recently uploaded

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraDeakin University
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Artificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning eraArtificial intelligence in the post-deep learning era
Artificial intelligence in the post-deep learning era
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 

Nagios Conference 2014 - Janice Singh - Real World Uses for Nagios APIs

  • 1. Real World Uses for Nagios APIs Janice Singh janice.s.singh@nasa.gov
  • 2. Agenda This presentation describes the Nagios 4 APIs and how the NASA Advanced Supercomputing at Ames Research Center is employing them to upgrade its graphical status display (the HUD) and explain why it’s worth trying to use them yourselves. Janice S Singh – janice.s.singh@nasa.gov 2
  • 3. The HUD: Visualization of the Center Status Janice S Singh – janice.s.singh@nasa.gov 3
  • 4. Monitored Resources • Pleiades – 11,176-node SGI ICE supercluster – 184,800 cores (plus 32,768 GPU cores) • Frontend systems • Hyperwall visualization cluster • Tape Storage - pDMF cluster • NFS servers for /home on computing systems • Lustre scratch filesystems with multiple servers • PBS (Portable Batch System) job scheduler Ref: http://www.nas.nasa.gov/hecc/ Janice S Singh – janice.s.singh@nasa.gov 4
  • 5. Nagios 4 Application Programming Interface • No additional setup required • Returns JSON output – multi-language support … • Three kinds of APIs – Archive – Object – Status • Run from the cgi-bin directory • Each of the APIs have a help query – domain.com/nagios/cgi-bin/statusjson.cgi?query=help – Also gives help if there is an error in the query Janice S Singh – janice.s.singh@nasa.gov 5
  • 6. JSON example http://lnxsrv78/nagios4/cgi-bin/objectjson.cgi? query=hostgroup&hostgroup=tools "data": { "hostgroup": { "group_name": "tools", "alias": "Tools Group", "members": [ "lamsdb", "lamsweb", "lnxsrv107", "nasrunner", "remedy", "reports" ], "notes": "", "notes_url": "", "action_url": "" } } Janice S Singh – janice.s.singh@nasa.gov 6
  • 7. Original Data Flow network firewall (The Enclave) nrpe nrpe Dedicated Nagios Node nsca nsca nrpe ssh Cluster Compute Node Remote Node Web Server nsca nagios nagios.cmd HUD format datagg nagios2.cmd nagios nagios nagios web interface HUD orange - pipe file green - text file purple - web site HUD buffer Janice Singh - janice.s.singh@nasa.gov 7 downtime.log
  • 8. Nagios 4 Benefits • Upgrading simplified configuration file – Frequent system configuration changes – Error prone – Time consuming • Was one file: 17,835 lines; now 23 files: 9,121 lines • Majority of the cleanup was using hostgroups • APIs eliminate datagg configuration file Janice S Singh – janice.s.singh@nasa.gov 8
  • 9. Modified Data Flow network firewall (The Enclave) Cluster nrpe nrpe Dedicated Nagios Node nrdp nrdp nrpe ssh Compute Node Remote Node Web Server nagios nagpopd nagios nagios nagios web interface HUD green - flat file purple - web site HUD buffer Janice Singh - janice.s.singh@nasa.gov 9
  • 10. Data Transfer with NRDP vs NSCA • Only using one pipe allows use of nrdp • Removing datagg layer allows using nagios as it was intended • nrdp’s larger file transfer simplifies process –Previously had to split/reassemble –Kernel limit may cause split/reassemble • No longer need to overload the perfdata Janice S Singh – janice.s.singh@nasa.gov 10
  • 11. API Type - Archive • Gives historical information based on var/archives – Availability – Alerts – Notifications • Based on timestamps that you give it http://lnxsrv78/nagios4/cgi-bin/archivejson.cgi? query=availability&availabilityobjecttype=hosts& hostname=pbspl233b&starttime=-604800& endtime=-0 Janice S Singh – janice.s.singh@nasa.gov 11
  • 12. API Type - Object Mirrors what your nagios configuration is •Hosts •Services •Contacts •Commands •Dependencies •etc. http://lnxsrv78/nagios4/cgi-bin/objectjson.cgi? query=hostgroup&hostgroup=tools Janice S Singh – janice.s.singh@nasa.gov 12
  • 13. API Type - Status Gives the current state of nagios checks •Host •Service •Comment •Downtime http://lnxsrv78/nagios4/cgi-bin/statusjson.cgi? query=hostlist&formatoptions=enumerate& hostgroup=tools Janice S Singh – janice.s.singh@nasa.gov 13
  • 14. Status API Post Processing • The API return codes are different than nagios • nagpopd converts for HUD Status Code (From Nagios To Hud): Pending: 1 => 6 Ok: 2 => 0 Warning: 4 => 1 Unknown: 8 => 3 Critical: 16 => 2 Janice S Singh – janice.s.singh@nasa.gov 14
  • 15. API GUI Tool Tool to figure out the variables for the APIs •Display builds the query – Dropdowns provide only relevant variables – Displays and executes the query – Displays the resulting JSON – Hovering over the input gives you help tips •domain.com/nagios/jsonquery.html Janice S Singh – janice.s.singh@nasa.gov 15
  • 16. API GUI Tool Screenshot Janice S Singh – janice.s.singh@nasa.gov 16
  • 17. API GUI Tool Hover Example Janice S Singh – janice.s.singh@nasa.gov 17
  • 18. NAS Use of APIs • nagpopd – datagg replacement – API for object model – API for status • Scheduled downtime handling Janice S Singh – janice.s.singh@nasa.gov 18
  • 19. Using API for nagpopd Uses objectJSON: •Get the structure directly from the API •Eliminates separate HUD config file – Duplicate effort – Human errors – Inertia (resist making changes) •HUD configuration put into nagios config •HUD content uses custom variables Janice S Singh – janice.s.singh@nasa.gov 19
  • 20. NAS Local Process (nagpopd) Prepares HUD interfacing file: •Object Model – Loaded at startup from API queries – Perl, but could be any OO language – Can apply to other processing needs – Specific processing via Service subclassing •Some objects created from custom variables – Some hosts form Domains – MultiServiceGroup for shared filesystem servers Janice S Singh – janice.s.singh@nasa.gov 20
  • 21. Object Model NII System:: Main System:: Config System:: Encode System:: Log System:: Query System:: Service2Object Objects:: Domain Objects:: Host Objects:: HostGroup Objects:: MultiServiceGroup Objects:: Service Objects:: A_Service Objects:: B_Service Objects:: Z_Service …
  • 22. API Queries • Object JSON used on startup to create the layout: – objectjson.cgi?query=hostlist&details=true – objectjson.cgi?query=hostgrouplist&details=true – objectjson.cgi?query=servicelist&details=true – objectjson.cgi?query=servicegrouplist&details=true • Status JSON queried in a loop to get latest data – statusjson.cgi?query=servicelist&details=true Janice S Singh – janice.s.singh@nasa.gov 22
  • 23. Processing Status Information • Generic Service object: – Default process ::setStatus (no changes) – Default output ::writeHUDb (reformat for HUD) – Other output methods easily added • ::writeJSON (planned) • ::writeHTML (later version) • others: MySQL commands, etc • Service Subclass overrides methods: – Handles service unique process or output – One array maps service name to object.pm Janice S Singh – janice.s.singh@nasa.gov 23
  • 24. Scheduled Downtime Handling • Old solution edited downtime.log • When host is down, nagios stops checking it • Used to sync with external program (schedule) … – Previous solution required shadow host • pleiades – actual host could be down • Pleiades – shadow never down – Now able to use APIs… Janice S Singh – janice.s.singh@nasa.gov 24 Host_a host_a
  • 25. External Program Use • External program (command line interface) $ schedule all ALEX 10/06/2014 10:00-10:25 10/06/2014 Raid Maintenance SUSAN 10/06/2014 10:00-10:25 10/06/2014 RAID maintenance REMEDY 10/06/2014 12:30-12:40 10/06/2014 Restart to resolve issue. $ • query=downtimelist&formatoptions=enumerate& details=true • Merges and updates nagios downtimelist … Janice S Singh – janice.s.singh@nasa.gov 25
  • 26. Updating downtimelist • Use nagios external command feature – SCHEDULE_HOST_DOWNTIME;<host_name>; <start_time>;<end_time>;<fixed>;<trigger_id>; <duration>;<author>;<comment> – SCHEDULE_HOST_DOWNTIME;pioneer;1412626315; 1412626233;1;0;7200;janice;just a test • Documentation described in: http://old.nagios.org/developerinfo/externalcommands/commandlist.Janice S Singh – janice.s.singh@nasa.gov 26
  • 27. Hiccups Fixed by Nagios support •Custom variables didn’t show up in JSON output •Percent signs broke the JSON … sometimes fatally •JSON output was limited to 8k •Newlines didn’t show up in output Janice S Singh – janice.s.singh@nasa.gov 27
  • 28. Hiccups • We have one plugin that outputs so much data it can’t be passed on the command line, so nrdp breaks. – Kernel limitation – Will have to send in packets • Having to have nsca and nrdp work at the same time Janice S Singh – janice.s.singh@nasa.gov 28
  • 29. Future Plans • AJAX-style updates to only update the part of the page that needs it • Use the other information we get from the APIs – When a service is acknowledged – Use archive data to display alerts based on trends Janice S Singh – janice.s.singh@nasa.gov 29
  • 30. Conclusion Using nagios 4 APIs has made our process much easier and will do more so in the future •Simplified configurations •Enabled object model •Improved the flow •Can communicate with external processes •Good customer support Janice S Singh – janice.s.singh@nasa.gov 30
  • 31. Questions? Janice S Singh – janice.s.singh@nasa.gov 31
  • 32. Thank You Janice Singh janice.s.singh@nasa.gov

Editor's Notes

  1. Picture with text overlaid?
  2. Point out NSCA and datagg – the subject of the modifications No downtime.log ? call HUD format a config file
  3. Will audience know pipes, perfdata removing datagg layer -&amp;gt; nagios writes nagios web interface
  4. I don’t use this – so all guessing. =)
  5. Not clear on sync? define schedule command
  6. explain the external command (schedule) explain how to use SCHEUDLE_HOST_DOWNTIME