Open source monitoring systems

Forthscale
ForthscaleCo-founder & CEO at Forthscale
Open Source
Monitoring systems
Your presenter
● Married +1
● CEO @ Forthscale systems
● Scalable infrastructure Architect
● Linux from 2.0, first distro is Biltmore
● Linux migration activist (from Unix and M$)
● nagios & forks from 2001
● monitoring expert with patents
Why monitor
● downtime suck
● losing money
● understanding your stack
● planning ahead
● analyzing what went wrong
● keeping inventory
● knowledge is control
What to monitor
● availability (ping, url, content ...)
● use of resources (CPU, RAM, disk … )
● benchmarking (throughput, request count …)
● events (logs, exceptions, dumps …)
● material (files, DBs, outputs … )
● anything else you might need
How to monitor
With some help of the open source
● Active
● Passive
● SNMP
● Scripts
● Frameworks
● Systems
What to do with data
based on time / object / state
● log (locally, externally )
● alert (someone, groupe, list)
● handle (or at least try to)
SNMP
● active and passive
● easy, out of the box support for basics
● widely deployed by default
● structured
● complicated for queries customization
● complicated to show trends
Net-SNMP package @ http://www.net-snmp.
org/
Scripts
● custom
● can do everything you want
● can be executed over ssh
● require programming skills
● need maintenance
● complicated to show trends
● do not show tactical overview
shell / perl / python
Frameworks
● scriptural frameworks such as watchdog
● full frameworks such as sensu
● structured
● easier in deployment
● supported
http://www.sensuapp.org
https://github.com/sebastien/monitoring
Systems
● Structured
● Extendable (plugins)
● Supported
● Complex
● Locking
● Not as scalable as you imagine
nagios / zenoss / zabbix
Monitoring (prv. known as watchdog)
● monitoring and data-collection daemon
● lightweight
● written in python
good for:
● to be notified when incidents happen
● automatic actions to be taken
● to collect statistics for further processing
Sensu
● lightweight
● written in python
consider themselves to be “monitoring router”
basically it is a framework that:
connects “check” scripts run across many
nodes with “handler” scripts run on one or
more Sensu servers
Systems
Everything packaged in one spot
● tactical
nagios / zenoss / zabbix
● acumalative
munin / cacti
● hybrids
nagios+cacti / icinga + munin
nagios / icinga / shinken
● de facto industry standard
● plugins for almost everything
● huge community
● text files for configuration
● clientless / clients
● alerting / handlers
● bad scalability
● trending via 3rd party
http://www.nagios.org/
zenoss
● written in python
● modern
● a very good Ajax gui
● split architecture (portal / process / data layer)
● only gui configuration
● scalable
● needs snmp
only basic core is open sourced
http://zenoss.com/
zabbix
● aimed to beat nagios
● some autodiscovery
● gui customization
● can show trends
● lightweight
● can correlate graphs
● cumbersome in configuration
● week escalation policy
http://www.zabbix.com/
munin
Munin is an open source client /server network
or system monitoring application that presents
output in graphs through a web interface
● stored to rrd
● makes graphs
● shows trends
● can alert
● very very very simple to deploy
● text based configuration
Summary
● nagios is still rock solid solution while
frameworks still are in 0.x version
● different frameworks look promising but require
high level of customization
● hybrids integrate nicely and can provide good
solution
● scalability is an issue for systems designed for
LAN use.
Forthscale contacts
Naor Weissmann
http://www.linkedin.com/in/naorweissmann
Cell: +972 (0)505 980005
Email: naorw@forthscale.com
EMEA office.
Email: info@forthscale.com
Web site: http://www.forthscale.com
Telephone: +972 (0)77 5280015
1 of 19

More Related Content

What's hot(20)

Centralised logging with ELK stackCentralised logging with ELK stack
Centralised logging with ELK stack
Simon Hanmer1.1K views
Presto meetup 2015-03-19 @FacebookPresto meetup 2015-03-19 @Facebook
Presto meetup 2015-03-19 @Facebook
Treasure Data, Inc.2.4K views
Presto@UberPresto@Uber
Presto@Uber
Zhenxiao Luo1.8K views
PrestoPresto
Presto
Chen Chun1.7K views
Presto at Hadoop Summit 2016Presto at Hadoop Summit 2016
Presto at Hadoop Summit 2016
kbajda3.4K views

Similar to Open source monitoring systems(20)

Archivematica Technical Training Diagnostics Guide (September 2018)Archivematica Technical Training Diagnostics Guide (September 2018)
Archivematica Technical Training Diagnostics Guide (September 2018)
Artefactual Systems - Archivematica546 views
Handout: 'Open Source Tools & Resources'Handout: 'Open Source Tools & Resources'
Handout: 'Open Source Tools & Resources'
BDPA Education and Technology Foundation3.7K views
Activity feeds (and more) at mate1Activity feeds (and more) at mate1
Activity feeds (and more) at mate1
Hisham Mardam-Bey236 views
Kick starting Network AutomationKick starting Network Automation
Kick starting Network Automation
Walid Shaari1.2K views
Zabbix Monitoring Platform Zabbix Monitoring Platform
Zabbix Monitoring Platform
Seyedmajid Etehadi3.1K views
Git presentationGit presentation
Git presentation
jordimash403 views
Free the Functions with Fn project!Free the Functions with Fn project!
Free the Functions with Fn project!
J On The Beach161 views

Recently uploaded(20)

The Research Portal of Catalonia: Growing more (information) & more (services)The Research Portal of Catalonia: Growing more (information) & more (services)
The Research Portal of Catalonia: Growing more (information) & more (services)
CSUC - Consorci de Serveis Universitaris de Catalunya51 views
CXL at OCPCXL at OCP
CXL at OCP
CXL Forum183 views
Web Dev - 1 PPT.pdfWeb Dev - 1 PPT.pdf
Web Dev - 1 PPT.pdf
gdsczhcet48 views
Green Leaf Consulting: Capabilities DeckGreen Leaf Consulting: Capabilities Deck
Green Leaf Consulting: Capabilities Deck
GreenLeafConsulting170 views
Java Platform Approach 1.0 - Picnic MeetupJava Platform Approach 1.0 - Picnic Meetup
Java Platform Approach 1.0 - Picnic Meetup
Rick Ossendrijver23 views

Open source monitoring systems

  • 2. Your presenter ● Married +1 ● CEO @ Forthscale systems ● Scalable infrastructure Architect ● Linux from 2.0, first distro is Biltmore ● Linux migration activist (from Unix and M$) ● nagios & forks from 2001 ● monitoring expert with patents
  • 3. Why monitor ● downtime suck ● losing money ● understanding your stack ● planning ahead ● analyzing what went wrong ● keeping inventory ● knowledge is control
  • 4. What to monitor ● availability (ping, url, content ...) ● use of resources (CPU, RAM, disk … ) ● benchmarking (throughput, request count …) ● events (logs, exceptions, dumps …) ● material (files, DBs, outputs … ) ● anything else you might need
  • 5. How to monitor With some help of the open source ● Active ● Passive ● SNMP ● Scripts ● Frameworks ● Systems
  • 6. What to do with data based on time / object / state ● log (locally, externally ) ● alert (someone, groupe, list) ● handle (or at least try to)
  • 7. SNMP ● active and passive ● easy, out of the box support for basics ● widely deployed by default ● structured ● complicated for queries customization ● complicated to show trends Net-SNMP package @ http://www.net-snmp. org/
  • 8. Scripts ● custom ● can do everything you want ● can be executed over ssh ● require programming skills ● need maintenance ● complicated to show trends ● do not show tactical overview shell / perl / python
  • 9. Frameworks ● scriptural frameworks such as watchdog ● full frameworks such as sensu ● structured ● easier in deployment ● supported http://www.sensuapp.org https://github.com/sebastien/monitoring
  • 10. Systems ● Structured ● Extendable (plugins) ● Supported ● Complex ● Locking ● Not as scalable as you imagine nagios / zenoss / zabbix
  • 11. Monitoring (prv. known as watchdog) ● monitoring and data-collection daemon ● lightweight ● written in python good for: ● to be notified when incidents happen ● automatic actions to be taken ● to collect statistics for further processing
  • 12. Sensu ● lightweight ● written in python consider themselves to be “monitoring router” basically it is a framework that: connects “check” scripts run across many nodes with “handler” scripts run on one or more Sensu servers
  • 13. Systems Everything packaged in one spot ● tactical nagios / zenoss / zabbix ● acumalative munin / cacti ● hybrids nagios+cacti / icinga + munin
  • 14. nagios / icinga / shinken ● de facto industry standard ● plugins for almost everything ● huge community ● text files for configuration ● clientless / clients ● alerting / handlers ● bad scalability ● trending via 3rd party http://www.nagios.org/
  • 15. zenoss ● written in python ● modern ● a very good Ajax gui ● split architecture (portal / process / data layer) ● only gui configuration ● scalable ● needs snmp only basic core is open sourced http://zenoss.com/
  • 16. zabbix ● aimed to beat nagios ● some autodiscovery ● gui customization ● can show trends ● lightweight ● can correlate graphs ● cumbersome in configuration ● week escalation policy http://www.zabbix.com/
  • 17. munin Munin is an open source client /server network or system monitoring application that presents output in graphs through a web interface ● stored to rrd ● makes graphs ● shows trends ● can alert ● very very very simple to deploy ● text based configuration
  • 18. Summary ● nagios is still rock solid solution while frameworks still are in 0.x version ● different frameworks look promising but require high level of customization ● hybrids integrate nicely and can provide good solution ● scalability is an issue for systems designed for LAN use.
  • 19. Forthscale contacts Naor Weissmann http://www.linkedin.com/in/naorweissmann Cell: +972 (0)505 980005 Email: naorw@forthscale.com EMEA office. Email: info@forthscale.com Web site: http://www.forthscale.com Telephone: +972 (0)77 5280015