Most often Zabbix users will monitor Linux hosts using the Zabbix agent, however SNMP is not only an option, it's actually a very viable one. Andrew Nelson will describe his experience configuring Zabbix to monitor a Linux environment of over 500 systems using only SNMP.
Zabbix Conference 2015
Dimitri Bellini - Monitoring Large Multi-Site Data EnvironmentZabbix
Dimitri Bellini and his customer Stefano Zuliani ,representative of a major Italian bank, will present a talk showcasing how Zabbix is used in a large enterprise infrastructure based on SAN Switches and IBM Mainframe HOST. The speech's focus will be on the integration process, case studying the integration of Zabbix with the Noc team of a large Italian bank using Zabbix API and a WEB front-end specialized for monitoring of Mainframe Storage Area Network infrastructure.
Zabbix Conference 2015
Mike Guthrie - Revamping Your 10 Year Old Nagios InstallationNagios
Mike Guthrie - Revamping Your 10 Year Old Nagios Installation - Mike Merideth from VictorOps talks about the challenges of
sharing responsibility for monitoring in the DevOps world. Learn several strategies for keeping your configuration correct,
consistent, and up-to-date when several people are working on it.
Nagios Conference 2014 - Janice Singh - Real World Uses for Nagios APIsNagios
Janice Singh's presentation on Real World Uses for Nagios APIs.
The presentation was given during the Nagios World Conference North America held Oct 13th - Oct 16th, 2014 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/conference
Nagios Conference 2014 - James Clark - Nagios Cool Tips and TricksNagios
James Clark's presentation on Nagios Cool Tips and Tricks.
The presentation was given during the Nagios World Conference North America held Oct 13th - Oct 16th, 2014 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/conference
Dave Williams - Nagios Log Server - Practical ExperienceNagios
Dave Williams - Nagios Log Server - Practical Experience. -
This session will detail the green field deployment of Nagios Log Server in a client environment consisting of HP LAN Switches, 3PAR disk storage, HP Blade Chassis with Flex Fabric using
VMware, Hyper-V, Exchange & Citrix.
Dimitri Bellini - Monitoring Large Multi-Site Data EnvironmentZabbix
Dimitri Bellini and his customer Stefano Zuliani ,representative of a major Italian bank, will present a talk showcasing how Zabbix is used in a large enterprise infrastructure based on SAN Switches and IBM Mainframe HOST. The speech's focus will be on the integration process, case studying the integration of Zabbix with the Noc team of a large Italian bank using Zabbix API and a WEB front-end specialized for monitoring of Mainframe Storage Area Network infrastructure.
Zabbix Conference 2015
Mike Guthrie - Revamping Your 10 Year Old Nagios InstallationNagios
Mike Guthrie - Revamping Your 10 Year Old Nagios Installation - Mike Merideth from VictorOps talks about the challenges of
sharing responsibility for monitoring in the DevOps world. Learn several strategies for keeping your configuration correct,
consistent, and up-to-date when several people are working on it.
Nagios Conference 2014 - Janice Singh - Real World Uses for Nagios APIsNagios
Janice Singh's presentation on Real World Uses for Nagios APIs.
The presentation was given during the Nagios World Conference North America held Oct 13th - Oct 16th, 2014 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/conference
Nagios Conference 2014 - James Clark - Nagios Cool Tips and TricksNagios
James Clark's presentation on Nagios Cool Tips and Tricks.
The presentation was given during the Nagios World Conference North America held Oct 13th - Oct 16th, 2014 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/conference
Dave Williams - Nagios Log Server - Practical ExperienceNagios
Dave Williams - Nagios Log Server - Practical Experience. -
This session will detail the green field deployment of Nagios Log Server in a client environment consisting of HP LAN Switches, 3PAR disk storage, HP Blade Chassis with Flex Fabric using
VMware, Hyper-V, Exchange & Citrix.
A presentation on our experience at Ingram Content Group with Grafana and MySQL. In an enterprise environment it is sometimes necessary to keep data in a traditional, general purpose SQL database such as MySQL or PostgreSQL. These slides explore the challenges and benefits of using Grafana with an SQL database in a large enterprise production setting.
Nagios Conference 2014 - Jack Chu - How to Think With Nagios to Solve Monitor...Nagios
Jack Chu's presentation on How to Think With Nagios to Solve Monitoring Problems.
The presentation was given during the Nagios World Conference North America held Oct 13th - Oct 16th, 2014 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/conference
Matt Bruzek - Monitoring Your Public Cloud With NagiosNagios
Matt Bruzek - Monitor Public Cloud Use Nagios to monitor your public cloud. - No debian installer for Nagios 4? No problem! Deploy your public cloud with Juju and you can connect Nagios core services to your Ubuntu instances in the cloud. In this session, Matt will quickly go over the basic concepts of Juju and spend the rest of the time walking through examples of deploying Nagios monitoring solutions
Nagios Conference 2013 - John Sellens - Monitoring Remote Locations with NagiosNagios
John Sellens's presentation on Monitoring Remote Locations with Nagios.
The presentation was given during the Nagios World Conference North America held Sept 20-Oct 2nd, 2013 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/nwcna
Nagios Conference 2012 - Mike Weber - FailoverNagios
Mike Weber's presentation on using Nagios and High Availability.
The presentation was given during the Nagios World Conference North America held Sept 25-28th, 2012 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/nwcna
Monitoring a billion kilometers of monthly ride sharing at BlaBlaCar - Zabbix...Jean Baptiste Favre
How BlaBlaCar designed and operates a Zabbix based monitoring platform, optimizing Zabbix configuration, developping & using python-protobix & jmx-zabbix for more scalability
Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016Zabbix
For the last two years I've been working in Cambridge (US) in Novartis Institute for Biomedical Research (NIBR) on a project related to a support of HPC cluster infrastructure and users. We're using Zabbix for HPC cluster monitoring (more than 1000 nodes, 10000+ cores, GPU cores, etc). In this presentation we will cover interesting use cases of Zabbix for HPC cluster, as it's not a regular infrastructure monitoring. We will talk about some challenges we have in HPC monitoring, how Zabbix helps us to work with scientists as well as present some solutions, which might be interesting for Zabbix community.
USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a MonthNicolas Brousse
TubeMogul grew from few servers to over two thousands servers and handling over one trillion http requests a month, processed in less than 50ms each. To keep up with the fast growth, the SRE team had to implement an efficient Continuous Delivery infrastructure that allowed to do over 10,000 puppet deployment and 8,500 application deployment in 2014. In this presentation, we will cover the nuts and bolts of the TubeMogul operations engineering team and how they overcome challenges.
OSMC 2014: Interesting use cases of Zabbix improvements in latest versions | ...NETWAYS
Zabbix is used all over the world - in standard IT infrastructure monitoring and also in some not so common environments.
In this talk we will look at some common uses of Zabbix, as well as at some slightly strange environments. A brief update on the latest improvements for Zabbix will be provided as well.
A presentation on our experience at Ingram Content Group with Grafana and MySQL. In an enterprise environment it is sometimes necessary to keep data in a traditional, general purpose SQL database such as MySQL or PostgreSQL. These slides explore the challenges and benefits of using Grafana with an SQL database in a large enterprise production setting.
Nagios Conference 2014 - Jack Chu - How to Think With Nagios to Solve Monitor...Nagios
Jack Chu's presentation on How to Think With Nagios to Solve Monitoring Problems.
The presentation was given during the Nagios World Conference North America held Oct 13th - Oct 16th, 2014 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/conference
Matt Bruzek - Monitoring Your Public Cloud With NagiosNagios
Matt Bruzek - Monitor Public Cloud Use Nagios to monitor your public cloud. - No debian installer for Nagios 4? No problem! Deploy your public cloud with Juju and you can connect Nagios core services to your Ubuntu instances in the cloud. In this session, Matt will quickly go over the basic concepts of Juju and spend the rest of the time walking through examples of deploying Nagios monitoring solutions
Nagios Conference 2013 - John Sellens - Monitoring Remote Locations with NagiosNagios
John Sellens's presentation on Monitoring Remote Locations with Nagios.
The presentation was given during the Nagios World Conference North America held Sept 20-Oct 2nd, 2013 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/nwcna
Nagios Conference 2012 - Mike Weber - FailoverNagios
Mike Weber's presentation on using Nagios and High Availability.
The presentation was given during the Nagios World Conference North America held Sept 25-28th, 2012 in Saint Paul, MN. For more information on the conference (including photos and videos), visit: http://go.nagios.com/nwcna
Monitoring a billion kilometers of monthly ride sharing at BlaBlaCar - Zabbix...Jean Baptiste Favre
How BlaBlaCar designed and operates a Zabbix based monitoring platform, optimizing Zabbix configuration, developping & using python-protobix & jmx-zabbix for more scalability
Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016Zabbix
For the last two years I've been working in Cambridge (US) in Novartis Institute for Biomedical Research (NIBR) on a project related to a support of HPC cluster infrastructure and users. We're using Zabbix for HPC cluster monitoring (more than 1000 nodes, 10000+ cores, GPU cores, etc). In this presentation we will cover interesting use cases of Zabbix for HPC cluster, as it's not a regular infrastructure monitoring. We will talk about some challenges we have in HPC monitoring, how Zabbix helps us to work with scientists as well as present some solutions, which might be interesting for Zabbix community.
USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a MonthNicolas Brousse
TubeMogul grew from few servers to over two thousands servers and handling over one trillion http requests a month, processed in less than 50ms each. To keep up with the fast growth, the SRE team had to implement an efficient Continuous Delivery infrastructure that allowed to do over 10,000 puppet deployment and 8,500 application deployment in 2014. In this presentation, we will cover the nuts and bolts of the TubeMogul operations engineering team and how they overcome challenges.
OSMC 2014: Interesting use cases of Zabbix improvements in latest versions | ...NETWAYS
Zabbix is used all over the world - in standard IT infrastructure monitoring and also in some not so common environments.
In this talk we will look at some common uses of Zabbix, as well as at some slightly strange environments. A brief update on the latest improvements for Zabbix will be provided as well.
Automation of Hadoop cluster operations in Arm Treasure DataYan Wang
This talk will focus on the journey we in the Arm Treasure Data hadoop team is on to simplify and automate how we deploy hadoop. In Arm Treasure Data, up to recently we were running hadoop clusters in two clouds. Due to fast increase of deployments into more sites, the overhead of manual operations has started to strain us. Due to this, we started a project last year to automate and simplify how we deploy using tools like AWS autoscaling groups. Steps we have taken so far are modernize and standardize instance types, moved from manually executed deployment scripts to api triggered work flows, actively working to deprecate chef in favor of debian packages and AWS Codedeploy. We have also started to automate a lot of operations that up to recently were manual, like scaling in and out clusters, and routing traffic between clusters. We also started simplify health check and node snapshotting. And our goal of the year is close to fully automated cluster operations.
Netflix Open Source Meetup Season 4 Episode 2aspyker
In this episode, we will take a close look at 2 different approaches to high-throughput/low-latency data stores, developed by Netflix.
The first, EVCache, is a battle-tested distributed memcached-backed data store, optimized for the cloud. You will also hear about the road ahead for EVCache it evolves into an L1/L2 cache over RAM and SSDs.
The second, Dynomite, is a framework to make any non-distributed data-store, distributed. Netflix's first implementation of Dynomite is based on Redis.
Come learn about the products' features and hear from Thomson and Reuters, Diego Pacheco from Ilegra and other third party speakers, internal and external to Netflix, on how these products fit in their stack and roadmap.
Terraforming your Infrastructure on GCPSamuel Chow
A talk I gave at the Google Cloud Platform LA Meetup event at Google Playa Vista on Nov 6, 2019. This is a 1+ hour-long, tutorial-oriented talk on Infrastructure as Code (IaC), Terraform (as a toolset for IaC and modern devops), and leverage the practice and tools in defining, deploying, and managing your infrastructure in GCP.
PuppetConf 2016: Why Network Automation Matters, and What You Can Do About It...Puppet
Here are the slides from Rick Sherman's PuppetConf 2016 presentation called Why Network Automation Matters, and What You Can Do About It. Watch the videos at https://www.youtube.com/playlist?list=PLV86BgbREluVjwwt-9UL8u2Uy8xnzpIqa
OSMC 2009 | net-snmp: The forgotten classic by Dr. Michael SchwartzkopffNETWAYS
Simple Network Management Protocol (SNMP) is probably the oldest protocol for the management of heterogeneous Networks. As twenty years have passed since the first standards in RFC 1157 were published, we can rank SNMP as a classic. However, its advantages have been long forgotten- perhaps in part, thanks to its trouble free application. This presentation will briefly cover the basics of SNMP and deal with its implementation in Linux. The net-smnp agent offers many tricks and configuration options which can be used to solve most management problems easily.
Linux Server Deep Dives (DrupalCon Amsterdam)Amin Astaneh
Over the past few years the Linux kernel has gained features that allow us to learn more about what's really happening on our servers and the applications that run on them.
This talk will explore how these new features, particularly perf_events and ebpf, enable us to answer questions about what a Drupal site is doing in real time beyond what the standard logs, server performance tools, and even strace will reveal. Attendees will be provided a brief introduction to example uses of these tools to diagnose performance problems.
This talk is intended for attendees that are familiar with Linux, the command line, and have used host observability tools in the past (top, netstat, etc).
OSDC 2015: Roland Kammerer | DRBD9: Managing High-Available Storage in Many-N...NETWAYS
Recent publications show an ever increasing demand in the area of cloud computing where high-available storage is one important corner stone.
DRBD (Distributed Replicated Block Device) is a building block for high availability clusters since years. Currently, DRBD is basically limited to two node cluster setups. In this talk we will provide an overview about recent developments in DRBD9 that allow us to make DRBD ready for upcoming challenges like many-node cluster setups and highly automated cloud deployments.
For the upcoming release we added an abstraction layer which is handled by “drbdmanage”. It is a tool that takes over management of logical volumes (LVM) and management of configuration files for DRBD. Features of drbdmanage include creating, resizing, and removing of replicated volumes. Additionally, drbdmanage handles taking snapshots and creating volumes in consistency groups. In order to support cloud deployments, a cinder (OpenStack) driver is in development.
All the mentioned components are currently under active development and this talk will provide an overview about these tools as well as about our vision for DRBD in general.
With the upcoming DRBD9 release we will be able to support a higher number of nodes (up to 30) per replication group, ease the deployment of DRBD setups with drbdmanage (up to 1000 nodes planed) and provide the foundations for cloud integration.
OSMC 2014: Naemon 1, 2, 3, N | Andreas EricssonNETWAYS
How monitoring should be automated without jeopardizing accuracy.
I will present a ready-to-use system that allows system admins to set up their servers to be automagically picked up by Naemon, and also allowing them to tweak their settings without requiring access to the monitoring system. Most notably, without even restarting or reloading the monitoring system.
I will also present a working (I hope) demo of dynamic thresholds in Naemon, using various helpers in a request/response system.
"Lightweight Virtualization with Linux Containers and Docker". Jerome Petazzo...Yandex
Lightweight virtualization", also called "OS-level virtualization", is not new. On Linux it evolved from VServer to OpenVZ, and, more recently, to Linux Containers (LXC). It is not Linux-specific; on FreeBSD it's called "Jails", while on Solaris it’s "Zones". Some of those have been available for a decade and are widely used to provide VPS (Virtual Private Servers), cheaper alternatives to virtual machines or physical servers. But containers have other purposes and are increasingly popular as the core components of public and private Platform-as-a-Service (PAAS), among others.
Just like a virtual machine, a Linux Container can run (almost) anywhere. But containers have many advantages over VMs: they are lightweight and easier to manage. After operating a large-scale PAAS for a few years, dotCloud realized that with those advantages, containers could become the perfect format for software delivery, since that is how dotCloud delivers from their build system to their hosts. To make it happen everywhere, dotCloud open-sourced Docker, the next generation of the containers engine powering its PAAS. Docker has been extremely successful so far, being adopted by many projects in various fields: PAAS, of course, but also continuous integration, testing, and more.
The wireless network monitoring data are abundant, as it seems relevant store information from devices and users connected, especially in an multicampus environment like Unesp. In this sense, the database tends to increase rapidly the number of records, being necessary to optimize the periodic cleaning routine of Zabbix data. Here are our way of improving the functioning of the "housekeeping" native application. Also will demonstrate the massive use of the data type "Zabbix Trapper" for flexible the list of informations of Wi-Fi infrastructure and techniques varied use of "low level discovery" for monitoring of wireless access points.
Zabbix Conference LatAm 2016 - Andre Deo - Zabbix Brazil CommunityZabbix
In 2008 Brazil hadn't any community about Zabbix, and the software was not known to most people. What changed in 8 years? How a community initiated by one single man (like in Japan) made the difference? Currently this community has more than 3.000 members, many lectures at local events, articles in magazines, books, many blogs and member involved in building additional functions for Zabbix and translation of official documentation!
The aim of the lecture is the demonstration of the new Low Level Discovery Resources that emerged in Zabbix 3.0, as well as presentation, operation and demonstration LLD settings of Windows and ODBC Services.
Zabbix Conference LatAm 2016 - Andre Deo - SNMP and ZabbixZabbix
The aim of the lecture is to discuss the main questions people have when using SNMP with Zabbix. Will present an overview of SNMP, MIBs, Net-SNMP and items used in Zabbix templates.
Zabbix Conference LatAm 2016 - Rodrigo Mohr - Challenges on Large Env with Or...Zabbix
Scalability on a large environment can be a challenge on many different aspects involving customization of monitors, performance and reporting. The goal of this presentation is to share the experience we had at Dell, monitoring a big number of servers in an environment with constant changes, lots of custom monitors and new servers configured every week. We will present, from our 3 years of experience with Zabbix and Oracle, which positive/negative aspects we have taken from the configuration parameters we used, involving strong use of User Macros, optimization of Database Queries, Table Partitioning and Automation.
The Lojas Renner has always had a close proximity to the Open Source movement in Brazil. Still in the 90s, all the company's POS solutions have been migrated to Linux and in early 2000, migration started in all the company's systems, including the main components of critical infrastructure. Since then, much has changed. The world scene Open Source has become a worldwide standard for all products and companies, making its adoption not only an innovation but a necessity. Understand how since 2008 Zabbix helps us in monitoring the entire IT infrastructure, remote units and our business processes.
A Unirede atua desde 2008 com projetos de todos os portes envolvendo o Zabbix. Desde então surgem necessidades onde devemos garantir a interação do Zabbix com as mais variadas formas, métodos e ferramentas de mensageria para para notificar os eventos (E-mail, SMS, criação de tickets/chamados,arquivos de log, WhatsApp, VOIP, Telegram, etc). Nessa palestra irei tentar exemplificar como podemos interegir com o Telegram, recebendo e enviando mensagens para o Zabbix e dessa forma tornar mais dinâmica a comunicação de usuários remotos com seus servidores e equipamentos no datacenter.
Zabbix Conference LatAm 2016 - Filipe Paternot - Zbx@Globo Automation+Integra...Zabbix
Zabbix API offers us a lot of power and possibilities. We will talk about automation and integrations at scale, at Globo.com. Automating gives us power to clone instances of Zabbix, perform batch operations, manage MANY networks for discovery and more. We will present our layer of abstraction to API, democratizing API access, offering a nice UI and standards for every new service monitored and few cached responses. Also, we will show how we have integrated with CloudStack, to deliver automated private cloud monitoring into Zabbix.
Zabbix Conference LatAm 2016 - Douglas Esteves - Zabbix at UNICAMPZabbix
Present the Zabbix use case in the Computer Center of UNICAMP, excellent option for monitoring Datacenter Environments and the University Environment. Presentation of the use of the tool at UNICAMP with simple monitoring and case of IT Service Monitoring to measure Server Availability and Database.
Ryan Armstrong - Monitoring More Than 6000 Devices in Zabbix | ZabConf2016Zabbix
Ryan will describe a Skunkworks project executed by Kinetic IT at the Department of Education to deliver an autonomous infrastructure monitoring solution for over 6000 devices distributed across WA. The team were given opportunity to experiment with DevOps practices such as Scrum product development, Infrastructure As Code and Continuous Integration to determine where the value lay and which practices should be adopted at greater scale.
Rafael Martinez Guerrero - Zabbix at the University of Oslo | ZabConf2016Zabbix
A case study showing the problems we have resolved with Zabbix and the challenges we had when we implemented Zabbix as the main monitoring tool at the University of Oslo. The number of challenges is not low in an organization as heterogenous as ours, with many thousands of servers and clients, all kinds of devices connected to our infrastructure, different operating systems, multiple locations and hundreds of IT staff. Full automation and delegation of privileges are the key words in the work we have done during the past year and a half.
Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016Zabbix
Zabbix is an excellent tool to do network monitoring and to alert if something bad happens. But Zabbix can do more. An underestimated feature of Zabbix is its ability to perform actions in addition to simple notifications. However, this requires to precisly setup those actions within zabbix, which is not always an easy task and might duplicate existing work. So what if Zabbix actually worked in concert with an external taskrunner / jobscheduler that is build to do exactly this: run a task or action against a host and report its outcome? Zabbix would perform the same well defined steps that an ops member would perform in case of certain failures using this kind of tool. A well know example of this kind of software is "Rundeck" which is licensed under the Apache License Version 2.0.
Wolfgang Alper - Zabbix Meets OPS Control / Rundeck | ZabConf2016Zabbix
Zabbix is an excellent tool to do network monitoring and to alert if something bad happens. But Zabbix can do more. An underestimated feature of Zabbix is its ability to perform actions in addition to simple notifications. However, this requires to precisly setup those actions within zabbix, which is not always an easy task and might duplicate existing work. So what if Zabbix actually worked in concert with an external taskrunner / jobscheduler that is build to do exactly this: run a task or action against a host and report its outcome? Zabbix would perform the same well defined steps that an ops member would perform in case of certain failures using this kind of tool. A well know example of this kind of software is "Rundeck" which is licensed under the Apache License Version 2.0.
Sumit Goel - Monitoring Cloud Applications Using Zabbix | ZabConf2016Zabbix
With global shift towards flexibility of cloud there are different demands on monitoring availability and performance of applications provided in the cloud. There are obvious limitations in accessing components of app hosted by third party run outside of internal environment. Same time there are opportunities of using vendor API and status page. In Salesforce, one of the most innovative company in the world by Forbes and one of the biggest cloud service provider, we understand the need of customer to be able to see in real time availability and performance of cloud application. In the following presentation we're going to list and describe multiple ways of monitoring cloud apps. Some of the methods are: building in web monitoring using Curl, web browser automation tools like Selenium, external scripts (reading vendor status dashboard) and API calls to the app.
Rihards Olups - Zabbix at Nokia - Case StudyZabbix
We will explore a fairly complicated Zabbix environment at one division in Nokia. Having several different Zabbix versions in use and a lot of custom products monitored, it is a place one can get lost in easily. We'll discuss JMX monitoring, approaches to keep notification configuration simple and notifications useful, different usecases for the Zabbix API and a lot of other topics. The importance of the SSL compliance will be covered along with some of the many ways custom solutions are monitored.
Raymond Kuiper - Zen and The Art of Zabbix Template Design | ZabConf2016Zabbix
Zabbix monitoring solution can help bring balance to your organisation's IT landscape. However, the success greatly depends on the templates you use to setup your monitoring system. As any Zabbix veteran will tell you, the default templates don't really suffice for any setup other than a proof-of-concept. How then do you set about creating your own templates? Following practical examples, we'll discuss some of the design decisions that need to be made to achieve template perfection.
Dimitri Bellini and Pietro Antonacci - Manage Zabbix Proxies in Remote Networ...Zabbix
Monitoring multiple server farms spread all around the world is not an easy task, many small problems have to be addressed, but using Zabbix it is all a breeze.
We will talk about our experience on setup of Zabbix proxies in very remote networks, problems we encountered and how we worked on fixing them.
Erik Skytthe - Monitoring Mesos, Docker, Containers with Zabbix | ZabConf2016Zabbix
At DBC we are running docker and other container types in a mesos/marathon cluster environment. I will demonstrate how we collect statistics, logs etc. and monitor this environment, showing configuration examples, data flows and templates.
Some of the covered topics:
- Mesos master and agents
- Marathon Framework
- Docker engine
- Containers
- Zookeeper
- Elasticserach/ELK
Lukáš Malý - Log management ELISA controlled by Zabbix | ZabConf2016Zabbix
Datasys ELISA log management is robust, powerful, yet inexpensive solution for collection, correlation and analysis of logs. Core system consists of the Elasticsearch “noSQL“ database and the web user interface Kibana, which provides high comfort for analysis of detected security incidents and relevant logs. It is common that the database ElasticSearch is distributed to multiple servers to achieve load balancing and high availability of indexed data. ELISA heavily utilizes ZABBIX for user authentication and role based access control, notifications and self-monitoring. Elasticsearch Indices can be managed right in ZABBIX Frontend. ZABBIX "trapper" items and monitoring templates are used to centrally manage configuration of distributed environment of NXlog agents. Agents are capable to securely auto-register as ZABBIX "hosts".
Konstantin Yakovlev - Event Analysis Toolset | ZabConf2016Zabbix
During outages on 10k+ hosts environment, NOC and Operations teams may face hundreds of alerts in order to perform root cause analysis, remediation or escalation, meanwhile logging resolution progress to Incident Management system for audit purposes.
This presentation will describe RingCentral approach to Incident and Problem Management in large Zabbix monitored cloud.
Co-authors of the presentation: Dmitry Shchemelinin, Ph.D., Sr. Director of Operations, RingCentral, USA.
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Tobias Schneck
As AI technology is pushing into IT I was wondering myself, as an “infrastructure container kubernetes guy”, how get this fancy AI technology get managed from an infrastructure operational view? Is it possible to apply our lovely cloud native principals as well? What benefit’s both technologies could bring to each other?
Let me take this questions and provide you a short journey through existing deployment models and use cases for AI software. On practical examples, we discuss what cloud/on-premise strategy we may need for applying it to our own infrastructure to get it to work from an enterprise perspective. I want to give an overview about infrastructure requirements and technologies, what could be beneficial or limiting your AI use cases in an enterprise environment. An interactive Demo will give you some insides, what approaches I got already working for real.
GraphRAG is All You need? LLM & Knowledge GraphGuy Korland
Guy Korland, CEO and Co-founder of FalkorDB, will review two articles on the integration of language models with knowledge graphs.
1. Unifying Large Language Models and Knowledge Graphs: A Roadmap.
https://arxiv.org/abs/2306.08302
2. Microsoft Research's GraphRAG paper and a review paper on various uses of knowledge graphs:
https://www.microsoft.com/en-us/research/blog/graphrag-unlocking-llm-discovery-on-narrative-private-data/
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf91mobiles
91mobiles recently conducted a Smart TV Buyer Insights Survey in which we asked over 3,000 respondents about the TV they own, aspects they look at on a new TV, and their TV buying preferences.
Let's dive deeper into the world of ODC! Ricardo Alves (OutSystems) will join us to tell all about the new Data Fabric. After that, Sezen de Bruijn (OutSystems) will get into the details on how to best design a sturdy architecture within ODC.
PHP Frameworks: I want to break free (IPC Berlin 2024)Ralf Eggert
In this presentation, we examine the challenges and limitations of relying too heavily on PHP frameworks in web development. We discuss the history of PHP and its frameworks to understand how this dependence has evolved. The focus will be on providing concrete tips and strategies to reduce reliance on these frameworks, based on real-world examples and practical considerations. The goal is to equip developers with the skills and knowledge to create more flexible and future-proof web applications. We'll explore the importance of maintaining autonomy in a rapidly changing tech landscape and how to make informed decisions in PHP development.
This talk is aimed at encouraging a more independent approach to using PHP frameworks, moving towards a more flexible and future-proof approach to PHP development.
Search and Society: Reimagining Information Access for Radical FuturesBhaskar Mitra
The field of Information retrieval (IR) is currently undergoing a transformative shift, at least partly due to the emerging applications of generative AI to information access. In this talk, we will deliberate on the sociotechnical implications of generative AI for information access. We will argue that there is both a critical necessity and an exciting opportunity for the IR community to re-center our research agendas on societal needs while dismantling the artificial separation between the work on fairness, accountability, transparency, and ethics in IR and the rest of IR research. Instead of adopting a reactionary strategy of trying to mitigate potential social harms from emerging technologies, the community should aim to proactively set the research agenda for the kinds of systems we should build inspired by diverse explicitly stated sociotechnical imaginaries. The sociotechnical imaginaries that underpin the design and development of information access technologies needs to be explicitly articulated, and we need to develop theories of change in context of these diverse perspectives. Our guiding future imaginaries must be informed by other academic fields, such as democratic theory and critical theory, and should be co-developed with social science scholars, legal scholars, civil rights and social justice activists, and artists, among others.
"Impact of front-end architecture on development cost", Viktor TurskyiFwdays
I have heard many times that architecture is not important for the front-end. Also, many times I have seen how developers implement features on the front-end just following the standard rules for a framework and think that this is enough to successfully launch the project, and then the project fails. How to prevent this and what approach to choose? I have launched dozens of complex projects and during the talk we will analyze which approaches have worked for me and which have not.
The Art of the Pitch: WordPress Relationships and SalesLaura Byrne
Clients don’t know what they don’t know. What web solutions are right for them? How does WordPress come into the picture? How do you make sure you understand scope and timeline? What do you do if sometime changes?
All these questions and more will be explored as we talk about matching clients’ needs with what your agency offers without pulling teeth or pulling your hair out. Practical tips, and strategies for successful relationship building that leads to closing the deal.
Connector Corner: Automate dynamic content and events by pushing a buttonDianaGray10
Here is something new! In our next Connector Corner webinar, we will demonstrate how you can use a single workflow to:
Create a campaign using Mailchimp with merge tags/fields
Send an interactive Slack channel message (using buttons)
Have the message received by managers and peers along with a test email for review
But there’s more:
In a second workflow supporting the same use case, you’ll see:
Your campaign sent to target colleagues for approval
If the “Approve” button is clicked, a Jira/Zendesk ticket is created for the marketing design team
But—if the “Reject” button is pushed, colleagues will be alerted via Slack message
Join us to learn more about this new, human-in-the-loop capability, brought to you by Integration Service connectors.
And...
Speakers:
Akshay Agnihotri, Product Manager
Charlie Greenberg, Host
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...DanBrown980551
Do you want to learn how to model and simulate an electrical network from scratch in under an hour?
Then welcome to this PowSyBl workshop, hosted by Rte, the French Transmission System Operator (TSO)!
During the webinar, you will discover the PowSyBl ecosystem as well as handle and study an electrical network through an interactive Python notebook.
PowSyBl is an open source project hosted by LF Energy, which offers a comprehensive set of features for electrical grid modelling and simulation. Among other advanced features, PowSyBl provides:
- A fully editable and extendable library for grid component modelling;
- Visualization tools to display your network;
- Grid simulation tools, such as power flows, security analyses (with or without remedial actions) and sensitivity analyses;
The framework is mostly written in Java, with a Python binding so that Python developers can access PowSyBl functionalities as well.
What you will learn during the webinar:
- For beginners: discover PowSyBl's functionalities through a quick general presentation and the notebook, without needing any expert coding skills;
- For advanced developers: master the skills to efficiently apply PowSyBl functionalities to your real-world scenarios.
LF Energy Webinar: Electrical Grid Modelling and Simulation Through PowSyBl -...
Andrew Nelson - Zabbix and SNMP on Linux
1. SNMP, for those times you can't
install the Zabbix agent.
Andrew Nelson
Senior Linux Consultant
Sep 10, 2015
Zabconf 2015, Riga, Latvia
2. RED HAT | Andrew Nelson2/27
Overview
● Introduction
● What is SNMP
● Extending the Net-SNMP agent
● My first MIB
● A sub-agent for your MIB
3. RED HAT | Andrew Nelson3/27
$ whoami
● Andrew Nelson
● anelson@redhat.com
● Senior Linux Consultant with Red Hat North America
● Active in the Zabbix community for approximately 10
years
● Known as “nelsonab” in
forums and IRC
● Author of the Zabbix API
Ruby library zbxapi
5. RED HAT | Andrew Nelson5/27
Topics
● SNMP Overview
● OIDs
● Units
● Tables
● Using SNMP With Zabbix
● Extending SNMP
6. RED HAT | Andrew Nelson6/27
What is SNMP
● Simple Network Monitoring Protocol
● Like all standards, it's not simple!
● Defined by numerous RFCs
● Uses a hierarchical data model
● Employs an Agent – Manager design
● Agents can be nested
● Is extensible
7. RED HAT | Andrew Nelson7/27
What is SNMP
● SNMP uses OIDs to identify each item
● OIDs are numeric with text translations provided by
MIB files.
9. RED HAT | Andrew Nelson9/27
Zabbix And SNMP
● Available since version 1.1
● One request per network connection before 2.2.3
● Dynamic index lookups available since version 1.5
● Item[“index”,”base item”,”row”]
● ifInOctets["index","ifDescr","GigabitEthernet0/1"]
10. RED HAT | Andrew Nelson10/27
Zabbix and SNMP
● Configure a global macro for {$SNMP_COMMUNITY}
then override locally for each host as needed.
● Templates make use of this macro
● Some parameters require some “massaging” before
they are useful.
● Disk space is often in the form of N Allocation Units,
not bytes.
● Calculated value: alocation_units[disk]*free_space[disk]
11. RED HAT | Andrew Nelson11/27
Zabbix and SNMP
● Because calculated items are needed timelines may
become very skewed.
● Free disk space is checked every 10 minutes
● Space calculation is performed every 10 minutes
● Graphed value could be time skwed by 20 minutes!
● Zabbix needs the ability for inbound value calculation.
● If only there were a scripting interface for this…..
12. RED HAT | Andrew Nelson12/27
Zabbix and SNMP
● It is possible to derive percent utilization from
accumulated time items.
● Set the item to Delta per second
● Convert the time scale to “centi-seconds” and store as
a float with a unit of “%”.
● Resultant value will be an approximation of the percent
utilization.
● However, it is skewed by the number of threads (CPU's)
available to process the item being counted.
15. RED HAT | Andrew Nelson15/27
Zabbix and SNMP
● When using SNMP for host discovery be careful when
using SNMPv2-MIB::sysDescr.0
● It will change with kernel updates
● Zabbix will then detect the updated machine as a new
system
● Best to override this parameter in snmpd.conf, or
perhaps use an extend script to return the dmi item uuid
# dmidecode -s system-uuid
93737AE6-8A60-2B41-BEA4-B61C7ACD8063
16. RED HAT | Andrew Nelson16/27
Zabbix and SNMP
● When using direct OID numbers be careful.
● OID number may remap over time, especially if the
value in question is part of a table.
● OID table members are not guaranteed to have the
same OID numbers across restarts of the SNMP agent
(or host reboot)
● If MIB text is not translating correctly ensure the MIB
file is in /usr/share/snmp/mibs (for RHEL systems).
● Ensure the OID in question translates correctly using
snmptranslate.
17. RED HAT | Andrew Nelson17/27
Zabbix and SNMP
● The snmp tools are your friends
● net-snmp-utils rpm on RHEL
● snmpwalk will allow you to peak at all of the items
available.
● Adding the parameter -On is useful when MIB translations or
namespaces are hampering things.
● snmptranslate can help ensure MIB files are loading
correctly
$ snmptranslate -IR myCustomOID
Unknown object identifier: myCustomOID
19. RED HAT | Andrew Nelson19/27
Extending Net-SNMP
● Common mechanisms available
● Extend – Raw Shell Scripts
● Perl – Call perl scripts directly
● Pass – Start a subprocess to handle a given OID
subtree
● PassPersist – Similar to pass but keep running
● AgentX – A form of sub-agent where a process listens
for requests on a socket.
● Plus a few more not mentioned above
20. RED HAT | Andrew Nelson20/27
Extending Net-SNMP
● A simple extend: (/usr/local/bin/check_proc.sh)
#!/bin/sh
PATTERN=$1
NUMPIDS=`pgrep $PATTERN | wc -l`
echo "There are $NUMPIDS $PATTERN processes."
echo "$NUMPIDS"
exit $NUMPIDS
● The statement in the snmpd.conf file
extend httpd_pids /bin/sh /usr/local/bin/check_proc.sh httpd
21. RED HAT | Andrew Nelson21/27
Extending Net SNMP
● Snmpwalk of the extend script
22. RED HAT | Andrew Nelson22/27
Extending Net SNMP
● Adding the item is as simple as adding an snmp item
and assigning it to a key
● No need to worry about OID's changing
23. RED HAT | Andrew Nelson23/27
● Note when using the exit value from a script, it's
maximum value is 256, any value over will roll over.
● Ex: An exit value of 5000 will result in an actual value of
136 5000-(256*19) = 136
● Place the value in question on a separate line and use
NET SNMP EXTEND MIB::nsExtendOutLine."ITEM".‑ ‑ ‑
n
● Where n is the line in question and ITEM is the extend
item.
Extending Net SNMP
24. RED HAT | Andrew Nelson24/27
Extending Net SNMP
● Adding uuid information to snmp via extend
extend host_uuid /usr/sbin/dmidecode -s system-uuid
● In discovery use the item:
NET SNMP EXTEND MIB::nsExtendOutLine."host_uuid".1‑ ‑ ‑