Alexei Vladishev - Zabbix - Monitoring Solution for EveryoneZabbix
Paris Zabbix User Group Meetup 2016
June 23, 2016
1. Open Source
2. Zabbix Architecture
3. Data Collection
4. Problem Detection
5. Problem Forecasting / Trend Prediction
6. Lifecycle and Support Policy
Introduction to Zabbix - Company, Product, Services and Use CasesZabbix
About Zabbix Software:
Zabbix is an enterprise-class open source distributed monitoring solution designed to monitor and track performance and availability of network servers, devices, services and other IT resources.
Zabbix is an all-in-one monitoring solution that allows users to collect, store, manage and analyze information received from IT infrastructure, as well as display on-screen, and alert by e-mail, SMS or Jabber when thresholds are reached.
Zabbix allows administrators to recognize server and device problems within a short period of time and therefore reduces the system downtime and risk of system failure. The monitoring solution is being actively used by SMBs and large enterprises across all industries and almost in every country of the world.
Zabbix is enterprise open source monitoring software for networks and applications, created by Alexei Vladishev. It is designed to monitor and track the status of various network services, servers, and other network hardware. Zabbix uses MySQL, PostgreSQL, SQLite, Oracle or IBM DB2 to store data.
Monitoring all Elements of Your Database Operations With ZabbixZabbix
In depth look into all aspects of Zabbix, from the history and origins of the software to an overview of the latest features, introduced in Zabbix 3.2 .
Presented by the founder and CEO of Zabbix, Alexei Vladishev at Percona Live 2016 Europe.
Alexei Vladishev - Zabbix - Monitoring Solution for EveryoneZabbix
Paris Zabbix User Group Meetup 2016
June 23, 2016
1. Open Source
2. Zabbix Architecture
3. Data Collection
4. Problem Detection
5. Problem Forecasting / Trend Prediction
6. Lifecycle and Support Policy
Introduction to Zabbix - Company, Product, Services and Use CasesZabbix
About Zabbix Software:
Zabbix is an enterprise-class open source distributed monitoring solution designed to monitor and track performance and availability of network servers, devices, services and other IT resources.
Zabbix is an all-in-one monitoring solution that allows users to collect, store, manage and analyze information received from IT infrastructure, as well as display on-screen, and alert by e-mail, SMS or Jabber when thresholds are reached.
Zabbix allows administrators to recognize server and device problems within a short period of time and therefore reduces the system downtime and risk of system failure. The monitoring solution is being actively used by SMBs and large enterprises across all industries and almost in every country of the world.
Zabbix is enterprise open source monitoring software for networks and applications, created by Alexei Vladishev. It is designed to monitor and track the status of various network services, servers, and other network hardware. Zabbix uses MySQL, PostgreSQL, SQLite, Oracle or IBM DB2 to store data.
Monitoring all Elements of Your Database Operations With ZabbixZabbix
In depth look into all aspects of Zabbix, from the history and origins of the software to an overview of the latest features, introduced in Zabbix 3.2 .
Presented by the founder and CEO of Zabbix, Alexei Vladishev at Percona Live 2016 Europe.
Monitoramento e Gerenciamento de Infraestrutura com Zabbix - Patrícia LadislauPatricia Ladislau Silva
Apresentação da palestra que ministrei para alunos, professores e coordenação, e profissionais da comunidade técnica na Semana de Integração dos cursos de Tecnlogia da Informação da Faculdade Invest em Cuiabá-MT.
----------------------------------------------------------------------------------------------------------------------
Nota:
Esta apresentação contém animações que não executam no modo como o Slideshare realiza a exibição, além de compactar a qualidade. Para visualizar a apresentação com todos os recursos e maior qualidade de imagem, baixe o arquivo da apresentação através deste link https://drive.google.com/file/d/1lo_g4etILLD8jx-Lk4hn79_izbFC0qt6/view?usp=sharing e execute a apresentação para visualizar as animações e as imagens com qualidade.
----------------------------------------------------------------------------------------------------------------------
A palestra é introdutória ao assunto e abordou pontos como conceitos de monitoramento, incidentes, a importância em ter um software para ajudar a gerenciar os recursos dos ambientes e focando em como o Zabbix auxilia enormemente nessa tarefa, suas vantagens e funcionalidades e ainda ponto positivo do custo de licença igual a zero.
Zabbix est un outils permettant d’effectuer de la supervision et de la métrologie en collectant des données à travers son agent, le snmp ou des scripts. Cet exposé expliquera le projet Zabbix, les technologies utilisées puis la mise en place pour ensuite effectuer une démonstration.
Monitoramento Inteligente utilizando o ZABBIXLuiz Andrade
Zabbix é uma poderosa ferramenta para monitoramento de recursos de TI. que fazem parte do organismo vivo que sustenta o negócio de todas as empresas.
O Zabbix oferece monitoramento distribuído em “tempo-real” com interface de administração Web. Ele permite ver a saúde de qualquer host em uma rede IP monitorada por meio de um único ponto de visualização. Entre os diversos itens, vale destacar a utilização de recursos de hardware e software, tais como CPU, memória, utilização de unidades de armazenamento e execução de processos.
Para grandes corporações, disponibilidade da infraestrutura custa caro e frequentes downtimes podem impactar diretamente na continuídade dos negócios, implicando em prejuízos desastrosos e multas astronômicas. Nessa palestra, será apresentada a solução Enterprise e Open Source para monitoramento de toda infraestrutura de TI, combinando o sistema de monitoramento Zabbix e o Red Hat Enterprise Linux.
Google Cloud Platform monitoring with ZabbixMax Kuzkin
This presentation describes how to configure Zabbix (https://zabbix.com/) to configure Google Cloud Platform events through its Monitoring API, using gcpmetrics (https://github.com/odin-public/gcpmetrics/) command line tool.
Zabbix Inventory Indo alem do monitoramento.
Qual a definição de Inventário ?
De uma forma sucinta, um inventário (do latim inventariu) é uma relação dos bens pertencentes a uma empresa, se refere ao bens disponíveis em estoque para uso normal de um negócio e seus colaboradores, Costumam conter a descrição do produto,informações de hardware,software , entre outros, bem como a quantidade existente e o local onde se encontra.
inventário de ativos e o controle patrimonial!
Tendo em vista a definição pontuada do inventário de bens patrimonial de TI, sua ligação ao inventário de ativos é direta.
Os ativos são os bens das empresas, desta forma a associação pode ser feita de forma simples. Mas qual a relação do inventário de ativos e o controle patrimonial de um empreendimento? A resposta parece ser intuitiva, e é mesmo! De forma análoga, pense não somente em pequenas empresas, mas também em grandes empreendimentos que possuam grande quantidade de ativos. É necessário ter o controle de todos os bens da empresa, não somente para fins legais e contábeis, mas também por segurança.
O inventário de ativos tem como papel principal manter registros atualizados para os gestores, sobre todos os bens do empreendimento, além de garantir um controle sobre os mesmos, evitando furtos, depreciações e afins.
Zabbix do Monitoramento ao inventario.
A ideia de utilizar o zabbix para um inventario de rede foi centralizar e otimizar informações alem do monitoramento ao inventario em uma unica solução!
A iniciativa foi utilizar o WMI nativo da solução de monitoramento Zabbix transformando a mesma em uma solução de inventario de custo 0 para o empreendimento, então faça as contas!
whats? How is it ?
Não me pergunte como , veja os resultados.
Do the math!
Licenciamento zabbix: 0800.
Outras soluções ???
Case Santos F.C. |Gerência de TI com ZabbixWagner Morais
Apresentação do case do Santos Futebol Clube. O quanto foi possível melhorar a infraestrutura e a gestão de TI com o Zabbix. Implementando monitoramento centralizado para identificar de forma dinâmica e instantânea falhas na infraestrutura.
Continuous Integration has improved software development process, while deployment of software often does not get as much attention. In this presentation we describe an approach for efficient cloud deployment of highly distributed system which can be quite complex to deal with. DeploymentManager(DM) tool is developed as internal project of Infobip company for managing large number(cca. 250 instances) of internal services in our data centers using automated deployment and controling load balancing software(Apache, HaProxy). With DM tool it is possible to deploy build(jar, war, egg…) on any environment(Windows, Linux) in max 5 minutes, without technical knowledge about deployment process, making Continuous Delivery process fast and simple.
Monitoramento e Gerenciamento de Infraestrutura com Zabbix - Patrícia LadislauPatricia Ladislau Silva
Apresentação da palestra que ministrei para alunos, professores e coordenação, e profissionais da comunidade técnica na Semana de Integração dos cursos de Tecnlogia da Informação da Faculdade Invest em Cuiabá-MT.
----------------------------------------------------------------------------------------------------------------------
Nota:
Esta apresentação contém animações que não executam no modo como o Slideshare realiza a exibição, além de compactar a qualidade. Para visualizar a apresentação com todos os recursos e maior qualidade de imagem, baixe o arquivo da apresentação através deste link https://drive.google.com/file/d/1lo_g4etILLD8jx-Lk4hn79_izbFC0qt6/view?usp=sharing e execute a apresentação para visualizar as animações e as imagens com qualidade.
----------------------------------------------------------------------------------------------------------------------
A palestra é introdutória ao assunto e abordou pontos como conceitos de monitoramento, incidentes, a importância em ter um software para ajudar a gerenciar os recursos dos ambientes e focando em como o Zabbix auxilia enormemente nessa tarefa, suas vantagens e funcionalidades e ainda ponto positivo do custo de licença igual a zero.
Zabbix est un outils permettant d’effectuer de la supervision et de la métrologie en collectant des données à travers son agent, le snmp ou des scripts. Cet exposé expliquera le projet Zabbix, les technologies utilisées puis la mise en place pour ensuite effectuer une démonstration.
Monitoramento Inteligente utilizando o ZABBIXLuiz Andrade
Zabbix é uma poderosa ferramenta para monitoramento de recursos de TI. que fazem parte do organismo vivo que sustenta o negócio de todas as empresas.
O Zabbix oferece monitoramento distribuído em “tempo-real” com interface de administração Web. Ele permite ver a saúde de qualquer host em uma rede IP monitorada por meio de um único ponto de visualização. Entre os diversos itens, vale destacar a utilização de recursos de hardware e software, tais como CPU, memória, utilização de unidades de armazenamento e execução de processos.
Para grandes corporações, disponibilidade da infraestrutura custa caro e frequentes downtimes podem impactar diretamente na continuídade dos negócios, implicando em prejuízos desastrosos e multas astronômicas. Nessa palestra, será apresentada a solução Enterprise e Open Source para monitoramento de toda infraestrutura de TI, combinando o sistema de monitoramento Zabbix e o Red Hat Enterprise Linux.
Google Cloud Platform monitoring with ZabbixMax Kuzkin
This presentation describes how to configure Zabbix (https://zabbix.com/) to configure Google Cloud Platform events through its Monitoring API, using gcpmetrics (https://github.com/odin-public/gcpmetrics/) command line tool.
Zabbix Inventory Indo alem do monitoramento.
Qual a definição de Inventário ?
De uma forma sucinta, um inventário (do latim inventariu) é uma relação dos bens pertencentes a uma empresa, se refere ao bens disponíveis em estoque para uso normal de um negócio e seus colaboradores, Costumam conter a descrição do produto,informações de hardware,software , entre outros, bem como a quantidade existente e o local onde se encontra.
inventário de ativos e o controle patrimonial!
Tendo em vista a definição pontuada do inventário de bens patrimonial de TI, sua ligação ao inventário de ativos é direta.
Os ativos são os bens das empresas, desta forma a associação pode ser feita de forma simples. Mas qual a relação do inventário de ativos e o controle patrimonial de um empreendimento? A resposta parece ser intuitiva, e é mesmo! De forma análoga, pense não somente em pequenas empresas, mas também em grandes empreendimentos que possuam grande quantidade de ativos. É necessário ter o controle de todos os bens da empresa, não somente para fins legais e contábeis, mas também por segurança.
O inventário de ativos tem como papel principal manter registros atualizados para os gestores, sobre todos os bens do empreendimento, além de garantir um controle sobre os mesmos, evitando furtos, depreciações e afins.
Zabbix do Monitoramento ao inventario.
A ideia de utilizar o zabbix para um inventario de rede foi centralizar e otimizar informações alem do monitoramento ao inventario em uma unica solução!
A iniciativa foi utilizar o WMI nativo da solução de monitoramento Zabbix transformando a mesma em uma solução de inventario de custo 0 para o empreendimento, então faça as contas!
whats? How is it ?
Não me pergunte como , veja os resultados.
Do the math!
Licenciamento zabbix: 0800.
Outras soluções ???
Case Santos F.C. |Gerência de TI com ZabbixWagner Morais
Apresentação do case do Santos Futebol Clube. O quanto foi possível melhorar a infraestrutura e a gestão de TI com o Zabbix. Implementando monitoramento centralizado para identificar de forma dinâmica e instantânea falhas na infraestrutura.
Continuous Integration has improved software development process, while deployment of software often does not get as much attention. In this presentation we describe an approach for efficient cloud deployment of highly distributed system which can be quite complex to deal with. DeploymentManager(DM) tool is developed as internal project of Infobip company for managing large number(cca. 250 instances) of internal services in our data centers using automated deployment and controling load balancing software(Apache, HaProxy). With DM tool it is possible to deploy build(jar, war, egg…) on any environment(Windows, Linux) in max 5 minutes, without technical knowledge about deployment process, making Continuous Delivery process fast and simple.
OpenStack Days East -- MySQL Options in OpenStackMatt Lord
In most production OpenStack installations, you want the backing metadata store to be highly available. For this, the de facto standard has become MySQL+Galera. In order to help you meet this basic use case even better, I will introduce you to the brand new native MySQL HA solution called MySQL Group Replication. This allows you to easily go from a single instance of MySQL to a MySQL service that's natively distributed and highly available, while eliminating the need for any third party library and implementations.
If you have an extremely large OpenStack installation in production, then you are likely to eventually run into write scaling issues and the metadata store itself can become a bottleneck. For this use case, MySQL NDB Cluster can allow you to linearly scale the metadata store as your needs grow. I will introduce you to the core features of MySQL NDB Cluster--which include in-memory OLTP, transparent sharding, and support for active/active multi-datacenter clusters--that will allow you to meet even the most demanding of use cases with ease.
Scalable and Reliable Logging at PinterestKrishna Gade
At Pinterest, hundreds of services and third-party tools that are implemented in various programming languages generate billions of events every day. To achieve scalable and reliable low latency logging, there are several challenges: (1) uploading logs that are generated in various formats from tens of thousands of hosts to Kafka in a timely manner; (2) running Kafka reliably on Amazon Web Services where the virtual instances are less reliable than on-premises hardware; (3) moving tens of terabytes data per day from Kafka to cloud storage reliably and efficiently, and guaranteeing exact one time persistence per message.
In this talk, we will present Pinterest’s logging pipeline, and share our experience addressing these challenges. We will dive deep into the three components we developed: data uploading from service hosts to Kafka, data transportation from Kafka to S3, and data sanitization. We will also share our experience in operating Kafka at scale in the cloud.
Sergey Dzyuban "To Build My Own Cloud with Blackjack…"Fwdays
Cloud providers like Amazon or Google have a great user experience to create and manage PaaS. But is it possible to reproduce the same experience and flexibility locally, in the on-premise datacenter? What if your own infrastructure grows to fast and your team can’t deal with it in the old way? What does Jenkins, .NET microservices and TVs for daily meetings have in common?
This talk shares our experience using DC/OS (datacenter operating system) for building flexible and stable infrastructure. I will show the evolution of private cloud from the first steps with Vagrant to the hybrid cloud with instance groups in Google Cloud, the benefits it gives us and the problems we get instead.
Keeping WebSphere under control with free tools - Wannes & Sharon share some tips and experience on the free tools they use daily to monitor Connections environments using FREE tools
Keeping WebSphere under control with free tools - Wannes & Sharon share some tips and experience on the free tools they use daily to monitor Connections environments using FREE tools
Monitoramento de Aplicações Web Modernas com ZabbixAndré Déo
Demonstrar que com os recursos nativos da ferramenta, atrelados à desenvolvedores integrados com a equipe de operações (DevOps) é possível monitorar aplicações web modernas, que utilizam recursos como APIs, REST e JSON.
Unlocking Productivity: Leveraging the Potential of Copilot in Microsoft 365, a presentation by Christoforos Vlachos, Senior Solutions Manager – Modern Workplace, Uni Systems
Removing Uninteresting Bytes in Software FuzzingAftab Hussain
Imagine a world where software fuzzing, the process of mutating bytes in test seeds to uncover hidden and erroneous program behaviors, becomes faster and more effective. A lot depends on the initial seeds, which can significantly dictate the trajectory of a fuzzing campaign, particularly in terms of how long it takes to uncover interesting behaviour in your code. We introduce DIAR, a technique designed to speedup fuzzing campaigns by pinpointing and eliminating those uninteresting bytes in the seeds. Picture this: instead of wasting valuable resources on meaningless mutations in large, bloated seeds, DIAR removes the unnecessary bytes, streamlining the entire process.
In this work, we equipped AFL, a popular fuzzer, with DIAR and examined two critical Linux libraries -- Libxml's xmllint, a tool for parsing xml documents, and Binutil's readelf, an essential debugging and security analysis command-line tool used to display detailed information about ELF (Executable and Linkable Format). Our preliminary results show that AFL+DIAR does not only discover new paths more quickly but also achieves higher coverage overall. This work thus showcases how starting with lean and optimized seeds can lead to faster, more comprehensive fuzzing campaigns -- and DIAR helps you find such seeds.
- These are slides of the talk given at IEEE International Conference on Software Testing Verification and Validation Workshop, ICSTW 2022.
Enchancing adoption of Open Source Libraries. A case study on Albumentations.AIVladimir Iglovikov, Ph.D.
Presented by Vladimir Iglovikov:
- https://www.linkedin.com/in/iglovikov/
- https://x.com/viglovikov
- https://www.instagram.com/ternaus/
This presentation delves into the journey of Albumentations.ai, a highly successful open-source library for data augmentation.
Created out of a necessity for superior performance in Kaggle competitions, Albumentations has grown to become a widely used tool among data scientists and machine learning practitioners.
This case study covers various aspects, including:
People: The contributors and community that have supported Albumentations.
Metrics: The success indicators such as downloads, daily active users, GitHub stars, and financial contributions.
Challenges: The hurdles in monetizing open-source projects and measuring user engagement.
Development Practices: Best practices for creating, maintaining, and scaling open-source libraries, including code hygiene, CI/CD, and fast iteration.
Community Building: Strategies for making adoption easy, iterating quickly, and fostering a vibrant, engaged community.
Marketing: Both online and offline marketing tactics, focusing on real, impactful interactions and collaborations.
Mental Health: Maintaining balance and not feeling pressured by user demands.
Key insights include the importance of automation, making the adoption process seamless, and leveraging offline interactions for marketing. The presentation also emphasizes the need for continuous small improvements and building a friendly, inclusive community that contributes to the project's growth.
Vladimir Iglovikov brings his extensive experience as a Kaggle Grandmaster, ex-Staff ML Engineer at Lyft, sharing valuable lessons and practical advice for anyone looking to enhance the adoption of their open-source projects.
Explore more about Albumentations and join the community at:
GitHub: https://github.com/albumentations-team/albumentations
Website: https://albumentations.ai/
LinkedIn: https://www.linkedin.com/company/100504475
Twitter: https://x.com/albumentations
Maruthi Prithivirajan, Head of ASEAN & IN Solution Architecture, Neo4j
Get an inside look at the latest Neo4j innovations that enable relationship-driven intelligence at scale. Learn more about the newest cloud integrations and product enhancements that make Neo4j an essential choice for developers building apps with interconnected data and generative AI.
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...James Anderson
Effective Application Security in Software Delivery lifecycle using Deployment Firewall and DBOM
The modern software delivery process (or the CI/CD process) includes many tools, distributed teams, open-source code, and cloud platforms. Constant focus on speed to release software to market, along with the traditional slow and manual security checks has caused gaps in continuous security as an important piece in the software supply chain. Today organizations feel more susceptible to external and internal cyber threats due to the vast attack surface in their applications supply chain and the lack of end-to-end governance and risk management.
The software team must secure its software delivery process to avoid vulnerability and security breaches. This needs to be achieved with existing tool chains and without extensive rework of the delivery processes. This talk will present strategies and techniques for providing visibility into the true risk of the existing vulnerabilities, preventing the introduction of security issues in the software, resolving vulnerabilities in production environments quickly, and capturing the deployment bill of materials (DBOM).
Speakers:
Bob Boule
Robert Boule is a technology enthusiast with PASSION for technology and making things work along with a knack for helping others understand how things work. He comes with around 20 years of solution engineering experience in application security, software continuous delivery, and SaaS platforms. He is known for his dynamic presentations in CI/CD and application security integrated in software delivery lifecycle.
Gopinath Rebala
Gopinath Rebala is the CTO of OpsMx, where he has overall responsibility for the machine learning and data processing architectures for Secure Software Delivery. Gopi also has a strong connection with our customers, leading design and architecture for strategic implementations. Gopi is a frequent speaker and well-known leader in continuous delivery and integrating security into software delivery.
DevOps and Testing slides at DASA ConnectKari Kakkonen
My and Rik Marselis slides at 30.5.2024 DASA Connect conference. We discuss about what is testing, then what is agile testing and finally what is Testing in DevOps. Finally we had lovely workshop with the participants trying to find out different ways to think about quality and testing in different parts of the DevOps infinity loop.
Communications Mining Series - Zero to Hero - Session 1DianaGray10
This session provides introduction to UiPath Communication Mining, importance and platform overview. You will acquire a good understand of the phases in Communication Mining as we go over the platform with you. Topics covered:
• Communication Mining Overview
• Why is it important?
• How can it help today’s business and the benefits
• Phases in Communication Mining
• Demo on Platform overview
• Q/A
UiPath Test Automation using UiPath Test Suite series, part 5DianaGray10
Welcome to UiPath Test Automation using UiPath Test Suite series part 5. In this session, we will cover CI/CD with devops.
Topics covered:
CI/CD with in UiPath
End-to-end overview of CI/CD pipeline with Azure devops
Speaker:
Lyndsey Byblow, Test Suite Sales Engineer @ UiPath, Inc.
In his public lecture, Christian Timmerer provides insights into the fascinating history of video streaming, starting from its humble beginnings before YouTube to the groundbreaking technologies that now dominate platforms like Netflix and ORF ON. Timmerer also presents provocative contributions of his own that have significantly influenced the industry. He concludes by looking at future challenges and invites the audience to join in a discussion.
Dr. Sean Tan, Head of Data Science, Changi Airport Group
Discover how Changi Airport Group (CAG) leverages graph technologies and generative AI to revolutionize their search capabilities. This session delves into the unique search needs of CAG’s diverse passengers and customers, showcasing how graph data structures enhance the accuracy and relevance of AI-generated search results, mitigating the risk of “hallucinations” and improving the overall customer journey.
4. Why shall we use monitoring?
Most important reasons:
• Warn and act in case of any problems.
• Downtimes are very expensive!
• To identify and fix problems ASAP before customers start calling.
• More productive work of IT staff
• To automate routine tasks, check of availability of resources
• To plan hardware resources. Capacity planning and trends.
• To measure and analyse quality of provided and used services (SLA)
A good monitoring system makes us confident our business is running!
5. History
Zabbix is celebrating its 8th anniversary!
• Choice of 1998 — HP OpenView, IBM, BMC: expensive to buy and maintain
• How to name it? ABCDE...Zabbix!
• April 2001 — the first public release Zabbix 1.0alpha1
• April 2004 — the first stable release Zabbix 1.0
• April 2005 — the company Zabbix SIA was established: commercial support
Zabbix today. We have made a good progress!
Zabbix 1.6.4, 500 downloads per day, 15.000 forum users
Zabbix company is growing, 20 Zabbix partners (Europe, Japan, the US)
6. What is Zabbix?
Zabbix is an Open Source distributed monitoring system capable of monitoring
availability and performance of servers, network devices, applications.
Zabbix functionality:
• Agent-less/based monitoring
• Auto-discovery
• Escalations and repeated notifications
• Pro-active monitoring, remote actions
• WEB monitoring
• Graphs, maps, screens
• IT Services (SLA), reports
• Distributed monitoring, IPv6 and more!
7. Zabbix: main components
Server:
• Zabbix core, system logic
• Data processing, escalations
WEB front-end:
• Access to historical data
• Configuration
Agent:
• Server data collection, actions
Proxy:
• Remote data collection
8. Technical details
Important technical decisions:
• WEB front-end for data visualisation and configuration
• Written in the C language, PHP front-end. No Java/Python/Perl/Ruby on the
server and agent side! No fork(), native syscalls() are used instead.
• Support of virtually all platforms (Linux, *BSD, Solaris, AIX, HP-UX,
Windows,...)
• Choice of database engines: MySQL, PostgreSQL, Oracle, SQLite
• We do not reuse Nagios, RRD, Cacti
Key principles of Zabbix development:
• Keep things simple (KISS), yet be very flexible
• Maintain low hardware requirements, should not affect production
9. Why would we choose Zabbix?
What makes Zabbix so special?
• All-in-one solution only when it comes to monitoring!
• All historical data, trends and configuration is stored in a database
• Ready for monitoring of small and LARGE distributed environments
• True Open Source (GPLv2) solution, no commercial versions.
• All logic is on the server side, agents are for data collection only
• Extremely flexible! Triggers, escalations, new checks, screens, and more.
• Designed to deal with unstable communications
• Full support of IPv6
10. How to monitor
Service checks: SNMP v1,v2,v3:
• FTP, SSH, HTTP, SMTP, DNS ... • Network devices
• Normally NET-SNMP for servers
Zabbix Agent: • Monitoring of applications (Oracle,
• Аctive and passive checks Weblogic, Websphere, PostgreSQL,
• Monitoring of logs, event logs MySQL, ...)
• Easy to extend • SNMP traps
• Remote command execution
• Extremely efficient! IPMI:
• Monitoring of hardware
Other: • Remote management (reboot, reset,
WMI, JMX, Nagios plugins halt)
11. Use of Zabbix agent
Active checks:
• Highly efficient
• Buffering of collected data
Passive checks:
• Requires polling on the Zabbix
server side
• Additional performance hit
because of polling and network
bandwidth
13. Mmm... Triggers!
Trigger is a flexible logical expression used to define a problem condition.
• Status (value) of a trigger represents system state
• Change of trigger value generates events
• It is one of the ways to deal with flapping
CPU load is too high: {host:cpuload.last(0)}>5
CPU load is too high: {host:cpuload.min(300)}>2
CPU load is too high: {host:cpuload.min(300)}>2 & {host:cpuuser.min(300)}>50
CPU load is too high: {host:cpuload.min(300)}>2 & {host2:backup.last(0)}=0
We decide how to define «CPU load is too high» not Zabbix itself!
14. Dependencies
They are used to:
• Avoid notifications
• Define dependencies between different problems (related to networks,
applications, anything). No host dependencies!
Server is down → Switch1 is down → Switch2 is down
WEB App is down → MySQL is not responsive → No free disk space on /tmp
15. Escalations
Different scenarios: Example (reaction to a failed WEB check):
• Delayed notifications
• Repeated notifications Increase step every 5 minutes
• Execution of commands Step 1-3: Send message to Unix Admins
• Escalation to other users Step 3-5: Send message to Boss if not ACK
• Recovery messages Step 6: Restart Apache if not ACK
• Different actions for Step 7: Reboot server if not ACK
acknowledged and not Step 10: Send message to all of not ACK
acknowledges events
16. Visualisation: Dashboard
Favourite resources:
• Maps
• Graphs
• Screens
High-level view:
• Problems by host group
• Zabbix statistics
• List of the latest issues
• WEB monitoring info
• Auto-discovery
17.
18. Visualisation: Graphs
Immediate access:
• Any period of time
• Easy time-navigation
• Two mouse-click zooming
• Problem conditions displayed
• Non-working time is marked
• Not generated in advance!
Graph types:
• Standard (dots, lines, colors)
• Stacked
• Pie
19.
20. Visualisation: Screens
Different blocks:
• Graphs
• Maps
• Plain text data
• List of problems
• High level stats
Slide shows:
• Combination of screens
• Displayed one after
another
21.
22. WEB monitoring
Goals:
• Monitoring of user experience
• Support of complex scenarios
• Performance monitoring
• Availability monitoring
Example:
Step 1 Access home page
Step 2 Login (POST, GET)
Step 3 Run report
Step 4 Logout
23.
24. IT Services
Goals:
• Business level monitoring
• SLA monitoring
• We care about services
• Escalation of problems
• Root cause of the problem
Tree structure based on:
• Dependencies
• Physical location
• Type of service, etc
25.
26. User management
Authentication:
• Standard: Zabbix database
• LDAP (Active Directory)
• Apache (Kerberos, Unix, etc)
Permissions:
• Depends of user type
• User group level permissions
Also:
• Notifications-only user groups
27. Extending Zabbix
New Zabbix agent-side check:
UserParameter=mysql.qps,mysqladmin –uroot status|cut –f9 –d”:”
UserParameter=sum[*],echo “$1+$2”|bc
Examples: mysql.qps = 456, sum[4,5] = 9
New notification methods:
• Just a matter of writing a shell script (voice generation, Skype call, anything)
New server side checks:
• Just a matter of writing a shell script
29. Our environment
Situation:
• Several thousands of servers and network devices
• Distributed accross 2-100 data centers or branches
• Centralised monitoring is required
30. Zabbix: several approaches
1 Server
1 Server Distributed
Many Proxies
• One Zabbix server • One Zabbix server • One Zabbix server per
does everything • One Proxy per data data center
center or company • More effort to maintain
branch • Can be used with
Proxies
31. What is Proxy?
Proxy is a data collector. It is also used for auto-discovery.
Advantages:
• Makes architecture easier
• Does not require significant resources
• Offloads Zabbix server
32. Proxy: how does it work?
Management: Connection loss processing:
• Data is buferred in the Proxy database
• Data collection only • Will be sent on connection recovery
• Fully managed via WEB front-end • No notifications about local problems!
• Configuration is stored on the
Zabbix server side
• All connections are initiated by
Proxy
• Collection of thousands of values
per second
33. Distributed monitoring
Basic attributes:
• Tree-like structure
• Node is a Zabbix server
• Nodes are platform
independent
Managements:
• Two-way replication of
configuration
• Parent node controls child
nodes
34. Processing of connection loss
What will stop working?
• Data sending to parent node
• Synchronisation of configuration
Everything else will keep working!
35. Thousands of devices: solutions
Problems and solutions:
• Huge data volume: use database partitions for historical data
• Integration with existing systems: LDAP authentication, notifcation
methods to open tickets, XML import/export for configuration
management and inventory
• Maintenance: templates, mass updates
• Upgrades: all Zabbix components are compatible within one major
release 1.6.x
36. Choice of the best schema
Depends on the requirements:
• Local administration
• Full-featured monitoring when no connection between data centers
(branches)
Distributed
1 Server
Many Proxies
1 Server Distributed monitoring
Adding Proxies
Getting used to
Zabbix
Adopt Open Source