SlideShare a Scribd company logo
Running the World’s Internet Servers 
Loving the Database 
By Steve Mushero 
September, 2014 
Running the World’s Internet Servers www.ChinaNetCloud.com 
Zabbix 
Build & Manage Servers  Optimize & Manage Servers  Managed Cloud Servers Copyright © 2014 ChinaNetCloud
Running the World’s Internet Servers www.ChinaNetCloud.com 
Greetings 
I’m Steve 
I’m from Shanghai, China 
We have a big Internet there 
We have a big business 
We have a big monitoring system 
That’s Zabbix 
We love the Database 
Let me tell you more about it . . . 
2 
Hello
Our System & Database 
Zabbix 1.8.3 – Going to 2.2 now 
1250 hosts, 300K items, 550 new values/sec 
MySQL Percona 5.5 w/ very optimized config 
Will go to 5.6 when 5.6 is more stable 
No query cache 
Main DB about 200GB in size 
History is about 2 billion rows of data, 157GB size 
Trends is 500 million, 37GB size 
Backups take 8 hours (including compress/encrypt) 
Has 48GB of Innodb Buffer (will expand to 64+ soon) 
Will go to 96, 128, 256 over time 
Generally runs well, except I/O bottleneck 
Running the World’s Internet Servers www.ChinaNetCloud.com 
3
Running the World’s Internet Servers www.ChinaNetCloud.com 
Zabbix Database 
MySQL for us 
Not Postgres/Oracle – No idea about them 
1.8 & 2.2 Versions 
Very familiar with 1.8.3 
Will try to focus on 2.2 
Overall, very logical 
Easy to use/query 
Very powerful 
Some weird things 
Some complex things 
Key values in defines.inc.php 
I have a 2.2 ERD Model for you ! 
4
Database Overview 
Running the World’s Internet Servers www.ChinaNetCloud.com 
~111 Tables 
Well-structured & logical 
All about foreign keys for joins 
All useful joins on ID 
objectid, e.g. hostid, itemid 
All timestamps are epoch time 
Always called ‘clock’ in any table 
Use MySQL unix_timestamp() and from_unixtime() 
Several billion-row tables 
Careful with these ! 
5
Running the World’s Internet Servers www.ChinaNetCloud.com 
Key Subsystems 
Hosts (& Templates) 
Items (& Templates) 
Functions 
Triggers 
Events 
Users 
Trends & History 
Macros, Profiles, HostGroups 
Graphs, Screens 
Queue 
6
Running the World’s Internet Servers www.ChinaNetCloud.com 
Core Subsystems 
Need to know/understand in detail 
Hosts (& Templates) 
Items (& Templates) 
Triggers (& Functions) 
Events 
Trends & History 
7 
Core
Hosts, Items & Templates I 
Critical to understand the relationships 
Hosts & Items are the core of the system 
Running the World’s Internet Servers www.ChinaNetCloud.com 
Hosts have items 
Items drive everything 
Agent Type, Data Type, Intervals, etc. 
SQL to look at a host & items: 
SELECT host, key_, description, from_unixtime(lastclock), lastvalue 
FROM items i JOIN hosts h ON i.hostid = h.hostid 
WHERE h.status = 0 
AND i.status = 0 
HOST KEY_ DESCRIPTION LASTCLOCK LASTVALUE 
srv-nc-webdav1 agent.ping agent-ping 2014-08-19 02:50:44 1 
srv-nc-webdav1 agent.version Version of zabbix_agent(d) 2014-08-19 02:25:13 1.8.3 
srv-nc-webdav1 apache[busyworkers] Apache Busy Workers 2014-08-19 02:50:16 1.000000 
items.lastvalue removed in 2.2 – Pull from history table 
8
Hosts, Items & Templates II 
Templates are just special hosts (status=3) 
Templated items are special items 
Templateid=0 and hosts.status=3 
A host’s non-template items also templateid=0 so careful 
On a host, an item is from a template if templateid>0 
Templateid is for an ITEM, not the template 
Must join (host’s item to template’s item to template) to get template name 
Running the World’s Internet Servers www.ChinaNetCloud.com 
9
Hosts, Items & Templates III 
Hosts have attached Templates 
Running the World’s Internet Servers www.ChinaNetCloud.com 
One or more per host 
Host’s templated items are copied from Template 
Template Items are COPIED to the host 
VERY important to understand this relationship 
Important to understand what can be changed at host level 
So 10 hosts with a template of 10 items 
110 items total in the system (10 + 10x10) 
HOST HOST_STATUS ITEMID TEMPLATEID KEY_ 
NC_Template_Linux 3 24130 0 agent.version 
srv-nc-webdav1 0 23110 24130 agent.version 
srv-nc-dns1 0 23314 24130 agent.version 
SELECT host, h.status, itemid, templateid, key_ 
FROM items i JOIN hosts h ON i.hostid = h.hostid 
WHERE (h.status = 0 OR h.status = 3) 
AND i.status = 0 
AND (i.itemid = 24130 OR i.templateid = 24130) 
ORDER by templateid 
10
Triggers, Events, Functions, and Items I 
Another key set of relationships 
Events are Trigger status changes 
Basically the alerts you see on dashboard 
Drive actions, emails, dashboard 
Triggers are logic that finds problems 
Contain the logic Expression 
Running the World’s Internet Servers www.ChinaNetCloud.com 
String with fomula 
Based on Functions 
Functions are the Zabbix functions 
11
Triggers, Events, Functions, and Items II 
Functions contain items and the function 
Running the World’s Internet Servers www.ChinaNetCloud.com 
Last, avg, etc. 
Items link to hosts, etc. 
Triggers can be multi-host 
This complicates logic 
Hard to link Trigger to a Host – big SQL 
Example: 
TRIGGERID EXPRESSION DESCRIPTION 
10048 {1003079}/{1003080}*100<10 Lack of free memory on server {HOSTNAME} 
10056 {1003227}>300 Too many processes on {HOSTNAME} 
FUNCTIONID ITEMID TRIGGERID FUNCTION PARAMETER 
1003079 10090 10048 last 0 
12
Now that you 
understand all that 
We’ll talk about some tables 
Running the World’s Internet Servers www.ChinaNetCloud.com 13
Running the World’s Internet Servers www.ChinaNetCloud.com 
Hosts 
Core table 
Hosts are Hosts 
Templates are also Hosts 
hosts.status = 3 
Proxies are also Hosts 
Hosts.status = 5 
Don’t confuse with agent on proxy host 
Hosts are Enabled/Disabled 
hosts.status = 0 or 1 
Hosts can be in unreachable state 
hosts.status = 2 
Not clear this is fully used 
Join items to hosts to get host/template name 
14
Running the World’s Internet Servers www.ChinaNetCloud.com 
Hosts – SQL 
List active hosts 
SELECT hostid, proxy_hostid, host, ip, port, status 
FROM hosts WHERE status = 0 
ORDER BY host 
HOSTID STATUS HOST IP PORT STATUS 
10057 0 srv-nc-web1 60.139.13.43 40067 0 
10058 0 srv-nc-web2 223.173.38.47 13050 0 
10059 0 srv-nc-web3 223.213.91.96 20450 0 
15
Running the World’s Internet Servers www.ChinaNetCloud.com 
Items 
Core table 
Linked to hosts on hostid 
Have Type 
Agent, SNMP, Internal, Simple, IPMI, ec. 
Enabled/Disabled, Error 
items.status = 0 or 1 
Can be in error, items.status = 3 
Lastclock tells last collect time in 1.8 
Very useful for pulling data, stats, issues 
Not clear where this went in 2.2 
LastValue has last value in 1.8 
In 2.2 you must get from Trend table, annoying 
16
Running the World’s Internet Servers www.ChinaNetCloud.com 
Items 
Either Host or Template Level 
If from Template, COPIED from Template 
Some fields can be changed at host level 
- Enabled, Interval, History/Trend Retention, Application, Group 
But OVERWRITTEN if you update the Template 
templateid = itemid on the Template 
SELECT host, h.status, itemid, templateid, key_ 
FROM items i JOIN hosts h ON i.hostid = h.hostid 
WHERE h.status = 0 
AND i.status = 0 
ORDER by host, key_ 
HOST STATUS ITEMID TEMPLATEID KEY_ 
srv-nc-def1 0 227843 22934 agent.ping 
srv-nc-def1 0 216864 44130 agent.version 
srv-nc-def1 0 216864 0 local.thing 
17
Running the World’s Internet Servers www.ChinaNetCloud.com 
Items – Get Data 
Get data from items (Ver 1.8) 
/* Get small swap servers with swap used */ 
SELECT host, i.lastvalue AS Swap_Size, 100-ii.lastvalue as Swap_Used 
FROM items i JOIN hosts h ON i.hostid = h.hostid 
JOIN items ii on h.hostid = ii.hostid 
WHERE i.templateid = 24172 /* Swap size */ 
AND i.lastvalue > (1 * 1024 * 1024 * 1024) 
AND i.lastvalue < (88 * 1024 * 1024 * 1024) 
AND h.status = 0 
AND i.status = 0 
AND ii.status = 0 
AND ii.templateid = 154766 /* swap % free */ 
AND (100-ii.lastvalue) > 10 
ORDER BY Swap_Used DESC 
18
Running the World’s Internet Servers www.ChinaNetCloud.com 
Items – Get Data 
items.lastvalue & lastclock removed in 2.2 – Pull from history 
VERY long query – about 5 pages (many parts removed): 
SELECT (case when (i.value_type = 0) 
then (select history.value from history 
where (history.itemid = i.itemid) 
order by history.clock desc limit 1) 
when 
(i.value_type = 1) … 
(i.value_type = 2) … 
(i.value_type = 3) … 
(i.value_type = 4) … 
end) AS lastvalue, 
(case when (i.value_type = 0) 
then (select history.clock from history 
where (history.itemid = i.itemid) 
order by history.clock desc limit 1) 
when 
(i.value_type = 1) 
(i.value_type = 2) 
(i.value_type = 3) 
(i.value_type = 4) 
end) AS lastclock 
from items i where itemid = 23110 
19
Running the World’s Internet Servers www.ChinaNetCloud.com 
Zabbix Queue 
Items can be in a ‘queue’ 
Seen on Admin|Queue screen 
Not a real queue ! 
Just a list of late items 
May change in Version 2.2 
Now() > (lastclock + interval) 
Max ‘queue’ is # of active items 
Stuff can get stuck if error & host disabled 
Hosts disabled but items enabled 
Lots of good SQL for reports 
Queue size, oldest items, queue by host 
Queue by proxy for graphs / triggers 
20
Running the World’s Internet Servers www.ChinaNetCloud.com 
Triggers I 
Core table 
Linked to functions 
Functions link to items 
Items link to hosts 
Functions 
Last, min, max, sum, nodata, etc. 
Enabled/Disabled, Error 
triggers.status = 0 or 1 
21
Running the World’s Internet Servers www.ChinaNetCloud.com 
Triggers II 
Value – OK or Problem 
Also UNKNOWN in 1.8 
Behavior changed in 2.0/2.2 
Priority is here 
URL is here 
Templateid tells you if came from template 
ID is of parent Trigger in the template 
Dependencies are here, complicated 
Trigger_up is trigger we depend on 
Trigger_down is dependent trigger (this trigger?) 
Trigger Level – 0 for no dependency 
22
Running the World’s Internet Servers www.ChinaNetCloud.com 
Events 
The Alerts you see on Dashboard 
Basically a trigger changing status 
Also include auto-discovery, etc. 
Triggered events 
Source = 0, Object = 0 
Objectid will match the trigger 
Status tells you Trigger status 
We tie Alert Tickets to this 
Note: Server re-creates events on restart 
Copies over ACKs 
Very annoying if you tie things to eventid 
We have special PHP to rebuild this relationship 
23
Events – ACK & Duration 
ACKs set flag in Event DB row 
And ACK data in acknowledges table 
Finding Event duration is HARD 
Basically scan forward for next OK event 
Slow and messy 
Important for metrics 
select distinct SUBSTRING_INDEX(from_unixtime(e.clock),' ',1) AlertDate, 
SUBSTRING_INDEX(from_unixtime(e.clock),' ',-1) AlertTime, (select 
floor((eb.clock-e.clock)/60) from events eb where value = 0 and eb.eventid > 
e.eventid and eb.objectid=e.objectid order by eb.eventid limit 1 ) as Duration, 
h.host, t.description, t.priority, from_unixtime(a.clock) ACK, u.name, 
a.acknowledgeid, a.message, floor((a.clock-e.clock)/60) response_time from 
triggers t join functions f on f.triggerid=t.triggerid right join items i on 
f.itemid=i.itemid join hosts h on h.hostid=i.hostid right join events e on 
t.triggerid=e.objectid left join acknowledges a on a.eventid=e.eventid left join 
users u on u.userid=a.userid where e.value=1 and t.triggerid<>19072 and 
e.clock>unix_timestamp('2010-04-01 09:00:00') and 
e.clock<unix_timestamp('2010-04-01 18:00:00') order by e.clock; 
Running the World’s Internet Servers www.ChinaNetCloud.com 
24
Running the World’s Internet Servers www.ChinaNetCloud.com 
History & Trends 
This is the data we collect 
Drives the graphs 
Data first goes to History tables 
By type – uint, text, double, etc. 
Server moves to Trends tables 
On schedule based on Item config 
Summarizes each hour 
Saves min, max, average 
Trends table purged by Housekeeper 
Can partition in DB for much faster purge 
Special SQL can also purge, but I/O heavy 
SQL in History/Trends painful 
Lots of random I/O 
Ideally data fits in server RAM or have SSD 
25
Running the World’s Internet Servers www.ChinaNetCloud.com 
Web Checks 
A bit complex – 4 tables 
Link into the rest of the system 
Httptest 
Test name, status, and interval, last check, and what seems like response time, and 
error text 
httpstep tables 
Step name, URL, timeout, response code, required text 
Httptestitem 
Links to items, creates two items of type 9 for each test - Download Speed & Failed 
Step (type 2 & 3 in this table) 
httpstepitem 
Links to items, creates 3 items of type 9 for each step - Download Speed, Response 
Time, Response Code 
Other 
Note Item History & Trends set to 30 & 90, but can't seem to be edited anywhere 
26
Graphs, Screens & Slideshows 
Pretty part of the system 
Running the World’s Internet Servers www.ChinaNetCloud.com 
Basic tables 
screens & screen_items 
Use resourcetype to know how to link 
Each type can link to underlying graph, item, etc. 
Complex security 
Based on item/host permissions 
27
Users & Security - I 
Running the World’s Internet Servers www.ChinaNetCloud.com 
users table 
alias field is the actual user name 
usrgrp table 
All roles/permissions tied here, API, GUI, etc. 
rights table 
Links usrgrp to server groups with RO, R/W permission 
profiles table 
Use not clear as thousands of rows per user 
Think its drives dashboard and other modules settings 
sessions table 
Basic user session tied to cookie 
mysql> update users set passwd=MD5('somepassword') where alias='Admin'; 
28
Users & Security - II 
Running the World’s Internet Servers www.ChinaNetCloud.com 
Enable/Disable 
Seems to disable by adding to disabled group 
Passwords 
MD5 has WITH TRAILING CARRIAGE RETURN, so use -n on echo: 
$ echo -n "somepassword" | md5sum 9c42a1346e333a770904b2a2b37fa7d3 
But easier to use MySQL function: 
mysql> update users set passwd=MD5('somepassword') where alias='Admin'; 
Refresh field – For all screens 
Defaults very low, set to 0 or much higher 
Otherwise heavy load on DB 
29
Running the World’s Internet Servers www.ChinaNetCloud.com 
Users & Security 
Hosts a User has rights on 
SELECT ug.usrgrpid, ug.name AS user_group, g.name as host_group, host FROM 
users u JOIN users_groups ugl ON u.userid = ugl.userid JOIN usrgrp ug ON 
ugl.usrgrpid = ug.usrgrpid JOIN rights r ON ug.usrgrpid = r.groupid JOIN groups g 
ON r.id = g.groupid JOIN hosts_groups hg ON g.groupid = hg.groupid JOIN hosts 
h on hg.hostid = h.hostid where u.alias = 'steve.mushero' /* userid 116 */ /* 
group 15 is 24x7 user */ AND r.permission in (2,3); 
30
Config and User Profiles 
Refresh rates and DB load 
Dashboard refresh rates 
UPDATE profiles p JOIN users u ON p.userid = u.userid 
SET p.value_int = 300 
WHERE idx LIKE 'web.dahsboard.rf_rate.%' 
AND u.alias not LIKE 'cust%'; 
Running the World’s Internet Servers www.ChinaNetCloud.com 
31
Running the World’s Internet Servers www.ChinaNetCloud.com 
Audit System 
Great, but useless GUI 
Data very fine-grained / detailed 
audit_log & audit_log_details 
Actions & Resources 
See code for list of values 
General SQL to get info: 
SELECT alias, from_unixtime(a.clock), a.action, a.resourcetype, a.details, 
a.resourceid, a.resourcename, ad.table_name, ad.field_name, CAST(ad.oldvalue 
AS UNSIGNED) AS oldvalue, CAST(ad.newvalue AS UNSIGNED) AS newvalue 
FROM auditlog a LEFT JOIN auditlog_details ad ON a.auditid = ad.auditid JOIN users 
u ON a.userid = u.userid 
WHERE resourceid = 109482 AND field_name = 'status' 
AND from_unixtime(clock) > '2013-05-01' 
32
Running the World’s Internet Servers www.ChinaNetCloud.com 
Audit System 
Big select to get details on hosts 
SELECT alias, from_unixtime(a.clock), CASE a.action WHEN 0 THEN "Added" 
WHEN 1 THEN "Updated" 
WHEN 2 THEN "Deleted" ELSE CAST(a.action AS CHAR) END AS action, CASE 
a.resourcetype 
WHEN 4 THEN "Host" 
WHEN 13 THEN "Item ?" 
WHEN 15 THEN "Item" 
ELSE CAST(a.resourcetype AS CHAR) END AS resource_type, a.details, a.resourceid, 
a.resourcename, ad.table_name, ad.field_name, CAST(ad.oldvalue AS CHAR), 
CASE ad.newvalue 
WHEN 0 THEN "Enable” 
WHEN 1 THEN "Disable" END as newvalue 
FROM auditlog a LEFT JOIN auditlog_details ad ON a.auditid = ad.auditid JOIN users 
u ON a.userid = u.userid 
WHERE from_unixtime(clock) > '2012-01-01' /* AND alias LIKE 'matt%' */ 
AND a.resourcetype = 4 /* Host */ AND (field_name = 'status' OR field_name IS 
NULL) /* AND ad.newvalue = 1 /* 1 = Disable */ /* 
AND resourcename LIKE '%web17%'*/; 
33
Running the World’s Internet Servers www.ChinaNetCloud.com 
Safety Reports 
We have dozens of these, such as: 
Items that differ from template 
Missing templates 
Disable items/hosts, forget to enable 
Alerts with no URL/Wiki 
Hosts missing profile data 
Items disabled conflict with trigger 
Web alerts with no trigger 
Web alerts with long/short timeouts 
Hosts in wrong, duplicate, conflicting groups 
Servers in Zabbix, not core system 
Many are quite complex, big SQL 
We will post these on-line 
After updating for Ver 2.2 
34
Running the World’s Internet Servers www.ChinaNetCloud.com 
Housekeeper 
Done by server every hour 
Very slow – Item by Item 
Thousands per second 
A LOT of I/O (mostly read) 
We have SQL to do in bulk 
But too heavy load on I/O system 
We’ll see on SSD, or all data in RAM 
35
Running the World’s Internet Servers www.ChinaNetCloud.com 
Backups 
Backup, of course, but big 
Ours are 8 hours 
200GB of data 
At 1TB this is not manageable 
Need incrementals 
Maybe recent data only 
Do from slave 
You can backup config only 
Ignore history*, trends*, events, audit*, acknowledges 
36
Running the World’s Internet Servers www.ChinaNetCloud.com 
Summary 
We love the Zabbix database 
So should you 
Learn how everything connects 
Focus on key / core tables 
Have fun 
37
Thanks from ChinaNetCloud 
Pioneers in OaaS – Operations as a Service 
Running the World’s Internet Servers www.ChinaNetCloud.com 38
ChinaNetCloud 
Shanghai Headquarters: 
X2 Space 1-601, 1238 Xietu Lu 
Shanghai, 200032 China 
T: +86-21-6422-1946 F: +86-21-6422-4911 
Beijing Office: 
Lee World Business Building #305 
57 Happiness Village Road, Chaoyang District 
Beijing, 100027 China 
Sales@ChinaNetCloud.com 
Running the World’s Internet Servers www.ChinaNetCloud.com 
www.ChinaNetCloud.com 
Silicon Valley Office: 
California Avenue 
Palo Alto, 94123 USA

More Related Content

What's hot

Lukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with Uciprov
Lukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with UciprovLukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with Uciprov
Lukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with Uciprov
Zabbix
 
Zabbix
ZabbixZabbix
Zabbix
pundir5
 
Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016
Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016
Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016
Zabbix
 
Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)
Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)
Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)
Nederlandstalige Zabbix Gebruikersgroep
 
Volker Fröhlich - How to Debug Common Agent Issues
Volker Fröhlich - How to Debug Common Agent IssuesVolker Fröhlich - How to Debug Common Agent Issues
Volker Fröhlich - How to Debug Common Agent Issues
Zabbix
 
Monitoring the ELK stack using Zabbix and Grafana (Dennis Kanbier / 26-11-2015)
Monitoring the ELK stack using Zabbix and Grafana (Dennis Kanbier / 26-11-2015)Monitoring the ELK stack using Zabbix and Grafana (Dennis Kanbier / 26-11-2015)
Monitoring the ELK stack using Zabbix and Grafana (Dennis Kanbier / 26-11-2015)
Nederlandstalige Zabbix Gebruikersgroep
 
Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016
Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016
Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016
Zabbix
 
Rihards Olups - Zabbix at Nokia - Case Study
Rihards Olups - Zabbix at Nokia - Case StudyRihards Olups - Zabbix at Nokia - Case Study
Rihards Olups - Zabbix at Nokia - Case Study
Zabbix
 
Aaron Mildenstein - Using Logstash with Zabbix
Aaron Mildenstein - Using Logstash with ZabbixAaron Mildenstein - Using Logstash with Zabbix
Aaron Mildenstein - Using Logstash with Zabbix
Zabbix
 
Trouble Ticket Integration with Zabbix in Large Environment
Trouble Ticket Integration with Zabbix in Large EnvironmentTrouble Ticket Integration with Zabbix in Large Environment
Trouble Ticket Integration with Zabbix in Large Environment
Alain Ganuchaud
 
Monitoring a billion kilometers of monthly ride sharing at BlaBlaCar - Zabbix...
Monitoring a billion kilometers of monthly ride sharing at BlaBlaCar - Zabbix...Monitoring a billion kilometers of monthly ride sharing at BlaBlaCar - Zabbix...
Monitoring a billion kilometers of monthly ride sharing at BlaBlaCar - Zabbix...
Jean Baptiste Favre
 
Monitoring infrastructure with prometheus
Monitoring infrastructure with prometheusMonitoring infrastructure with prometheus
Monitoring infrastructure with prometheus
Shahnawaz Saifi
 
Zabbix introduction ( RadixCloud Radix Technologies SA)
Zabbix introduction ( RadixCloud Radix Technologies SA)Zabbix introduction ( RadixCloud Radix Technologies SA)
Zabbix introduction ( RadixCloud Radix Technologies SA)
Martin Markovski
 
Nagios Conference 2014 - Jeff Mendoza - Monitoring Microsoft Azure with Nagios
Nagios Conference 2014 - Jeff Mendoza - Monitoring Microsoft Azure with NagiosNagios Conference 2014 - Jeff Mendoza - Monitoring Microsoft Azure with Nagios
Nagios Conference 2014 - Jeff Mendoza - Monitoring Microsoft Azure with Nagios
Nagios
 
So we're running Apache ZooKeeper. Now What? By Camille Fournier
So we're running Apache ZooKeeper. Now What? By Camille Fournier So we're running Apache ZooKeeper. Now What? By Camille Fournier
So we're running Apache ZooKeeper. Now What? By Camille Fournier
Hakka Labs
 
Monitoring with Prometheus
Monitoring with PrometheusMonitoring with Prometheus
Monitoring with Prometheus
Shiao-An Yuan
 
DSLing your System For Scalability Testing Using Gatling - Dublin Scala User ...
DSLing your System For Scalability Testing Using Gatling - Dublin Scala User ...DSLing your System For Scalability Testing Using Gatling - Dublin Scala User ...
DSLing your System For Scalability Testing Using Gatling - Dublin Scala User ...
Aman Kohli
 
Nagios Conference 2011 - Michael Medin - NSClient++: Whats New
Nagios Conference 2011 - Michael Medin - NSClient++: Whats NewNagios Conference 2011 - Michael Medin - NSClient++: Whats New
Nagios Conference 2011 - Michael Medin - NSClient++: Whats New
Nagios
 
Native container monitoring
Native container monitoringNative container monitoring
Native container monitoring
Rohit Jnagal
 
Nagios Conference 2013 - Eric Stanley and Andy Brist - API and Nagios
Nagios Conference 2013 - Eric Stanley and Andy Brist - API and NagiosNagios Conference 2013 - Eric Stanley and Andy Brist - API and Nagios
Nagios Conference 2013 - Eric Stanley and Andy Brist - API and Nagios
Nagios
 

What's hot (20)

Lukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with Uciprov
Lukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with UciprovLukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with Uciprov
Lukas Macura - Employing Zabbix to monitor OpenWrt (Beesip) devices with Uciprov
 
Zabbix
ZabbixZabbix
Zabbix
 
Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016
Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016
Mikhail Serkov - Zabbix for HPC Cluster Support | ZabConf2016
 
Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)
Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)
Automating Zabbix with Puppet (Werner Dijkerman / 26-11-2015)
 
Volker Fröhlich - How to Debug Common Agent Issues
Volker Fröhlich - How to Debug Common Agent IssuesVolker Fröhlich - How to Debug Common Agent Issues
Volker Fröhlich - How to Debug Common Agent Issues
 
Monitoring the ELK stack using Zabbix and Grafana (Dennis Kanbier / 26-11-2015)
Monitoring the ELK stack using Zabbix and Grafana (Dennis Kanbier / 26-11-2015)Monitoring the ELK stack using Zabbix and Grafana (Dennis Kanbier / 26-11-2015)
Monitoring the ELK stack using Zabbix and Grafana (Dennis Kanbier / 26-11-2015)
 
Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016
Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016
Alexander Naydenko - Nagios to Zabbix Migration | ZabConf2016
 
Rihards Olups - Zabbix at Nokia - Case Study
Rihards Olups - Zabbix at Nokia - Case StudyRihards Olups - Zabbix at Nokia - Case Study
Rihards Olups - Zabbix at Nokia - Case Study
 
Aaron Mildenstein - Using Logstash with Zabbix
Aaron Mildenstein - Using Logstash with ZabbixAaron Mildenstein - Using Logstash with Zabbix
Aaron Mildenstein - Using Logstash with Zabbix
 
Trouble Ticket Integration with Zabbix in Large Environment
Trouble Ticket Integration with Zabbix in Large EnvironmentTrouble Ticket Integration with Zabbix in Large Environment
Trouble Ticket Integration with Zabbix in Large Environment
 
Monitoring a billion kilometers of monthly ride sharing at BlaBlaCar - Zabbix...
Monitoring a billion kilometers of monthly ride sharing at BlaBlaCar - Zabbix...Monitoring a billion kilometers of monthly ride sharing at BlaBlaCar - Zabbix...
Monitoring a billion kilometers of monthly ride sharing at BlaBlaCar - Zabbix...
 
Monitoring infrastructure with prometheus
Monitoring infrastructure with prometheusMonitoring infrastructure with prometheus
Monitoring infrastructure with prometheus
 
Zabbix introduction ( RadixCloud Radix Technologies SA)
Zabbix introduction ( RadixCloud Radix Technologies SA)Zabbix introduction ( RadixCloud Radix Technologies SA)
Zabbix introduction ( RadixCloud Radix Technologies SA)
 
Nagios Conference 2014 - Jeff Mendoza - Monitoring Microsoft Azure with Nagios
Nagios Conference 2014 - Jeff Mendoza - Monitoring Microsoft Azure with NagiosNagios Conference 2014 - Jeff Mendoza - Monitoring Microsoft Azure with Nagios
Nagios Conference 2014 - Jeff Mendoza - Monitoring Microsoft Azure with Nagios
 
So we're running Apache ZooKeeper. Now What? By Camille Fournier
So we're running Apache ZooKeeper. Now What? By Camille Fournier So we're running Apache ZooKeeper. Now What? By Camille Fournier
So we're running Apache ZooKeeper. Now What? By Camille Fournier
 
Monitoring with Prometheus
Monitoring with PrometheusMonitoring with Prometheus
Monitoring with Prometheus
 
DSLing your System For Scalability Testing Using Gatling - Dublin Scala User ...
DSLing your System For Scalability Testing Using Gatling - Dublin Scala User ...DSLing your System For Scalability Testing Using Gatling - Dublin Scala User ...
DSLing your System For Scalability Testing Using Gatling - Dublin Scala User ...
 
Nagios Conference 2011 - Michael Medin - NSClient++: Whats New
Nagios Conference 2011 - Michael Medin - NSClient++: Whats NewNagios Conference 2011 - Michael Medin - NSClient++: Whats New
Nagios Conference 2011 - Michael Medin - NSClient++: Whats New
 
Native container monitoring
Native container monitoringNative container monitoring
Native container monitoring
 
Nagios Conference 2013 - Eric Stanley and Andy Brist - API and Nagios
Nagios Conference 2013 - Eric Stanley and Andy Brist - API and NagiosNagios Conference 2013 - Eric Stanley and Andy Brist - API and Nagios
Nagios Conference 2013 - Eric Stanley and Andy Brist - API and Nagios
 

Viewers also liked

MongoDB Application Monitor
MongoDB Application MonitorMongoDB Application Monitor
MongoDB Application Monitor
Philippe Sfeir
 
ChinaNetCloud - Zabbix Monitoring System Overview
ChinaNetCloud - Zabbix Monitoring System OverviewChinaNetCloud - Zabbix Monitoring System Overview
ChinaNetCloud - Zabbix Monitoring System Overview
ChinaNetCloud
 
The hidden power of network maps on Zabbix
The hidden power of network maps on ZabbixThe hidden power of network maps on Zabbix
The hidden power of network maps on Zabbix
Ricardo Santos
 
Introduction to Zabbix - Company, Product, Services and Use Cases
Introduction to Zabbix - Company, Product, Services and Use CasesIntroduction to Zabbix - Company, Product, Services and Use Cases
Introduction to Zabbix - Company, Product, Services and Use Cases
Zabbix
 
Zabbix Performance Tuning
Zabbix Performance TuningZabbix Performance Tuning
Zabbix Performance Tuning
Ricardo Santos
 
Zabbix 2014 Conference : Implementing Zabbix in large Banking Environment
Zabbix 2014 Conference : Implementing Zabbix in large Banking EnvironmentZabbix 2014 Conference : Implementing Zabbix in large Banking Environment
Zabbix 2014 Conference : Implementing Zabbix in large Banking Environment
Alain Ganuchaud
 
Zabbix Performance Tuning
Zabbix Performance TuningZabbix Performance Tuning
Zabbix Performance Tuning
Alessandro Silva
 
Zabbix 3.0 and beyond - FISL 2015
Zabbix 3.0 and beyond - FISL 2015Zabbix 3.0 and beyond - FISL 2015
Zabbix 3.0 and beyond - FISL 2015
Zabbix
 
Zabbix 2.0 - New Features and Improvements
Zabbix 2.0 - New Features and ImprovementsZabbix 2.0 - New Features and Improvements
Zabbix 2.0 - New Features and ImprovementsZabbix
 
Using Zabbix API from Drupal
Using Zabbix API from DrupalUsing Zabbix API from Drupal
Using Zabbix API from Drupal
Ricardo Santos
 
Reporting Large Environment Zabbix Database
Reporting Large Environment Zabbix DatabaseReporting Large Environment Zabbix Database
Reporting Large Environment Zabbix Database
Alain Ganuchaud
 
Monitoring all Elements of Your Database Operations With Zabbix
Monitoring all Elements of Your Database Operations With ZabbixMonitoring all Elements of Your Database Operations With Zabbix
Monitoring all Elements of Your Database Operations With Zabbix
Zabbix
 
VMware vSphere 5.1 Overview
VMware vSphere 5.1 OverviewVMware vSphere 5.1 Overview
VMware vSphere 5.1 Overview
ESXLab
 
Alexei Vladishev - Zabbix - Monitoring Solution for Everyone
Alexei Vladishev - Zabbix - Monitoring Solution for EveryoneAlexei Vladishev - Zabbix - Monitoring Solution for Everyone
Alexei Vladishev - Zabbix - Monitoring Solution for Everyone
Zabbix
 
VMware vSphere technical presentation
VMware vSphere technical presentationVMware vSphere technical presentation
VMware vSphere technical presentationaleyeldean
 
VMware Presentation
VMware PresentationVMware Presentation
VMware Presentation
Emirates Computers
 
Virtualization 101: Everything You Need To Know To Get Started With VMware
Virtualization 101: Everything You Need To Know To Get Started With VMwareVirtualization 101: Everything You Need To Know To Get Started With VMware
Virtualization 101: Everything You Need To Know To Get Started With VMware
Datapath Consulting
 
RTSP Analysis Wireshark
RTSP Analysis WiresharkRTSP Analysis Wireshark
RTSP Analysis Wireshark
Yoss Cohen
 

Viewers also liked (20)

MongoDB Application Monitor
MongoDB Application MonitorMongoDB Application Monitor
MongoDB Application Monitor
 
ChinaNetCloud - Zabbix Monitoring System Overview
ChinaNetCloud - Zabbix Monitoring System OverviewChinaNetCloud - Zabbix Monitoring System Overview
ChinaNetCloud - Zabbix Monitoring System Overview
 
The hidden power of network maps on Zabbix
The hidden power of network maps on ZabbixThe hidden power of network maps on Zabbix
The hidden power of network maps on Zabbix
 
Introduction to Zabbix - Company, Product, Services and Use Cases
Introduction to Zabbix - Company, Product, Services and Use CasesIntroduction to Zabbix - Company, Product, Services and Use Cases
Introduction to Zabbix - Company, Product, Services and Use Cases
 
Zabbix Performance Tuning
Zabbix Performance TuningZabbix Performance Tuning
Zabbix Performance Tuning
 
Zabbix 2014 Conference : Implementing Zabbix in large Banking Environment
Zabbix 2014 Conference : Implementing Zabbix in large Banking EnvironmentZabbix 2014 Conference : Implementing Zabbix in large Banking Environment
Zabbix 2014 Conference : Implementing Zabbix in large Banking Environment
 
Zabbix
Zabbix Zabbix
Zabbix
 
Zabbix Performance Tuning
Zabbix Performance TuningZabbix Performance Tuning
Zabbix Performance Tuning
 
Zabbix 3.0 and beyond - FISL 2015
Zabbix 3.0 and beyond - FISL 2015Zabbix 3.0 and beyond - FISL 2015
Zabbix 3.0 and beyond - FISL 2015
 
Zabbix 2.0 - New Features and Improvements
Zabbix 2.0 - New Features and ImprovementsZabbix 2.0 - New Features and Improvements
Zabbix 2.0 - New Features and Improvements
 
Using Zabbix API from Drupal
Using Zabbix API from DrupalUsing Zabbix API from Drupal
Using Zabbix API from Drupal
 
Reporting Large Environment Zabbix Database
Reporting Large Environment Zabbix DatabaseReporting Large Environment Zabbix Database
Reporting Large Environment Zabbix Database
 
Monitoring all Elements of Your Database Operations With Zabbix
Monitoring all Elements of Your Database Operations With ZabbixMonitoring all Elements of Your Database Operations With Zabbix
Monitoring all Elements of Your Database Operations With Zabbix
 
VMware vSphere 5.1 Overview
VMware vSphere 5.1 OverviewVMware vSphere 5.1 Overview
VMware vSphere 5.1 Overview
 
Alexei Vladishev - Zabbix - Monitoring Solution for Everyone
Alexei Vladishev - Zabbix - Monitoring Solution for EveryoneAlexei Vladishev - Zabbix - Monitoring Solution for Everyone
Alexei Vladishev - Zabbix - Monitoring Solution for Everyone
 
VMware vSphere technical presentation
VMware vSphere technical presentationVMware vSphere technical presentation
VMware vSphere technical presentation
 
VMware Presentation
VMware PresentationVMware Presentation
VMware Presentation
 
Virtualization 101: Everything You Need To Know To Get Started With VMware
Virtualization 101: Everything You Need To Know To Get Started With VMwareVirtualization 101: Everything You Need To Know To Get Started With VMware
Virtualization 101: Everything You Need To Know To Get Started With VMware
 
Introduction to virtualization
Introduction to virtualizationIntroduction to virtualization
Introduction to virtualization
 
RTSP Analysis Wireshark
RTSP Analysis WiresharkRTSP Analysis Wireshark
RTSP Analysis Wireshark
 

Similar to ChinaNetCloud - The Zabbix Database - Zabbix Conference 2014

ChinaNetCloud - Using Zabbix Monitoring at Scale - Zabbix Conference 2014
ChinaNetCloud - Using Zabbix Monitoring at Scale - Zabbix Conference 2014ChinaNetCloud - Using Zabbix Monitoring at Scale - Zabbix Conference 2014
ChinaNetCloud - Using Zabbix Monitoring at Scale - Zabbix Conference 2014
ChinaNetCloud
 
Scaling asp.net websites to millions of users
Scaling asp.net websites to millions of usersScaling asp.net websites to millions of users
Scaling asp.net websites to millions of users
oazabir
 
06 response-headers
06 response-headers06 response-headers
06 response-headerssnopteck
 
AD102 - Break out of the Box
AD102 - Break out of the BoxAD102 - Break out of the Box
AD102 - Break out of the Box
Karl-Henry Martinsson
 
A language for the Internet: Why JavaScript and Node.js is right for Internet...
A language for the Internet: Why JavaScript and Node.js is right for Internet...A language for the Internet: Why JavaScript and Node.js is right for Internet...
A language for the Internet: Why JavaScript and Node.js is right for Internet...Tom Croucher
 
A language for the Internet: Why JavaScript and Node.js is right for Internet...
A language for the Internet: Why JavaScript and Node.js is right for Internet...A language for the Internet: Why JavaScript and Node.js is right for Internet...
A language for the Internet: Why JavaScript and Node.js is right for Internet...
Tom Croucher
 
Dok Talks #124 - Intro to Druid on Kubernetes
Dok Talks #124 - Intro to Druid on KubernetesDok Talks #124 - Intro to Druid on Kubernetes
Dok Talks #124 - Intro to Druid on Kubernetes
DoKC
 
Finding an unusual cause of max_user_connections in MySQL
Finding an unusual cause of max_user_connections in MySQLFinding an unusual cause of max_user_connections in MySQL
Finding an unusual cause of max_user_connections in MySQL
Olivier Doucet
 
Postgres Vienna DB Meetup 2014
Postgres Vienna DB Meetup 2014Postgres Vienna DB Meetup 2014
Postgres Vienna DB Meetup 2014
Michael Renner
 
Sherlock Homepage - A detective story about running large web services - NDC ...
Sherlock Homepage - A detective story about running large web services - NDC ...Sherlock Homepage - A detective story about running large web services - NDC ...
Sherlock Homepage - A detective story about running large web services - NDC ...
Maarten Balliauw
 
fog or: How I Learned to Stop Worrying and Love the Cloud
fog or: How I Learned to Stop Worrying and Love the Cloudfog or: How I Learned to Stop Worrying and Love the Cloud
fog or: How I Learned to Stop Worrying and Love the Cloud
Wesley Beary
 
Islands: Puppet at Bulletproof Networks
Islands: Puppet at Bulletproof NetworksIslands: Puppet at Bulletproof Networks
Islands: Puppet at Bulletproof Networks
Lindsay Holmwood
 
Cookin’ up Hybrid Clouds: Chef and the Accenture Cloud Platform
Cookin’ up Hybrid Clouds: Chef and the Accenture Cloud PlatformCookin’ up Hybrid Clouds: Chef and the Accenture Cloud Platform
Cookin’ up Hybrid Clouds: Chef and the Accenture Cloud Platform
Chef Software, Inc.
 
Sinatra and JSONQuery Web Service
Sinatra and JSONQuery Web ServiceSinatra and JSONQuery Web Service
Sinatra and JSONQuery Web Service
vvatikiotis
 
Deferred Processing in Ruby - Philly rb - August 2011
Deferred Processing in Ruby - Philly rb - August 2011Deferred Processing in Ruby - Philly rb - August 2011
Deferred Processing in Ruby - Philly rb - August 2011rob_dimarco
 
fog or: How I Learned to Stop Worrying and Love the Cloud (OpenStack Edition)
fog or: How I Learned to Stop Worrying and Love the Cloud (OpenStack Edition)fog or: How I Learned to Stop Worrying and Love the Cloud (OpenStack Edition)
fog or: How I Learned to Stop Worrying and Love the Cloud (OpenStack Edition)
Wesley Beary
 
Introduction to Node.js
Introduction to Node.jsIntroduction to Node.js
Introduction to Node.jsRichard Lee
 
Non-blocking I/O, Event loops and node.js
Non-blocking I/O, Event loops and node.jsNon-blocking I/O, Event loops and node.js
Non-blocking I/O, Event loops and node.js
Marcus Frödin
 
Web Technologies - forms and actions
Web Technologies -  forms and actionsWeb Technologies -  forms and actions
Web Technologies - forms and actionsAren Zomorodian
 
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB Puppet
 

Similar to ChinaNetCloud - The Zabbix Database - Zabbix Conference 2014 (20)

ChinaNetCloud - Using Zabbix Monitoring at Scale - Zabbix Conference 2014
ChinaNetCloud - Using Zabbix Monitoring at Scale - Zabbix Conference 2014ChinaNetCloud - Using Zabbix Monitoring at Scale - Zabbix Conference 2014
ChinaNetCloud - Using Zabbix Monitoring at Scale - Zabbix Conference 2014
 
Scaling asp.net websites to millions of users
Scaling asp.net websites to millions of usersScaling asp.net websites to millions of users
Scaling asp.net websites to millions of users
 
06 response-headers
06 response-headers06 response-headers
06 response-headers
 
AD102 - Break out of the Box
AD102 - Break out of the BoxAD102 - Break out of the Box
AD102 - Break out of the Box
 
A language for the Internet: Why JavaScript and Node.js is right for Internet...
A language for the Internet: Why JavaScript and Node.js is right for Internet...A language for the Internet: Why JavaScript and Node.js is right for Internet...
A language for the Internet: Why JavaScript and Node.js is right for Internet...
 
A language for the Internet: Why JavaScript and Node.js is right for Internet...
A language for the Internet: Why JavaScript and Node.js is right for Internet...A language for the Internet: Why JavaScript and Node.js is right for Internet...
A language for the Internet: Why JavaScript and Node.js is right for Internet...
 
Dok Talks #124 - Intro to Druid on Kubernetes
Dok Talks #124 - Intro to Druid on KubernetesDok Talks #124 - Intro to Druid on Kubernetes
Dok Talks #124 - Intro to Druid on Kubernetes
 
Finding an unusual cause of max_user_connections in MySQL
Finding an unusual cause of max_user_connections in MySQLFinding an unusual cause of max_user_connections in MySQL
Finding an unusual cause of max_user_connections in MySQL
 
Postgres Vienna DB Meetup 2014
Postgres Vienna DB Meetup 2014Postgres Vienna DB Meetup 2014
Postgres Vienna DB Meetup 2014
 
Sherlock Homepage - A detective story about running large web services - NDC ...
Sherlock Homepage - A detective story about running large web services - NDC ...Sherlock Homepage - A detective story about running large web services - NDC ...
Sherlock Homepage - A detective story about running large web services - NDC ...
 
fog or: How I Learned to Stop Worrying and Love the Cloud
fog or: How I Learned to Stop Worrying and Love the Cloudfog or: How I Learned to Stop Worrying and Love the Cloud
fog or: How I Learned to Stop Worrying and Love the Cloud
 
Islands: Puppet at Bulletproof Networks
Islands: Puppet at Bulletproof NetworksIslands: Puppet at Bulletproof Networks
Islands: Puppet at Bulletproof Networks
 
Cookin’ up Hybrid Clouds: Chef and the Accenture Cloud Platform
Cookin’ up Hybrid Clouds: Chef and the Accenture Cloud PlatformCookin’ up Hybrid Clouds: Chef and the Accenture Cloud Platform
Cookin’ up Hybrid Clouds: Chef and the Accenture Cloud Platform
 
Sinatra and JSONQuery Web Service
Sinatra and JSONQuery Web ServiceSinatra and JSONQuery Web Service
Sinatra and JSONQuery Web Service
 
Deferred Processing in Ruby - Philly rb - August 2011
Deferred Processing in Ruby - Philly rb - August 2011Deferred Processing in Ruby - Philly rb - August 2011
Deferred Processing in Ruby - Philly rb - August 2011
 
fog or: How I Learned to Stop Worrying and Love the Cloud (OpenStack Edition)
fog or: How I Learned to Stop Worrying and Love the Cloud (OpenStack Edition)fog or: How I Learned to Stop Worrying and Love the Cloud (OpenStack Edition)
fog or: How I Learned to Stop Worrying and Love the Cloud (OpenStack Edition)
 
Introduction to Node.js
Introduction to Node.jsIntroduction to Node.js
Introduction to Node.js
 
Non-blocking I/O, Event loops and node.js
Non-blocking I/O, Event loops and node.jsNon-blocking I/O, Event loops and node.js
Non-blocking I/O, Event loops and node.js
 
Web Technologies - forms and actions
Web Technologies -  forms and actionsWeb Technologies -  forms and actions
Web Technologies - forms and actions
 
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
Puppet Camp Melbourne 2014: Node Collaboration with PuppetDB
 

More from ChinaNetCloud

AWS ELB Tips & Best Practices
AWS ELB Tips & Best PracticesAWS ELB Tips & Best Practices
AWS ELB Tips & Best Practices
ChinaNetCloud
 
OpsStack--Integrated Operation Platform
OpsStack--Integrated Operation PlatformOpsStack--Integrated Operation Platform
OpsStack--Integrated Operation Platform
ChinaNetCloud
 
ChinaNetCloud Online Lecture:Something About Tshark
ChinaNetCloud Online Lecture:Something About TsharkChinaNetCloud Online Lecture:Something About Tshark
ChinaNetCloud Online Lecture:Something About Tshark
ChinaNetCloud
 
ChinaNetCloud Online Lecture: Fight Against External Attacks From Different L...
ChinaNetCloud Online Lecture: Fight Against External Attacks From Different L...ChinaNetCloud Online Lecture: Fight Against External Attacks From Different L...
ChinaNetCloud Online Lecture: Fight Against External Attacks From Different L...
ChinaNetCloud
 
Steve Mushero on Entrepreneurship - 创业 - 崔牛会
Steve Mushero on Entrepreneurship - 创业 - 崔牛会Steve Mushero on Entrepreneurship - 创业 - 崔牛会
Steve Mushero on Entrepreneurship - 创业 - 崔牛会
ChinaNetCloud
 
Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲
Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲
Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲
ChinaNetCloud
 
云中漫步 颠覆创新_创业邦春季创新峰会主题演讲 Cloud Innovation in China
云中漫步 颠覆创新_创业邦春季创新峰会主题演讲 Cloud Innovation in China云中漫步 颠覆创新_创业邦春季创新峰会主题演讲 Cloud Innovation in China
云中漫步 颠覆创新_创业邦春季创新峰会主题演讲 Cloud Innovation in China
ChinaNetCloud
 
运维安全 抵抗黑客攻击_云络安全沙龙4月上海站主题分享
运维安全 抵抗黑客攻击_云络安全沙龙4月上海站主题分享运维安全 抵抗黑客攻击_云络安全沙龙4月上海站主题分享
运维安全 抵抗黑客攻击_云络安全沙龙4月上海站主题分享
ChinaNetCloud
 
AWS Summit OaaS Talk by ChinaNetCloud
AWS Summit OaaS Talk by ChinaNetCloudAWS Summit OaaS Talk by ChinaNetCloud
AWS Summit OaaS Talk by ChinaNetCloud
ChinaNetCloud
 
Running Internet Systems in China - The Details You Need to Succeed in Chines...
Running Internet Systems in China - The Details You Need to Succeed in Chines...Running Internet Systems in China - The Details You Need to Succeed in Chines...
Running Internet Systems in China - The Details You Need to Succeed in Chines...
ChinaNetCloud
 
Making Internet Operations Easier
Making Internet Operations EasierMaking Internet Operations Easier
Making Internet Operations Easier
ChinaNetCloud
 
Internet Cloud Operations - ChinaNetcloud & AWS Event Beijing
Internet Cloud Operations - ChinaNetcloud & AWS Event BeijingInternet Cloud Operations - ChinaNetcloud & AWS Event Beijing
Internet Cloud Operations - ChinaNetcloud & AWS Event Beijing
ChinaNetCloud
 
Big Data Security (ChinaNetCloud - Guiyang Conference)
Big Data Security (ChinaNetCloud - Guiyang Conference)Big Data Security (ChinaNetCloud - Guiyang Conference)
Big Data Security (ChinaNetCloud - Guiyang Conference)
ChinaNetCloud
 
Internet System Security Overview
Internet System Security OverviewInternet System Security Overview
Internet System Security Overview
ChinaNetCloud
 
Why Work at ChinaNetCloud
Why Work at ChinaNetCloudWhy Work at ChinaNetCloud
Why Work at ChinaNetCloud
ChinaNetCloud
 
Cloud Operations Challenges - Talk by ChinaNetCloud at Joint Cisco event
Cloud Operations Challenges - Talk by ChinaNetCloud at Joint Cisco eventCloud Operations Challenges - Talk by ChinaNetCloud at Joint Cisco event
Cloud Operations Challenges - Talk by ChinaNetCloud at Joint Cisco event
ChinaNetCloud
 
Automatically Managing Internet Operations In The Cloud - 云计算平台的自动化运维
Automatically Managing  Internet Operations  In The Cloud - 云计算平台的自动化运维Automatically Managing  Internet Operations  In The Cloud - 云计算平台的自动化运维
Automatically Managing Internet Operations In The Cloud - 云计算平台的自动化运维
ChinaNetCloud
 
ChinaNetCloud - Aliyun Joint Event on Cloud Operations
ChinaNetCloud - Aliyun Joint Event on Cloud Operations ChinaNetCloud - Aliyun Joint Event on Cloud Operations
ChinaNetCloud - Aliyun Joint Event on Cloud Operations
ChinaNetCloud
 
Clouds in China
Clouds in ChinaClouds in China
Clouds in China
ChinaNetCloud
 
ChinaNetCloud - Public Clouds in China Overview
ChinaNetCloud - Public Clouds in China OverviewChinaNetCloud - Public Clouds in China Overview
ChinaNetCloud - Public Clouds in China Overview
ChinaNetCloud
 

More from ChinaNetCloud (20)

AWS ELB Tips & Best Practices
AWS ELB Tips & Best PracticesAWS ELB Tips & Best Practices
AWS ELB Tips & Best Practices
 
OpsStack--Integrated Operation Platform
OpsStack--Integrated Operation PlatformOpsStack--Integrated Operation Platform
OpsStack--Integrated Operation Platform
 
ChinaNetCloud Online Lecture:Something About Tshark
ChinaNetCloud Online Lecture:Something About TsharkChinaNetCloud Online Lecture:Something About Tshark
ChinaNetCloud Online Lecture:Something About Tshark
 
ChinaNetCloud Online Lecture: Fight Against External Attacks From Different L...
ChinaNetCloud Online Lecture: Fight Against External Attacks From Different L...ChinaNetCloud Online Lecture: Fight Against External Attacks From Different L...
ChinaNetCloud Online Lecture: Fight Against External Attacks From Different L...
 
Steve Mushero on Entrepreneurship - 创业 - 崔牛会
Steve Mushero on Entrepreneurship - 创业 - 崔牛会Steve Mushero on Entrepreneurship - 创业 - 崔牛会
Steve Mushero on Entrepreneurship - 创业 - 崔牛会
 
Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲
Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲
Dev-Ops与Docker的最佳实践 QCon2016 北京站演讲
 
云中漫步 颠覆创新_创业邦春季创新峰会主题演讲 Cloud Innovation in China
云中漫步 颠覆创新_创业邦春季创新峰会主题演讲 Cloud Innovation in China云中漫步 颠覆创新_创业邦春季创新峰会主题演讲 Cloud Innovation in China
云中漫步 颠覆创新_创业邦春季创新峰会主题演讲 Cloud Innovation in China
 
运维安全 抵抗黑客攻击_云络安全沙龙4月上海站主题分享
运维安全 抵抗黑客攻击_云络安全沙龙4月上海站主题分享运维安全 抵抗黑客攻击_云络安全沙龙4月上海站主题分享
运维安全 抵抗黑客攻击_云络安全沙龙4月上海站主题分享
 
AWS Summit OaaS Talk by ChinaNetCloud
AWS Summit OaaS Talk by ChinaNetCloudAWS Summit OaaS Talk by ChinaNetCloud
AWS Summit OaaS Talk by ChinaNetCloud
 
Running Internet Systems in China - The Details You Need to Succeed in Chines...
Running Internet Systems in China - The Details You Need to Succeed in Chines...Running Internet Systems in China - The Details You Need to Succeed in Chines...
Running Internet Systems in China - The Details You Need to Succeed in Chines...
 
Making Internet Operations Easier
Making Internet Operations EasierMaking Internet Operations Easier
Making Internet Operations Easier
 
Internet Cloud Operations - ChinaNetcloud & AWS Event Beijing
Internet Cloud Operations - ChinaNetcloud & AWS Event BeijingInternet Cloud Operations - ChinaNetcloud & AWS Event Beijing
Internet Cloud Operations - ChinaNetcloud & AWS Event Beijing
 
Big Data Security (ChinaNetCloud - Guiyang Conference)
Big Data Security (ChinaNetCloud - Guiyang Conference)Big Data Security (ChinaNetCloud - Guiyang Conference)
Big Data Security (ChinaNetCloud - Guiyang Conference)
 
Internet System Security Overview
Internet System Security OverviewInternet System Security Overview
Internet System Security Overview
 
Why Work at ChinaNetCloud
Why Work at ChinaNetCloudWhy Work at ChinaNetCloud
Why Work at ChinaNetCloud
 
Cloud Operations Challenges - Talk by ChinaNetCloud at Joint Cisco event
Cloud Operations Challenges - Talk by ChinaNetCloud at Joint Cisco eventCloud Operations Challenges - Talk by ChinaNetCloud at Joint Cisco event
Cloud Operations Challenges - Talk by ChinaNetCloud at Joint Cisco event
 
Automatically Managing Internet Operations In The Cloud - 云计算平台的自动化运维
Automatically Managing  Internet Operations  In The Cloud - 云计算平台的自动化运维Automatically Managing  Internet Operations  In The Cloud - 云计算平台的自动化运维
Automatically Managing Internet Operations In The Cloud - 云计算平台的自动化运维
 
ChinaNetCloud - Aliyun Joint Event on Cloud Operations
ChinaNetCloud - Aliyun Joint Event on Cloud Operations ChinaNetCloud - Aliyun Joint Event on Cloud Operations
ChinaNetCloud - Aliyun Joint Event on Cloud Operations
 
Clouds in China
Clouds in ChinaClouds in China
Clouds in China
 
ChinaNetCloud - Public Clouds in China Overview
ChinaNetCloud - Public Clouds in China OverviewChinaNetCloud - Public Clouds in China Overview
ChinaNetCloud - Public Clouds in China Overview
 

Recently uploaded

How to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptxHow to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptx
Gal Baras
 
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and GuidelinesMulti-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Sanjeev Rampal
 
Latest trends in computer networking.pptx
Latest trends in computer networking.pptxLatest trends in computer networking.pptx
Latest trends in computer networking.pptx
JungkooksNonexistent
 
This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!
nirahealhty
 
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
3ipehhoa
 
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Brad Spiegel Macon GA
 
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC
 
guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...
Rogerio Filho
 
Comptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guideComptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guide
GTProductions1
 
1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...
JeyaPerumal1
 
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
ufdana
 
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shopHistory+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
laozhuseo02
 
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
eutxy
 
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
keoku
 
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptxInternet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
VivekSinghShekhawat2
 
BASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptxBASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptx
natyesu
 
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
3ipehhoa
 
test test test test testtest test testtest test testtest test testtest test ...
test test  test test testtest test testtest test testtest test testtest test ...test test  test test testtest test testtest test testtest test testtest test ...
test test test test testtest test testtest test testtest test testtest test ...
Arif0071
 
The+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptxThe+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptx
laozhuseo02
 
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
3ipehhoa
 

Recently uploaded (20)

How to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptxHow to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptx
 
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and GuidelinesMulti-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
 
Latest trends in computer networking.pptx
Latest trends in computer networking.pptxLatest trends in computer networking.pptx
Latest trends in computer networking.pptx
 
This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!
 
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
 
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
 
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
 
guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...
 
Comptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guideComptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guide
 
1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...
 
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
 
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shopHistory+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
 
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
 
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
 
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptxInternet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
 
BASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptxBASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptx
 
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
 
test test test test testtest test testtest test testtest test testtest test ...
test test  test test testtest test testtest test testtest test testtest test ...test test  test test testtest test testtest test testtest test testtest test ...
test test test test testtest test testtest test testtest test testtest test ...
 
The+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptxThe+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptx
 
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
急速办(bedfordhire毕业证书)英国贝德福特大学毕业证成绩单原版一模一样
 

ChinaNetCloud - The Zabbix Database - Zabbix Conference 2014

  • 1. Running the World’s Internet Servers Loving the Database By Steve Mushero September, 2014 Running the World’s Internet Servers www.ChinaNetCloud.com Zabbix Build & Manage Servers  Optimize & Manage Servers  Managed Cloud Servers Copyright © 2014 ChinaNetCloud
  • 2. Running the World’s Internet Servers www.ChinaNetCloud.com Greetings I’m Steve I’m from Shanghai, China We have a big Internet there We have a big business We have a big monitoring system That’s Zabbix We love the Database Let me tell you more about it . . . 2 Hello
  • 3. Our System & Database Zabbix 1.8.3 – Going to 2.2 now 1250 hosts, 300K items, 550 new values/sec MySQL Percona 5.5 w/ very optimized config Will go to 5.6 when 5.6 is more stable No query cache Main DB about 200GB in size History is about 2 billion rows of data, 157GB size Trends is 500 million, 37GB size Backups take 8 hours (including compress/encrypt) Has 48GB of Innodb Buffer (will expand to 64+ soon) Will go to 96, 128, 256 over time Generally runs well, except I/O bottleneck Running the World’s Internet Servers www.ChinaNetCloud.com 3
  • 4. Running the World’s Internet Servers www.ChinaNetCloud.com Zabbix Database MySQL for us Not Postgres/Oracle – No idea about them 1.8 & 2.2 Versions Very familiar with 1.8.3 Will try to focus on 2.2 Overall, very logical Easy to use/query Very powerful Some weird things Some complex things Key values in defines.inc.php I have a 2.2 ERD Model for you ! 4
  • 5. Database Overview Running the World’s Internet Servers www.ChinaNetCloud.com ~111 Tables Well-structured & logical All about foreign keys for joins All useful joins on ID objectid, e.g. hostid, itemid All timestamps are epoch time Always called ‘clock’ in any table Use MySQL unix_timestamp() and from_unixtime() Several billion-row tables Careful with these ! 5
  • 6. Running the World’s Internet Servers www.ChinaNetCloud.com Key Subsystems Hosts (& Templates) Items (& Templates) Functions Triggers Events Users Trends & History Macros, Profiles, HostGroups Graphs, Screens Queue 6
  • 7. Running the World’s Internet Servers www.ChinaNetCloud.com Core Subsystems Need to know/understand in detail Hosts (& Templates) Items (& Templates) Triggers (& Functions) Events Trends & History 7 Core
  • 8. Hosts, Items & Templates I Critical to understand the relationships Hosts & Items are the core of the system Running the World’s Internet Servers www.ChinaNetCloud.com Hosts have items Items drive everything Agent Type, Data Type, Intervals, etc. SQL to look at a host & items: SELECT host, key_, description, from_unixtime(lastclock), lastvalue FROM items i JOIN hosts h ON i.hostid = h.hostid WHERE h.status = 0 AND i.status = 0 HOST KEY_ DESCRIPTION LASTCLOCK LASTVALUE srv-nc-webdav1 agent.ping agent-ping 2014-08-19 02:50:44 1 srv-nc-webdav1 agent.version Version of zabbix_agent(d) 2014-08-19 02:25:13 1.8.3 srv-nc-webdav1 apache[busyworkers] Apache Busy Workers 2014-08-19 02:50:16 1.000000 items.lastvalue removed in 2.2 – Pull from history table 8
  • 9. Hosts, Items & Templates II Templates are just special hosts (status=3) Templated items are special items Templateid=0 and hosts.status=3 A host’s non-template items also templateid=0 so careful On a host, an item is from a template if templateid>0 Templateid is for an ITEM, not the template Must join (host’s item to template’s item to template) to get template name Running the World’s Internet Servers www.ChinaNetCloud.com 9
  • 10. Hosts, Items & Templates III Hosts have attached Templates Running the World’s Internet Servers www.ChinaNetCloud.com One or more per host Host’s templated items are copied from Template Template Items are COPIED to the host VERY important to understand this relationship Important to understand what can be changed at host level So 10 hosts with a template of 10 items 110 items total in the system (10 + 10x10) HOST HOST_STATUS ITEMID TEMPLATEID KEY_ NC_Template_Linux 3 24130 0 agent.version srv-nc-webdav1 0 23110 24130 agent.version srv-nc-dns1 0 23314 24130 agent.version SELECT host, h.status, itemid, templateid, key_ FROM items i JOIN hosts h ON i.hostid = h.hostid WHERE (h.status = 0 OR h.status = 3) AND i.status = 0 AND (i.itemid = 24130 OR i.templateid = 24130) ORDER by templateid 10
  • 11. Triggers, Events, Functions, and Items I Another key set of relationships Events are Trigger status changes Basically the alerts you see on dashboard Drive actions, emails, dashboard Triggers are logic that finds problems Contain the logic Expression Running the World’s Internet Servers www.ChinaNetCloud.com String with fomula Based on Functions Functions are the Zabbix functions 11
  • 12. Triggers, Events, Functions, and Items II Functions contain items and the function Running the World’s Internet Servers www.ChinaNetCloud.com Last, avg, etc. Items link to hosts, etc. Triggers can be multi-host This complicates logic Hard to link Trigger to a Host – big SQL Example: TRIGGERID EXPRESSION DESCRIPTION 10048 {1003079}/{1003080}*100<10 Lack of free memory on server {HOSTNAME} 10056 {1003227}>300 Too many processes on {HOSTNAME} FUNCTIONID ITEMID TRIGGERID FUNCTION PARAMETER 1003079 10090 10048 last 0 12
  • 13. Now that you understand all that We’ll talk about some tables Running the World’s Internet Servers www.ChinaNetCloud.com 13
  • 14. Running the World’s Internet Servers www.ChinaNetCloud.com Hosts Core table Hosts are Hosts Templates are also Hosts hosts.status = 3 Proxies are also Hosts Hosts.status = 5 Don’t confuse with agent on proxy host Hosts are Enabled/Disabled hosts.status = 0 or 1 Hosts can be in unreachable state hosts.status = 2 Not clear this is fully used Join items to hosts to get host/template name 14
  • 15. Running the World’s Internet Servers www.ChinaNetCloud.com Hosts – SQL List active hosts SELECT hostid, proxy_hostid, host, ip, port, status FROM hosts WHERE status = 0 ORDER BY host HOSTID STATUS HOST IP PORT STATUS 10057 0 srv-nc-web1 60.139.13.43 40067 0 10058 0 srv-nc-web2 223.173.38.47 13050 0 10059 0 srv-nc-web3 223.213.91.96 20450 0 15
  • 16. Running the World’s Internet Servers www.ChinaNetCloud.com Items Core table Linked to hosts on hostid Have Type Agent, SNMP, Internal, Simple, IPMI, ec. Enabled/Disabled, Error items.status = 0 or 1 Can be in error, items.status = 3 Lastclock tells last collect time in 1.8 Very useful for pulling data, stats, issues Not clear where this went in 2.2 LastValue has last value in 1.8 In 2.2 you must get from Trend table, annoying 16
  • 17. Running the World’s Internet Servers www.ChinaNetCloud.com Items Either Host or Template Level If from Template, COPIED from Template Some fields can be changed at host level - Enabled, Interval, History/Trend Retention, Application, Group But OVERWRITTEN if you update the Template templateid = itemid on the Template SELECT host, h.status, itemid, templateid, key_ FROM items i JOIN hosts h ON i.hostid = h.hostid WHERE h.status = 0 AND i.status = 0 ORDER by host, key_ HOST STATUS ITEMID TEMPLATEID KEY_ srv-nc-def1 0 227843 22934 agent.ping srv-nc-def1 0 216864 44130 agent.version srv-nc-def1 0 216864 0 local.thing 17
  • 18. Running the World’s Internet Servers www.ChinaNetCloud.com Items – Get Data Get data from items (Ver 1.8) /* Get small swap servers with swap used */ SELECT host, i.lastvalue AS Swap_Size, 100-ii.lastvalue as Swap_Used FROM items i JOIN hosts h ON i.hostid = h.hostid JOIN items ii on h.hostid = ii.hostid WHERE i.templateid = 24172 /* Swap size */ AND i.lastvalue > (1 * 1024 * 1024 * 1024) AND i.lastvalue < (88 * 1024 * 1024 * 1024) AND h.status = 0 AND i.status = 0 AND ii.status = 0 AND ii.templateid = 154766 /* swap % free */ AND (100-ii.lastvalue) > 10 ORDER BY Swap_Used DESC 18
  • 19. Running the World’s Internet Servers www.ChinaNetCloud.com Items – Get Data items.lastvalue & lastclock removed in 2.2 – Pull from history VERY long query – about 5 pages (many parts removed): SELECT (case when (i.value_type = 0) then (select history.value from history where (history.itemid = i.itemid) order by history.clock desc limit 1) when (i.value_type = 1) … (i.value_type = 2) … (i.value_type = 3) … (i.value_type = 4) … end) AS lastvalue, (case when (i.value_type = 0) then (select history.clock from history where (history.itemid = i.itemid) order by history.clock desc limit 1) when (i.value_type = 1) (i.value_type = 2) (i.value_type = 3) (i.value_type = 4) end) AS lastclock from items i where itemid = 23110 19
  • 20. Running the World’s Internet Servers www.ChinaNetCloud.com Zabbix Queue Items can be in a ‘queue’ Seen on Admin|Queue screen Not a real queue ! Just a list of late items May change in Version 2.2 Now() > (lastclock + interval) Max ‘queue’ is # of active items Stuff can get stuck if error & host disabled Hosts disabled but items enabled Lots of good SQL for reports Queue size, oldest items, queue by host Queue by proxy for graphs / triggers 20
  • 21. Running the World’s Internet Servers www.ChinaNetCloud.com Triggers I Core table Linked to functions Functions link to items Items link to hosts Functions Last, min, max, sum, nodata, etc. Enabled/Disabled, Error triggers.status = 0 or 1 21
  • 22. Running the World’s Internet Servers www.ChinaNetCloud.com Triggers II Value – OK or Problem Also UNKNOWN in 1.8 Behavior changed in 2.0/2.2 Priority is here URL is here Templateid tells you if came from template ID is of parent Trigger in the template Dependencies are here, complicated Trigger_up is trigger we depend on Trigger_down is dependent trigger (this trigger?) Trigger Level – 0 for no dependency 22
  • 23. Running the World’s Internet Servers www.ChinaNetCloud.com Events The Alerts you see on Dashboard Basically a trigger changing status Also include auto-discovery, etc. Triggered events Source = 0, Object = 0 Objectid will match the trigger Status tells you Trigger status We tie Alert Tickets to this Note: Server re-creates events on restart Copies over ACKs Very annoying if you tie things to eventid We have special PHP to rebuild this relationship 23
  • 24. Events – ACK & Duration ACKs set flag in Event DB row And ACK data in acknowledges table Finding Event duration is HARD Basically scan forward for next OK event Slow and messy Important for metrics select distinct SUBSTRING_INDEX(from_unixtime(e.clock),' ',1) AlertDate, SUBSTRING_INDEX(from_unixtime(e.clock),' ',-1) AlertTime, (select floor((eb.clock-e.clock)/60) from events eb where value = 0 and eb.eventid > e.eventid and eb.objectid=e.objectid order by eb.eventid limit 1 ) as Duration, h.host, t.description, t.priority, from_unixtime(a.clock) ACK, u.name, a.acknowledgeid, a.message, floor((a.clock-e.clock)/60) response_time from triggers t join functions f on f.triggerid=t.triggerid right join items i on f.itemid=i.itemid join hosts h on h.hostid=i.hostid right join events e on t.triggerid=e.objectid left join acknowledges a on a.eventid=e.eventid left join users u on u.userid=a.userid where e.value=1 and t.triggerid<>19072 and e.clock>unix_timestamp('2010-04-01 09:00:00') and e.clock<unix_timestamp('2010-04-01 18:00:00') order by e.clock; Running the World’s Internet Servers www.ChinaNetCloud.com 24
  • 25. Running the World’s Internet Servers www.ChinaNetCloud.com History & Trends This is the data we collect Drives the graphs Data first goes to History tables By type – uint, text, double, etc. Server moves to Trends tables On schedule based on Item config Summarizes each hour Saves min, max, average Trends table purged by Housekeeper Can partition in DB for much faster purge Special SQL can also purge, but I/O heavy SQL in History/Trends painful Lots of random I/O Ideally data fits in server RAM or have SSD 25
  • 26. Running the World’s Internet Servers www.ChinaNetCloud.com Web Checks A bit complex – 4 tables Link into the rest of the system Httptest Test name, status, and interval, last check, and what seems like response time, and error text httpstep tables Step name, URL, timeout, response code, required text Httptestitem Links to items, creates two items of type 9 for each test - Download Speed & Failed Step (type 2 & 3 in this table) httpstepitem Links to items, creates 3 items of type 9 for each step - Download Speed, Response Time, Response Code Other Note Item History & Trends set to 30 & 90, but can't seem to be edited anywhere 26
  • 27. Graphs, Screens & Slideshows Pretty part of the system Running the World’s Internet Servers www.ChinaNetCloud.com Basic tables screens & screen_items Use resourcetype to know how to link Each type can link to underlying graph, item, etc. Complex security Based on item/host permissions 27
  • 28. Users & Security - I Running the World’s Internet Servers www.ChinaNetCloud.com users table alias field is the actual user name usrgrp table All roles/permissions tied here, API, GUI, etc. rights table Links usrgrp to server groups with RO, R/W permission profiles table Use not clear as thousands of rows per user Think its drives dashboard and other modules settings sessions table Basic user session tied to cookie mysql> update users set passwd=MD5('somepassword') where alias='Admin'; 28
  • 29. Users & Security - II Running the World’s Internet Servers www.ChinaNetCloud.com Enable/Disable Seems to disable by adding to disabled group Passwords MD5 has WITH TRAILING CARRIAGE RETURN, so use -n on echo: $ echo -n "somepassword" | md5sum 9c42a1346e333a770904b2a2b37fa7d3 But easier to use MySQL function: mysql> update users set passwd=MD5('somepassword') where alias='Admin'; Refresh field – For all screens Defaults very low, set to 0 or much higher Otherwise heavy load on DB 29
  • 30. Running the World’s Internet Servers www.ChinaNetCloud.com Users & Security Hosts a User has rights on SELECT ug.usrgrpid, ug.name AS user_group, g.name as host_group, host FROM users u JOIN users_groups ugl ON u.userid = ugl.userid JOIN usrgrp ug ON ugl.usrgrpid = ug.usrgrpid JOIN rights r ON ug.usrgrpid = r.groupid JOIN groups g ON r.id = g.groupid JOIN hosts_groups hg ON g.groupid = hg.groupid JOIN hosts h on hg.hostid = h.hostid where u.alias = 'steve.mushero' /* userid 116 */ /* group 15 is 24x7 user */ AND r.permission in (2,3); 30
  • 31. Config and User Profiles Refresh rates and DB load Dashboard refresh rates UPDATE profiles p JOIN users u ON p.userid = u.userid SET p.value_int = 300 WHERE idx LIKE 'web.dahsboard.rf_rate.%' AND u.alias not LIKE 'cust%'; Running the World’s Internet Servers www.ChinaNetCloud.com 31
  • 32. Running the World’s Internet Servers www.ChinaNetCloud.com Audit System Great, but useless GUI Data very fine-grained / detailed audit_log & audit_log_details Actions & Resources See code for list of values General SQL to get info: SELECT alias, from_unixtime(a.clock), a.action, a.resourcetype, a.details, a.resourceid, a.resourcename, ad.table_name, ad.field_name, CAST(ad.oldvalue AS UNSIGNED) AS oldvalue, CAST(ad.newvalue AS UNSIGNED) AS newvalue FROM auditlog a LEFT JOIN auditlog_details ad ON a.auditid = ad.auditid JOIN users u ON a.userid = u.userid WHERE resourceid = 109482 AND field_name = 'status' AND from_unixtime(clock) > '2013-05-01' 32
  • 33. Running the World’s Internet Servers www.ChinaNetCloud.com Audit System Big select to get details on hosts SELECT alias, from_unixtime(a.clock), CASE a.action WHEN 0 THEN "Added" WHEN 1 THEN "Updated" WHEN 2 THEN "Deleted" ELSE CAST(a.action AS CHAR) END AS action, CASE a.resourcetype WHEN 4 THEN "Host" WHEN 13 THEN "Item ?" WHEN 15 THEN "Item" ELSE CAST(a.resourcetype AS CHAR) END AS resource_type, a.details, a.resourceid, a.resourcename, ad.table_name, ad.field_name, CAST(ad.oldvalue AS CHAR), CASE ad.newvalue WHEN 0 THEN "Enable” WHEN 1 THEN "Disable" END as newvalue FROM auditlog a LEFT JOIN auditlog_details ad ON a.auditid = ad.auditid JOIN users u ON a.userid = u.userid WHERE from_unixtime(clock) > '2012-01-01' /* AND alias LIKE 'matt%' */ AND a.resourcetype = 4 /* Host */ AND (field_name = 'status' OR field_name IS NULL) /* AND ad.newvalue = 1 /* 1 = Disable */ /* AND resourcename LIKE '%web17%'*/; 33
  • 34. Running the World’s Internet Servers www.ChinaNetCloud.com Safety Reports We have dozens of these, such as: Items that differ from template Missing templates Disable items/hosts, forget to enable Alerts with no URL/Wiki Hosts missing profile data Items disabled conflict with trigger Web alerts with no trigger Web alerts with long/short timeouts Hosts in wrong, duplicate, conflicting groups Servers in Zabbix, not core system Many are quite complex, big SQL We will post these on-line After updating for Ver 2.2 34
  • 35. Running the World’s Internet Servers www.ChinaNetCloud.com Housekeeper Done by server every hour Very slow – Item by Item Thousands per second A LOT of I/O (mostly read) We have SQL to do in bulk But too heavy load on I/O system We’ll see on SSD, or all data in RAM 35
  • 36. Running the World’s Internet Servers www.ChinaNetCloud.com Backups Backup, of course, but big Ours are 8 hours 200GB of data At 1TB this is not manageable Need incrementals Maybe recent data only Do from slave You can backup config only Ignore history*, trends*, events, audit*, acknowledges 36
  • 37. Running the World’s Internet Servers www.ChinaNetCloud.com Summary We love the Zabbix database So should you Learn how everything connects Focus on key / core tables Have fun 37
  • 38. Thanks from ChinaNetCloud Pioneers in OaaS – Operations as a Service Running the World’s Internet Servers www.ChinaNetCloud.com 38
  • 39. ChinaNetCloud Shanghai Headquarters: X2 Space 1-601, 1238 Xietu Lu Shanghai, 200032 China T: +86-21-6422-1946 F: +86-21-6422-4911 Beijing Office: Lee World Business Building #305 57 Happiness Village Road, Chaoyang District Beijing, 100027 China Sales@ChinaNetCloud.com Running the World’s Internet Servers www.ChinaNetCloud.com www.ChinaNetCloud.com Silicon Valley Office: California Avenue Palo Alto, 94123 USA