1. Hardware Monitoring
with the new
IPMI Plugin v2
Werner Fischer, Technology Specialist Thomas-Krenn.AG
6. OSMC / Nuremberg / Germany
29th November 2011
2. Introduction
who I am who
I am not
Werner Fischer working for a Linux user Kernel or H/W
Server vendor since 2001 developer
slide 2/37
3. Introduction
who is
Server & accessories based in Freyung, serving all over Europe
"Made in Germany" Bavaria
slide 3/37
16. IPMI main features
Monitoring
(temp, fans, ...)
Recovery Control
(power on/off/reset)
Logging
(System Event Log)
Inventory
(FRU information)
slide 16/37
17. IPMI overview
access req.
username & Remote Mmgt. Card
(KVM over IP, ...)
ICMB
LAN
Connector
Serial
Connector
password Auxillary
IPMB Connector
ICMB
bridge
Chassis
PCI mgmt. bus IPMB mgmt.
NVS Storage (Satellite
SDR
Controller)
Network
LAN SEL
(LAN)
interface FRU
Controller
Baseboard FRU Temp.
Sensors & Controls
Management sensor
access req. Controller
Fan sensor
Temp. sensor
…
(BMC) Power control
root privileges Reset control
…
Chassis board
Serial BMC
Serial/Modem
Port Serial private mgmt. busses FRU
interface
Sharing Controller
FRU FRU
Redundant Power
M/B board
Temp. s.
Serial System
Controller interface Memory Processor
board board
System bus
Motherboard
slide 17/37
18. IPMI Channel Privilege Levels (LAN access)
Privilege Level Allows
User ● query sensors
Operator ● nearly all IPMI commands
● but no changing of out-of-
band interfaces
Administrator • all IPMI commands
use privilege level 'User'
for monitoring purposes
slide 18/37
19. Example: remote control with ipmitool
[user@adminpc ~]$ ipmitool I lan H 192.168.1.211
[user@adminpc ~]$ ipmitool I lan H 192.168.1.211
U admin power status
U admin power status
Password:
Password:
Chassis Power is off
Chassis Power is off
[user@adminpc ~]$
[user@adminpc ~]$
[user@adminpc ~]$ ipmitool I lan H 192.168.1.211
[user@adminpc ~]$ ipmitool I lan H 192.168.1.211
U admin power on
U admin power on
Password:
Password:
Chassis Power Control: Up/On
Chassis Power Control: Up/On
[user@adminpc ~]$
[user@adminpc ~]$
[user@adminpc ~]$ ipmitool I lan H 192.168.1.211
[user@adminpc ~]$ ipmitool I lan H 192.168.1.211
U admin power status
U admin power status
Password:
Password:
Chassis Power is on
Chassis Power is on
[user@adminpc ~]$
[user@adminpc ~]$
slide 19/37
20. IPMI Sensor Classes (1/2)
Discrete Threshold
multiple states possible: changes event status on:
● up to 15 states ● analog reading compared to
● each state is reflected by a bit threshold values
● multiple state bits can active
can provide: provides:
● generic states ● analog reading
● sensor-specific states of the sensor
● discr. threshold
comparison status bit
other class similar to discrete:
● OEM: discrete sensor where the
meaning of the states (offsets) are
OEM defined
slide 20/37
21. IPMI Sensor Classes (2/2)
Discrete Threshold
[root@test ~]# ipmitool sdr get "PS2 Status" [root@test ~]# ipmitool sdr get "Fan 1"
[root@test ~]# ipmitool sdr get "PS2 Status" [root@test ~]# ipmitool sdr get "Fan 1"
Sensor ID : PS2 Status (0x71) Sensor ID : Fan 1 (0x50)
Sensor ID : PS2 Status (0x71) Sensor ID : Fan 1 (0x50)
Entity ID : 10.2 (Power Supply) Entity ID : 29.1 (Fan Device)
Entity ID : 10.2 (Power Supply) Entity ID : 29.1 (Fan Device)
Sensor Type (Discrete): Power Supply Sensor Type (Analog) : Fan
Sensor Type (Discrete): Power Supply Sensor Type (Analog) : Fan
States Asserted : Power Supply Sensor Reading : 5719 (+/ 0) RPM
States Asserted : Power Supply Sensor Reading : 5719 (+/ 0) RPM
[Presence detected] Status : ok
[Presence detected] Status : ok
[Power Supply AC lost] Nominal Reading : 6708.000
[Power Supply AC lost] Nominal Reading : 6708.000
Assertion Events : Power Supply Normal Minimum : 2451.000
Assertion Events : Power Supply Normal Minimum : 2451.000
[Presence detected] Normal Maximum : 10965.000
[Presence detected] Normal Maximum : 10965.000
[Power Supply AC lost] Lower critical : 1720.000
[Power Supply AC lost] Lower critical : 1720.000
Assertions Enabled : Power Supply Lower noncritical : 1978.000
Assertions Enabled : Power Supply Lower noncritical : 1978.000
[Presence detected] Positive Hysteresis : 86.000
[Presence detected] Positive Hysteresis : 86.000
[Failure detected] Negative Hysteresis : 86.000
[Failure detected] Negative Hysteresis : 86.000
[Predictive failure] Minimum sensor range : Unspecified
[Predictive failure] Minimum sensor range : Unspecified
[Power Supply AC lost] Maximum sensor range : Unspecified
[Power Supply AC lost] Maximum sensor range : Unspecified
[...] Event Message Control : Perthreshold
[...] Event Message Control : Perthreshold
Deassertions Enabled : Power Supply Readable Thresholds : lcr lnc
Deassertions Enabled : Power Supply Readable Thresholds : lcr lnc
[...] Settable Thresholds : lcr lnc
[...] Settable Thresholds : lcr lnc
Threshold Read Mask : lcr lnc
Threshold Read Mask : lcr lnc
Assertion Events :
Assertion Events :
Assertions Enabled : lnc lcr
Assertions Enabled : lnc lcr
Deassertions Enabled : lnc lcr
Deassertions Enabled : lnc lcr
slide 21/37
22. IPMI Sensor Types
root@test:~# ipmisensors L
root@test:~# ipmisensors L
Temperature
Temperature
Voltage
Voltage
Current
Current
Fan
Fan
Physical_Security
Physical_Security
Platform_Security_Violation_Attempt
Platform_Security_Violation_Attempt
Processor
Processor
Power_Supply
Power_Supply
Power_Unit
Power_Unit
Cooling_Device
Cooling_Device
[…]
[…]
slide 22/37
23. Example: query sensors with FreeIPMI
[root@testserver ~]# ipmimonitoring
[root@testserver ~]# ipmimonitoring
Record_ID | Sensor Name | Sensor Group | Monitoring Status|
Record_ID | Sensor Name | Sensor Group | Monitoring Status|
Sensor Units | Sensor Reading
Sensor Units | Sensor Reading
[...]
[...]
17 | Fan 5 | Fan | Nominal | RPM | 9052.000000
17 | Fan 5 | Fan | Nominal | RPM | 9052.000000
18 | Fan 6 | Fan | Nominal | RPM | 8060.000000
18 | Fan 6 | Fan | Nominal | RPM | 8060.000000
19 | PS1 AC Current | Current | Nominal | A | 0.124000
19 | PS1 AC Current | Current | Nominal | A | 0.124000
20 | PS2 AC Current | Current | Nominal | A | 0.992000
20 | PS2 AC Current | Current | Nominal | A | 0.992000
[...]
[...]
36 | Physical Scrty | Physical Security | Critical | N/A |
36 | Physical Scrty | Physical Security | Critical | N/A |
'General Chassis Intrusion'
'General Chassis Intrusion'
slide 23/37
25. IPMI System Event Log (SEL)
• stored in non-volatile storage
[root@testserver ~]# ipmitool sel elist
[root@testserver ~]# ipmitool sel elist
40 | 06/21/2010 | 14:29:29 | Power Supply PS1 Status | Power Supply AC lost | Asserted
40 | 06/21/2010 | 14:29:29 | Power Supply PS1 Status | Power Supply AC lost | Asserted
54 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Fully Redundant
54 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Fully Redundant
68 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Redundancy Lost
68 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Redundancy Lost
7c | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | NonRedundant: Sufficient from Redundant
7c | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | NonRedundant: Sufficient from Redundant
[...]
[...]
2fc | 06/21/2010 | 15:20:32 | Physical Security Physical Scrty | General Chassis intrusion | Asserted
2fc | 06/21/2010 | 15:20:32 | Physical Security Physical Scrty | General Chassis intrusion | Asserted
[root@testserver ~]# ipmitool sel elist
[root@testserver ~]# ipmitool sel elist
Power Supply PS1 Status | Power Supply AC lost | Asserted
Power Supply PS1 Status | Power Supply AC lost | Asserted
Power Unit Power Redundancy | Fully Redundant
Power Unit Power Redundancy | Fully Redundant
Power Unit Power Redundancy | Redundancy Lost
Power Unit Power Redundancy | Redundancy Lost
Power Unit Power Redundancy | NonRedundant: Sufficient from Redundant
Power Unit Power Redundancy | NonRedundant: Sufficient from Redundant
[...]
[...]
Physical Security Physical Scrty | General Chassis intrusion | Asserted
Physical Security Physical Scrty | General Chassis intrusion | Asserted
slide 25/37
26. Agenda
1) IPMI overview
2) Plugin implementation
3) Live demo
4) Common pitfalls
slide 26/37