SlideShare a Scribd company logo
1 of 37
Download to read offline
Hardware Monitoring
with the new
IPMI Plugin v2
Werner Fischer, Technology Specialist Thomas-Krenn.AG

6. OSMC / Nuremberg / Germany
29th November 2011
Introduction

                  who I am                        who
                                               I am not
 Werner Fischer   working for a   Linux user   Kernel or H/W
                  Server vendor   since 2001     developer




                                                 slide 2/37
Introduction

              who is
 Server & accessories   based in Freyung,   serving all over Europe
  "Made in Germany"         Bavaria




                                                      slide 3/37
Some questions ...

        Should I use monitoring?




                                   slide 4/37
Some questions ...

        Should I use monitoring?
        It depends on what you
      want to do in your free time...
     remember yesterday evening?
                     ✔          ✘
                                    slide 5/37
Some questions ...

      All drives of the RAID 6 O.K.?


          ✔     ✔    ✔     ✔




                                   slide 6/37
Some questions ...

      All network connections O.K.?

                            ✔
                      ✔




                                  slide 7/37
Some questions ...

              All FANs O.K.?




               ?
                               slide 8/37
Some questions ...

         All power supplies O.K.?




               ?
                                    slide 9/37
Some questions ...

   Can we monitor all these servers?




                             any IPMI
                            compatible
                              server


                                   slide 10/37
Some questions ...

   Can we monitor all these servers?
              Easily?




                             any IPMI
                            compatible
                              server


                                   slide 11/37
Some questions ...

   Can we monitor all these servers?
              Easily?
        With one single tool?



                             any IPMI
                            compatible
                              server


                                   slide 12/37
Some questions ...

   Can we monitor all these servers?
              Easily?
        With one single tool?




                                slide 13/37
Agenda

 1) IPMI overview

 2) Plugin implementation

 3) Live demo

 4) Common pitfalls




                            slide 14/37
Intelligent Platform Management Interface

  • IPMI developed by Intel, HP, NEC, Dell
    – 1998: IPMI v1.0
    – 2001: IPMI v1.5
    – 2004: IPMI v2.0




                                       slide 15/37
IPMI main features


                        Monitoring
                      (temp, fans, ...)


                 Recovery Control
                 (power on/off/reset)


                      Logging
                 (System Event Log)


                       Inventory
                     (FRU information)
                                          slide 16/37
IPMI overview
                             access req.
                             username &                        Remote Mmgt. Card
                                                                (KVM over IP, ...)
                                                                                                     ICMB

    LAN
  Connector
                  Serial
                Connector
                              password                              Auxillary
                                                                 IPMB Connector
                                                                                        ICMB
                                                                                        bridge

                                                                                                                  Chassis
                                     PCI mgmt. bus                                     IPMB                        mgmt.
                                                                             NVS Storage                          (Satellite
                                                                                SDR
                                                                                                                 Controller)
  Network
                           LAN                                                  SEL
   (LAN)
                        interface                                               FRU
  Controller
                                             Baseboard                                                         FRU        Temp.
                                                                          Sensors & Controls
                                            Management                                                                    sensor
 access req.                                 Controller
                                                                             Fan sensor
                                                                            Temp. sensor
                                                                                                                            …

                                               (BMC)                        Power control
root privileges                                                             Reset control
                                                                                  …
                                                                                                                Chassis board


                  Serial                      BMC
                             Serial/Modem
                   Port                      Serial                       private mgmt. busses                 FRU
                               interface
                 Sharing                    Controller
                                                                             FRU              FRU
                                                                                                              Redundant Power
                  M/B                                                                                              board
                                                                            Temp. s.
                 Serial                                  System
                Controller                               interface         Memory        Processor
                                                                            board          board
                                        System bus
  Motherboard


                                                                                                            slide 17/37
IPMI Channel Privilege Levels (LAN access)

Privilege Level Allows
User            ● query sensors


Operator        ● nearly all IPMI commands

                ● but no changing of out-of-

                  band interfaces
Administrator   • all IPMI commands

             use privilege level 'User'
             for monitoring purposes


                                          slide 18/37
Example: remote control with ipmitool
    [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211 
     [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211 
                      ­U admin power status
                       ­U admin power status
    Password:
     Password:
    Chassis Power is off
     Chassis Power is off
    [user@adminpc ~]$
     [user@adminpc ~]$
    [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211 
     [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211 
                      ­U admin power on
                       ­U admin power on
    Password:
     Password:
    Chassis Power Control: Up/On
     Chassis Power Control: Up/On
    [user@adminpc ~]$
     [user@adminpc ~]$
    [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211 
     [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211 
                      ­U admin power status
                       ­U admin power status
    Password:
     Password:
    Chassis Power is on
     Chassis Power is on
    [user@adminpc ~]$
     [user@adminpc ~]$




                                                      slide 19/37
IPMI Sensor Classes (1/2)
Discrete                                Threshold
multiple states possible:               changes event status on:
● up to 15 states                       ● analog reading compared to

● each state is reflected by a bit        threshold values
● multiple state bits can active




can provide:                            provides:
● generic states                        ● analog reading

● sensor-specific states                  of the sensor
                                        ● discr. threshold

                                          comparison status bit
other class similar to discrete:
● OEM: discrete sensor where the

  meaning of the states (offsets) are
  OEM defined




                                                                  slide 20/37
IPMI Sensor Classes (2/2)
Discrete                                            Threshold
 [root@test ~]# ipmitool sdr get "PS2 Status"        [root@test ~]# ipmitool sdr get "Fan 1"
  [root@test ~]# ipmitool sdr get "PS2 Status"        [root@test ~]# ipmitool sdr get "Fan 1"
 Sensor ID              : PS2 Status (0x71)          Sensor ID              : Fan 1 (0x50)
  Sensor ID              : PS2 Status (0x71)          Sensor ID              : Fan 1 (0x50)
  Entity ID             : 10.2 (Power Supply)         Entity ID             : 29.1 (Fan Device)
   Entity ID             : 10.2 (Power Supply)         Entity ID             : 29.1 (Fan Device)
  Sensor Type (Discrete): Power Supply                Sensor Type (Analog)  : Fan
   Sensor Type (Discrete): Power Supply                Sensor Type (Analog)  : Fan
  States Asserted       : Power Supply                Sensor Reading        : 5719 (+/­ 0) RPM
   States Asserted       : Power Supply                Sensor Reading        : 5719 (+/­ 0) RPM
                          [Presence detected]         Status                : ok
                           [Presence detected]         Status                : ok
                          [Power Supply AC lost]      Nominal Reading       : 6708.000
                           [Power Supply AC lost]      Nominal Reading       : 6708.000
  Assertion Events      : Power Supply                Normal Minimum        : 2451.000
   Assertion Events      : Power Supply                Normal Minimum        : 2451.000
                          [Presence detected]         Normal Maximum        : 10965.000
                           [Presence detected]         Normal Maximum        : 10965.000
                          [Power Supply AC lost]      Lower critical        : 1720.000
                           [Power Supply AC lost]      Lower critical        : 1720.000
  Assertions Enabled    : Power Supply                Lower non­critical    : 1978.000
   Assertions Enabled    : Power Supply                Lower non­critical    : 1978.000
                          [Presence detected]         Positive Hysteresis   : 86.000
                           [Presence detected]         Positive Hysteresis   : 86.000
                          [Failure detected]          Negative Hysteresis   : 86.000
                           [Failure detected]          Negative Hysteresis   : 86.000
                          [Predictive failure]        Minimum sensor range  : Unspecified
                           [Predictive failure]        Minimum sensor range  : Unspecified
                          [Power Supply AC lost]      Maximum sensor range  : Unspecified
                           [Power Supply AC lost]      Maximum sensor range  : Unspecified
 [...]                                                Event Message Control : Per­threshold
  [...]                                                Event Message Control : Per­threshold
  Deassertions Enabled  : Power Supply                Readable Thresholds   : lcr lnc 
   Deassertions Enabled  : Power Supply                Readable Thresholds   : lcr lnc 
 [...]                                                Settable Thresholds   : lcr lnc 
  [...]                                                Settable Thresholds   : lcr lnc 
                                                      Threshold Read Mask   : lcr lnc 
                                                       Threshold Read Mask   : lcr lnc 
                                                      Assertion Events      : 
                                                       Assertion Events      : 
                                                      Assertions Enabled    : lnc­ lcr­ 
                                                       Assertions Enabled    : lnc­ lcr­ 
                                                      Deassertions Enabled  : lnc­ lcr­ 
                                                       Deassertions Enabled  : lnc­ lcr­ 




                                                                               slide 21/37
IPMI Sensor Types

 root@test:~# ipmi­sensors ­L
 root@test:~# ipmi­sensors ­L
 Temperature
 Temperature
 Voltage
 Voltage
 Current
 Current
 Fan
 Fan
 Physical_Security
 Physical_Security
 Platform_Security_Violation_Attempt
 Platform_Security_Violation_Attempt
 Processor
 Processor
 Power_Supply
 Power_Supply
 Power_Unit
 Power_Unit
 Cooling_Device
 Cooling_Device
 […]
 […]



                                       slide 22/37
Example: query sensors with FreeIPMI

 [root@testserver ~]# ipmimonitoring
  [root@testserver ~]# ipmimonitoring
 Record_ID | Sensor Name | Sensor Group | Monitoring Status| 
  Record_ID | Sensor Name | Sensor Group | Monitoring Status| 
 Sensor Units | Sensor Reading
  Sensor Units | Sensor Reading
 [...]
  [...]
 17 | Fan 5              | Fan     | Nominal | RPM | 9052.000000 
  17 | Fan 5              | Fan     | Nominal | RPM | 9052.000000 
 18 | Fan 6              | Fan     | Nominal | RPM | 8060.000000 
  18 | Fan 6              | Fan     | Nominal | RPM | 8060.000000 
 19 | PS1 AC Current     | Current | Nominal | A   | 0.124000 
  19 | PS1 AC Current     | Current | Nominal | A   | 0.124000 
 20 | PS2 AC Current     | Current | Nominal | A   | 0.992000 
  20 | PS2 AC Current     | Current | Nominal | A   | 0.992000 
 [...]
  [...]
 36 | Physical Scrty     | Physical Security | Critical | N/A | 
  36 | Physical Scrty     | Physical Security | Critical | N/A | 
                                       'General Chassis Intrusion'
                                        'General Chassis Intrusion'




                                                       slide 23/37
Example: interpret discrete sensors( FreeIPMI)

 root@test:~# cat /etc/freeipmi/freeipmi_interpret_sensor.conf
  root@test:~# cat /etc/freeipmi/freeipmi_interpret_sensor.conf
 […]
  […]
 ## IPMI_Physical_Security 
  ## IPMI_Physical_Security 
 ##
 # IPMI_Physical_Security_No_Event                  Nominal
  # IPMI_Physical_Security_No_Event                  Nominal
 # IPMI_Physical_Security_General_Chassis_Intrusion Critical
  # IPMI_Physical_Security_General_Chassis_Intrusion Critical
 # IPMI_Physical_Security_Drive_Bay_Intrusion       Critical
  # IPMI_Physical_Security_Drive_Bay_Intrusion       Critical
 […]
  […]
 # IPMI_Power_Supply_No_Event                       Nominal
  # IPMI_Power_Supply_No_Event                       Nominal
 # IPMI_Power_Supply_Presence_Detected              Nominal
  # IPMI_Power_Supply_Presence_Detected              Nominal
 # IPMI_Power_Supply_Power_Supply_Failure_Detected  Critical
  # IPMI_Power_Supply_Power_Supply_Failure_Detected  Critical
 # IPMI_Power_Supply_Predictive_Failure             Critical
  # IPMI_Power_Supply_Predictive_Failure             Critical
 # IPMI_Power_Supply_Power_Supply_Input_Lost_AC_DC  Critical
  # IPMI_Power_Supply_Power_Supply_Input_Lost_AC_DC  Critical
 […]
  […]
 ## IPMI_Memory
  ## IPMI_Memory
 ##
 # IPMI_Memory_No_Event                             Nominal
  # IPMI_Memory_No_Event                             Nominal
 # IPMI_Memory_Correctable_Memory_Error             Warning
  # IPMI_Memory_Correctable_Memory_Error             Warning
 # IPMI_Memory_Uncorrectable_Memory_Error           Critical
  # IPMI_Memory_Uncorrectable_Memory_Error           Critical


                                                       slide 24/37
IPMI System Event Log (SEL)

    • stored in non-volatile storage
[root@testserver ~]# ipmitool sel elist
 [root@testserver ~]# ipmitool sel elist
  40 | 06/21/2010 | 14:29:29 | Power Supply PS1 Status | Power Supply AC lost | Asserted
   40 | 06/21/2010 | 14:29:29 | Power Supply PS1 Status | Power Supply AC lost | Asserted
  54 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Fully Redundant
   54 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Fully Redundant
  68 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Redundancy Lost
   68 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Redundancy Lost
  7c | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant
   7c | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant
[...]
 [...]
 2fc | 06/21/2010 | 15:20:32 | Physical Security Physical Scrty | General Chassis intrusion | Asserted
  2fc | 06/21/2010 | 15:20:32 | Physical Security Physical Scrty | General Chassis intrusion | Asserted


[root@testserver ~]# ipmitool sel elist
 [root@testserver ~]# ipmitool sel elist
Power Supply PS1 Status | Power Supply AC lost | Asserted
 Power Supply PS1 Status | Power Supply AC lost | Asserted
Power Unit Power Redundancy | Fully Redundant
 Power Unit Power Redundancy | Fully Redundant
Power Unit Power Redundancy | Redundancy Lost
 Power Unit Power Redundancy | Redundancy Lost
Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant
 Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant
[...]
 [...]
Physical Security Physical Scrty | General Chassis intrusion | Asserted
 Physical Security Physical Scrty | General Chassis intrusion | Asserted




                                                                                         slide 25/37
Agenda

 1) IPMI overview

 2) Plugin implementation

 3) Live demo

 4) Common pitfalls




                            slide 26/37
Plugin implementation

  • Bash script
  • uses FreeIPMI, gawk
# ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg 
 # ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg 
IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4185.000000 'FAN 
 IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4185.000000 'FAN 
2'=4320.000000 'FAN 3'=4590.000000 'FAN 4'=4320.000000 'FAN 
 2'=4320.000000 'FAN 3'=4590.000000 'FAN 4'=4320.000000 'FAN 
A'=4590.000000 'Vcore'=0.712000 '3.3VCC'=3.392000 '12V'=12.190000 
 A'=4590.000000 'Vcore'=0.712000 '3.3VCC'=3.392000 '12V'=12.190000 
'VDIMM'=1.528000 '5VCC'=5.088000 '­12V'=­11.681000 'VBAT'=3.024000 
 'VDIMM'=1.528000 '5VCC'=5.088000 '­12V'=­11.681000 'VBAT'=3.024000 
'VSB'=3.344000 'AVCC'=3.408000 
 'VSB'=3.344000 'AVCC'=3.408000 




                                                        slide 27/37
Plugin implementation

    • Bash script
    • uses FreeIPMI, gawk
# ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg ­v 2
 # ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg ­v 2
IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4320.000000 'FAN 
 IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4320.000000 'FAN 
[…] 
 […] 
System Temp = 29.000000 (Status: Nominal)
 System Temp = 29.000000 (Status: Nominal)
CPU Temp = 'Low' (Status: Nominal)
 CPU Temp = 'Low' (Status: Nominal)
FAN 1 = 4320.000000 (Status: Nominal)
 FAN 1 = 4320.000000 (Status: Nominal)
FAN 2 = 4320.000000 (Status: Nominal)
 FAN 2 = 4320.000000 (Status: Nominal)
FAN 3 = 4590.000000 (Status: Nominal)
 FAN 3 = 4590.000000 (Status: Nominal)
[…]
 […]
AVCC = 3.408000 (Status: Nominal)
 AVCC = 3.408000 (Status: Nominal)
Chassis Intru = 'OK' (Status: Nominal)
 Chassis Intru = 'OK' (Status: Nominal)
PS Status = 'Presence detected' (Status: Nominal) 
 PS Status = 'Presence detected' (Status: Nominal) 




                                                         slide 28/37
Plugin implementation

  • clear illustration in webinterfaces




                                          slide 29/37
Agenda

 1) IPMI overview

 2) Plugin implementation

 3) Live demo

 4) Common pitfalls




                            slide 30/37
Agenda

 1) IPMI overview

 2) Plugin implementation

 3) Live demo

 4) Common pitfalls




                            slide 31/37
Common pitfalls

   • sensors with state N/A
[…]
 […]
12 | CPU1 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h'
 12 | CPU1 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h'
13 | CPU2 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h' 
 13 | CPU2 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h' 
[…]
 […]


   • solution shortest-term: exclude (-x opt.)
   • solution short-term: FreeIPMI update
     tkwiki.cc/FreeIPMI-NA-Sensor



                                                         slide 32/37
Common pitfalls

  • unrecognized events
[…] 
 […] 
40 | Status | Cable/Interconnect | Nominal | N/A | 
 40 | Status | Cable/Interconnect | Nominal | N/A | 
'Cable/Interconnect is connected'
 'Cable/Interconnect is connected'
41 | RAC Status | Module/Board | N/A | N/A | 'Unrecognized Event = 
 41 | RAC Status | Module/Board | N/A | N/A | 'Unrecognized Event = 
0001h' 'Unrecognized Event = 0002h' 'Unrecognized Event = 0004h'
 0001h' 'Unrecognized Event = 0002h' 'Unrecognized Event = 0004h'
42 | OS Watchdog | Watchdog 2 | Nominal | N/A | 'OK'
 42 | OS Watchdog | Watchdog 2 | Nominal | N/A | 'OK'
[…]  
 […]  

  • solution shortest-term: ignore unrec. e.
    tkwiki.cc/FreeIPMI-Unrec-Event




                                                        slide 33/37
Agenda

 1) IPMI overview

 2) Plugin implementation

 3) Live demo

 4) Common pitfalls

             some conclusions …


                                  slide 34/37
Conclusions (1/2)

  • Download:
    www.thomas-krenn.com/en/oss
  • Mailing List:
    lists.thomas-krenn.com
  • Thanks for your contribution:
     Nikolaus Filus, Timme Katz, Lars
     Meuser, Sebastian Mörchen, Gustav
     Olsson, Holger Paschke, Andy Spiegl,
     Ulrich Zehl
                                     slide 35/37
Conclusions (2/2)


       Monitor hardware
        with Icinga & IPMI


         Problems?
        They will tell you!


          It'll save you
          time & money



                              slide 36/37
Get German article
on the plugin for free at

tkwiki.cc/ipmi-plugin




Thanks for your time!

More Related Content

What's hot

ARM Processor architecture
ARM Processor  architectureARM Processor  architecture
ARM Processor architecturerajkciitr
 
Arm architecture overview
Arm architecture overviewArm architecture overview
Arm architecture overviewSunil Thorat
 
Arm corrected ppt
Arm corrected pptArm corrected ppt
Arm corrected pptanish jagan
 
Programming The Arm Microprocessor For Embedded Systems
Programming The Arm Microprocessor For Embedded SystemsProgramming The Arm Microprocessor For Embedded Systems
Programming The Arm Microprocessor For Embedded Systemsjoshparrish13
 
Mpc5121 econfs
Mpc5121 econfsMpc5121 econfs
Mpc5121 econfsDino, llc
 
Q4.11: ARM Architecture
Q4.11: ARM ArchitectureQ4.11: ARM Architecture
Q4.11: ARM ArchitectureLinaro
 
Introduction to Processor Design and ARM Processor
Introduction to Processor Design and ARM ProcessorIntroduction to Processor Design and ARM Processor
Introduction to Processor Design and ARM ProcessorDarling Jemima
 
Sears Point Racetrack
Sears Point RacetrackSears Point Racetrack
Sears Point RacetrackDino, llc
 
Arm processor architecture awareness session pi technologies
Arm processor architecture awareness session pi technologiesArm processor architecture awareness session pi technologies
Arm processor architecture awareness session pi technologiesPiTechnologies
 
การออกแบบโปรเซสเซอร์
การออกแบบโปรเซสเซอร์การออกแบบโปรเซสเซอร์
การออกแบบโปรเซสเซอร์Nattawooth Mingtong
 
Arm cm3 architecture_and_programmer_model
Arm cm3 architecture_and_programmer_modelArm cm3 architecture_and_programmer_model
Arm cm3 architecture_and_programmer_modelGanesh Naik
 
Xen summit spring2010_tom_woller_amd
Xen summit spring2010_tom_woller_amdXen summit spring2010_tom_woller_amd
Xen summit spring2010_tom_woller_amdThe Linux Foundation
 
ARM - Advance RISC Machine
ARM - Advance RISC MachineARM - Advance RISC Machine
ARM - Advance RISC MachineEdutechLearners
 
ARM7-ARCHITECTURE
ARM7-ARCHITECTURE ARM7-ARCHITECTURE
ARM7-ARCHITECTURE Dr.YNM
 
Unitii armarchitecture-130305014346-phpapp01
Unitii armarchitecture-130305014346-phpapp01Unitii armarchitecture-130305014346-phpapp01
Unitii armarchitecture-130305014346-phpapp01mannepalli Srinivasulu
 
Introduction to arm processor
Introduction to arm processorIntroduction to arm processor
Introduction to arm processorRAMPRAKASHT1
 

What's hot (19)

ARM Processor architecture
ARM Processor  architectureARM Processor  architecture
ARM Processor architecture
 
Arm Processor
Arm ProcessorArm Processor
Arm Processor
 
Arm architecture overview
Arm architecture overviewArm architecture overview
Arm architecture overview
 
Arm corrected ppt
Arm corrected pptArm corrected ppt
Arm corrected ppt
 
Programming The Arm Microprocessor For Embedded Systems
Programming The Arm Microprocessor For Embedded SystemsProgramming The Arm Microprocessor For Embedded Systems
Programming The Arm Microprocessor For Embedded Systems
 
Mpc5121 econfs
Mpc5121 econfsMpc5121 econfs
Mpc5121 econfs
 
Q4.11: ARM Architecture
Q4.11: ARM ArchitectureQ4.11: ARM Architecture
Q4.11: ARM Architecture
 
Introduction to Processor Design and ARM Processor
Introduction to Processor Design and ARM ProcessorIntroduction to Processor Design and ARM Processor
Introduction to Processor Design and ARM Processor
 
Sears Point Racetrack
Sears Point RacetrackSears Point Racetrack
Sears Point Racetrack
 
Ppt
PptPpt
Ppt
 
Arm processor architecture awareness session pi technologies
Arm processor architecture awareness session pi technologiesArm processor architecture awareness session pi technologies
Arm processor architecture awareness session pi technologies
 
การออกแบบโปรเซสเซอร์
การออกแบบโปรเซสเซอร์การออกแบบโปรเซสเซอร์
การออกแบบโปรเซสเซอร์
 
Arm cm3 architecture_and_programmer_model
Arm cm3 architecture_and_programmer_modelArm cm3 architecture_and_programmer_model
Arm cm3 architecture_and_programmer_model
 
Xen summit spring2010_tom_woller_amd
Xen summit spring2010_tom_woller_amdXen summit spring2010_tom_woller_amd
Xen summit spring2010_tom_woller_amd
 
ARM - Advance RISC Machine
ARM - Advance RISC MachineARM - Advance RISC Machine
ARM - Advance RISC Machine
 
ARM7-ARCHITECTURE
ARM7-ARCHITECTURE ARM7-ARCHITECTURE
ARM7-ARCHITECTURE
 
Unitii armarchitecture-130305014346-phpapp01
Unitii armarchitecture-130305014346-phpapp01Unitii armarchitecture-130305014346-phpapp01
Unitii armarchitecture-130305014346-phpapp01
 
Introduction to arm processor
Introduction to arm processorIntroduction to arm processor
Introduction to arm processor
 
OMAP
OMAPOMAP
OMAP
 

Similar to 20111130 hardware-monitoring-with-the-new-ipmi-plugin-v2

OSMC 2011 | Hardware Monitoring with the new IPMI Plugin v2 by Werner Fischer
OSMC 2011 | Hardware Monitoring with the new IPMI Plugin v2 by Werner FischerOSMC 2011 | Hardware Monitoring with the new IPMI Plugin v2 by Werner Fischer
OSMC 2011 | Hardware Monitoring with the new IPMI Plugin v2 by Werner FischerNETWAYS
 
OSMC 2014: Server Hardware Monitoring done right | Werner Fischer
OSMC 2014: Server Hardware Monitoring done right | Werner FischerOSMC 2014: Server Hardware Monitoring done right | Werner Fischer
OSMC 2014: Server Hardware Monitoring done right | Werner FischerNETWAYS
 
Programming And Controlling Puma Arms
Programming And Controlling Puma ArmsProgramming And Controlling Puma Arms
Programming And Controlling Puma Arms블로그코디
 
Open Storage Sun Intel European Business Technology Tour
Open Storage Sun Intel European Business Technology TourOpen Storage Sun Intel European Business Technology Tour
Open Storage Sun Intel European Business Technology TourWalter Moriconi
 
Component Based Distributed System Development
Component Based Distributed System DevelopmentComponent Based Distributed System Development
Component Based Distributed System DevelopmentEmmanuel Fuchs
 
Презентация команды "Обыватели"
Презентация команды "Обыватели"Презентация команды "Обыватели"
Презентация команды "Обыватели"Tatyana Savchyk
 
Adv. FPGA Motor Control--EBV & Univ. of Koln: Embedded World 2010
Adv. FPGA Motor Control--EBV & Univ. of Koln: Embedded World 2010Adv. FPGA Motor Control--EBV & Univ. of Koln: Embedded World 2010
Adv. FPGA Motor Control--EBV & Univ. of Koln: Embedded World 2010Altera Corporation
 
Meeting SEP 2.0 Compliance: Developing Power Aware Embedded Systems for the M...
Meeting SEP 2.0 Compliance: Developing Power Aware Embedded Systems for the M...Meeting SEP 2.0 Compliance: Developing Power Aware Embedded Systems for the M...
Meeting SEP 2.0 Compliance: Developing Power Aware Embedded Systems for the M...mentoresd
 
15.00 hr van Hilten
15.00 hr van Hilten15.00 hr van Hilten
15.00 hr van HiltenThemadagen
 
Flex Stack Rapid Prototyping System
Flex Stack Rapid Prototyping SystemFlex Stack Rapid Prototyping System
Flex Stack Rapid Prototyping Systemfrankp617
 
EASA Part-66 Module 5.6 : Basic Computer Structure
EASA Part-66 Module  5.6 : Basic Computer StructureEASA Part-66 Module  5.6 : Basic Computer Structure
EASA Part-66 Module 5.6 : Basic Computer Structuresoulstalker
 
Zilker Labs Mixed-Signal Verification
Zilker Labs Mixed-Signal VerificationZilker Labs Mixed-Signal Verification
Zilker Labs Mixed-Signal VerificationDVClub
 
Icinga Camp Berlin 2017 - 10 Tips for better Hardware Monitoring
Icinga Camp Berlin 2017 - 10 Tips for better Hardware MonitoringIcinga Camp Berlin 2017 - 10 Tips for better Hardware Monitoring
Icinga Camp Berlin 2017 - 10 Tips for better Hardware MonitoringIcinga
 
Camera camcorder framework overview(ginger bread)
Camera camcorder framework overview(ginger bread)Camera camcorder framework overview(ginger bread)
Camera camcorder framework overview(ginger bread)fefe7270
 
Modeling System Behaviors: A Better Paradigm on Prototyping
Modeling System Behaviors: A Better Paradigm on PrototypingModeling System Behaviors: A Better Paradigm on Prototyping
Modeling System Behaviors: A Better Paradigm on PrototypingDVClub
 
[05][cuda 및 fermi 최적화 기술] hryu optimization
[05][cuda 및 fermi 최적화 기술] hryu optimization[05][cuda 및 fermi 최적화 기술] hryu optimization
[05][cuda 및 fermi 최적화 기술] hryu optimizationlaparuma
 

Similar to 20111130 hardware-monitoring-with-the-new-ipmi-plugin-v2 (20)

OSMC 2011 | Hardware Monitoring with the new IPMI Plugin v2 by Werner Fischer
OSMC 2011 | Hardware Monitoring with the new IPMI Plugin v2 by Werner FischerOSMC 2011 | Hardware Monitoring with the new IPMI Plugin v2 by Werner Fischer
OSMC 2011 | Hardware Monitoring with the new IPMI Plugin v2 by Werner Fischer
 
OSMC 2014: Server Hardware Monitoring done right | Werner Fischer
OSMC 2014: Server Hardware Monitoring done right | Werner FischerOSMC 2014: Server Hardware Monitoring done right | Werner Fischer
OSMC 2014: Server Hardware Monitoring done right | Werner Fischer
 
Programming And Controlling Puma Arms
Programming And Controlling Puma ArmsProgramming And Controlling Puma Arms
Programming And Controlling Puma Arms
 
Open Storage Sun Intel European Business Technology Tour
Open Storage Sun Intel European Business Technology TourOpen Storage Sun Intel European Business Technology Tour
Open Storage Sun Intel European Business Technology Tour
 
Component Based Distributed System Development
Component Based Distributed System DevelopmentComponent Based Distributed System Development
Component Based Distributed System Development
 
Презентация команды "Обыватели"
Презентация команды "Обыватели"Презентация команды "Обыватели"
Презентация команды "Обыватели"
 
Adv. FPGA Motor Control--EBV & Univ. of Koln: Embedded World 2010
Adv. FPGA Motor Control--EBV & Univ. of Koln: Embedded World 2010Adv. FPGA Motor Control--EBV & Univ. of Koln: Embedded World 2010
Adv. FPGA Motor Control--EBV & Univ. of Koln: Embedded World 2010
 
Meeting SEP 2.0 Compliance: Developing Power Aware Embedded Systems for the M...
Meeting SEP 2.0 Compliance: Developing Power Aware Embedded Systems for the M...Meeting SEP 2.0 Compliance: Developing Power Aware Embedded Systems for the M...
Meeting SEP 2.0 Compliance: Developing Power Aware Embedded Systems for the M...
 
15.00 hr van Hilten
15.00 hr van Hilten15.00 hr van Hilten
15.00 hr van Hilten
 
Flex Stack Rapid Prototyping System
Flex Stack Rapid Prototyping SystemFlex Stack Rapid Prototyping System
Flex Stack Rapid Prototyping System
 
EASA Part-66 Module 5.6 : Basic Computer Structure
EASA Part-66 Module  5.6 : Basic Computer StructureEASA Part-66 Module  5.6 : Basic Computer Structure
EASA Part-66 Module 5.6 : Basic Computer Structure
 
Herrington dv club_sept19-1
Herrington dv club_sept19-1Herrington dv club_sept19-1
Herrington dv club_sept19-1
 
Zilker Labs Mixed-Signal Verification
Zilker Labs Mixed-Signal VerificationZilker Labs Mixed-Signal Verification
Zilker Labs Mixed-Signal Verification
 
Icinga Camp Berlin 2017 - 10 Tips for better Hardware Monitoring
Icinga Camp Berlin 2017 - 10 Tips for better Hardware MonitoringIcinga Camp Berlin 2017 - 10 Tips for better Hardware Monitoring
Icinga Camp Berlin 2017 - 10 Tips for better Hardware Monitoring
 
Thesis Donato Slides EN
Thesis Donato Slides ENThesis Donato Slides EN
Thesis Donato Slides EN
 
Camera camcorder framework overview(ginger bread)
Camera camcorder framework overview(ginger bread)Camera camcorder framework overview(ginger bread)
Camera camcorder framework overview(ginger bread)
 
Modeling System Behaviors: A Better Paradigm on Prototyping
Modeling System Behaviors: A Better Paradigm on PrototypingModeling System Behaviors: A Better Paradigm on Prototyping
Modeling System Behaviors: A Better Paradigm on Prototyping
 
[05][cuda 및 fermi 최적화 기술] hryu optimization
[05][cuda 및 fermi 최적화 기술] hryu optimization[05][cuda 및 fermi 최적화 기술] hryu optimization
[05][cuda 및 fermi 최적화 기술] hryu optimization
 
Fpga design flow
Fpga design flowFpga design flow
Fpga design flow
 
Ph.D. Thesis presentation
Ph.D. Thesis presentationPh.D. Thesis presentation
Ph.D. Thesis presentation
 

Recently uploaded

Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfngoud9212
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 

Recently uploaded (20)

Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Bluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdfBluetooth Controlled Car with Arduino.pdf
Bluetooth Controlled Car with Arduino.pdf
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 

20111130 hardware-monitoring-with-the-new-ipmi-plugin-v2

  • 1. Hardware Monitoring with the new IPMI Plugin v2 Werner Fischer, Technology Specialist Thomas-Krenn.AG 6. OSMC / Nuremberg / Germany 29th November 2011
  • 2. Introduction who I am who I am not Werner Fischer working for a Linux user Kernel or H/W Server vendor since 2001 developer slide 2/37
  • 3. Introduction who is Server & accessories based in Freyung, serving all over Europe "Made in Germany" Bavaria slide 3/37
  • 4. Some questions ... Should I use monitoring? slide 4/37
  • 5. Some questions ... Should I use monitoring? It depends on what you want to do in your free time... remember yesterday evening? ✔ ✘ slide 5/37
  • 6. Some questions ... All drives of the RAID 6 O.K.? ✔ ✔ ✔ ✔ slide 6/37
  • 7. Some questions ... All network connections O.K.? ✔ ✔ slide 7/37
  • 8. Some questions ... All FANs O.K.? ? slide 8/37
  • 9. Some questions ... All power supplies O.K.? ? slide 9/37
  • 10. Some questions ... Can we monitor all these servers? any IPMI compatible server slide 10/37
  • 11. Some questions ... Can we monitor all these servers? Easily? any IPMI compatible server slide 11/37
  • 12. Some questions ... Can we monitor all these servers? Easily? With one single tool? any IPMI compatible server slide 12/37
  • 13. Some questions ... Can we monitor all these servers? Easily? With one single tool? slide 13/37
  • 14. Agenda 1) IPMI overview 2) Plugin implementation 3) Live demo 4) Common pitfalls slide 14/37
  • 15. Intelligent Platform Management Interface • IPMI developed by Intel, HP, NEC, Dell – 1998: IPMI v1.0 – 2001: IPMI v1.5 – 2004: IPMI v2.0 slide 15/37
  • 16. IPMI main features  Monitoring (temp, fans, ...)  Recovery Control (power on/off/reset)  Logging (System Event Log)  Inventory (FRU information) slide 16/37
  • 17. IPMI overview access req. username & Remote Mmgt. Card (KVM over IP, ...) ICMB LAN Connector Serial Connector password Auxillary IPMB Connector ICMB bridge Chassis PCI mgmt. bus IPMB mgmt. NVS Storage (Satellite SDR Controller) Network LAN SEL (LAN) interface FRU Controller Baseboard FRU Temp. Sensors & Controls Management sensor access req. Controller Fan sensor Temp. sensor … (BMC) Power control root privileges Reset control … Chassis board Serial BMC Serial/Modem Port Serial private mgmt. busses FRU interface Sharing Controller FRU FRU Redundant Power M/B board Temp. s. Serial System Controller interface Memory Processor board board System bus Motherboard slide 17/37
  • 18. IPMI Channel Privilege Levels (LAN access) Privilege Level Allows User ● query sensors Operator ● nearly all IPMI commands ● but no changing of out-of- band interfaces Administrator • all IPMI commands use privilege level 'User' for monitoring purposes slide 18/37
  • 19. Example: remote control with ipmitool [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211  [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211                    ­U admin power status                   ­U admin power status Password: Password: Chassis Power is off Chassis Power is off [user@adminpc ~]$ [user@adminpc ~]$ [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211  [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211                    ­U admin power on                   ­U admin power on Password: Password: Chassis Power Control: Up/On Chassis Power Control: Up/On [user@adminpc ~]$ [user@adminpc ~]$ [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211  [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211                    ­U admin power status                   ­U admin power status Password: Password: Chassis Power is on Chassis Power is on [user@adminpc ~]$ [user@adminpc ~]$ slide 19/37
  • 20. IPMI Sensor Classes (1/2) Discrete Threshold multiple states possible: changes event status on: ● up to 15 states ● analog reading compared to ● each state is reflected by a bit threshold values ● multiple state bits can active can provide: provides: ● generic states ● analog reading ● sensor-specific states of the sensor ● discr. threshold comparison status bit other class similar to discrete: ● OEM: discrete sensor where the meaning of the states (offsets) are OEM defined slide 20/37
  • 21. IPMI Sensor Classes (2/2) Discrete Threshold [root@test ~]# ipmitool sdr get "PS2 Status" [root@test ~]# ipmitool sdr get "Fan 1" [root@test ~]# ipmitool sdr get "PS2 Status" [root@test ~]# ipmitool sdr get "Fan 1" Sensor ID              : PS2 Status (0x71) Sensor ID              : Fan 1 (0x50) Sensor ID              : PS2 Status (0x71) Sensor ID              : Fan 1 (0x50)  Entity ID             : 10.2 (Power Supply)  Entity ID             : 29.1 (Fan Device)  Entity ID             : 10.2 (Power Supply)  Entity ID             : 29.1 (Fan Device)  Sensor Type (Discrete): Power Supply  Sensor Type (Analog)  : Fan  Sensor Type (Discrete): Power Supply  Sensor Type (Analog)  : Fan  States Asserted       : Power Supply  Sensor Reading        : 5719 (+/­ 0) RPM  States Asserted       : Power Supply  Sensor Reading        : 5719 (+/­ 0) RPM                          [Presence detected]  Status                : ok                          [Presence detected]  Status                : ok                          [Power Supply AC lost]  Nominal Reading       : 6708.000                          [Power Supply AC lost]  Nominal Reading       : 6708.000  Assertion Events      : Power Supply  Normal Minimum        : 2451.000  Assertion Events      : Power Supply  Normal Minimum        : 2451.000                          [Presence detected]  Normal Maximum        : 10965.000                          [Presence detected]  Normal Maximum        : 10965.000                          [Power Supply AC lost]  Lower critical        : 1720.000                          [Power Supply AC lost]  Lower critical        : 1720.000  Assertions Enabled    : Power Supply  Lower non­critical    : 1978.000  Assertions Enabled    : Power Supply  Lower non­critical    : 1978.000                          [Presence detected]  Positive Hysteresis   : 86.000                          [Presence detected]  Positive Hysteresis   : 86.000                          [Failure detected]  Negative Hysteresis   : 86.000                          [Failure detected]  Negative Hysteresis   : 86.000                          [Predictive failure]  Minimum sensor range  : Unspecified                          [Predictive failure]  Minimum sensor range  : Unspecified                          [Power Supply AC lost]  Maximum sensor range  : Unspecified                          [Power Supply AC lost]  Maximum sensor range  : Unspecified [...]  Event Message Control : Per­threshold [...]  Event Message Control : Per­threshold  Deassertions Enabled  : Power Supply  Readable Thresholds   : lcr lnc   Deassertions Enabled  : Power Supply  Readable Thresholds   : lcr lnc  [...]  Settable Thresholds   : lcr lnc  [...]  Settable Thresholds   : lcr lnc   Threshold Read Mask   : lcr lnc   Threshold Read Mask   : lcr lnc   Assertion Events      :   Assertion Events      :   Assertions Enabled    : lnc­ lcr­   Assertions Enabled    : lnc­ lcr­   Deassertions Enabled  : lnc­ lcr­   Deassertions Enabled  : lnc­ lcr­  slide 21/37
  • 22. IPMI Sensor Types root@test:~# ipmi­sensors ­L root@test:~# ipmi­sensors ­L Temperature Temperature Voltage Voltage Current Current Fan Fan Physical_Security Physical_Security Platform_Security_Violation_Attempt Platform_Security_Violation_Attempt Processor Processor Power_Supply Power_Supply Power_Unit Power_Unit Cooling_Device Cooling_Device […] […] slide 22/37
  • 23. Example: query sensors with FreeIPMI [root@testserver ~]# ipmimonitoring [root@testserver ~]# ipmimonitoring Record_ID | Sensor Name | Sensor Group | Monitoring Status|  Record_ID | Sensor Name | Sensor Group | Monitoring Status|  Sensor Units | Sensor Reading Sensor Units | Sensor Reading [...] [...] 17 | Fan 5              | Fan     | Nominal | RPM | 9052.000000  17 | Fan 5              | Fan     | Nominal | RPM | 9052.000000  18 | Fan 6              | Fan     | Nominal | RPM | 8060.000000  18 | Fan 6              | Fan     | Nominal | RPM | 8060.000000  19 | PS1 AC Current     | Current | Nominal | A   | 0.124000  19 | PS1 AC Current     | Current | Nominal | A   | 0.124000  20 | PS2 AC Current     | Current | Nominal | A   | 0.992000  20 | PS2 AC Current     | Current | Nominal | A   | 0.992000  [...] [...] 36 | Physical Scrty     | Physical Security | Critical | N/A |  36 | Physical Scrty     | Physical Security | Critical | N/A |                                        'General Chassis Intrusion'                                       'General Chassis Intrusion' slide 23/37
  • 24. Example: interpret discrete sensors( FreeIPMI) root@test:~# cat /etc/freeipmi/freeipmi_interpret_sensor.conf root@test:~# cat /etc/freeipmi/freeipmi_interpret_sensor.conf […] […] ## IPMI_Physical_Security  ## IPMI_Physical_Security  ## # IPMI_Physical_Security_No_Event                  Nominal # IPMI_Physical_Security_No_Event                  Nominal # IPMI_Physical_Security_General_Chassis_Intrusion Critical # IPMI_Physical_Security_General_Chassis_Intrusion Critical # IPMI_Physical_Security_Drive_Bay_Intrusion       Critical # IPMI_Physical_Security_Drive_Bay_Intrusion       Critical […] […] # IPMI_Power_Supply_No_Event                       Nominal # IPMI_Power_Supply_No_Event                       Nominal # IPMI_Power_Supply_Presence_Detected              Nominal # IPMI_Power_Supply_Presence_Detected              Nominal # IPMI_Power_Supply_Power_Supply_Failure_Detected  Critical # IPMI_Power_Supply_Power_Supply_Failure_Detected  Critical # IPMI_Power_Supply_Predictive_Failure             Critical # IPMI_Power_Supply_Predictive_Failure             Critical # IPMI_Power_Supply_Power_Supply_Input_Lost_AC_DC  Critical # IPMI_Power_Supply_Power_Supply_Input_Lost_AC_DC  Critical […] […] ## IPMI_Memory ## IPMI_Memory ## # IPMI_Memory_No_Event                             Nominal # IPMI_Memory_No_Event                             Nominal # IPMI_Memory_Correctable_Memory_Error             Warning # IPMI_Memory_Correctable_Memory_Error             Warning # IPMI_Memory_Uncorrectable_Memory_Error           Critical # IPMI_Memory_Uncorrectable_Memory_Error           Critical slide 24/37
  • 25. IPMI System Event Log (SEL) • stored in non-volatile storage [root@testserver ~]# ipmitool sel elist [root@testserver ~]# ipmitool sel elist   40 | 06/21/2010 | 14:29:29 | Power Supply PS1 Status | Power Supply AC lost | Asserted   40 | 06/21/2010 | 14:29:29 | Power Supply PS1 Status | Power Supply AC lost | Asserted   54 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Fully Redundant   54 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Fully Redundant   68 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Redundancy Lost   68 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Redundancy Lost   7c | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant   7c | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant [...] [...]  2fc | 06/21/2010 | 15:20:32 | Physical Security Physical Scrty | General Chassis intrusion | Asserted  2fc | 06/21/2010 | 15:20:32 | Physical Security Physical Scrty | General Chassis intrusion | Asserted [root@testserver ~]# ipmitool sel elist [root@testserver ~]# ipmitool sel elist Power Supply PS1 Status | Power Supply AC lost | Asserted Power Supply PS1 Status | Power Supply AC lost | Asserted Power Unit Power Redundancy | Fully Redundant Power Unit Power Redundancy | Fully Redundant Power Unit Power Redundancy | Redundancy Lost Power Unit Power Redundancy | Redundancy Lost Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant [...] [...] Physical Security Physical Scrty | General Chassis intrusion | Asserted Physical Security Physical Scrty | General Chassis intrusion | Asserted slide 25/37
  • 26. Agenda 1) IPMI overview 2) Plugin implementation 3) Live demo 4) Common pitfalls slide 26/37
  • 27. Plugin implementation • Bash script • uses FreeIPMI, gawk # ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg  # ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg  IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4185.000000 'FAN  IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4185.000000 'FAN  2'=4320.000000 'FAN 3'=4590.000000 'FAN 4'=4320.000000 'FAN  2'=4320.000000 'FAN 3'=4590.000000 'FAN 4'=4320.000000 'FAN  A'=4590.000000 'Vcore'=0.712000 '3.3VCC'=3.392000 '12V'=12.190000  A'=4590.000000 'Vcore'=0.712000 '3.3VCC'=3.392000 '12V'=12.190000  'VDIMM'=1.528000 '5VCC'=5.088000 '­12V'=­11.681000 'VBAT'=3.024000  'VDIMM'=1.528000 '5VCC'=5.088000 '­12V'=­11.681000 'VBAT'=3.024000  'VSB'=3.344000 'AVCC'=3.408000  'VSB'=3.344000 'AVCC'=3.408000  slide 27/37
  • 28. Plugin implementation • Bash script • uses FreeIPMI, gawk # ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg ­v 2 # ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg ­v 2 IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4320.000000 'FAN  IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4320.000000 'FAN  […]  […]  System Temp = 29.000000 (Status: Nominal) System Temp = 29.000000 (Status: Nominal) CPU Temp = 'Low' (Status: Nominal) CPU Temp = 'Low' (Status: Nominal) FAN 1 = 4320.000000 (Status: Nominal) FAN 1 = 4320.000000 (Status: Nominal) FAN 2 = 4320.000000 (Status: Nominal) FAN 2 = 4320.000000 (Status: Nominal) FAN 3 = 4590.000000 (Status: Nominal) FAN 3 = 4590.000000 (Status: Nominal) […] […] AVCC = 3.408000 (Status: Nominal) AVCC = 3.408000 (Status: Nominal) Chassis Intru = 'OK' (Status: Nominal) Chassis Intru = 'OK' (Status: Nominal) PS Status = 'Presence detected' (Status: Nominal)  PS Status = 'Presence detected' (Status: Nominal)  slide 28/37
  • 29. Plugin implementation • clear illustration in webinterfaces slide 29/37
  • 30. Agenda 1) IPMI overview 2) Plugin implementation 3) Live demo 4) Common pitfalls slide 30/37
  • 31. Agenda 1) IPMI overview 2) Plugin implementation 3) Live demo 4) Common pitfalls slide 31/37
  • 32. Common pitfalls • sensors with state N/A […] […] 12 | CPU1 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h' 12 | CPU1 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h' 13 | CPU2 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h'  13 | CPU2 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h'  […] […] • solution shortest-term: exclude (-x opt.) • solution short-term: FreeIPMI update tkwiki.cc/FreeIPMI-NA-Sensor slide 32/37
  • 33. Common pitfalls • unrecognized events […]  […]  40 | Status | Cable/Interconnect | Nominal | N/A |  40 | Status | Cable/Interconnect | Nominal | N/A |  'Cable/Interconnect is connected' 'Cable/Interconnect is connected' 41 | RAC Status | Module/Board | N/A | N/A | 'Unrecognized Event =  41 | RAC Status | Module/Board | N/A | N/A | 'Unrecognized Event =  0001h' 'Unrecognized Event = 0002h' 'Unrecognized Event = 0004h' 0001h' 'Unrecognized Event = 0002h' 'Unrecognized Event = 0004h' 42 | OS Watchdog | Watchdog 2 | Nominal | N/A | 'OK' 42 | OS Watchdog | Watchdog 2 | Nominal | N/A | 'OK' […]   […]   • solution shortest-term: ignore unrec. e. tkwiki.cc/FreeIPMI-Unrec-Event slide 33/37
  • 34. Agenda 1) IPMI overview 2) Plugin implementation 3) Live demo 4) Common pitfalls some conclusions … slide 34/37
  • 35. Conclusions (1/2) • Download: www.thomas-krenn.com/en/oss • Mailing List: lists.thomas-krenn.com • Thanks for your contribution: Nikolaus Filus, Timme Katz, Lars Meuser, Sebastian Mörchen, Gustav Olsson, Holger Paschke, Andy Spiegl, Ulrich Zehl slide 35/37
  • 36. Conclusions (2/2)  Monitor hardware with Icinga & IPMI  Problems? They will tell you!  It'll save you time & money slide 36/37
  • 37. Get German article on the plugin for free at tkwiki.cc/ipmi-plugin Thanks for your time!