On a CentOS 4 IBM x3650 server, ipmitool -v sensor | grep -A 14 "PECI" gets me this:
Sensor ID : CPU 1 PECI tics (0x7c)
Entity ID : 3.1
Sensor Type (Analog) : Temperature
Sensor Reading : -55 (+/- 0) unspecified
Status : ok
Lower Non-Recoverable : -64.000
Lower Critical : na
Lower Non-Critical : 64.000
Upper Non-Critical : 0.000
Upper Critical : na
Upper Non-Recoverable : na
Event Status : Event Messages Disabled
Assertion Events :
Event Enable : Event Messages Disabled
Assertions Enabled : unc-
--
Sensor ID : CPU 2 PECI tics (0x7d)
Entity ID : 3.2
Sensor Type (Analog) : Temperature
Sensor Reading : -56 (+/- 0) unspecified
Status : ok
Lower Non-Recoverable : -64.000
Lower Critical : na
Lower Non-Critical : 64.000
Upper Non-Critical : 0.000
Upper Critical : na
Upper Non-Recoverable : na
Event Status : Event Messages Disabled
Assertion Events :
Event Enable : Event Messages Disabled
Assertions Enabled : unc-
2 things stick out to me. The lower non-critical threshold is higher than the upper non-critical threshold, so technically the Sensor Reading will always be out of range.
But according to ipmitool sdr list | grep PECI, the reading seems to be fine:
CPU 1 PECI tics | -49 unspecified | ok
CPU 2 PECI tics | -52 unspecified | ok
But I do see that the only Assertion Enabled is unc(Upper Non-Critical). So I guess that means that it's ignoring the Lower Non-Critical threshold? The second thing that sticks out to me is that the Assertions Enabled section says unc- as opposed to unc+. On all other sensors, I only ever see lnr-, lcr-, lnc- and/or, unc+, ucr+, unr+. So although I couldn't find any documentation to support my assumption I assumed they could be interpreted as "assertions are enabled in the event that the reading goes below(-) a lower threshold and/or above(+) an upper threshold," as the case may be. However, wouldn't that mean that unc- would enable assertions when the reading was below(-) the upper threshold? If so, why is the reading of -55 which is below the Upper Non-Critical threshold of 0.000 ok?
I have a similar problem on a CentOS 4 IBM eServer xSeries 346 server. ipmitool -v sensor for the CPU1 VCore shows:
Sensor ID : CPU 1 VCore (0xb8)
Entity ID : 3.1
Sensor Type (Analog) : Voltage
Sensor Reading : 1.330 (+/- 0) Volts
Status : ok
Lower Non-Recoverable : 1.920
Lower Critical : na
Lower Non-Critical : 1.920
Upper Non-Critical : 0.000
Upper Critical : na
Upper Non-Recoverable : na
Assertion Events :
Assertions Enabled :
So here the Upper Non-Critical threshold is lower than the Lower Non-Critical threshold, rendering any reading out of range. However, ipmitool sdr list shows:
CPU 1 VCore | 1.32 Volts | ok
In this case, however, there are no assertions enabled. Is that why the status is "ok"? What am I to make of these seemingly weird thresholds? Is there any documentation to explain what unc- could mean? Is it a problem specific to CentOS 4 IBMs or maybe these sensors in particular?