From our side the same problem, we have around 50 of these sensors, all of them did go into error after installing update 14.4.12.3510+ (overal status failed)
For the sensors which were monitoring linux systems it was sufficient to clear the iml log via the ilo console, this cleared the down status for these sensors.
For the sensors which are installed on windows systems, clearing the IML logs doesn't seem to solve the issue, neither does restarting the HP Systems Management Agents.
On all affected sensors installed on windows systems the CPU Fan has a status of "other", I don't know however if this is causing the "down status" of the sensor.
Paessler mentioned that they have reworked the sensor to update the behaviour with the latest HP update, so we have done a test case, installing the latest proliant support pack on an HP Proliant ML 110, the installed version of the HP systems management agents is 10.0.0.0
Installing the updated proliant support pack, including the updated HP Systems management Agents did not resolve the issue.
We performed a second test by removing the sensor on the server and re-adding it, and then the situation gets even worse.
Before removal we had the following channels with status:
CPU Fan Status 3 Other Other Other
Disk Controller Status 41 OK OK OK
Downtime -4
Fans Broken 5 0 # 0 # 0 #
Fans Running 4 0 # 0 # 0 #
Fault Tolerant Fans Broken 7 0 # 0 # 0 #
Fault Tolerant Fans Running 6 6 # 6 # 6 #
Overall Status 0 Failed OK Failed
Power Consumption 1 33 55 W 50 W 90 W
Power Consumption 1 (%) 34 12 % 11 % 20 %
Power Consumption 2 37 65 W 60 W 100 W
Power Consumption 2 (%) 38 14 % 13 % 22 %
Power Supply 1 Condition 36 OK OK OK
Power Supply 1 Status 35 No Error No Error No Error
Power Supply 2 Condition 40 OK OK Failed
Power Supply 2 Status 39 No Error No Error General Failure
System Fan Status 2 OK OK OK
Temperature 01(ambient) 8 24 °C 18 °C 27 °C
Temperature 02(cpu) 9 40 °C 40 °C 40 °C
Temperature 03(cpu) 10 40 °C 40 °C 40 °C
Temperature 04(memory) 11 27 °C 23 °C 36 °C
Temperature 05(memory) 12 27 °C 22 °C 36 °C
Temperature 06(memory) 13 31 °C 26 °C 38 °C
Temperature 07(memory) 14 32 °C 27 °C 40 °C
Temperature 08(powerSupply) 15 43 °C 39 °C 45 °C
Temperature 09(powerSupply) 16 35 °C 31 °C 37 °C
Temperature 10(system) 17 43 °C 38 °C 46 °C
Temperature 11(system) 18 32 °C 28 °C 36 °C
Temperature 12(system) 19 39 °C 35 °C 44 °C
Temperature 13(ioBoard) 20 32 °C 27 °C 35 °C
Temperature 14(ioBoard) 21 33 °C 29 °C 36 °C
Temperature 15(ioBoard) 22 32 °C 28 °C 35 °C
Temperature 19(system) 23 22 °C 18 °C 26 °C
Temperature 20(system) 24 28 °C 24 °C 31 °C
Temperature 21(system) 25 28 °C 24 °C 31 °C
Temperature 22(system) 26 29 °C 24 °C 31 °C
Temperature 23(system) 27 38 °C 33 °C 40 °C
Temperature 24(system) 28 33 °C 28 °C 35 °C
Temperature 25(system) 29 32 °C 28 °C 34 °C
Temperature 26(system) 30 32 °C 28 °C 34 °C
Temperature 29(storage) 31 35 °C 35 °C 35 °C
Temperature 30(system) 32 67 °C 59 °C 70 °C
Thermal Status
After the removal and re-addition of the sensor, only the following channels appeared again, all without status:
CPU Fan Status 3 Other Other Other
Disk Controller Status 8 OK OK OK
Downtime -4
Fans Broken 5 0 # 0 # 0 #
Fans Running 4 0 # 0 # 0 #
Fault Tolerant Fans Broken 7 0 # 0 # 0 #
Fault Tolerant Fans Running 6 0 # 0 # 0 #
Overall Status 0 Failed Failed Failed
System Fan Status 2 OK OK OK
Thermal Status
We hope that paessler can repair this sensor, as in our environment this was one of the key sensors to justify the investment on this product.
Add comment