After Upgrade, all Windows 2008/Windows 2008 R2 HP Health monitoring went down only with the message "Failed". A local upgrade of HP Service Pack (2014.09) did not resolve the issue. All servers up-to-date in term of Windows Upates. Please advise.
HP Health monitoring after 14.4.12.3284+
Votes:
0
Best Answer
Votes:
1
Hi Hermanbrood,
the OID we used in the previous version did not work anymore in our labs after updating to the latest software version from HP. So we worked on the sensor and realized that we actually did not use the proper OID before. Now PRTG displays the same status as the HP software does (at least in the versions that we have in our labs).
The overall status is not something that PRTG is making up, but it is directly read and displayed from the target system.
If you don't like the status as it is you can edit your sensor to ignore the "red" status by changing the channels settings as described below. But we strongly recommend to not do so because you will be ignoring the status information that HP gives you.
To redefine the way the sensor displays the status you can edit the "lookup file" that is used for the respective sensor channel:
- Go to the \lookups folder in your PRTG program directory and copy the file prtg.standardlookups.hp.status.ovl into the \lookups\custom folder.
- Open your copy in an editor.
- In the lookup definition, change state="Error" to state="Warning" and save your settings.
- This will cause the sensor to show a yellow warning status instead of a red down status.
- As long as your custom file exists your customizations will be handled preferably by PRTG instead of the original lookup settings.
- For details, see PRTG Manual: Define Lookups, section "Customizing Lookups".
Hope this helps!
Dirk
Created on Oct 29, 2014 8:58:38 AM by
Dirk Paessler [Founder Paessler AG]
(11,025)
●3
●6
Last change on Oct 29, 2014 10:35:55 AM by
Daniel Zobel [Product Manager]
10 Replies
Votes:
0
I have the same issue. Everything else works fine, but "SNMP HP System Health" for three different HP servers is now status "Failed" but it looks like all of the channels are good status (so no reason to show failed)
Votes:
0
Same for us, all "SNMP HP Proliant System Health" sensors showing as failed after the update to 14.4.12.3284.
Votes:
0
Same for us as well. Most are working, some don't (even some probes checking 3 server, 2 are working, 1 not). Doesn't seem to be a consistency on the servers affected. Mix of 2003 and 2008. Gen5 through to Gen8.
Votes:
0
For issues on SNMP HP System Health Sensors after the update to version 14.4.12, please be aware that the sensor checks a different OID as of this version. Please see the details in this KB-Article: Why is my SNMP HP Proliant System Health sensor in error status after updating PRTG?
You can also change the settings for this sensor, see below.
Created on Oct 24, 2014 7:20:15 AM by
Torsten Lindner [Paessler Support]
Last change on Oct 29, 2014 9:00:21 AM by
Dirk Paessler [Founder Paessler AG]
(11,025)
●3
●6
Votes:
0
After following some the aforementionned advices, I did a SNMP GET on the OID .1.3.6.1.4.1.232.6.2.11.2.0 with a result of "4" which apparently points to the IML logs.
I did clear that log and the HP Health alarm in PRTG also cleared.
Follow-up question:
1. Why this new behavior after release 14.4.12.3284+ since the IML log entries were pre-dating my original PRTG installation (By a couple of years!!!). There was nothing "recent" in that log.
Unless I'm mistaken, these alarms should have shown themselves the very first time I would have polled these devices? No ? Anyhow, all is good now. BTW, there is usually a link to "Cpqimlv.exe" called "HP ProLiant Integrated MAnagement Log Viewer" that gives access to clearing the log without the CLI.
Votes:
0
We implemented this based on the HP recommendation though and we hope that you will understand, that we had to decide it like this.
HP provides several different "state variables" and also one for the "overall state". This overall state shows the "worst" of the other states and does not take into account which one it is. The Overall State would switch to "failed" if for example a fan or power supply failed or a temperature was too high.
You can edit the status in case by changing the respective entry in the lookup file used by the sensor. For detailed information, please have a look at https://www.paessler.com/manuals/prtg/define_lookups.htm
Votes:
0
Ok the change in functionality is good, i checked a few of our servers and sure enough there were uncleared events in the IML. After clearning the events from IML the status returned to normal. Perhaps the error message could be updated to be more descriptive. The current message "Error in Overall Status: 'Failed'" has no clues.
Votes:
0
We're sorry but changing the error message is not possible because it is generated by the value lookup system of PRTG.
Created on Oct 27, 2014 11:55:53 AM by
Johannes Herrmann [Paessler Support]
(1,360)
●2
●2
Last change on Oct 27, 2014 4:06:32 PM by
Johannes Herrmann [Paessler Support]
(1,360)
●2
●2
Votes:
0
The noise this sensor now generates, not based on the current state of the system, but based on OLD information in de Integrated Management Logs made us decide to not monitor the HP ProLiant Health using PRTG any more.
What's next, overall failed status for Windows machines because I did not cleanup the system event logs??
I totally do not understand why Paessler made this move, a sensor based on logs? No!! A sensor needs to be based on live status information of a system. That's exactly why we chose to use PRTG, because of the live info.
This is a bad decision, it's forcing us to use other monitoring software to do the trick PRTG did until now.
Bottom line: If the overall status in HP Integrated Management is ok, then PRTG should be also, and that is currently not the case!
Votes:
1
Hi Hermanbrood,
the OID we used in the previous version did not work anymore in our labs after updating to the latest software version from HP. So we worked on the sensor and realized that we actually did not use the proper OID before. Now PRTG displays the same status as the HP software does (at least in the versions that we have in our labs).
The overall status is not something that PRTG is making up, but it is directly read and displayed from the target system.
If you don't like the status as it is you can edit your sensor to ignore the "red" status by changing the channels settings as described below. But we strongly recommend to not do so because you will be ignoring the status information that HP gives you.
To redefine the way the sensor displays the status you can edit the "lookup file" that is used for the respective sensor channel:
- Go to the \lookups folder in your PRTG program directory and copy the file prtg.standardlookups.hp.status.ovl into the \lookups\custom folder.
- Open your copy in an editor.
- In the lookup definition, change state="Error" to state="Warning" and save your settings.
- This will cause the sensor to show a yellow warning status instead of a red down status.
- As long as your custom file exists your customizations will be handled preferably by PRTG instead of the original lookup settings.
- For details, see PRTG Manual: Define Lookups, section "Customizing Lookups".
Hope this helps!
Dirk
Created on Oct 29, 2014 8:58:38 AM by
Dirk Paessler [Founder Paessler AG]
(11,025)
●3
●6
Last change on Oct 29, 2014 10:35:55 AM by
Daniel Zobel [Product Manager]
Add comment