The other day I stopped started the PRTG services on the Primary node of my PRTG cluster. The Secondary node became active, starting to do the alerting, nice an smoothly.
When I started the PRTG services on the Primary node again, PRTG immediately failed back to this node. However, it took the node over 30 minutes to check all the sensors, resulting in a lot of Business Service sensors to go red.
Is there a way to disable the manual fail-back of the cluster? Because if I can manually fail back, I can do this when the Primary node has checked all sensors again.
I know I can manually fail-over to the Secondary node if I need to do maintenance on the Primary node. But I'm now talking about the Primary node for example crashing (BSOD) and automatically restarting. I really don't want this to cause an alert storm due to not all sensors checked yet on the Primary node in case of automatic fail-back...
Kind regards,
Corné van den Bosch
Add comment