Recently we had a Problem with the Failover of our PRTG Cluster, which caused it to be disconnected from the Master for about a week. After we turned the Failover on again it showed that the cluster was back to normal (Status was ok). But after some time i realized that it wasn't synchronizing all my changes anymore (For example i unpaused a sensor on my Master but it didnt on the Failover which caused the sensor's overall state to stay on paused).
I decided to disconnect the cluster completely, delete the configuration on the Failover and reconnect it again with a new cluster ID which obviously solved the Problem. My question is now for future situations like this. Is there a maximum time the cluster can be on an error state before i can expect synchronisations Problems? How do I handle a outage of a cluster that lasts for more than just a few hours? Do you have any Best Practices regarding this topic?