New Question
 
 
PRTG Network Monitor

Intuitive to Use.
Easy to manage.

200.000 administrators have chosen PRTG to monitor their network. Find out how you can reduce cost, increase QoS and ease planning, as well.

Free PRTG
Download >>

 

What is this?

This knowledgebase contains questions and answers about PRTG Network Monitor and network monitoring in general. You are invited to get involved by asking and answering questions!

Learn more

 

Top Tags


View all Tags


Are there alternatives to the PRTG cluster when running a large installation?

Votes:

0

Your Vote:

Up

Down

I run a large PRTG installation with thousands of sensors in my network and want to set up a PRTG high availability cluster for failsafe monitoring. However, according to the PRTG system recommendations, more than 2,500 sensors in a cluster are not recommended and more than 5,000 sensors in a cluster are not officially supported.

How can I set up failsafe monitoring nevertheless in case my PRTG installation is too large for the PRTG cluster feature?

cluster large-installation prtg recommendations requirements

Created on Aug 9, 2017 1:04:05 PM by  Gerald Schoch [Paessler Support]



3 Replies

Accepted Answer

Votes:

0

Your Vote:

Up

Down

This article applies to PRTG Network Monitor 17 or later

PRTG High Availability Cluster for Large Installations

The PRTG cluster feature does not officially support more than 5,000 sensors, and we also do not recommend that you set up a cluster for more than 2,500 sensors. Please always keep in mind that monitoring traffic and load will be multiplied for each cluster node that you add. You might encounter performance issues in this case, but this also depends on your individual setup.

So, before creating a PRTG cluster with such a high number of sensors, please contact your presales team. Together we can discuss your options. You can find some alternatives to a cluster below.


Alternatives to a PRTG Cluster

The following alternatives neither replace nor provide equivalent features of a PRTG cluster. The goal is to give you some ideas that you can implement to help you quickly get your PRTG installation up and running (see also the Knowledge Base article My PRTG has crashed and I can't restart it anymore. What can I do?).

We distinguish two cases here:

  • PRTG is running on a real hardware server.
  • PRTG is running on a virtual machine.

PRTG on Real Server Hardware

If your PRTG installation is too large for a properly working cluster setup, you can alternatively implement the following approach to recover PRTG as fast as possible if it fails.

You will need to have two real servers, both must have PRTG installed. One will act as a "master node" and the second as a standby node. Keep the standby server up-to-date by regularly updating it to the same PRTG version as the master node.

The master node runs PRTG and monitors your infrastructure. The standby server will have PRTG installed, but the PRTG core server and its local probe services need to be stopped. Copy or synchronize all PRTG data like configuration files, monitoring data, and templates on the master node with the standby server on a regular basis. You can do this by using a custom script that only copies data that has changed since the last synchronization.

Note: Copying the files will require your master PRTG core server and its local probe services to be stopped.

To keep the offline time short, your script can proceed as follows:

  1. Stop the Windows services of the master PRTG core server and its local probe.
  2. Copy all relevant data to a specific location where the copy time is short.
  3. Start the services of the core server and its local probe.
  4. Compress the copied data, transfer it to the standby server, and decompress it in the correct PRTG folders.

You can use a freeware version of PRTG that monitors the status of the master core server. When the latter fails, you will be notified to trigger the standby PRTG server to start monitoring your infrastructure.

Some manual configuration will be necessary to configure your remote probes to send monitoring data to your new PRTG core server. You will also need to migrate your PRTG license from the old server to the new server.


PRTG on a Virtual Machine

When running PRTG in a virtual environment, you have two options to keep monitoring downtimes as low as possible.

Both approaches require actions from the PRTG administrator to recover the PRTG installation once it is down. Moreover, there will be a gap in the monitoring data due to the downtime.

1. Use Snapshots

The idea is to make VMware or Hyper-V snapshots of the virtual machine where the PRTG core server is running. The snapshot will contain the status of the virtual machine, disk data, and configuration at a given point in time. Take snapshots regularly and carefully because performance may degrade as more snapshots are taken.

If the virtual machine where PRTG is running crashes or fails, then you can restore it quickly from the latest snapshot.

2. Use a VM Backup

Hyper-V and VMware make it possible to have backups of virtual machines. The backup should contain the configuration, VM snapshots, and virtual hard disks used by the virtual machine.

If the virtual machine where PRTG is running crashes, then you can restore it from a backup copy.


More

Created on Aug 9, 2017 2:31:42 PM by  Gerald Schoch [Paessler Support]

Last change on Aug 10, 2017 9:38:48 AM by  Gerald Schoch [Paessler Support]



Votes:

0

Your Vote:

Up

Down

Hello @Gerald,

we are going into this procedure but our monitoring environment doesn't allow stopping the services on the "master node" because it will take 10-15 minutes and we just can't have 10k+ sensors without monitoring. We REALLY need to stop the services? Is there any impacts if we try to copy files to the "slave server" with both services active on master?

We need to make 2-4 copies a day, so stopping services 2-4 days is out of question.

Thanks!

Created on Jun 28, 2018 6:18:26 PM by  Mariows (0) 2

Last change on Jun 28, 2018 7:21:31 PM by  Luciano Lingnau [Paessler Support]



Votes:

0

Your Vote:

Up

Down

Hey Mariows,

there is no need to stop the Core Server service on the Master node. Instead, you can work with snapshots of the configuration file as well. This snapshot can be created under Setup >> System Administration >> Administrative Tools on the Master node.

Once you have done this, please copy the configuration file from the snapshot-zip and the monitoring data to the Failover node (here, the Core Server service must be stopped).

Best regards,
Sven

Created on Jun 29, 2018 12:59:25 PM by  Sven Roggenhofer [Paessler Technical Support]



Please log in or register to enter your reply.


Disclaimer: The information in the Paessler Knowledge Base comes without warranty of any kind. Use at your own risk. Before applying any instructions please exercise proper system administrator housekeeping. You must make sure that a proper backup of all your data is available.