High Availability Overview
Moogsoft Enterprise supports high availability (HA) architectures to improve the fault tolerance of Moogsoft Enterprise. Each component supports a multi-node architecture to enable redundancy, failover, or both to minimize the risk of data loss, for example, in the case of a hardware failure
This topic covers the architectures you can use to achieve HA with Moogsoft Enterprise. For an example of how to set up a single site HA system, see Fully Distributed HA Installation. See HA Reference Architecture for a detailed diagram of the components in a single site HA configuration.
Distributed HA architectures
Moogsoft Enterprise supports high availability in distributed architectures where different machines host a subset of the stack. You can run one or more of the server roles on its own machine.
See Server Roles for details of the HA architecture server roles in Moogsoft Enterprise.
If you run more than one server role on a machine, choose a primary role for the server. The primary role dictates which additional roles are supported on the machine as follows:
Primary Role | Supported Secondary Roles |
---|---|
Core | UI, Data ingestion and Database |
UI | Data ingestion |
Data Ingestion | UI |
Database | Redundancy |
Redundancy | Database |
See Scale Your Moogsoft Enterprise Implementation for information on how to increase capacity within the HA architecture, you can.
Contact your Moogsoft technical representative to discuss scaling your deployment.
See Sizing Recommendations for more information on hardware sizes and capacity.
After you decide on the best HA architecture for your environment, you can plan your implementation.
Resilience and failover
Moogsoft Enterprise provides support for automatic failover between the two nodes within an HA pair. For example from one instance of Moogfarmd to another, or from one instance of a LAM to another. When an active instance in an HA pair fails, Moogsoft Enterprise persists any affected messages. The passive instance of the HA pair automatically takes over, and processes those messages without any interruption or loss of data. See Message Persistence for more information.
There is no automatic failover between multiple HA pairs. For example, there is no failover from a primary site to a second site, such as a disaster recovery replica.
Moogsoft Enterprise does not support automated fail-back for any architecture. For example, consider an HA pair of Moogfarmd instances. When the instance of Moogfarmd in cluster 1 becomes unavailable, the instance in cluster 2 enters an active state. When the instance from cluster 1 recovers and becomes available, the instance in cluster 2 remains active.