When you upgrade FailSafe software in an active cluster, you upgrade one node at a time in the cluster.
The following procedure upgrades FailSafe on node web-node3.
If resource groups are online on the node, use a cluster_mgr command to move them another node in the cluster. To move the resource group to another node in the cluster, there should be another node available in the failover policy domain of the resource group.
The following cluster_mgr command moves resource group web-rg to another node in the cluster web-cluster:
cmgr> admin move resource_group web-rg in cluster web-cluster |
To stop HA services on the node web-node3, use the following cluster_mgr command. This command will move all the resource groups online on this node to other nodes in the cluster if possible.
cmgr> stop ha_services on node web-node3 for cluster web-cluster |
If it is not possible to move resource groups that are online on node web-node3, the above command will fail. You can use the force option to stop HA services in a node even in the case of an error.
Stop all cluster processes running on the node.
# /etc/rc.dinit.d/failsafe stop |
Perform the FailSafe upgrade in the node web-node3.
After the FailSafe upgrade, check whether cluster processes (cmond, cad, crsd) are running. If not, restart cluster processes:
# fsconfig failsafe on; /etc/rc.d/init.d/failsafe start |
Restart HA services on the node. The following cluster_mgr command restarts HA services on the node:
cmgr> start ha_services on node web-node3 for cluster web-cluster |
Make sure the resource groups are running on the most appropriate node after restarting HA services.