Use the following procedure to add a node to an active cluster. This procedure begins with the assumption that cluster_admin, cluster_control, cluster_ha and failsafe2 products are already installed in this node.
Check control network connections from the node to the rest of the cluster using ping command. Note the list of control network IP addresses.
Check the serial connections to reset this node. Note the name of the node that can reset this node.
Run node diagnostics. For information on FailSafe diagnostic commands, see Chapter 8.
Make sure sgi-cad, sgi-crsd, sgi-cmsd, and sgi-gcd entries are present in the /etc/services file. The port numbers for these processes should match the port numbers in other nodes in the cluster.
Example entries:
sgi-cad 7200/tcp # SGI cluster admin daemon sgi-crsd 7500/udp # SGI cluster reset services daemon sgi-cmsd 7000/udp # SGI cluster membership Daemon sgi-gcd 8000/udp # SGI group communication Daemon |
Check if cluster processes (cad, cmond, crsd) are running.
# ps -ef | grep cad |
If cluster processes are not running, run the cdbreinit command.
# /usr/lib/failsafe/bin/cdbreinit /var/lib/failsafe/cdb/cdb.db Killing cdbd... Removing database header file /var/lib/failsafe/cdb/cdb.db... Preparing to delete database directory /var/lib/failsafe/cdb/cdb.db# !! Continue[y/n]y Removing database directory /var/lib/failsafe/cdb/cdb.db#... Deleted CDB database at /var/lib/failsafe/cdb/cdb.db Recreating new CDB database at /var/lib/failsafe/cdb/cdb.db with cdb-exitop... cdbd Created standard CDB database in /var/lib/failsafe/cdb/cdb.db Please make sure that "sgi-cad" service is added to /etc/services file If not, add the entry and restart cluster processes. Please refer to FailSafe administration manual for more information. Modifying CDB database at /var/lib/failsafe/cdb/cdb.db with cluster_ha-exitop... Modified standard CDB database in /var/lib/failsafe/cdb/cdb.db Please make sure that "sgi-cmsd" and "sgi-gcd" services are added to /etc/services file before starting HA services. Please refer to FailSafe administration manual for more information. Starting cluster control processes with cluster_control-exitop... Please make sure that "sgi-crsd" service is added to /etc/services file If not, add the entry and restart cluster processes. Please refer to FailSafe administration manual for more information. Started cluster control processes Restarting cluster admin processes with failsafe-exitop... |
Use cluster_mgr template (/usr/lib/failsafe/cmgr-templates/cmgr-create-node) or cluster_mgr command to define the node.
Note: This node must be defined from one of nodes that is already in the cluster.
Use the cluster_mgr command to add the node to the cluster.
For example: The following cluster_mgr command adds the node web-node3 to the cluster web-cluster:
cmgr> modify cluster web-cluster Enter commands, when finished enter either "done" or "cancel" |
web-cluster ? add node web-node3 web-cluster ? done |
You can start HA services on this node using the cluster_mgr command. For example, the following cluster_mgr command starts HA services on node web-node3 in cluster web-cluster:
cmgr> start ha_services on node web-node3 in cluster web-cluster |
Remember to add this node to the failure domain of the relevant failover policy. In order to do this, the entire failover policy must be re-defined, including the additional node in the failure domain.