Testing failover


BMC recommends that the Cluster Administrator simulates the failure of the active node and issues commands necessary to perform a 'switchover' and a 'switch back' to validate the solution.

Validation should include the following steps:

  1. Shut down the active node using normal or forced Power Off.
  2. Mount all file systems on the passive node.
  3. Configure the virtual IP address.
  4. Start the database and BDA services.
  5. Validate if BDA works correctly using the GUI interface.

Validation should always include the following tests to ensure that switchover will be successful:

  1. Validate the HA installation resources and configuration.
  2. Test the storage level resources. Does mount succeed? Do file permissions map to existing UID/GID on passive node?
  3. Verify the daemon resources. Check the logs and ensure that all appropriate processes are running.
  4. Complete the GUI level validation in the end. Can the end-user connect via the VIP? Are Agents coming back online in a timely fashion? This should not take more than a few minutes, as discussed in Overview-of-HA-solution.

Going back to primary node from secondary node

The procedure is identical, except reverse the steps and perform the same validations.

 

Tip: For faster searching, add an asterisk to the end of your partial query. Example: cert*