TrueSight Operations Management Suite offers many features which will help ensure availability in your environment. One of the main features is high availability options. Does TrueSight offer high availability within its components? It sure does.
Many of the components used within the TrueSight Operations Management suite are HA capable. There have been improvements in the overall HA configuration and functionality and we continue to strive to make it easier for customers to setup an HA environment. When deciding whether to implement HA you would need to determine which components will be able to support the HA setup. The TrueSight Operations Management Suite has HA options for TrueSight Presentation Server, TrueSight Infrastructure Management, App Visibility Manager, Real End User Experience Monitoring Software Edition, and Atrium Single Sign On.
Are you familiar with HA? - High availability (HA) is a redundancy operation that automatically switches to a standby server if the primary server fails or is temporarily shut down for maintenance. HA enables the TrueSight component to continuously function so that monitoring of your mission-critical systems is continuously available. HA utilizes a load balancer software component as an HA proxy to switch operations between the primary and standby server.
What does the TSPS HA architecture consist of?
The HA architecture for the Presentation Server as well as Infrastructure require a primary server, secondary server, and a load balancer. It’s that simple. See the image below for the Presentation Server HA architecture
- Primary server
- Secondary server
- Load balancer (HAProxy)
Why is a load balancer required?
In a successful HA deployment, the secondary server must take over when the primary server is not working, or the primary server is ready to take over, a load balancer is required to direct the client requests to the active server. The load balancer will act as a unified point of access for the Presentation Server.
When do I have to configure the HA setup? What if I installed all components already?
Customers have two options for configuring an HA environment:
- Deploying in HA mode during the installation process
- Deploying in HA mode post-installation
Both options are user friendly and both options allow for easy configuration of primary and secondary presentation servers. See this link for more details.
How is the transfer of the service between the active and standby Presentation Server handled?
In the Presentation Server HA mode, the secondary server becomes the active server if the primary server stops operating, due to an event that triggers a server shutdown. Once the primary server is up and running, it does not become the active server by default. The primary server is still in a standby mode. The service can be transferred back to the primary server, or the primary server can remain in standby mode.
Note: Do not stop the active server immediately after restarting the standby server. The cache synchronization takes time to complete and is dependent on your environment.
Is there a way to tell which server is the primary/active server on TSPS?
tssh ha status command provides information about the primary and secondary servers and the current status.
From the <Presentation Server installation directory>\truesightpserver\bin directory, run the
tssh ha status command. An example command output for the primary server is provided below:
What are some important limitations to note regarding the Presentation Server HA environment?
- If the host name of the Presentation Server is mapped to the loopback address in the etc/hosts file, HA will not work. Ensure that the hosts file is not updated with the loopback address
- If the primary and secondary servers are both disconnected from the network, you must restart both servers, and set one server to the active mode and the other server to standby mode.
- The primary and secondary server must have identical setups, including the hardware platform, operating system, and post-installation configuration
- The Infrastructure Management PATROL Repository is not replicated. Import the repository on both servers. For more information, see Importing the Infrastructure Management repository
TrueSight Infrastructure Management contains lots of moving parts, what can you tell me about the HA structure and requirements?
Consider the following requirements before you configure HA for Infrastructure Management.
TrueSight Operations Management component
Third-party load balancer
What does the Infrastructure Management HA architecture look like?
The TSIM HA setup is similar in design to the TSPS setup in that there is a primary server, secondary server, and a required load balancer. BMC does not provide a load balancer so this would be a third party load balancer.
HA deployment for Infrastructure Management
In an HA deployment of Infrastructure Management, the load balancer is installed on a separate server and redirects requests to the active node. The load balancer provides a single point of access to the HA-enabled Infrastructure Management server. For details on installing and configuring the Infrastructure Management server for application high-availability review the details here
When we have TSIM running in ha mode how do we configure the publishing server and how does CMDB manage the cell communication?
You configure TSIM in HA mode without consideration of CMDB. When you configure the CMDB integration in the TSIM Ops Console you just point to the CMDB server hostname or load balancer name if in AR server group. This updates the pserver.conf file in %MCELL_HOME%\etc. This is also done initially during installation of TSIM, see "Preparing for BMC Atrium CMDB integration".
Is it possible to install TrueSight Infrastructure Management server in an HA OS cluster mode?
Yes, it is possible, please see this link for more details.
I am planning to upgrade TSIM which is currently in HA mode, what do I need to know to upgrade?
Upgrading in HA cluster mode requires a two-part upgrade: one on the primary node and the other on the secondary node, which serves as the secondary node in the event of a failover. Click here for more details on the TSIM upgrade information. Click here for the TSPS upgrade information
How do I deploy other Infrastructure Management components in HA mode? For example, Data collection Integration Services, Staging Integration Service, Event management cell, PATROL Agent, etc?
The details for each component are listed here.
Note: It is important to review each component and its active/standby configuration as some components are active/passive and some are not.
If I have setup an HA TrueSight Operations Management environment and want to integrate other components, are there special instructions to consider?
Yes, if you have configured TrueSight Infrastructure Management for high availability, and are integrating it with the following products, you will need to perform the procedures detailed here. There are details for each of the components such as BMC Service Resolution, Atrium CMDB, Impact Integration Web Services server, My IT, and TrueSight IT Data Analytics.
Can the App Visibility Manager be configured for HA?
Yes it can. An App Visibility Manager HA deployment comprises the following systems, each on a separate computer:
- Active node
- Standby node
- Third-party load balancer
The following diagram shows more than one load balancing server, but you can configure one load balancer to manage the failover for all the different components. All communication to the App Visibility server components goes through that component's load balancer. The diagram shows only one App Visibility collector cluster; your system can have many.
HA deployment for App Visibility Manager
In an HA deployment of App Visibility Manager, the load balancer is on a separate computer and redirects requests to the active server. In this way, the load balancer can provide a single point of access to the App Visibility server components.
- When you install the App Visibility server, you can enable high availability for the App Visibility portal, App Visibility collector, or both.
- To deploy an HA component, install the component twice to create a failover cluster, which is a pair of servers that work together to maintain high availability.
- When you install a component, enable HA and provide details for its alternate node. Then run the installation utility on the second computer, enable HA, and provide details for its alternate node (that is, the first component).
- The computers on which you install the component pairs must have the same hardware and operating system configuration.
- The first server that joins the failover cluster is the active node, and its pair becomes the standby node.
- When you install the App Visibility collector, provide details of a third-party load balancer for the collector pair. The load balancer information is required to establish communication between the App Visibility agents and the collector.
Where can I learn more about the high availability options for Real End User Experience Software Edition?
The details of HA configuration for this integration are located here. For the Real User Analyzer, do not configure the isolation response to switch power off. Also, BMC recommends that you test all failover targets to ensure that any Real User Collector feeds are available to the Real User Analyzer.
Is there a best practice document for TrueSight HA?
Yes, please see “Using ASSO with TrueSight FAQ including High Availability Best Practice (V1.0)” - Using ASSO with TrueSight FAQ including High Availability Best Practice (V1.0)
Is it possible to install BMC TrueSight 10 SSO High availability components on the same servers where TSPS is also installed in an HA fashion?
For details, please visit our Community Link - https://communities.bmc.com/docs/DOC-80597
What is one issue that you have seen repeatedly about High Availability setups?
One issue we do see periodically is a failure of the database service to startup after a failover. This issue is mentioned under our Troubleshooting page found here.
This blog post cannot possibly address all questions surrounding HA configuration in TSOM, but if you do have any questions or need help with HA setup, please contact Support for assistance as we will be glad to help you with your HA concerns.
New Content for our Knowledge Base
Here are a few of the newly created Knowledge Articles over the last month:
We’re back with our monthly blog and we’re open to hearing your thoughts on topics and areas of concern which may be of benefit to you! Post your comments regarding what you want to see!