Business Service Monitoring

Overview Copied

Business Service Monitoring, or BSM, allows you a much-enhanced view into your IT infrastructure – as opposed to looking at your infrastructure on a Host by Host basis. Your monitoring solution will understand resiliency, service / operational availability (SLA/OLA), and more.

This is the advanced monitoring of Hosts and Services in Opsview Monitor — allowing you to group similar Hosts into Components, set resiliency on these Components, and add multiple Components together in order to form a top-level business service.

You can also undertake actions against every layer of the business service, from the user interface. such as setting downtime, acknowledging problems, adding comments against business service’s, etc.

The purpose of BSM is to give real-world views where Hosts are grouped together into Components (i.e. h-scaled clusters), and Components are then grouped together to form the overall Business Service unit (i.e. all host and service that need to run so that, for example, my website is operational).

BSM diagram

Using BSM, you can have multiple Hosts of the same type grouped together in Components, and if you have configured the resiliency level correctly, you can allow one Host to fail but the Component to still be operational.

A Business Service can then be comprised of multiple Components, each with their own resiliency levels ’ giving a true end-to-end view of the Business Service, as shown below.

You can then undertake operations at a Component and a BSM level, such as:

Business Service Monitoring view Copied

In an Opsview Monitor system, you can have a website called ‘Website.com’ with six Components including an ‘Apache Servers’ cluster, ‘Linux cluster’, etc.

BSM Investigate tab

With BSM, you can now monitor and display your entire stack in a single view, so you can see ‘one Host has failed in the Linux cluster; it hasn’t affected my website yet but I will need to fix that soon.

Business Service monitoring is a terrific tool that will take existing Hosts, Services and Host templates and allow the creation of a hierarchy of Components and Business Services showing the relationship between Hosts and the Business Services they support, availability (SLA/OLA) at each layer, reporting, notifications, access control and more.

Component Hosts Copied

Note that for the purposes of BSM, the Host consists of the Service Checks related to the Host template used by the Component — the Host state is not taken into account.

The Host can be one of three calculated states:

Additionally, there is one calculated flag:

Note

The soft or hard state of the Service Check is not considered — the latest state is always used.

If you set DOWNTIME and there are no failed services, then an operational state is used. This is to cover scenarios where downtime of two hours is recorded, but only 15 minutes is used. This allows the Host to be marked as DOWNTIME only during the time there were actual failures.

It is possible that for a Host, the Service Checks are UNKNOWN yet the Host is DOWN. From a BSM perspective, the Host is considered to be OPERATIONAL because there are no CRITICAL Service Checks. This would be an error in the configuration as the Service Check should be CRITICAL to show a severe error.

Components Copied

This is calculated from the Component Host states and can be one of three calculated states:

Additionally, there are two calculated flags:

The operational zone percent is calculated as (hosts_required_online) / (hosts_total) x 100. If there are not enough operational Hosts, then the Component is failed. Hosts in DOWNTIME are not counted, but have the effect of making failed Hosts more important.

Note

Due to the operational zone percentage, it is possible that a Component is in an operational state with failed Hosts. If those failed Hosts are acknowledged, then the Component will also be acknowledged, so you could have an acknowledged icon on an operational Component.

Business Services Copied

This can be one of three calculated states:

Additionally, there are two calculated flags:

BSM Notifications Copied

The below diagram provides with details of when a user will or will not receive a notification based on status changes:

BSM Notifications

["Opsview"] ["User Guide"]

Was this topic helpful?