Host Template: Infrastructure - Cisco UCS - Rack
Overview Copied
Monitor your Cisco UCS Rack server with metrics for overall health, usage information and a detailed summary of faults. This Host Template only supports Rack Servers discovered and managed by a UCS Manager.
This Host Template includes the following Service Checks:
Service Check Name | Description | Default Thresholds (Metric: Warning, Critical) | UOM |
---|---|---|---|
UCS - Rack - Admin Power | Admin power for the Rack | N/A | N/A |
UCS - Rack - Admin State | Admin state for the Rack | N/A | N/A |
UCS - Rack - Association State | Association state of the Rack | N/A | N/A |
UCS - Rack - Checkpoint | Latest checkpoint of the Rack | N/A | N/A |
UCS - Rack - Connection Path | Connection path of the Rack | N/A | N/A |
UCS - Rack - Connection State | Connection state of the Rack | N/A | N/A |
UCS - Rack - Discovery State | Discovery state of the Rack | N/A | N/A |
UCS - Rack - Faults | Fault information for the Rack server | rack_critical_faults: N/A, rack_major_faults: N/A, rack_minor_faults: rack_warning_faults: rack_info_faults: N/A,N/A rack_cleared_faults: N/A,N/A |
N/A |
UCS - Rack - HBAs | Number of HBAs for the Rack | N/A | N/A |
UCS - Rack - Local Storage Health | Health information for the Local Storage on the Rack | N/A | N/A |
UCS - Rack - Local Storage Usage | Usage information for the local storage on the Rack | N/A | N/A |
UCS - Rack - Managing Instance | Managing instance of the Rack | N/A | N/A |
UCS - Rack - Memory Unit Health | Information on the memory units on the Rack | N/A | N/A |
UCS - Rack - NICs | Number of NICs for the Rack | N/A | N/A |
UCS - Rack - Operability | Operability of the Rack | N/A | N/A |
UCS - Rack - Operational State | Operational State of the Rack | N/A | N/A |
UCS - Rack - Operational State Qualifier | Operational State Qualifier of the Rack | N/A | N/A |
UCS - Rack - Power State | Power state of the Rack | N/A | N/A |
UCS - Rack - Presence | Presence of the Rack | N/A | N/A |
UCS - Rack - Processor Health | Information on the processors on the Rack | N/A | N/A |
UCS - Rack - Processor Usage | Processor usage of the Rack | cpu_usage: N/A,N/A cores_enabled: N/A, |
N/A |
UCS - Rack - PSUs | Information on the PSUs on the Rack | N/A | N/A |
UCS - Rack - Threads | Number of threads on the Rack | N/A | N/A |
Note
See details about Nagios thresholds here.
Usage Instructions Copied
Please review the overall Infrastructure - Cisco UCS Opspack documentation for details on prerequisites, known issues, and other necessary configurations that apply to all Host Templates in the Opspack.
Add this Host Template Copied
Add the Infrastructure - Cisco UCS - Rack Host Template to your Opsview Monitor host. If the resource you’re monitoring has no hostname or public IP, then open the Advanced settings pane and change Host Check Command to Always assumed to be UP.
Note
For more information, refer to the documentation on Adding Host Templates to Hosts.
Add and configure Variables Copied
The Service Checks in this Host Template use the following variables, and they will be added to your Opsview Monitor instance alongside the Host Template:
Note
For more information, refer to the documentation on Adding Variables to Hosts.
Apply Changes Copied
Apply Changes and the system will now be monitored:
Available Variables Copied
UCS_CREDENTIALS Copied
The Value is not used and therefore can be set to anything. Override the arguments with your credentials.
Plugin Parameter | Position in Variable | Name | Description |
---|---|---|---|
N/A (Opsview UI Only) | Value | UCS Credentials | The credentials for your Cisco UCS |
--api-server-url |
Arg1 | UCS API Server URL | The server URL for your Cisco UCS Manager or CIMC Supervisor |
--username |
Arg2 | UCS Username | The username for your Cisco UCS user |
--password |
Arg3 | UCS Password | The password for your Cisco UCS user |
UCS_RACK_NAME Copied
Plugin Parameter | Position in Variable | Name | Description |
---|---|---|---|
--rack-name |
Value | UCS Rack Name | Name of the Rack to monitor |
UCS_FAULT_SETTINGS Copied
The Value is not used and therefore can be set to anything. Override the arguments with your settings.
The fault severity must be one of; ‘critical’, ‘major’, ‘minor’, ‘warning’, ‘info’, ‘cleared’ (default: ‘major’).
The fault limit, limits the number of faults displayed (default: ‘10’). Leave this blank to display all faults.
Plugin Parameter | Position in Variable | Name | Description |
---|---|---|---|
N/A (Opsview UI Only) | Value | UCS Fault Settings | Optional configuration options for fault service checks |
--fault-severity |
Arg1 | UCS Fault Severity | The minimum severity level of faults to display |
--fault-limit |
Arg2 | UCS Fault Limit | The maximum number of faults to display |
SSL Certificates: UCS_CERTIFICATES Copied
Opsview Cloud
For Opsview Cloud customers, please contact ITRS Support for assistance with setting up certificate files, as they require Orchestrator access.
Additionally, you can use the UCS_CERTIFICATES variable to pass in your SSL certificates for use when connecting to the service provider to gather metrics:
The Value is not used and therefore can be set to anything. Override the arguments with your certificates.
Parameter | Position in Variable | Name | Description |
---|---|---|---|
--ca-path |
Arg1 | CA Certificate Path | Path to the CA Certificate |
--client-cert |
Arg2 | Client Certificate Path | Path to the Client Certificate |
--client-key |
Arg3 | Client Key Path | Path to the Client Key |
Plugin help Copied
Plugin: check_cisco_ucs
Secure Args
This plugin uses the Secure Args feature for argument passing.
For mode-specific help, run the plugin with the -h -m <mode>
flags. This will list the required and optional Variable Arguments for that mode.
The appropriate mode for each Service Check is listed here:
Service Check Name | Mode |
---|---|
UCS - Rack - Admin Power | UCS.Rack.Admin.Power |
UCS - Rack - Admin State | UCS.Rack.Admin.State |
UCS - Rack - Association State | UCS.Rack.Association |
UCS - Rack - Checkpoint | UCS.Rack.Checkpoint |
UCS - Rack - Connection Path | UCS.Rack.Conn.Path |
UCS - Rack - Connection State | UCS.Rack.Conn.State |
UCS - Rack - Discovery State | UCS.Rack.Discovery |
UCS - Rack - Faults | UCS.Rack.Faults |
UCS - Rack - HBAs | UCS.Rack.HBA |
UCS - Rack - Local Storage Health | UCS.Rack.Storage.Health |
UCS - Rack - Local Storage Usage | UCS.Rack.Storage.Usage |
UCS - Rack - Managing Instance | UCS.Rack.Managing.Inst |
UCS - Rack - Memory Unit Health | UCS.Rack.Memory.Units |
UCS - Rack - NICs | UCS.Rack.NIC |
UCS - Rack - Operability | UCS.Rack.Operability |
UCS - Rack - Operational State | UCS.Rack.Oper.State |
UCS - Rack - Operational State Qualifier | UCS.Rack.Oper.Qual |
UCS - Rack - Power State | UCS.Rack.Power |
UCS - Rack - Presence | UCS.Rack.Presence |
UCS - Rack - Processor Health | UCS.Rack.Processor.Health |
UCS - Rack - Processor Usage | UCS.Rack.Processor.Usage |
UCS - Rack - PSUs | UCS.Rack.PSU |
UCS - Rack - Threads | UCS.Rack.Threads |