Parameters for Application Check Metrics

You can specify the following parameters for Application Check metrics:

Collection Frequency
For Application Check metrics of type HTTP Health API and Prometheus, the collection frequency is the time interval (in minutes, hours, or days) at which a Simple Diagnostics Agent measures the metric. For example, if you set a collection frequency of one minute, the Simple Diagnostics Agent measures the metric once each minute.

Thresholds
A threshold defines when a metric will receive a certain rating, such as Info (blue), Ok (green), Warning (yellow), Critical (red), or Fatal (red). If no threshold is defined for a metric, Health Monitoring uses the rating for the metric reported by the cloud service.

The types of threshold available depend on the type of metric: 

  • Metric type: Status
    • Available thresholds: Info Only, Status
  • Metric type: Performance 
    • Available thresholds: Info Only, Status, Numeric, Numeric (Critical)
  • Metric type: Quota
    • Available thresholds: Info Only, Status, Numeric, Numeric (Critical)

The following threshold types are available:

  • Info Only: When you assign a threshold of the type Info Only to a metric, the rating of the metric that is reported by the cloud service is overwritten by the rating Info Only.
  • Status: When you assign a threshold of the type Status to a metric, the rating of the metric that is reported by the cloud service can be overwritten by a specified rating. For example, if the cloud service reports a Warning rating, you can specify in the threshold that the reported rating is overwritten by Critical.
    You can also switch on Use delay and specify a delay in minutes before the rating of the metric is set to Critical. This means the rating of the metric is not set to Critical as soon as the threshold is reached, but only after the specified delay.
  • Numeric: When you assign a threshold of the type Numeric to a metric, you specify a limit at or above which the rating of the metric is set to Warning and a second limit at or above which the rating of the metric is set to Critical. For each limit, you must also specify the condition for setting the rating – for example, if the value of the metric is below or above the limit. Additionally, you can switch on Use delay and specify a delay in minutes before setting the rating of the metric to Critical. This means the rating of the metric is not set to Critical as soon as the limit is reached, but only after the specified delay.
    When you specify a limit for a metric of the type Performance, you have to use the metric value when defining a limit.
    When you specify a limit for a metric of the type Quota, you can use its metric value, metric limit, or metric usage to define the limit.

Example of a Performance metric
The rating of the metric cpu.usage is set to Warning when the value of the metric is above 50%. The rating of the metric cpu.usage is set to Critical when the value of the metric is above 80%.

Example of a metric of the type Quota
The rating of the metric disk.usage is set to Warning when the usage is above 50%. The rating of the metric disk.usage is set to Critical when the usage is above 90%.

  • Numeric (Critical)
    This threshold type offers similar functionality to the threshold of the type Numeric, with one exception: You can specify only one limit at or above which the rating of the metric is to be set to Critical.

Alerts (optional)
When you don't maintain a threshold, an alert is generated when the cloud service reports a metric status red (critical or fatal). When you maintain a threshold for a metric, an alert is generated when the defined threshold is reached. Alerts are displayed in the Health Monitoring and Alert Management applications.

The following Alert settings are available:

  • Alert Active
    Set the switch to On to generate an alert when the metric rating is set to red. Set to Off to prevent alerts being generated.
  • Alert Name
    Specify a name for your alert. Be sure to use a descriptive name that others can easily understand.
  • Notification Variant
    Select an alert consumer variant of the type Notification to be informed when the metric turns red (for example, by e‑mail).
  • Outbound Variant
    Select an alert consumer variant of the type Outbound Integration to create a support ticket in a ticketing system, for example.
  • Alert Category
    Choose a category to categorize an alert so that you can display alerts by category in the Alert Management app.
  • Severity
    Select a severity value from 0 (Very Low) to 9 (Critical).
  • Separate by Label
    Create an alert for metrics with the specified label.
  •  Alert Documentation
    Describe the alert and provide any additional information that the user may need.

Alert Rules
An alert rule selects metrics of the destination. Select metrics by either their name or their label.
Use the Include table to select metrics for which alerts are created.
Use the Exclude table to select metrics for which no alerts are created.

 

Availability Alerts Generated from  Application Check Metrics

Availability alerts can be generated from  Application Check metrics (HTTP Health API and Prometheus). A metric is reported to Service Availability Management as an unplanned outage if the following prerequisites are met:

  • The metric has been assigned to a technical system.

An active service definition has been created for the technical system in Service Availability Management as explained here.