The purpose of Job & Automation Monitoring is to provide transparency about the current status of automation processes regarding execution status, application status, start delay and run time. It supports the monitoring of automation processes (for simplification called “jobs” in the later text) that run on the different platforms with a unified user experience using a common look-and-feel and handling pattern.
The basic concept is to collect individual job execution data into the central monitoring application, to correlate it to the related job definition and evaluate every execution using historical data. The rating of the last execution is propagated to the job level and finally to the service level so that you can understand easily the current status of the jobs executed. In case of issues with job executions, context-based navigation is offered from the aggregate to the individual job execution and finally a direct navigation to the corresponding job instance in the executing service to support a more detailed analysis, e.g. in order to stop a not ending execution or to restart a failed execution.
Job & Automation Monitoring shall help IT and Business users to understand the success of the automation processes they are responsible for, cross all involved cloud services and systems.
On top of showing the current status and indicate exceptional situations, job analytics is offered for the different monitored aspects (e.g. runtime and exception trend reporting) so that critical trends can be discovered and downtimes avoided.
Job & Automation Monitoring in SAP Cloud ALM collects data for these job types:
Job & Automation Monitoring in SAP Cloud ALM collects data for these solutions and products.
The Overview provides a summary status regarding job execution for the services in scope. It shows the status of the latest execution of every job with regards to the Execution Status, Application Status and Run Time. On top it displays the number of alert situations, i.e. jobs for which an alert had been created, that is not yet confirmed.
Clicking on card Automations with Technical Issues you are navigated to the list of jobs sorted by Execution Status. If you click on Automations with Application Issues or Automations with Performance Issues you are forwarded to the same list, sorted accordingly. On clicking on Automations with Open Alerts you are navigated to the list of open alerts.
If you directly navigate to the Monitoring view, then you get a status summary per cloud service, i.e. the status of the latest execution of every job with regards to the Execution Status, Application Status, Start Delay and Run Time. On top you see the number of monitored jobs and the number of alert situations.
By clicking on the Service Name, you are navigated to the list of jobs for a service sorted by Execution Status rating in a descending order. You see also ratings regarding Application Status, Start Delay, and Run Time. The summary rating of a job is the latest non-grey rating of the respective aspect. If you click on any of the rating icons, you are navigated to the list sorted by the corresponding aspect.
By clicking on a name, you can view the list of executions of the selected job sorted by planned start time.
By clicking on the information icon next to each execution, you can view further details and navigate directly to the job in the managed cloud service or on-premise system* by clicking on the Run ID value.
*Note: If you use a SAP Web Dispatcher please maintain the URL of the Web Dispatcher as Logon URL for the respective System in SAP Cloud ALM Landscape Management.
In Monitoring in the job list view you can filter the list for specific jobs and have the option to define this list as favorite. After you have created the favorite by pressing the * icon, go to the home page and set a speaking title and subtitle (via the star icon) for the card.
From the favorite card you can forward navigate to monitoring, namely to the list of jobs that you have defined as favorite.
Use the Analysis view to identify those jobs that have the longest response times and highest number of exceptions. A trend icon indicates, if the failure rate or response time is up or down in the time frame selected taking the mid point of the time frame as comparison point.
For any job it offers aggregated information on total and average run time, number of executions and number of failed executions (regarding Execution Status or Application Status) and the resulting failure rate. By clicking on the arrow at the end of every line, you can drill down into various trend charts for the metrics.
Use the Exceptions view to check for exceptions of the services and systems in scope.
To get exception data collected in general please follow the instructions provided in Integration & Exception Monitoring. In case of issue with the view, please report it via component.
To get exception data collected related job exceptions, no additional configuration is needed as those are collected together with the job execution data. Currently the following job types are supported:
On click on the rating icon you are forward navigated to the list of exceptions related to the job execution.
*Prerequisite: ST-PI 7.40 SP20 or higher is implemented in the managed on-premise system and in /n/sdf/alm_setup use case Exception Monitoring is activated.
Use the Alerting view to check the job execution related alerts for the services in scope. You can configure to get alerted, if a job has a red or yellow Executions Status, a red or yellow Application Status, if its Run Time or Start Delay exceeds a defined threshold.
As you can adjust the Alert Name you can indicate what you have configured, e.g. the filters used. In the alert Object Details the name of the job is contained. In the alert list UI, you can filter for the content of the object details, i.e. the job name.
Alerts are raised per job and alert type and contain all failed executions of a specific job. From the Run Id value, you can navigate to the corresponding job execution in the managed cloud service, just like from the job execution popup in the monitoring application.
Please note, that a downstream action (like email) is triggered when there is a change in the event rating (green to yellow / red, yellow to red / green or red to yellow / green). This means also, that no new notification is sent on consecutive failures of the same job as the rating does not change.
Below we list the most important use cases that Job & Automation Monitoring offers: