KBI 311444 Issue Addressed: When The Argent Job Scheduler Backup Node Goes Offline, The Primary Server Gets Hung
Version
Argent Job Scheduler 10.0A-1603-A and earlier
Date
Monday, 1 August 2016
Summary
Addressed the issue that Argent Job Scheduler Primary Server gets hung when the Argent Job Scheduler backup Node goes offline
Technical Background
Argent Job Scheduler Backup Node updates a heartbeat file in a network shared path to notify the Primary Engine about its service status
This path (Failure-proof Path Of Backup) and the interval at which the path is updated (Service Status Update Frequency) are configured as shown in the figure below
Argent Job Scheduler Main Server checks for the updated heartbeat file from Backup Node at an interval specified in ‘Service Status Checking Frequency’
If Failure-proof Path Of Backup is configured in the Backup Node, Primary Server gets hung when the Argent Job Scheduler backup Node goes offline
The shared path is accessed using Microsoft APIs, which goes unresponsive when the server is inaccessible, which in turn leads the Argent Job Scheduler main process to be unresponsive and Argent Job Scheduler Service restarts
This issue has been addressed by adding bullet proof to check if the shared path server is online before checking if the file has been updated by the Backup Node
Resolution
Upgrade to Argent Job Scheduler 10.0A-1607-A or later