KBI 311452 Argent Jobs Suddenly Fail Or Appear To Fail

Version

Argent Job Scheduler – Any Version

Date

Monday, 5 September 2016

Summary

Everything in the world is interconnected, and this is especially true of any distributed Job Scheduler

After running happily for months or years, if Jobs suddenly start to fail the first place to look is the network

Argent has seen a London bank apply an emergency Microsoft patch at 11:23 p.m. and a few minutes all Jobs start to fail

Of course, the patching team did not think it worth telling the operations team running 20,000 batch Jobs on 17 Argent Queue Engine

Results?

Chaos

Technical Background

The best and fastest way to test?

One of the built-in Windows troubleshooting tools, Perfmon, is perfect for this test

Performance Monitor, or Perfmon, is a simple tool for viewing performance data on remote Windows machines

Log on as the Argent Service Account on the Central Job Scheduling machine, start Perfmon, and check the CPU time on each of the remote Argent Queue Engines

Argent and Perfmon use the same technology – if Perfmon fails to connect, so will Argent

Resolution

To Connect To A Remote Computer With Perfmon

  1. On the central Job Scheduling Engine click Start, click in the Start Search box, type Perfmon , and press ENTER
  2. In the navigation tree, click Performance, right click and choose Connect to another computer
  3. In the Select Computer dialog box, type the name of the Remote Queue Engine you want to connect to, or click Browse to select it from a list
  4. Click OK