KBI 310896 Issue Addressed: Argent Console Motor Does Not Fire Alerts While Another Motor Is Offline

Version

Argent Advanced Technology 3.1A-1401-E or below

Date

Wednesday, 26 Mar 2014

Summary

When an Argent Console Motor is offline, in some rare combinations, it can cause other running Motors fail to fire alerts

When inspecting the Argent Console service log, customer can see lines similar to following repeated over and over in the service log:






23 Mar 2014 06:40:51.208 ISMON1 CORP\admin_sched Potential DeadLock Detected



Thread (Id=0X1870) has waited for more than 310 seconds trying to lock the resource 



                        Lock Name: AAC_IncJobNumber 



                        Acquiring: AAC_ProcessAlert.cpp#127 



                        Used by: AAC_ProcessAlert.cpp#127  

Technical Background

If the offline Motor shuts down unexpectedly such as in case of power outage, and the Motor happens to be holding the cluster lock, the lock will not be released until the Motor is back online

In such a rare situation, the running Motor will have long delay each time to check the lock status

Combined with slow network connection and SQL operation during the unusual period, the total time to acquire lock can exceed the deadlock detection threshold (300 seconds by default), the running Argent Console will recycle and fail to fire alerts

The issue is addressed in Argent AT 3.1A-1401-T5

Resolution

Upgrade to Argent Advanced Technology 3.1A-1401-T5 or later