Issue:
Error Message :
No Error.
Patch level detected:Dollar Universe 6.0.00
Product Version: Dollar.Universe 6.0.0
Description :When checking a job launched from a logical queue to a remote physical queue, sometimes jobs can turn to aborted on logical queue's node while still running on remote node.
Meanwhile, Job is seen as "aborted" on mother node even if the process is still running on remote node.
Cause:
Cause type:
Defect
Root Cause: Any network issue when mother node with the logical queue was checking the status of a job on a remote DQM physical queue led to the job taking status "aborted".
Resolution:
A retry has been added, with number of other improvements in DQM logical-physical queue management. (reducing to number of needed check, etc.)
Fix Status: Released
Fix Version(s):
Component: Application.Server
Version: Dollar.Universe 6.0.0