Internode sessions experiencing communication delays after network issues

Document ID : KB000086122
Last Modified Date : 14/04/2018
Show Technical Document Details
Affects Release version(s): 5

Error Message :

2009-06-14 16:06:56 0005897/uxech /CALL_UXIOSRV /000000000 - u_io_callsrv(on node1,COMPANY1,A) returns error -1
br/> 2009-06-14 16:06:56 0005897/uxech /GAI90A32 /134455874 - %UNI_-E-U_EGAI90A3225, Network down, can't set packet N°
br/> 2009-06-14 16:06:56 0005897/uxech /u_io_callsrv /000000000 - u_connect error : Errno syserror 239: Connection refused (host [node1])

Patch level detected:Dollar Universe 5.6
Product Version: Dollar.Universe 5.6.0 FX25010

Description :During a network outage that lasts some time, the universe.log is flooded with exchanger error messages. After the network has recovered, cross node sessions start to run.However, the jobs that should run on remote nodes only start 20 minutes later.
OS: All
Root Cause: The reason for this delay is due to the exchanger data files are filled with pending requests. During the network outage, the DUAS is still working, and trying to send out several requests to remote nodes. However, it could not so more and more requests are accumulated in the exchanger data files.
This kind of delay should dissipate once the network is stable.

Fix Status: No Fix

Additional Information:
Workaround :