Job runs Abort or IO Server is unresponsive for a limited time

Document ID : KB000084633
Last Modified Date : 14/04/2018
Show Technical Document Details
Issue:
Affects Release version(s): 5

Error Message :
The following kind errors have been found in the universe.log:
/uxord /u_io_callsrv_connect/000000000 - u_connect error : Errno syserror 145: Connection timed out (host [IP_SERVER])
/uxcal /u_io_callsrv_connect/000000000 - u_connect error : Errno syserror 145: Connection timed out (host [IP_SERVER])
/uxord /GASNDJ33 /134458554 - %UNI_-E-U_EGASNDJ3302, Error-Job submission
/uxjobinit /GAI08A33 /134452482 - %UNI_-E-U_EGAI08A3301, Exec Unknown
/uxjobinit /GAFIAP32 /134451498 - %UNI_-E-U_EGAFIAP3201, Management Unit unknown :
/uxjobend /uxjobend /000000000 - getenv S_PROCEXE Failed

Patch level detected:Dollar Universe 5.6
Product Version: Dollar.Universe 5

Description :During the execution of a particularly resource intensive maintenance job, all / some of the job runs abort for a limited amount of time. 
Some Dollar Universe commands may also abort during this period.
After the afore mentioned period, the execution of jobs resumes without error.
Environment:
OS: All
Cause:
Cause type:
Configuration
Root Cause: The afore mentioned errors are associated with the IO server being overloaded.
The variables Number of retries if a connection to the IO server fails with the U_CONNECT_ITER_NBMAX and the interval between the connection retries to the IO server. (seconds on UNIX and milliseconds on Windows) U_CONNECT_ITER_INTERVAL should be increased to improve IO availability.
This means that the clients of the IO server will try U_CONNECT_ITER_NBMAX times to connect itself to the server with a break of U_CONNECT_ITER_INTERVAL seconds between them.
Resolution:
Please modify the following parameters:

-In the $UXMGR/uxsetenv and $UXMGR/uxsetenv_ksh:
U_CONNECT_ITER_INTERVAL 2
export U_CONNECT_ITER_INTERVAL 2
U_CONNECT_ITER_NBMAX 6
export U_CONNECT_ITER_NBMAX 6

-In the $UXMGR/uxsetenv_csh:

setenv U_CONNECT_ITER_INTERVAL 2
setenv U_CONNECT_ITER_NBMAX 6

Fix Status: No Fix

Additional Information:
Workaround :
N/A