The customer experienced a problem where uncontrolled application work caused an overflow of the OSF command Queue set at 1700 commands.
They currently have a max of 15 OSF class servers. We discussed that they could increase the max number OSF servers to 30 for each class of server (TSL and TSP) where by they could have a total of 90 available servers. We also observed that some of the offending exec's ran up to 12 minutes long, which they explained was the nature of the application which uses a common dataset that every exec needs access to at some point in the code.
Since they can not 'throttle' when the exec's are invoked, the next best item would be to increase the max number of servers possible to 30 AND start making use of the TSL strictly for this application.
We also discussed increasing the OSFQUE from 1700 to 3400, this does not really address the real problem but have the effect of 'pushing' the problem further ahead in time. This is a small hit of 1 meg additional storage consumption in the OPSMAIN asid with doubling this Queue.