[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Computers missing from Condor pool



On Tue, Feb 26, 2008 at 03:34:39PM +0100, Rob de Graaf wrote:
> 
> The suggested fix, adding a delay by setting the D_NETWORK debug flag, 
> has been applied on all computers and has had some effect; the average 
> pool size has gone up, but not by as much as we had hoped, and ping 
> sweeps still reveal many more live machines not appearing in the pool, 
> leading us to believe there is still some other problem.
> 
> We've looked at master and startd log files but we haven't been able to 
> find anything seriously wrong, and we're running out of ideas.
> 
> What could be causing computers to sometimes be missing from our pool, 
> and what else can we do to find them?
> 

Turn on TCP updates to the collector, instead of UDP.

-Erik