[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Startd on workers dies just after claiming job "error opening watchdog pipe"





Ian Stokes-Rees wrote:

I think there are multiple compounding problems here, and the disappearance of lock files is just one part (this may be due to tmpwatch).


Condor's LOCK configuration should not be pointed at a directory that is cleaned by tmpwatch. Condor should probably outright refuse to operate if LOCK=/tmp. Either that, or the procd should not use the lock directory as the location of the named pipe.

--Dan