[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Strange hold event



Hi,

Once in a while I find jobs in the queue in held state, with the following hold reason in the job classad:

lastHoldReason = "Error from computerName.domain.local: STARTER at 192.168.0.149 failed to send file(s) to <192.168.0.96:39609>; SHADOW at 192.168.0.96 failed to write to file /opt/condor/spool/8901/0/cluster7198901.proc0.subproc0.tmp/_condor_stderr: (errno 2) No such file or directory"

What might cause this? Where to look for some more information about it?

Using condor 7.5.5 on Ubuntu on both the starter, the scheduler and the central manager.

Cheers,
Szabolcs