[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Shadow Exception after 3 hours of running!



Hi All,

Recently, the jobs that I send to condor are held after 3 hours (or more) of run,

I looked at the log file and it says:

...
007 (46992.000.000) 01/30 20:07:10 Shadow exception!
Error from slot1@xxxxxxxx: STARTER at xxx.xx.xxx.xx failed to send file(s) to <xxx.xx.xxx.xxx:xxxxx>; SHADOW at xxx.xx.xxx.xxx failed to write to file C:\condor/spool\6992\0\cluster46992.proc0.subproc0.tmp\_condor_stdout: (errno 2) No such file or directory
72922408  -  Run Bytes Sent By Job
232107  -  Run Bytes Received By Job
...
012 (46992.000.000) 01/30 20:07:10 Job was held.
Error from slot1@xxxxxxxx: STARTER at xxx.xx.xxx.xx failed to send file(s) to <xxx.xx.xxx.xxx:xxxxx>; SHADOW at xxx.xx.xxx.xxx failed to write to file C:\condor/spool\6992\0\cluster46992.proc0.subproc0.tmp\_condor_stdout: (errno 2) No such file or directory
Code 12 Subcode 2
...

any ideas why this happens? and how to solve it?

The jobs were OK with condor until 2 days ago, even they are still OK when I run them manually on my PC.
by the way I am the admin user of the Windows based PC that is sending jobs to Condor.

Regards
Mosy