[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Shadow Exception after 3 hours of running!



Did your spool directory run out of disk space?

- Ian


On Thu, Jan 31, 2013 at 1:47 AM, Mostafa.B <bakhtvar@xxxxxxxxx> wrote:
Hi All,

Recently, the jobs that I send to condor are held after 3 hours (or more) of run,

I looked at the log file and it says:

...
007 (46992.000.000) 01/30 20:07:10 Shadow exception!
Error from slot1@xxxxxxxx: STARTER at xxx.xx.xxx.xx failed to send file(s) to <xxx.xx.xxx.xxx:xxxxx>; SHADOW at xxx.xx.xxx.xxx failed to write to file C:\condor/spool\6992\0\cluster46992.proc0.subproc0.tmp\_condor_stdout: (errno 2) No such file or directory
72922408  -  Run Bytes Sent By Job
232107  -  Run Bytes Received By Job
...
012 (46992.000.000) 01/30 20:07:10 Job was held.
Error from slot1@xxxxxxxx: STARTER at xxx.xx.xxx.xx failed to send file(s) to <xxx.xx.xxx.xxx:xxxxx>; SHADOW at xxx.xx.xxx.xxx failed to write to file C:\condor/spool\6992\0\cluster46992.proc0.subproc0.tmp\_condor_stdout: (errno 2) No such file or directory
Code 12 Subcode 2
...

any ideas why this happens? and how to solve it?

The jobs were OK with condor until 2 days ago, even they are still OK when I run them manually on my PC.
by the way I am the admin user of the Windows based PC that is sending jobs to Condor.

Regards
Mosy


_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/