[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Shadow exception errors



On Monday 13 February 2006 12:01 am, Greg.Hitchen@xxxxxxxx wrote:
> Hi
>
> We have been setting up and experimenting with condor for a while
> and now have some "real" users onboard using the system.
>
> This user has submitted a number of jobs that keep trying to start,
> fail and start again. There are shadow execption problems and eviction
> problems. Just concentrating on the shadow exception problems for now
> I have including logs from the submitting machine and from 2 different
> execute machines.
>
> What problem is likely to cause these type of error messages?

The relevant lines are most likely these:

> 2/13 10:54:32 (7.0) (1268): Request to run on <130.116.147.52:9590> was
> ACCEPTED
> 2/13 10:54:45 (7.0) (1268): ReliSock: put_file: Failed to open file
> C:\Documents and Settings\odw010\.condorqueue\D78aUAA.egs, errno = 2.
> 2/13 10:54:45 (7.0) (1268): ERROR "DoUpload: Failed to send file
> C:\Documents and Settings\odw010\.condorqueue\D78aUAA.egs, exiting at
> 1398

The shadow get errno 2 "File not found" when trying to send the file "C:
\Documents and Settings\odw010\.condorqueue\D78aUAA.egs".  I'd start looking 
there...

Hope this helps

-Nick

-- 
           <<< Why, oh, why, didn't I take the blue pill? >>>
 /`-_    Nicholas R. LeRoy               The Condor Project
{     }/ http://www.cs.wisc.edu/~nleroy  http://www.cs.wisc.edu/condor
 \    /  nleroy@xxxxxxxxxxx              The University of Wisconsin
 |_*_|   608-265-5761                    Department of Computer Sciences