[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [condor-users] why jobs are always evicted on remotes machines?



Are you sure you want Standard universe (with checkpointing) as opposed to Vanilla universe? Have you compiled and linked your job so that it can run in Standard universe?

Regards,
Alexander Klyubin

On 10/28/2003 04:19 PM, Dan Bradley wrote:
Habib,

********************************************************
my sub file have a simple structure, like:
********************************************************
universe       = standard
Executable     = /home/condor/test
initialdir     = /home/condor

transfer_executable = TRUE

ould_transfer_files = YES
when_to_transfer_output = ON_EXIT

Output        = out.$(process)
Log            = log.$(process)
Queue 15


*******************************************************



This is unrelated to your problem, but all of this transfer-files stuff is unnecessary in standard universe.


10/28 14:19:50 Process 13236 exited with status 129
10/28 14:19:50 EXEC of user process failed, probably
insufficient swap



This looks like the problem to me. As soon as the process starts up, it is exiting abnormally. Can you run the same executable by hand on that machine?


--Dan


Condor Support Information: http://www.cs.wisc.edu/condor/condor-support/ To Unsubscribe, send mail to majordomo@xxxxxxxxxxx with unsubscribe condor-users <your_email_address>

Attachment: pgp00011.pgp
Description: PGP signature