[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Managing evictions and reruns



On 2/5/2013 7:53 AM, Brian Pipa wrote:
ok, more info...

I just ran the master job and it was suspended. It eventually
restarted and 4 of the worker jobs were suspended/evicted. The only
thing this my condor setup is being used for is for my code, all jobs
are of equal rank, submitted by the same person, and all running in
the java universe. I don't know what could be causing all of these
suspensions and evictions.

On a dedicated node you'd normally use

SUSPEND = FALSE
PREEMPT = FALSE
KILL = FALSE

in condor_config.local.

HTH,
Dimitri