[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] rebooting a submission node






> if you do a graceful shutdown (SIGTERM, condor_off, etc), condor tries
> to evict jobs, and in the case of vanilla jobs, that means killing
> them and restarting them.

Why are vanilla jobs killed if the submit machine is down? I thought that
vanilla jobs don't communicate (no remote system calls) with the submit
machine during execution. The only critical time I can think of is when a
job ends and the result should be transfered. But why kill a job which is
running for days because the submit machine is only down for a few minutes?

regards,
Matthias