[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] rebooting a submission node



On Tue, May 17, 2005 at 02:54:30PM -0400, Daniel Loughlin wrote:
> I'd like to take down a submission node that currently has a bunch of 
> condor_shadow processes. I am running 6.7.1 (Yes, I'm upgrading soon.). 
> If I condor_off this node, make my change, and reboot the machine, will 
> the condor_shadow processes respawn after the condor_master and schedd 
> startup after the reboot?
> 
> I also want to avoid having the condor_starter processes on the 
> execution nodes that communicate to condor_shadow on the submission node 
> die off. 
> 
> Basically I want to know if it possible to reboot a submission node 
> without causing headaches for the user in a production environment?
> 

Jobs that have been submitted with a JobLeaseDuration set will survive
a reboot of the submit machine. Anything jobs without the job lease duration
will be checkpointed (if possible) and then resumed or restarted when
the condor_schedd starts running again, but will not be running while 
the submit machine is down.

-Erik