[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [condor-users] What happens when a machine is removed from the grid?




If a machine is removed from the grid while a job is running on it (say, due
to a hardware failure) when/how is that job moved to another machine?

When one of your jobs is running, it will be monitored by a condor_shadow process. When that process detects a failure like this, it will put your job back into the queue on your machine to be run again. At that point, it mostly looks like any other job, so it will be matched and re-run when there are machines available.


-alain



Condor Support Information:
http://www.cs.wisc.edu/condor/condor-support/
To Unsubscribe, send mail to majordomo@xxxxxxxxxxx with
unsubscribe condor-users <your_email_address>