[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Job matched with a machine and not able to execute is Idle forever



On 03/09/2010 08:39 AM, Johnson koil Raj wrote:
> Hi,
> 
>    I submitted a job it got matched with 2 machines(A,B), the job is
> about to execute in machine A but due to some user issue it is not able
> to continue on that machine A and job back into idle state and jobAds
> JobRunCount is increasing. so I stopped condor in machine A. Now the job
> matched machine B and get execute there.
> 
> why condor can't do it automatically this after certain number of
> failures to execute in a machine.
> 
> some configuration is already there to do that.

Are you already using LastRemoteHost in your requirements to avoid machine A?

Best,


matt


> by
> Johnson
> 
> 
> 
> 
> Please do not print this email unless it is absolutely necessary.
> The information contained in this electronic message and any attachments
> to this message are intended for the exclusive use of the addressee(s)
> and may contain proprietary, confidential or privileged information. If
> you are not the intended recipient, you should not disseminate,
> distribute or copy this e-mail. Please notify the sender immediately and
> destroy all copies of this message and any attachments.
> WARNING: Computer viruses can be transmitted via email. The recipient
> should check this email and any attachments for the presence of viruses.
> The company accepts no liability for any damage caused by any virus
> transmitted by this email.
> www.wipro.com
> _______________________________________________
> Condor-users mailing list
> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
> 
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/condor-users/