[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] requirements question and job run count



Any thoughts? 



On Thu, Jan 23, 2014 at 7:09 PM, Rita <rmorgan466@xxxxxxxxx> wrote:

I am trying to setup a policy in my submit file where I want to avoid a host which I recently ran on (blackhole problem, https://htcondor-wiki.cs.wisc.edu/index.cgi/wiki?p=AvoidingBlackHoles). I am running 7.6


So, I have something like this


job_machine_attrs = Machine

job_machine_attrs_history_length = 3


Requirements = \

       ( (target.machine =!= MachineAttrMachine0) && ( (JobRunCount =?= Undefined) || (JobRunCount <=2 ) )) \

         && \

       ( (target.machine =!= MachineAttrMachine1) && ( (JobRunCount =?= Undefined) || (JobRunCount <=2 ) )) \

         && \

       $(Requirements)


periodic_hold = JobRunCount > 5



I would like the job to: run on the bad host no more than 2 times and it select a different host and if the total run count goes greater than 5 HOLD the job.  

For whatever reason the jobs runs on the host more than 2 times. Is there something I am doing wrong?


--
--- Get your facts first, then you can distort them as you please.--



--
--- Get your facts first, then you can distort them as you please.--