[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] FEATURE QUESTION: Re-submitting using 'on_exit_remove' but for a limited number of re-tries



In the documentation there is:

As another example, if your job should only leave the queue if it exited on its own with status 0, you would use this on_exit_remove _expression_:

         == False) && (ExitCode == 0)

If the job was killed by a signal or exited with a non-zero exit status, Condor would leave the job in the queue to run again.

 

I have a job with both real and intermittent failure modes. I would like to have a counter on the resubmission – e.g. re-submit up to a maximum of 5 times. This would allow working-through the intermittent failures but would not cause an infinite loop with the real failures.

 

Does such a feature exist?

 

Thanks,

 

Etan

 

 

-----------------------------------------------

Etan G. Cohen

etan.cohen@xxxxxxxxxxxx

(949) 433 1811