On 3/31/21 1:55 PM, templon@xxxxxxxxx wrote:
What is the corresponding simplest way to achieve exactly this in HTCondor?
Note the word âexactlyâ :)
The answer was the
condor_draincommand, but it does not achieve exactly this, without a bit more.
condor_drainalso evicts running jobs from slots, depending on what the value of
MaxJobRetirementTimeis. I did not know about this variable so we did not have it set, and aside from nodes not accepting new jobs (the question), they stopped running the already-running jobs - not the desired behavior.