Mailing List Archives Public Access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Limiting max number of running jobs for a group

Date: Thu, 21 Sep 2017 11:06:55 +0200
From: Antonio Delgado Peris <antonio.delgadoperis@xxxxxxxxx>
Subject: [HTCondor-users] Limiting max number of running jobs for a group

Dear all,

This is my first message to the list, so I'll start by presenting myself:-) I am writing from CIEMAT institute, at Madrid, Spain, where we haverecently installed a HTCondor cluster (with an HTCondor-CE in front ofit). We're still in the testing phase, but should be moving toproduction fairly soon. We'll be serving mostly (but not uniquely) theLHC CMS experiment.

So moving to my question... we've defined some hierarchical dynamicgroup quotas, with surplus allowed, which is nice because we want minorgroups to be able to use the farm if CMS is not running for some reason.However, we also would like to limit their expansion, so that theycannot occupy the whole farm (to speed up CMS taking over the farm whentheir jobs come back).

Naively, this would be like having both dynamic (soft, fair share-like)quotas and static (hard) quotas for some groups. But the manual saysthat if you define both dynamic and static quotas, the dynamic one isignored.

I have looked for another parameter like 'MAX_RUNNING_JOBS_PER_GROUP'but haven't found anything like that. I have also tried to code somelogic in the START expression using 'SubmitterGroupResourcesInUse', butit didn't work (I think that attribute is only usable for preemption...which we don't allow).

We have solved the situation by just reserving some named nodes to CMS,but I was still curious if there might be a less static solution to theproblem--i.e.: not tied to a fixed set of nodes, but just stating a maxnumber of simultaneous running jobs.

Thanks for any hints. (And sorry if this question has been repliedearlier... I couldn't find it)


Cheers,

    Antonio

Follow-Ups:
- Re: [HTCondor-users] Limiting max number of running jobs for a group
  - From: Jose Caballero
- Re: [HTCondor-users] Limiting max number of running jobs for a group
  - From: Antonio Dorta

Prev by Date: Re: [HTCondor-users] Change JobUniverse from vanilla to local?
Next by Date: Re: [HTCondor-users] Limiting max number of running jobs for a group
Previous by thread: Re: [HTCondor-users] HTCondor 8.7.3 Python Bindings available via pip (including Python 3.5+)
Next by thread: Re: [HTCondor-users] Limiting max number of running jobs for a group
Index(es):
- Date
- Thread

Mailing List Archives

Public Access

[HTCondor-users] Limiting max number of running jobs for a group