Mailing List Archives
Public Access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[HTCondor-users] CLAIM_WORKLIFE - a couple of questions
- Date: Tue, 05 Nov 2019 11:44:35 +0100 (CET)
- From: "Beyer, Christoph" <christoph.beyer@xxxxxxx>
- Subject: [HTCondor-users] CLAIM_WORKLIFE - a couple of questions
Hi,
I do have a coupls - OK 2 questions about the CLAIM_WORKLIFE:
- Is it possible to define the CLAIM_WORKLIFe per slot as in: 'SLOT_TYPE_1_CLAIM_WORKLIFE = 60' ?
- second = more complicated ;)
We do have occasionally jobs that fail within a second or two on a specific workernode without being recognized as a failed job by htcondor. In this case the slot attracts hundreds more of these jobs of the user over the 20 minute period time that all fail.
Hence it would be useful to zero the CLAIM_WORKLIFE in case of job-failure (maybe this is the case already ?) and it would be equally useful to compute the CLAIM_WORKLIFE more individually/dynamic like '10 times the runtime of the first job- max 1200 sec' etc. I had a short look into the condor-toolbox but it seemed difficult to realize with ship's ressources ?
Best
Christoph
--
Christoph Beyer
DESY Hamburg
IT-Department
Notkestr. 85
Building 02b, Room 009
22607 Hamburg
phone:+49-(0)40-8998-2317
mail: christoph.beyer@xxxxxxx