[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[HTCondor-users] startd statistics
- Date: Wed, 18 Mar 2020 12:10:51 +0100 (CET)
- From: "Beyer, Christoph" <christoph.beyer@xxxxxxx>
- Subject: [HTCondor-users] startd statistics
I hope you are all well and work from home, help flattening the curve of new infections - I bet someone is using HTC somewhere to fight corona by the way :)
Anyway - something completley different, I think for a while about establishing a kind of error counter for workernodes that come with the host-classadd as a ratio of successful/unsuccessful jobstarts/jobfinishes.
I would like to use the startd-cron feature and the local startd statisitics to calculate that number. Therefore I did set
STATISTICS_TO_PUBLISH = STARTER:2
But that is currently not leading to any helpful numbers using 'condor_status -l -startd' maybe I am on the wrong track here and someone did something similar using different tools ?
I think I could come up with something by going through the job history on the sched but that sounds a bit over-engineered as I suppose the startd should have some numbers that I could use ?
Building 02b, Room 009