[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] RANK question; slots unclaimed on a single computer



On 8/8/2014 1:02 PM, Ralph Finch wrote:
Ben and Todd, thanks much for the helpful replies.

A little more explanation:

We have a single Windows7x64 HTC pool of about 90 cores among about 20
deskside machines. Most have 4 cores, but two have 16 cores, and those two
have clock speeds of 2.2 and 3.3 GHz, enough to make a difference in job
speed. Typical jobs are 1000+ at a time, and each one takes 5-10 mins to
finish. Of the 90 cores, up to 75-80 are usable, the remaining don't have
enough disk space or their HTC install is boogered and I need to fix that.

We've noticed that as the last core is claimed and run on the 4-core
machines, interactive use becomes fairly dismal.


Some additional random thoughts re using the last core problem, besides using RANK -

+ Your start expression on your execute machine could only allow Slot1 to be claimed during the night and/or weekends, when interactive use is unlikely, and/or your preempt expression could kick off the job on Slot1 when keyboard activity is detected.

+ You could permanently reserve one core for interactive use by the owner with
    NUM_CPUS = $(DETECTED_CPUS) - 1

regards,
Todd