[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] How can a user bypass the negotiator?



Hi all,

one of our "power" users just discovered something which I don't
understand. Our cluster has many nodes and two submit machines with
almost the same setup (HA). Box A is currently running the negotiator,
we only preempt the backfill, but no user jobs, even
negotiator_consider_preemption is false.

Power user 1 (U1) submits 10000 jobs on A, power user 2 (U2) a DAG job
spawning many jobs on machine B. Currently the situation is as follows:

U2
effective prio:		1623673.62
running jobs:		4114
idle: 			13000

U1
effective prio:		33489760.00
running jobs:		2410
idle:			0
hold: 			12000

now the fun part.

U1 submits 10 short running jobs on both A and B. Since his prio is much
worse than U2's and CLAIM_WORKLIFE is set to zero, I would expect that
no jobs are run from him, however:

on B:
3/10 17:06:13 (pid:24448) Negotiating for owner: U1
3/10 17:06:13 (pid:24448) Lost priority - 0 jobs matched

looks good

on A:
3/10 17:13:05 (pid:26999) Starting add_shadow_birthdate(541094.0)
3/10 17:13:05 (pid:26999) Started shadow for job 541094.0 on
slot2@xxxxxxxxxxxxxxxxx <10.10.13.65:35449> for U2, (shadow pid = 28027)

[...]

However, user U1 never shows up in the negotiator.

Now my question, how can U1 bypass the negotiator and also U2's jobs? Or
maybe my understanding of Condor needs to be improved again ;)


Cheers

Carsten