[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] How to reserve a slot or machine for special jobs



Dear All,

i am following this receipe: https://htcondor-wiki.cs.wisc.edu/index.cgi/wiki?p=HowToReserveSlotForSpecialJobs

but for some reason the jobs are not getting through. Can you please help, below are debug logs

Regards,
Prem

Here is the negotiator logs showing the matching step. 

08/20/13 08:45:19   Negotiating with prem@xxxxxxxxxxx at <10.1.9.1:50738>
08/20/13 08:45:19 0 seconds so far
08/20/13 08:45:19   Calculating submitter limit with the following parameters
08/20/13 08:45:19     SubmitterPrio       = 0.500000
08/20/13 08:45:19     SubmitterPrioFactor = 1.000000
08/20/13 08:45:19     submitterShare      = 0.328903
08/20/13 08:45:19     submitterAbsShare   = 0.166667
08/20/13 08:45:19     submitterLimit    = 846.267970
08/20/13 08:45:19     submitterUsage    = 0.000000
08/20/13 08:45:19 Socket to prem@xxxxxxxxxxx (<10.1.9.1:50738>) already in cache, reusing
08/20/13 08:45:19     Sending SEND_JOB_INFO/eom
08/20/13 08:45:19     Getting reply from schedd ...
08/20/13 08:45:19     Got JOB_INFO command; getting classad/eom
08/20/13 08:45:19     Request 2649337.00000:
08/20/13 08:45:19 matchmakingAlgorithm: limit 846.267970 used 0.000000 pieLeft 2541.306023
08/20/13 08:45:19       Rejected 2649337.0 prem@xxxxxxxxxxx <10.1.9.1:50738>: no match found
08/20/13 08:45:19     Sending SEND_JOB_INFO/eom
08/20/13 08:45:19     Getting reply from schedd ...
08/20/13 08:45:19     Got NO_MORE_JOBS;  done negotiating
08/20/13 08:45:19   Submitter prem@xxxxxxxxxxx got all it wants; removing it.
08/20/13 08:45:19  resources used by prem@xxxxxxxxxxx are 0.000000


Here is the condor_q analyze report: I have set aside three machines for this purpose but all three reject.

-- Submitter: login1.xxx.xxx.xxx : <10.1.9.1:50738> : login1.xxx.xxx.xxx
---
2649337.000:  Run analysis summary.  Of 1597 machines,
   1594 are rejected by your job's requirements
      3 reject your job because of their own requirements
      0 match but are serving users with a better priority in the pool
      0 match but reject the job for unknown reasons
      0 match but will not currently preempt their existing job
      0 match but are currently offline
      0 are available to run your job
        No successful match recorded.
        Last failed match: Tue Aug 20 09:01:00 2013
        Reason for last match failure: no match found

The Requirements _expression_ for your job is:

( target.CAN_RUN_WHOLE_MACHINE ) && TARGET.IsChemSlot

    Condition                         Machines Matched    Suggestion
    ---------                         ----------------    ----------
1   TARGET.IsChemSlot                 39
2   ( target.CAN_RUN_WHOLE_MACHINE )  149