[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Negotiator fails to match the jobs even when -better-analyze succeeds



Hi,

it seems the job conditions are not met by any slot, try:

condor_q -better-analyze 112848.0 -reverse -machine <fqdn>

to get an idea of what is going 'wrong' ...

Best
christoph


--
Christoph Beyer
DESY Hamburg
IT-Department

Notkestr. 85
Building 02b, Room 009
22607 Hamburg

phone:+49-(0)40-8998-2317
mail: christoph.beyer@xxxxxxx


Von: wangzqe@xxxxxxxxx
An: "htcondor-users" <htcondor-users@xxxxxxxxxxx>
Gesendet: Dienstag, 7. September 2021 22:08:10
Betreff: [HTCondor-users] Negotiator fails to match the jobs even when        -better-analyze succeeds

Dear condor experts,

Dear condor experts,

 

   I have encountered a condor error, which the submitted job finds running slots according to "condor_q -better-analyze" but Negotiator fails to find the match.  This happens when I tried to set up the bosco, but the error seems to come from the condor instead of bosco. The info from the condor_q -better-analyze, and a cycle of the negotiator is shown below. 


   Do you know what is the problem? The condor version is 8.8.1 


Sincerely,

Zhangqier Wang


condor_q -better-analyze 112848.0 -name t3home000.mit.edu

The Requirements _expression_ for job 112848.000 reduces to these conditions:

         Slots
Step    Matched  Condition
-----  --------  ---------
[0]           5  BOSCOCluster == "bosco_tier3@xxxxxxxxxxxxxxxxx/condor"
[2]           5  Arch == "X86_64"
[5]           5  TARGET.OpSys == "LINUX"
[7]           5  TARGET.Disk >= RequestDisk
[9]           5  TARGET.Memory >= RequestMemory
[11]          5  TARGET.HasFileTransfer


No successful match recorded.
Last failed match: Wed Aug 25 18:43:43 2021

Reason for last match failure: no match found 

112848.000:  Run analysis summary ignoring user priority.  Of 4 machines,
      0 are rejected by your job's requirements
      0 reject your job because of their own requirements
      0 match and are already running your jobs
      0 match but are serving other users
      4 are able to run your job


NegotiatorLog:

08/25/21 18:44:43 ---------- Started Negotiation Cycle ----------
08/25/21 18:44:43 Phase 1:  Obtaining ads from collector ...
08/25/21 18:44:43   Getting startd private ads ...
08/25/21 18:44:43   Getting Scheduler, Submitter and Machine ads ...
08/25/21 18:44:43   Sorting 11 ads ...
08/25/21 18:44:43 Got ads: 11 public and 0 private
08/25/21 18:44:43 Public ads include 1 submitter, 5 startd
08/25/21 18:44:43 Phase 2:  Performing accounting ...
08/25/21 18:44:43 Phase 3:  Sorting submitter ads by priority ...
08/25/21 18:44:43 Starting prefetch round; 1 potential prefetches to do.
08/25/21 18:44:43 Starting prefetch negotiation for analysis.wangzqe@xxxxxxx.
08/25/21 18:44:43     Got NO_MORE_JOBS;  schedd has no more requests
08/25/21 18:44:43 Prefetch summary: 1 attempted, 1 successful.
08/25/21 18:44:43 Phase 4.1:  Negotiating with schedds ...
08/25/21 18:44:43   Negotiating with analysis.wangzqe@xxxxxxx at <18.4.134.252:9620?addrs=18.4.134.252-9620+[2603-4000-486-1-1a66-daff-feee-c1dc]-9620&noUDP&sock=39910_84d6_3>
08/25/21 18:44:43 0 seconds so far for this submitter
08/25/21 18:44:43 0 seconds so far for this schedd
08/25/21 18:44:43     Request 112848.00000: autocluster 413 (request count 1 of 50)
08/25/21 18:44:43       Rejected 112848.0 analysis.wangzqe@xxxxxxx <18.4.134.252:9620?addrs=18.4.134.252-9620+[2603-4000-486-1-1a66-daff-feee-c1dc]-9620&noUDP&sock=39910_84d6_3>: no match found
08/25/21 18:44:43  negotiateWithGroup resources used submitterAds length 0
08/25/21 18:44:43 ---------- Finished Negotiation Cycle ----------

_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/