[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] "Over submitter resource limit (0) ... only considerstartd ranks"



(A followup for the archives.) 

The problem disappeared after restarting the condor daemons on the
submitting machine.

P

> -----Original Message-----
> From: condor-users-bounces@xxxxxxxxxxx 
> [mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of DeVoil, Peter
> Sent: Tuesday, 14 February 2006 11:15 AM
> To: Condor-Users Mail List
> Subject: [Condor-users] "Over submitter resource limit (0) 
> ... only considerstartd ranks"
> 
> Hi,
> 
> I have a problem with a 80-node windows pool. 
> 
> I have a "bulk user" that has submitted tens of thousands of 
> jobs, and also ordinary users - hundreds of jobs. 
> 
> Since last week, only about ~50% of the pool has been active 
> at any time. There are about 100 nice_user (bulk) jobs in the 
> queue, but only 40 execute at any time - should be 80.
> 
> I've read manual pages and can't find a setting that mentions 
> this restriction. Any suggestions?
> 
> Yours,
> pdev.
> 
> There is a strange message in the negotiator log:
> 2/13 11:41:24 ---------- Started Negotiation Cycle ----------
> 2/13 11:41:24 Phase 1:  Obtaining ads from collector ...
> 2/13 11:41:24   Getting all public ads ...
> 2/13 11:41:24   Sorting 134 ads ...
> 2/13 11:41:24   Getting startd private ads ...
> 2/13 11:41:25 Got ads: 134 public and 85 private
> 2/13 11:41:25 Public ads include 2 submitter, 85 startd
> 2/13 11:41:25 Phase 2:  Performing accounting ...
> 2/13 11:41:25 Phase 3:  Sorting submitter ads by priority ...
> 2/13 11:41:25 Phase 4.1:  Negotiating with schedds ...
> 2/13 11:41:25   Negotiating with nice-user.Reds@* at 
> <192.168.0.98:1868>
> 2/13 11:41:25     Over submitter resource limit (0) ... only consider
> startd ranks
> 2/13 11:41:36     Request 91073.00000:
> 2/13 11:41:36       Rejected 91073.0 nice-user.Reds@*
> <192.168.0.98:1868>: no match found
> 2/13 11:41:36     Request 91074.00000:
> 2/13 11:41:37       Rejected 91074.0 nice-user.Reds@*
> <192.168.0.98:1868>: no match found
> 2/13 11:41:37     Request 91075.00000:
> 2/13 11:41:37       Rejected 91075.0 nice-user.Reds@*
> <192.168.0.98:1868>: no match found
> .............
> 
> I have reset the userpriorities to no avail. Any ideas?
> 
> Yours,
> pdev. 
> 
> ********************************DISCLAIMER****************************
> The information contained in the above e-mail message or 
> messages (which includes any attachments) is confidential and 
> may be legally privileged.  It is intended only for the use 
> of the person or entity to which it is addressed.  If you are 
> not the addressee any form of disclosure, copying, 
> modification, distribution or any action taken or omitted in 
> reliance on the information is unauthorised.  Opinions 
> contained in the message(s) do not necessarily reflect the 
> opinions of the Queensland Government and its authorities.  
> If you received this communication in error, please notify 
> the sender immediately and delete it from your computer 
> system network. 
> 
> _______________________________________________
> Condor-users mailing list
> Condor-users@xxxxxxxxxxx
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
> 
>