[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] -better-analyze doesn't tell me details (7.0.4)



At one time the Condor staff told me that you will
only get the list of requirements that your job has got,
if you have a non-zero value of machines that are rejected
by your jobs requirements.

The ways to get at the "reject the job for unknown reasons"
are to do condor_q -ana -l 227322

That will tell you the last machine that rejected your match, and why.
NegotiatorLog can sometimes tell you something too if you are running
at high enough debug.

Two "unknown reasons" I've hit before are (a) the negotiation cycle
just hasn't happened yet since this job was submitted and (b)
the user in question has exceeded his group quota.

Steve Timm


On Thu, 6 Nov 2008, Steffen Grunewald wrote:

Hi,

trying to find out why a particular job cluster remains in Idle state,
I tried:

$ condor_q -be 227322.0


-- Submitter: deepthought.$domain : <10.100.200.92:44421> : deepthought.$domain
---
227322.000:  Run analysis summary.  Of 1200 machines,
     0 are rejected by your job's requirements
     8 reject your job because of their own requirements
     0 match but are serving users with a better priority in the pool
    64 match but reject the job for unknown reasons
  1128 match but will not currently preempt their existing job
     0 are available to run your job

The following attributes are missing from the job ClassAd:

CheckpointPlatform


I had expected to get a detailed listing of Requirements (and I'm sure I have
seen them before). Any chance to get the old behaviour back?
OS is Debian Etch/amd64, running RHEL3 Condor 7.0.4 binaries.

Thanks,
Steffen



--
------------------------------------------------------------------
Steven C. Timm, Ph.D  (630) 840-8525
timm@xxxxxxxx  http://home.fnal.gov/~timm/
Fermilab Computing Division, Scientific Computing Facilities,
Grid Facilities Department, FermiGrid Services Group, Assistant Group Leader.