[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Problem: machine match but reject the job for unknown reasons



Hi,

I know this issue of a machine rejecting a job for unknown reason was discussed before, but to my knowledge there was never a clear statement on how to resolve this issue.
So my question: How to find out what is going on/wrong if there is a message like follows:

> condor_q -analyze 1076700.0

-- Submitter: xxx : <192.168.101.214:32776> : xxx
 ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD
---
1076700.000:  Run analysis summary.  Of 152 machines,
      2 are rejected by your job's requirements
      0 reject your job because of their own requirements
      1 match but are serving users with a better priority in the pool
    108 match but reject the job for unknown reasons
     41 match but will not currently preempt their existing job
      0 are available to run your job

Especially, "108" is a very large number, so this issue is quite important to be resolved. Any ideas welcome.

Cheers
  Alex