[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Jobs are rejected for unknown reasons although requirements are fulfilled



Dear all,

I have posted this message last week. But I couldn't get any reply. Solving this problem is important and urgent for me, so I am posting again.
I have been using condor for a while.
I added new computers to our pool whose have "WINDOWS" as OpSys and INTEL as Arch and 503 as Mem.
I designed requirements in my submit files as: 

requirements = OpSys == "WINDOWS" && Arch == "INTEL"

However, what I see by calling condor_q -better -analyze is:

173.000:  Run analysis summary.  Of 84 machines,
     33 are rejected by your job's requirements 
     41 reject your job because of their own requirements 
      0 match but are serving users with a better priority in the pool 
     10 match but reject the job for unknown reasons 
      0 match but will not currently preempt their existing job 
      0 match but are currently offline 
      0 are available to run your job
No successful match recorded.
Last failed match: Mon May 21 09:59:03 2012
Reason for last match failure: no match found

The Requirements _expression_ for your job is:

( target.Memory >= 32 && target.OpSys == "WINDOWS" && target.Arch == "INTEL" ) &&
( TARGET.Disk >= DiskUsage ) && ( ( RequestMemory * 1024 ) >= ImageSize ) &&
( TARGET.HasFileTransfer )

    Condition                         Machines Matched    Suggestion
    ---------                         ----------------    ----------
1   target.OpSys == "WINDOWS"         53                   
2   target.Arch == "INTEL"            61                   
3   target.Memory >= 32               84                   
4   ( TARGET.Disk >= 12500 )          84                   
5   ( ( 1024 * ceiling(ifThenElse(JobVMMemory isnt undefined,JobVMMemory,2.929687500000000E+000)) ) >= 3000 )
                                      84                   
6   ( TARGET.HasFileTransfer )        84                   

The following attributes are missing from the job ClassAd:

CheckpointPlatform
---
Can anyone tell me why I get this error? Is it related to CheckPoint platform which is used in Unix systems as far as I know?

Thanks in advance,
Canan Has