[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Jobs are rejected for unknown reasons although requirements are fulfilled



It looks more like your machine is matching but not landing on machines in your pool possibly due to the start _expression_ on the machines: 

----------------- Key'd bit:
 41 reject your job because of their own requirements 

To diagnose entirely you can condor_q -long (a single job) 
then condor_status -long (a single slot which it should have matched) 

and that would help to definitively diagnose. 

Cheers,
Tim


From: "Canan Has" <cananhas@gmail.com>
To: condor-users@cs.wisc.edu
Sent: Monday, May 21, 2012 2:38:31 AM
Subject: [Condor-users] Jobs are rejected for unknown reasons although        requirements are fulfilled

Dear all,

I have posted this message last week. But I couldn't get any reply. Solving this problem is important and urgent for me, so I am posting again.
I have been using condor for a while.
I added new computers to our pool whose have "WINDOWS" as OpSys and INTEL as Arch and 503 as Mem.
I designed requirements in my submit files as: 

requirements = OpSys == "WINDOWS" && Arch == "INTEL"

However, what I see by calling condor_q -better -analyze is:

173.000:  Run analysis summary.  Of 84 machines,
     33 are rejected by your job's requirements 
     41 reject your job because of their own requirements 
      0 match but are serving users with a better priority in the pool 
     10 match but reject the job for unknown reasons 
      0 match but will not currently preempt their existing job 
      0 match but are currently offline 
      0 are available to run your job
No successful match recorded.
Last failed match: Mon May 21 09:59:03 2012
Reason for last match failure: no match found

The Requirements _expression_ for your job is:

( target.Memory >= 32 && target.OpSys == "WINDOWS" && target.Arch == "INTEL" ) &&
( TARGET.Disk >= DiskUsage ) && ( ( RequestMemory * 1024 ) >= ImageSize ) &&
( TARGET.HasFileTransfer )

    Condition                         Machines Matched    Suggestion
    ---------                         ----------------    ----------
1   target.OpSys == "WINDOWS"         53                   
2   target.Arch == "INTEL"            61                   
3   target.Memory >= 32               84                   
4   ( TARGET.Disk >= 12500 )          84                   
5   ( ( 1024 * ceiling(ifThenElse(JobVMMemory isnt undefined,JobVMMemory,2.929687500000000E+000)) ) >= 3000 )
                                      84                   
6   ( TARGET.HasFileTransfer )        84                   

The following attributes are missing from the job ClassAd:

CheckpointPlatform
---
Can anyone tell me why I get this error? Is it related to CheckPoint platform which is used in Unix systems as far as I know?

Thanks in advance,
Canan Has

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@cs.wisc.edu with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/