[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] No resources matched request's constraints



Hi Group,
 
Can some please help me with an issue I'm having?
I have installed condor version 7.5.2 on 2 Windows 7 machine.
All of them are part of the same Condor Pool.
 
1 machine name Condor_Master_01 is the submitter and credd machine (http://www.cs.wisc.edu/condor/manual/v7.2/6_2Microsoft_Windows.html).
The other machine name Condor_Node_01,
 
here is the condor_status:
 
Name               OpSys      Arch   State     Activity LoadAv Mem   ActvtyTime
slot1@Condor_Maste WINNT61    INTEL  Unclaimed Idle     0.000  4054  0+00:30:32
slot2@Condor_Maste WINNT61    INTEL  Unclaimed Idle     0.000  4054  0+00:35:05
slot3@Condor_Maste WINNT61    INTEL  Unclaimed Idle     0.000  4054  0+00:35:06
slot4@Condor_Maste WINNT61    INTEL  Unclaimed Idle     0.940  4054  0+00:35:07
slot1@Condor_Node_ WINNT61    INTEL  Unclaimed Idle     0.000  2006  0+00:20:04
slot2@Condor_Node_ WINNT61    INTEL  Unclaimed Idle     0.000  2006  0+00:20:05
slot3@Condor_Node_ WINNT61    INTEL  Unclaimed Idle     0.000  2006  0+00:20:06
slot4@Condor_Node_ WINNT61    INTEL  Unclaimed Idle     0.010  2006  0+00:20:07
                     Total Owner Claimed Unclaimed Matched Preempting Backfill
       INTEL/WINNT61     8     0       0         8       0          0        0
               Total     8     0       0         8       0          0        0
 
Now I have setup a JOB to run Matlab on my system.
When I submit the job from Condor_Master_01 (queue is set to 8) the job run OK but it only run locally and not runing on the other node.
 
So I checked to see if I can run to job on the system so I change the JOB config file and adding the following: "Requirements = Machine=="Condor_Node_01.domain.com". Then the job failed to run.
 
Here is the output from the condor_q  -analyze

-- Submitter: Condor_Master_01.Domain.com : <172.17.0.61:59857> : Condor_Master_01.Domain.com
---
257.000:  Run analysis summary.  Of 8 machines,
      8 are rejected by your job's requirements
      0 reject your job because of their own requirements
      0 match but are serving users with a better priority in the pool
      0 match but reject the job for unknown reasons
      0 match but will not currently preempt their existing job
      0 match but are currently offline
      0 are available to run your job
        No successful match recorded.
        Last failed match: Mon Jun 28 19:30:25 2010
        Reason for last match failure: no match found
WARNING:  Be advised:
   No resources matched request's constraints
The Requirements _expression_ for your job is:
( target.Machine == "Condor_Node_01.Domain.com" ) &&
( TARGET.Arch == "INTEL" ) && ( TARGET.OpSys == "WINNT61" ) &&
( TARGET.Disk >= DiskUsage ) && ( ( ( TARGET.Memory * 1024 ) >= ImageSize ) &&
( ( RequestMemory * 1024 ) >= ImageSize ) ) && ( TARGET.HasFileTransfer ) &&
( TARGET.HasWindowsRunAsOwner &&
( TARGET.LocalCredd is "Condor_Master_01.Domain.com" ) )
    Condition                         Machines Matched    Suggestion
    ---------                         ----------------    ----------
1   ( target.Machine == "Condor_Node_01.PrimeSense.com" )
                                      4
2   ( TARGET.HasWindowsRunAsOwner && ( TARGET.LocalCredd is "Condor_Master_01.Domain.com")
                               4                   REMOVE
3   ( TARGET.Arch == "INTEL" )        8
4   ( TARGET.OpSys == "WINNT61" )     8
5   ( TARGET.Disk >= 1 )              8
6   ( ( ( 1024 * TARGET.Memory ) >= 1 ) && ( ( 1024 * ceiling(ifThenElse(JobVMMemory isnt undefined,JobVMMemory,9.76562500000
                                      8
7   ( TARGET.HasFileTransfer )        8
Conflicts:
  conditions: 1, 2
 
 
And here is the job definition file :
 
universe = vanilla
transfer_files=always
Requirements = Machine=="Condor_Node_01.Domain.com"
executable = K:\scripts\run_matlab.bat
Arguments  = matlab_init($(PROCESS))
output =1.out
log = 1.log
error = 1.err
notification = Error
initialdir = K:\scripts
run_as_owner = True
load_profile = True
queue 8
 
Now u can see I have set run_as_owner = True because I want to use CIFS windows Shared drive. This however make conflict with the job submitting.
 
Can someone please help me understand what I'm doing wrong?
Why the Job doesn't run on the node machine?
 
 
Thanks
Sassy