[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Failing to Match requirements



On Thu, Sep 01, 2005 at 01:53:17PM -0400, Haria, Ratilal wrote:
> Hi
> 
> I am having difficulty understanding why condor fails to match a host in
> the pool
> 
> I have a host (a laptop for test purpose) whose machine name is
> iitm50129-220 and the condor_status shows that all the values required
> to satisfy the requirements are present and yet it fails.
> 

For a job to run on a machine, the 'Requirements' attribute of both the
machine and the job ad must be true. The requirements for the job ad (the
output of condor_q -l) are met by the machine, but the requirements for the
machine ad (condor_status -l) are not true.  Your machine is in the "Owner" 
state, and so it won't match with any jobs.
Try the TESTINGMODE settings in the config file.

-Erik

> I have also tried  
> 	requirements = ( (Machine=="iitm50129-220") && (Arch=="INTEL")
> && ((OpSys == "WINNT50") )
> And that fails too
> 
> Thanks for any leads or pointers
> 
> Thanks ... Ratilal
> 
> 
> ********* Output of roger.sub *******************
> universe = vanilla
> executable = fuzme3_5.exe
> requirements = Machine=="iitm50129-220"
> environment = path=c:\winnt\system32
> transfer_input_files = irises.dat , fuzme3_5.cfg_1
> arguments            = fuzme3_5.cfg_1
> log                  = roger.log_1
> output                  = roger.out_1
> error   = roger.err_1
> queue
> 
> ********* Output of condor_q -long *******************
> -- Schedd: iitm50129-220 : <132.246.129.220:1136>
> MyType = "Job"
> TargetType = "Machine"
> ClusterId = 2
> QDate = 1125595939
> CompletionDate = 0
> Owner = "Hariar"
> NTDomain = "IITM50129-220"
> RemoteWallClockTime = 0.000000
> LocalUserCpu = 0.000000
> LocalSysCpu = 0.000000
> RemoteUserCpu = 0.000000
> RemoteSysCpu = 0.000000
> ExitStatus = 0
> NumCkpts = 0
> NumRestarts = 0
> NumSystemHolds = 0
> CommittedTime = 0
> TotalSuspensions = 0
> LastSuspensionTime = 0
> CumulativeSuspensionTime = 0
> ExitBySignal = FALSE
> CondorVersion = "$CondorVersion: 6.6.10 Jun 22 2005 $"
> CondorPlatform = "$CondorPlatform: INTEL-WINNT50 $"
> Iwd = "C:\Condor\examples\Fuzzme-test"
> JobUniverse = 5
> Cmd = "C:\Condor\examples\Fuzzme-test\fuzme3_5.exe"
> MinHosts = 1
> MaxHosts = 1
> CurrentHosts = 0
> WantRemoteSyscalls = FALSE
> WantCheckpoint = FALSE
> JobStatus = 1
> EnteredCurrentStatus = 1125595939
> JobPrio = 0
> User = "Hariar@iitm50129-220"
> NiceUser = FALSE
> Env = "path=c:\winnt\system32"
> JobNotification = 2
> UserLog = "C:\Condor\examples\Fuzzme-test\roger.log_1"
> CoreSize = 4564920
> Rank = 0.000000
> In = "/dev/null"
> TransferIn = FALSE
> Out = "roger.out_1"
> Err = "roger.err_1"
> BufferSize = 524288
> BufferBlockSize = 32768
> ShouldTransferFiles = "YES"
> WhenToTransferOutput = "ON_EXIT"
> TransferFiles = "ONEXIT"
> TransferInput = "irises.dat ,fuzme3_5.cfg_1"
> ImageSize = 616
> ExecutableSize = 616
> DiskUsage = 623
> Requirements = (Machine == "iitm50129-220") && (Arch == "INTEL") &&
> (OpSys == "WINNT50") && (Disk >= DiskUsage) && ((Memory * 1024) >=
> ImageSize) && (HasFileTransfer)
> PeriodicHold = FALSE
> PeriodicRelease = FALSE
> PeriodicRemove = FALSE
> OnExitHold = FALSE
> OnExitRemove = TRUE
> LeaveJobInQueue = FALSE
> Args = "fuzme3_5.cfg_1"
> ProcId = 0
> WantMatchDiagnostics = TRUE
> LastRejMatchReason = "no match found"
> LastRejMatchTime = 1125595940
> ServerTime = 1125596166
> 
> ********* Selected Output of condor_status -long *******************
> 
> MyType = "Machine"
> TargetType = "Job"
> Name = "iitm50129-220"
> Machine = "iitm50129-220"
> Rank = 0.000000
> CpuBusy = ((LoadAvg - CondorLoadAvg) >= 0.500000)
> CondorVersion = "$CondorVersion: 6.6.10 Jun 22 2005 $"
> CondorPlatform = "$CondorPlatform: INTEL-WINNT50 $"
> VirtualMachineID = 1
> VirtualMemory = 1018684
> Disk = 13645036
> CondorLoadAvg = 0.000000
> LoadAvg = 0.220000
> KeyboardIdle = 0
> ConsoleIdle = 0
> Memory = 511
> Cpus = 1
> StartdIpAddr = "<132.246.129.220:1135>"
> Arch = "INTEL"
> OpSys = "WINNT50"
> UidDomain = "iitm50129-220"
> FileSystemDomain = "iitm50129-220"
> Subnet = "132.246.129"
> HasIOProxy = TRUE
> TotalVirtualMemory = 1018684
> TotalDisk = 13645036
> KFlops = 351956
> Mips = 1494
> LastBenchmark = 1125595551
> TotalLoadAvg = 0.220000
> TotalCondorLoadAvg = 0.000000
> ClockMin = 815
> ClockDay = 4
> TotalVirtualMachines = 1
> HasFileTransfer = TRUE
> HasMPI = TRUE
> HasJICLocalConfig = TRUE
> HasJICLocalStdin = TRUE
> StarterAbilityList =
> "HasFileTransfer,HasMPI,HasJICLocalConfig,HasJICLocalStdin"
> CpuBusyTime = 0
> CpuIsBusy = FALSE
> State = "Owner"
> EnteredCurrentState = 1125595541
> Activity = "Idle"
> EnteredCurrentActivity = 1125595541
> Start = ((KeyboardIdle > 15 * 60) && (((LoadAvg - CondorLoadAvg) <=
> 0.300000) || (State != "Unclaimed" && State != "Owner")))
> Requirements = START
> CurrentRank = 0.000000
> DaemonStartTime = 1125595541
> UpdateSequenceNumber = 2
> MyAddress = "<132.246.129.220:1135>"
> LastHeardFrom = 1125596336
> UpdatesTotal = 3
> UpdatesSequenced = 2
> UpdatesLost = 0
> UpdatesHistory = "0x00000000000000000000000000000000"
> 
> _______________________________________________
> Condor-users mailing list
> Condor-users@xxxxxxxxxxx
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users