[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Condor and Matlab



Erik,

Condor_status -l for the submit node is included below.

I've tried:

Requirements = (MATLAB =?= TRUE)

And also

Requirements = MATLAB

And I still get the following output from condor_q -analyze for the
submit node:

---
034.002:  Run analysis summary.  Of 4 machines,
      3 are rejected by your job's requirements
      1 reject your job because of their own requirements
      0 match, but are serving users with a better priority in the pool
      0 match, match, but reject the job for unknown reasons
      0 match, but will not currently preempt their existing job
      0 are available to run your job
---

The "3 are rejected by your job's requirements" I would expect, because
it's a 4 node test pool and the custom STARTD_EXPRS for MATLAB are only
set on one machine, the actual submit node.  The "1 reject your job
because of their own requirements" must be the actual machine but I'm
unsure exactly why because I've set the custom attributes as shown in
the condor_status -l below  Any pointers on this would be great.

Condor_status -l for submit node:

---
MyType = "Machine"
TargetType = "Job"
Name = "submitmachine.xxx.xxx.xxx"
Machine = "submitmachine.xxx.xxx.xxx"
Rank = 0.000000
CpuBusy = ((LoadAvg - CondorLoadAvg) >= 0.500000)
COLLECTOR_HOST_STRING = "xxx.xxx.xxx.xxx"
MATLAB = TRUE
CondorVersion = "$CondorVersion: 6.6.11 Mar 23 2006 $"
CondorPlatform = "$CondorPlatform: I386-LINUX_RH9 $"
VirtualMachineID = 1
VirtualMemory = 0
Disk = 11447128
CondorLoadAvg = 0.000000
LoadAvg = 0.000000
KeyboardIdle = 141953
ConsoleIdle = 142249
Memory = 768
Cpus = 1
StartdIpAddr = "<xxx.xxx.xxx.xxx:33997>"
Arch = "INTEL"
OpSys = "LINUX"
UidDomain = "domain"
FileSystemDomain = "domain"
Subnet = "xxx.xxx.xxx"
HasIOProxy = TRUE
TotalVirtualMemory = 0
TotalDisk = 11447128
KFlops = 972891
Mips = 2328
LastBenchmark = 1153279798
TotalLoadAvg = 0.000000
TotalCondorLoadAvg = 0.000000
ClockMin = 479
ClockDay = 3
TotalVirtualMachines = 1
HasFileTransfer = TRUE
HasMPI = TRUE
HasJICLocalConfig = TRUE
HasJICLocalStdin = TRUE
JavaVendor = "Free Software Foundation, Inc."
JavaVersion = "1.4.2"
JavaMFlops = 5.293770
HasJava = TRUE
HasPVM = TRUE
HasRemoteSyscalls = TRUE
HasCheckpointing = TRUE
StarterAbilityList =
"HasFileTransfer,HasMPI,HasJICLocalConfig,HasJICLocalStdin,HasJava,HasPV
M,HasRemoteSyscalls,HasCheckpointing"
CpuBusyTime = 0
CpuIsBusy = FALSE
State = "Unclaimed"
EnteredCurrentState = 1153150154
Activity = "Idle"
EnteredCurrentActivity = 1153279798
Start = Owner == "root" || Owner == "condor" || Owner ==
"my_local_user_account"
Requirements = START
CurrentRank = 0.000000
DaemonStartTime = 1153150148
UpdateSequenceNumber = 474
MyAddress = "<xxx.xxx.xxx.xxx:33997>"
LastHeardFrom = 1153295951
UpdatesTotal = 540
UpdatesSequenced = 532
UpdatesLost = 0
UpdatesHistory = "0x00000000000000000000000000000000"

---



-----Original Message-----
From: condor-users-bounces@xxxxxxxxxxx
[mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of Erik Paulson
Sent: 17 July 2006 17:44
To: Condor-Users Mail List
Subject: Re: [Condor-users] Condor and Matlab

On Mon, Jul 17, 2006 at 05:31:07PM +0100, Shaun J. O'Callaghan wrote:
> 
> I'm guessing the "1 reject your job because of their own requirements"
> line is due to the submit node, which is the only node I've used
custom
> STARTD_EXPRS on.
> 
>  

Could be. What does 'condor_status -l' look like for the machine
where you've set the startd_exprs on?

-Erik
_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with
a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at either
https://lists.cs.wisc.edu/archive/condor-users/
http://www.opencondor.org/spaces/viewmailarchive.action?key=CONDOR