[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Args not found error



I'm getting a weird error when I submit a job.  The program runs fine
from a local console; however, when run through condor (in the vanilla
universe) I get a strange error:

4/5 15:54:08 ******************************************************
4/5 15:54:08 ** condor_starter (CONDOR_STARTER) STARTING UP
4/5 15:54:08 ** /home/condor/6.7.14/sbin/condor_starter
4/5 15:54:08 ** $CondorVersion: 6.7.14 Dec 13 2005 $
4/5 15:54:08 ** $CondorPlatform: I386-LINUX_RH9 $
4/5 15:54:08 ** PID = 28678
4/5 15:54:08 ******************************************************
4/5 15:54:08 Using config file: /home/condor/condor_config
4/5 15:54:08 Using local config files:
/home/condor/hosts/sei/condor_config.local
4/5 15:54:08 DaemonCore: Command Socket at <128.111.45.22:45276>
4/5 15:54:08 Done setting resource limits
4/5 15:54:08 Communicating with shadow <128.111.45.35:51873>
4/5 15:54:08 Submitting machine is "pompone.cs.ucsb.edu"
4/5 15:54:08 Starting a VANILLA universe job with ID: 1956.0
4/5 15:54:08 Args not found in JobAd.  Aborting OsProc::StartJob.
4/5 15:54:08 Failed to start job, exiting
4/5 15:54:08 ShutdownFast all jobs.
4/5 15:54:08 **** condor_starter (condor_STARTER) EXITING WITH STATUS 0

This is funny because I do have an Arguments value set in my JobAd,
and the binary that ends up in the spool directory runs as expected:

$ condor_q -long
-- Submitter: pompone.cs.ucsb.edu : <128.111.45.35:34041> :
pompone.cs.ucsb.edu
MyType = "Job"
TargetType = "Machine"
GlobalJobId = "pompone.cs.ucsb.edu#1144276112#1956.0"
RootDir = "/"
MinHosts = 1
WantRemoteSyscalls = FALSE
WantCheckpoint = FALSE
RemoteSpoolDir =
"/tmp/home/rgarver/dynamic_condor/localcondor/conf.noir/spool/cluster2.proc0.subproc0"
JobPrio = 0
NiceUser = FALSE
WantRemoteIO = TRUE
CoreSize = 0
KillSig = "SIGTERM"
Rank = 0.000000
In = "/dev/null"
TransferIn = FALSE
Out = "out.0"
StreamOut = FALSE
Err = "/dev/null"
TransferErr = FALSE
BufferSize = 524288
BufferBlockSize = 32768
ShouldTransferFiles = "NO"
TransferFiles = "NEVER"
ImageSize = 12
ExecutableSize = 12
DiskUsage = 12
Requirements = TRUE
GlobusResubmit = FALSE
GlobusStatus = 32
NumGlobusSubmits = 0
JobUniverse = 5
QDate = 1144276072
CompletionDate = 0
LocalUserCpu = 0.000000
LocalSysCpu = 0.000000
RemoteUserCpu = 0.000000
RemoteSysCpu = 0.000000
ExitStatus = 0
NumCkpts = 0
NumRestarts = 0
NumSystemHolds = 0
CommittedTime = 0
TotalSuspensions = 0
CumulativeSuspensionTime = 0
ExitBySignal = FALSE
JobNotification = 0
LeaveJobInQueue = JobStatus == 4
User = "rgarver@xxxxxxxxxxx"
Owner = "rgarver"
PeriodicRemove = (StageInFinish > 0) =!= TRUE && CurrentTime > QDate +
28800
SubmitterId = "rgarver@xxxxxxxxxxxxxxxxxxx"
Arguments = "5000000"
Environment = ""
ClusterId = 1956
ProcId = 0
StageInStart = 1144276132
SUBMIT_Iwd =
"/tmp/home/rgarver/condor_install/daisy/conf.pompone/spool/cluster2.proc0.subproc0"
Iwd = "/home/condor/hosts/pompone/spool/cluster1956.proc0.subproc0"
SUBMIT_Cmd =
"/tmp/home/rgarver/condor_install/daisy/conf.pompone/spool/cluster2.proc0.subproc0/pi-compute"
Cmd =
"/home/condor/hosts/pompone/spool/cluster1956.proc0.subproc0/pi-compute"
StageInFinish = 1144276133
ReleaseReason = "Data files spooled"
LastHoldReason = "Spooling input data files"
JobStartDate = 1144276138
PeriodicHold = FALSE
PeriodicRelease = FALSE
OnExitHold = FALSE
OnExitRemove = TRUE
WantMatchDiagnostics = TRUE
LastMatchTime = 1144277635
NumJobMatches = 7
OrigMaxHosts = 1
LastJobLeaseRenewal = 1144277648
JobLastStartDate = 1144277645
JobCurrentStartDate = 1144277648
JobRunCount = 30
RemoteWallClockTime = 14.000000
LastRemoteHost = "sei.cs.ucsb.edu"
LastClaimId = "<128.111.45.22:34762>#1140125522#550"
CurrentHosts = 0
JobStatus = 1
EnteredCurrentStatus = 1144277648
LastSuspensionTime = 0
MaxHosts = 1
ServerTime = 1144277881

Any suggestions?

-- 
Ryan Garver
<rgarver@xxxxxxxxxxx>