[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] condor_q -better -analyze shows some error.



Hi,
When I analysing idle job using condor_q -better -analyze it below copied output. But after some times the job got matched to a machine and start runing normally.

This errors need to be ignored or my job ads have any incorrect entries

1)
560.000:  Run analysis summary.  Of 18 machines,
     0 are rejected by your job's requirements
    11 reject your job because of their own requirements
     6 match but are serving users with a better priority in the pool
     1 match but reject the job for unknown reasons
     0 match but will not currently preempt their existing job
     0 are available to run your job
   Last successful match: Sun Aug  9 19:13:43 2009
   Last failed match: Wed Aug 12 15:17:01 2009
   Reason for last match failure: no match found

error: no operator/attribute found
error: found NULL ptr in expr
error: problem with ExprToProfile
error in ExprToMultiProfile

error in AnalyzeAttributes

2)
560.000:  Request is being serviced

error: no operator/attribute found
error: found NULL ptr in expr
error: problem with ExprToProfile
error in ExprToMultiProfile

error in AnalyzeAttributes


The job Ad of the job is..

MyType = "Job"
TargetType = "Machine"
GlobalJobId = "gridprime.cloud.wipro.com#560.0#1249751668"
TransferInput = "/vmfs/volumes/nfs1/Rhel5_64/01/Rhel5_64.vmx"
VMPARAM_VMware_SnapshotDisk = TRUE
VMPARAM_VMware_Transfer = FALSE
VMPARAM_VMware_VMX_File = "Rhel5_64-esx.vmx"
JobVMType = "vmware"
VMPARAM_VMware_Dir = "/vmfs/volumes/nfs1/Rhel5_64/01/"
JobLeaseDuration = 7200
TransferExecutable = FALSE
ExecutableSize_RAW = 0
ExecutableSize = 0
UserLog = "/condor/log/VM_560_0.log"
RequestDisk = 1000000
DiskUsage_RAW = 1000000
DiskUsage = 1000000
RequestMemory = 16384
RequestCpus = 4
JobVM_VCPUS = 4
JobVMCheckpoint = TRUE
JobVMNetworkingType = "custom"
JobVMNetworking = TRUE
JobVMMemory = 16384
Owner = "idealgrid"
JobUniverse = 13
Cmd = "obiee-75_OBIEETest_obieedb"
QDate = 1249751668
CompletionDate = 0
LocalUserCpu = 0.000000
LocalSysCpu = 0.000000
CoreSize = -1
ExitStatus = 0
ExitBySignal = FALSE
NumCkpts_RAW = 0
NumCkpts = 0
NumRestarts = 0
NumSystemHolds = 0
CommittedTime = 0
TotalSuspensions = 0
CumulativeSuspensionTime = 0
RootDir = "/"
MinHosts = 1
WantRemoteSyscalls = FALSE
WantCheckpoint = FALSE
WantRemoteIO = TRUE
JobPrio = 0
User = "idealgrid@xxxxxxxxx"
NiceUser = FALSE
Env = ""
JobNotification = 0
KillSig = "SIGTERM"
In = "/dev/null"
Out = "/dev/null"
Err = "/dev/null"
BufferSize = 524288
BufferBlockSize = 32768
ShouldTransferFiles = "YES"
TransferFiles = "ONEXIT"
WhenToTransferOutput = "ON_EXIT_OR_EVICT"
PeriodicHold = FALSE
PeriodicRemove = FALSE
PeriodicRelease = FALSE
OnExitHold = TRUE
OnExitRemove = FALSE
CondorVersion = "$CondorVersion: 7.2.3 May 11 2009 BuildID: 151729 $"
CondorPlatform = "$CondorPlatform: I386-LINUX_RHEL5 $"
ClusterId = 560
ProcId = 0
Requirements = ((Arch == "INTEL") && (HasVM) && (VM_AvailNum > 0) && (TotalDisk >= DiskUsage) && (HasFileTransfer) && (VM_Networking) && (VM_Type == "vmware"))
StageInStart = 1
StageInFinish = 1
FilesRetrieved = FALSE
LeaveJobInQueue = FilesRetrieved =?= FALSE
Arguments = ""
Iwd = "/spool/cluster560.proc0.subproc0"
JobStartDate = 1249751707
ImageSize_RAW = 2016
ImageSize = 2250
LastHoldReasonSubCode = 0
NumJobReconnects = 6289
ScheddBday = 1250069462
isProductionVM = TRUE
AutoClusterId = 122
AutoClusterAttrs = "isControlJob,JobUniverse,LastCheckpointPlatform,NumCkpts,RequestCpus,RequestMemory,RequestDisk,isProductionVM,isTestJob,DiskUsage,Requirements,NiceUser,ConcurrencyLimits"
LastVacateTime = 1250072905
BytesSent = 0.000000
BytesRecvd = 4580.000000
RemoteWallClockTime = 284678.000000
LastRemoteHost = "slot1@xxxxxxxxxxxxxxxxxxx"
LastPublicClaimId = "<192.168.10.94:10310>#1250069006#36#..."
LastPublicClaimIds = ""
MaxHosts = 1
LastReleaseReason = "via condor_release (by user daemon)"
ReleaseReason = "via condor_release (by user daemon)"
LastHoldReason = "via condor_hold (by user daemon)"
LastHoldReasonCode = 1
LastRejMatchReason = "no match found"
LastRejMatchTime = 1250074865
WantMatchDiagnostics = TRUE
LastMatchTime = 1250074897
NumJobMatches = 10
OrigMaxHosts = 1
JobStatus = 2
EnteredCurrentStatus = 1250074975
LastSuspensionTime = 0
CurrentHosts = 1
PublicClaimId = "<192.168.10.96:10126>#1250074583#13#..."
StartdIpAddr = "<192.168.10.96:10126>"
RemoteHost = "slot1@xxxxxxxxxxxxxxxxxxx"
RemoteSlotID = 1
StartdPrincipal = "192.168.10.96"
ShadowBday = 1250074975
JobLastStartDate = 1250070551
JobCurrentStartDate = 1250074975
NumShadowStarts = 11
JobRunCount = 11
NumJobStarts = 6
RemoteSysCpu = 0.000000
RemoteUserCpu = 390.000000
LastJobLeaseRenewal = 1250077273
ServerTime = 1250077337

by
Johnson







Please do not print this email unless it is absolutely necessary. The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and destroy all copies of this message and any attachments. WARNING: Computer viruses can be transmitted via email. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email.
www.wipro.com