[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[condor-users] condor_starter ERROR



Hi,

I'm getting this error in the starter logfiles:

<SNIP>
10/31 19:33:06 ******************************************************
10/31 19:33:06 ** condor_starter (CONDOR_STARTER) STARTING UP
10/31 19:33:06 ** $CondorVersion: 6.5.3 Jul  2 2003 $
10/31 19:33:06 ** $CondorPlatform: INTEL-LINUX-GLIBC23 $
10/31 19:33:06 ** PID = 9060
10/31 19:33:06 ******************************************************
10/31 19:33:07 Using config file: /users/condor/condor_config
10/31 19:33:07 Using local config files: 
/users/condor/hosts/leo3/condor_config.local
10/31 19:33:07 DaemonCore: Command Socket at <192.168.0.12:35075>
10/31 19:33:07 Done setting resource limits
10/31 19:33:07 ERROR "Assertion ERROR on (result)" at line 148 in file 
NTsenders.C
10/31 19:33:07 ShutdownFast all jobs.
<SNIP>


The respective StartLog ...  

<SNIP>
10/31 19:03:10 DaemonCore: Command received via TCP from host 
<192.168.0.39:59408>
10/31 19:03:10 DaemonCore: received command 444 (ACTIVATE_CLAIM), calling 
handler (command_activate_claim)
10/31 19:03:10 vm1: Got activate_claim request from shadow 
(<192.168.0.39:59408>)
10/31 19:03:10 vm1: Remote job ID is 427.0
10/31 19:03:10 vm1: Got universe "VANILLA" (5) from request classad
10/31 19:03:10 vm1: State change: claim-activation protocol successful
10/31 19:03:10 vm1: Changing activity: Idle -> Busy
10/31 19:03:10 Starter pid 24666 exited with status 4
10/31 19:03:10 vm1: State change: starter exited
10/31 19:03:10 vm1: Changing activity: Busy -> Idle
<SNIP>




The shadow logfiles in the submission machine looks OK to me :

<SNIP> 
10/31 20:03:21 ******************************************************
10/31 20:03:21 ** condor_shadow (CONDOR_SHADOW) STARTING UP
10/31 20:03:21 ** $CondorVersion: 6.5.3 Jul  2 2003 $
10/31 20:03:21 ** $CondorPlatform: INTEL-LINUX-GLIBC23 $
10/31 20:03:21 ** PID = 21575
10/31 20:03:21 ******************************************************
10/31 20:03:21 Using config file: /users/condor/condor_config
10/31 20:03:21 Using local config files: 
/users/condor/hosts/convoluta/condor_config.local
10/31 20:03:21 DaemonCore: Command Socket at <192.168.0.39:59092>
10/31 20:03:22 Initializing a VANILLA shadow
10/31 20:03:22 (427.5) (21575): Request to run on <192.168.0.13:32773> 
was ACCEPTED
<SNIP>
 
so the nodes are claimed, but stay idle. Any ideas what is going wrong?


Thanks in advance,

Ralf



=======================================
Dr. Ralf Schmid
ICAPB
School of Biological Sciences
The University of Edinburgh
King's Buildings
Ashworth Building 
Edinburgh EH9 3JW

Condor Support Information:
http://www.cs.wisc.edu/condor/condor-support/
To Unsubscribe, send mail to majordomo@xxxxxxxxxxx with
unsubscribe condor-users <your_email_address>