[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Condor on Windows



Hi,

agreeing Thomas, and, is your executable properly running under the windows cmd line interface (dos emulation)? Try to implement it in C and compile it using gcc (use e.g. dev_cpp). If that's not working, just try, uninstall condor, DELETE DIRECTORY, restart, reinstall. But I think it's a problem with your executable (Log files -> not valid Windows exe ...), since the other programs are working.

Tobias

Thomas Bauer schrieb:
Hi,

The Starterlog says:

8/24 14:16:17 About to exec C:\condor\execute\dir_3684\condor_exec.exe 8/24 14:16:17 ERROR: C:\condor\execute\dir_3684\condor_exec.exe is not a
valid Windows executable

Are you sure, that you used a proper Windows-compiler for your
pascal-program?

Thomas

________________________________

	Von: condor-users-bounces@xxxxxxxxxxx
[mailto:condor-users-bounces@xxxxxxxxxxx] Im Auftrag von Ivailo Penev
	Gesendet: Montag, 27. August 2007 09:24
	An: condor-users@xxxxxxxxxxx
	Betreff: [Condor-users] Condor on Windows
	
	

Hi again!
	Here are some of the log files for Condor. If you consider that the
rest files contain some useful information, I could send them all.

	MasterLog.log

	8/24 10:50:10 ******************************************************

	8/24 10:50:10 ** Condor (CONDOR_MASTER) STARTING UP

	8/24 10:50:10 ** C:\condor\bin\condor_master.exe

	8/24 10:50:10 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 10:50:10 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 10:50:10 ** PID = 2380

	8/24 10:50:10 ** Log last touched time unavailable (No such file or
directory)

	8/24 10:50:10 ******************************************************

	8/24 10:50:10 Using config source: C:\condor\condor_config

8/24 10:50:10 Using local config sources:
	8/24 10:50:10    C:\condor/condor_config.local

	8/24 10:50:10 DaemonCore: Command Socket at <10.254.254.219:1065>

	8/24 10:50:10 Started DaemonCore process
"C:\condor/bin/condor_collector.exe", pid and pgroup = 2436

	8/24 10:50:10 Started DaemonCore process
"C:\condor/bin/condor_negotiator.exe", pid and pgroup = 2468

	8/24 10:50:10 Started DaemonCore process
"C:\condor/bin/condor_schedd.exe", pid and pgroup = 2520

	8/24 10:50:10 Started DaemonCore process
"C:\condor/bin/condor_startd.exe", pid and pgroup = 3052

	8/24 11:50:10 Preen pid is 1540

	8/24 11:50:10 DaemonCore: Command received via UDP from host
<10.254.254.219:1559>

	8/24 11:50:10 DaemonCore: received command 60011 (DC_NOP), calling
handler (handle_nop())

	8/24 11:50:10 Child 1540 died, but not a daemon -- Ignored

	8/24 15:57:59 DaemonCore: Command received via UDP from host
<10.254.254.219:3726>

	8/24 15:57:59 DaemonCore: received command 60000 (DC_RAISESIGNAL),
calling handler (HandleSigCommand())

	8/24 15:57:59 Got SIGQUIT.  Performing fast shutdown.

	8/24 15:57:59 Sent signal 3 to COLLECTOR (pid 2436)

	8/24 15:57:59 Sent signal 3 to NEGOTIATOR (pid 2468)

	8/24 15:57:59 Sent signal 3 to SCHEDD (pid 2520)

	8/24 15:57:59 Sent signal 3 to STARTD (pid 3052)

	8/24 15:58:00 DaemonCore: Command received via UDP from host
<10.254.254.219:3743>

	8/24 15:58:00 DaemonCore: received command 60011 (DC_NOP), calling
handler (handle_nop())

	8/24 15:58:00 The COLLECTOR (pid 2436) exited with status 0

	8/24 15:58:00 DaemonCore: Command received via UDP from host
<10.254.254.219:3744>

	8/24 15:58:00 DaemonCore: received command 60011 (DC_NOP), calling
handler (handle_nop())

	8/24 15:58:00 The NEGOTIATOR (pid 2468) exited with status 0

	8/24 15:58:00 The SCHEDD (pid 2520) exited with status 0

	8/24 15:58:00 The STARTD (pid 3052) exited with status 0

	8/24 15:58:00 All daemons are gone.  Exiting.

	8/24 15:58:00 **** Condor (condor_MASTER) EXITING WITH STATUS 0

	StarterLog.log

	8/24 14:16:15 ******************************************************

	8/24 14:16:15 ** condor_starter (CONDOR_STARTER) STARTING UP

	8/24 14:16:15 ** C:\condor\bin\condor_starter.exe

	8/24 14:16:15 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:16:15 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:16:15 ** PID = 3684

	8/24 14:16:15 ** Log last touched 8/24 14:15:44

	8/24 14:16:15 ******************************************************

	8/24 14:16:15 Using config source: C:\condor\condor_config

8/24 14:16:15 Using local config sources:
	8/24 14:16:15    C:\condor/condor_config.local

	8/24 14:16:15 DaemonCore: Command Socket at <10.254.254.219:2823>

	8/24 14:16:15 Setting resource limits not implemented!

	8/24 14:16:15 Communicating with shadow <10.254.254.219:2821>

	8/24 14:16:15 Submitting machine is "comp12"

	8/24 14:16:16 File transfer completed successfully.

	8/24 14:16:17 Starting a VANILLA universe job with ID: 8.0

	8/24 14:16:17 IWD: C:\condor/execute\dir_3684

	8/24 14:16:17 Output file: C:\condor/execute\dir_3684\test.out

	8/24 14:16:17 Error file: C:\condor/execute\dir_3684\test.err

	8/24 14:16:17 Renice expr "10" evaluated to 10

	8/24 14:16:17 About to exec
C:\condor\execute\dir_3684\condor_exec.exe
	8/24 14:16:17 ERROR: C:\condor\execute\dir_3684\condor_exec.exe is
not a valid Windows executable

	8/24 14:16:17 ERROR
"Create_Process(C:\condor\execute\dir_3684\condor_exec.exe,, ...) failed" at
line 393 in file ..\src\condor_starter.V6.1\os_proc.C

	8/24 14:16:17 ShutdownFast all jobs.

	8/24 14:33:27 ******************************************************

	ShadowLog.log

	8/24 10:52:47 ******************************************************

	8/24 10:52:47 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 10:52:47 ** C:\condor\bin\condor_shadow.exe

	8/24 10:52:47 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 10:52:47 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 10:52:47 ** PID = 2144

	8/24 10:52:47 ** Log last touched time unavailable (No such file or
directory)

	8/24 10:52:47 ******************************************************

	8/24 10:52:47 Using config source: C:\condor\condor_config

8/24 10:52:47 Using local config sources:
	8/24 10:52:47    C:\condor/condor_config.local

	8/24 10:52:47 DaemonCore: Command Socket at <10.254.254.219:1112>

	8/24 10:52:47 Initializing a VANILLA shadow for job 1.0

	8/24 10:52:47 (1.0) (2144): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 10:52:48 (1.0) (2144): Job 1.0 terminated: exited with status 0

	8/24 10:52:48 (1.0) (2144): **** condor_shadow (condor_SHADOW)
EXITING WITH STATUS 100

	8/24 10:54:24 ******************************************************

	8/24 10:54:24 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 10:54:24 ** C:\condor\bin\condor_shadow.exe

	8/24 10:54:24 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 10:54:24 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 10:54:24 ** PID = 2316

	8/24 10:54:24 ** Log last touched 8/24 10:52:48

	8/24 10:54:24 ******************************************************

	8/24 10:54:24 Using config source: C:\condor\condor_config

8/24 10:54:24 Using local config sources:
	8/24 10:54:24    C:\condor/condor_config.local

	8/24 10:54:24 DaemonCore: Command Socket at <10.254.254.219:1160>

	8/24 10:54:24 Initializing a VANILLA shadow for job 2.0

	8/24 10:54:25 (2.0) (2316): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 10:54:26 (2.0) (2316): ERROR "Error from starter on vm1@comp12:
Create_Process(C:\condor\execute\dir_2336\condor_exec.exe,, ...) failed" at
line 643 in file ..\src\condor_shadow.V6.1\pseudo_ops.C

	8/24 10:54:28 ******************************************************

	8/24 10:54:28 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 10:54:28 ** C:\condor\bin\condor_shadow.exe

	8/24 10:54:28 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 10:54:28 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 10:54:28 ** PID = 2752

	8/24 10:54:28 ** Log last touched 8/24 10:54:26

	8/24 10:54:28 ******************************************************

	8/24 10:54:28 Using config source: C:\condor\condor_config

8/24 10:54:28 Using local config sources:
	8/24 10:54:28    C:\condor/condor_config.local

	8/24 10:54:28 DaemonCore: Command Socket at <10.254.254.219:1177>

	8/24 10:54:28 Initializing a VANILLA shadow for job 2.0

	8/24 10:54:28 (2.0) (2752): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 10:54:30 (2.0) (2752): condor_read(): recv() returned -1, errno
= 10054, assuming failure reading 5 bytes from <10.254.254.219:1068>.

	8/24 10:54:30 (2.0) (2752): Can no longer talk to condor_starter
<10.254.254.219:1068>

	8/24 10:54:30 (2.0) (2752): Trying to reconnect to disconnected job

	8/24 10:54:30 (2.0) (2752): LastJobLeaseRenewal: 1187942070 Fri Aug
24 10:54:30 2007

	8/24 10:54:30 (2.0) (2752): JobLeaseDuration: 1200 seconds

	8/24 10:54:30 (2.0) (2752): JobLeaseDuration remaining: 1200

	8/24 10:54:30 (2.0) (2752): Attempting to locate disconnected
starter

	8/24 10:54:30 (2.0) (2752): Found starter: <10.254.254.219:1179>

	8/24 10:54:30 (2.0) (2752): Attempting to reconnect to starter
<10.254.254.219:1179>

	8/24 10:54:31 (2.0) (2752): attempt to connect to
<10.254.254.219:1179> failed: connect errno = 10061 connection refused.

	8/24 10:54:31 (2.0) (2752): Attempt to reconnect failed: Failed to
connect to starter <10.254.254.219:1179>

	8/24 10:54:31 (2.0) (2752): JobLeaseDuration remaining: 1199

	8/24 10:54:31 (2.0) (2752): Scheduling another attempt to reconnect
in 8 seconds

	8/24 10:54:39 (2.0) (2752): Attempting to locate disconnected
starter

	8/24 10:54:39 (2.0) (2752): locateStarter(): ClaimId
(<10.254.254.219:1068>#1187941812#4) and GlobalJobId ( comp12#1187942060#2.0
) not found

	8/24 10:54:39 (2.0) (2752): Reconnect FAILED: Job not found at
execution machine

	8/24 10:54:39 (2.0) (2752): **** condor_shadow (condor_SHADOW)
EXITING WITH STATUS 107

	8/24 14:15:42 ******************************************************

	8/24 14:15:42 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 14:15:42 ** C:\condor\bin\condor_shadow.exe

	8/24 14:15:42 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:15:42 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:15:42 ** PID = 2252

	8/24 14:15:42 ** Log last touched 8/24 10:54:39

	8/24 14:15:42 ******************************************************

	8/24 14:15:42 Using config source: C:\condor\condor_config

8/24 14:15:42 Using local config sources:
	8/24 14:15:42    C:\condor/condor_config.local

	8/24 14:15:42 DaemonCore: Command Socket at <10.254.254.219:2756>

	8/24 14:15:42 Initializing a VANILLA shadow for job 6.0

	8/24 14:15:42 (6.0) (2252): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 14:15:44 ******************************************************

	8/24 14:15:44 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 14:15:44 ** C:\condor\bin\condor_shadow.exe

	8/24 14:15:44 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:15:44 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:15:44 ** PID = 3252

	8/24 14:15:44 ** Log last touched 8/24 14:15:42

	8/24 14:15:44 ******************************************************

	8/24 14:15:44 Using config source: C:\condor\condor_config

8/24 14:15:44 Using local config sources:
	8/24 14:15:44    C:\condor/condor_config.local

	8/24 14:15:44 DaemonCore: Command Socket at <10.254.254.219:2767>

	8/24 14:15:44 Initializing a VANILLA shadow for job 7.0

	8/24 14:15:44 (7.0) (3252): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 14:15:44 (6.0) (2252): ERROR "Error from starter on vm1@comp12:
Create_Process(C:\condor\execute\dir_3500\condor_exec.exe,, ...) failed" at
line 643 in file ..\src\condor_shadow.V6.1\pseudo_ops.C

	8/24 14:15:46 (7.0) (3252): ERROR "Error from starter on vm2@comp12:
Create_Process(C:\condor\execute\dir_3288\condor_exec.exe,, ...) failed" at
line 643 in file ..\src\condor_shadow.V6.1\pseudo_ops.C

	8/24 14:16:15 ******************************************************

	8/24 14:16:15 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 14:16:15 ** C:\condor\bin\condor_shadow.exe

	8/24 14:16:15 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:16:15 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:16:15 ** PID = 1648

	8/24 14:16:15 ** Log last touched 8/24 14:15:46

	8/24 14:16:15 ******************************************************

	8/24 14:16:15 Using config source: C:\condor\condor_config

8/24 14:16:15 Using local config sources:
	8/24 14:16:15    C:\condor/condor_config.local

	8/24 14:16:15 DaemonCore: Command Socket at <10.254.254.219:2821>

	8/24 14:16:15 Initializing a VANILLA shadow for job 8.0

	8/24 14:16:15 (8.0) (1648): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 14:16:17 (8.0) (1648): condor_read(): recv() returned -1, errno
= 10054, assuming failure reading 5 bytes from <10.254.254.219:1068>.

	8/24 14:16:17 (8.0) (1648): Can no longer talk to condor_starter
<10.254.254.219:1068>

	8/24 14:16:17 (8.0) (1648): Trying to reconnect to disconnected job

	8/24 14:16:17 (8.0) (1648): LastJobLeaseRenewal: 1187954177 Fri Aug
24 14:16:17 2007

	8/24 14:16:17 (8.0) (1648): JobLeaseDuration: 1200 seconds

	8/24 14:16:17 (8.0) (1648): JobLeaseDuration remaining: 1200

	8/24 14:16:17 (8.0) (1648): Attempting to locate disconnected
starter

	8/24 14:16:17 (8.0) (1648): Found starter: <10.254.254.219:2823>

	8/24 14:16:17 (8.0) (1648): Attempting to reconnect to starter
<10.254.254.219:2823>

	8/24 14:16:18 (8.0) (1648): attempt to connect to
<10.254.254.219:2823> failed: connect errno = 10061 connection refused.

	8/24 14:16:18 (8.0) (1648): Attempt to reconnect failed: Failed to
connect to starter <10.254.254.219:2823>

	8/24 14:16:18 (8.0) (1648): JobLeaseDuration remaining: 1199

	8/24 14:16:18 (8.0) (1648): Scheduling another attempt to reconnect
in 8 seconds

	8/24 14:16:26 (8.0) (1648): Attempting to locate disconnected
starter

	8/24 14:16:26 (8.0) (1648): locateStarter(): ClaimId
(<10.254.254.219:1068>#1187941812#11) and GlobalJobId (
comp12#1187954170#8.0 ) not found

	8/24 14:16:26 (8.0) (1648): Reconnect FAILED: Job not found at
execution machine

	8/24 14:16:26 (8.0) (1648): **** condor_shadow (condor_SHADOW)
EXITING WITH STATUS 107

	8/24 14:33:27 ******************************************************

	8/24 14:33:27 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 14:33:27 ** C:\condor\bin\condor_shadow.exe

	8/24 14:33:27 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:33:27 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:33:27 ** PID = 2752

	8/24 14:33:27 ** Log last touched 8/24 14:16:26

	8/24 14:33:27 ******************************************************

	8/24 14:33:27 Using config source: C:\condor\condor_config

8/24 14:33:27 Using local config sources:
	8/24 14:33:27    C:\condor/condor_config.local

	8/24 14:33:27 DaemonCore: Command Socket at <10.254.254.219:2959>

	8/24 14:33:27 Initializing a VANILLA shadow for job 9.0

	8/24 14:33:27 (9.0) (2752): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 14:33:28 (9.0) (2752): ERROR "Error from starter on vm1@comp12:
Create_Process(C:\condor\execute\dir_2880\condor_exec.exe,, ...) failed" at
line 643 in file ..\src\condor_shadow.V6.1\pseudo_ops.C

	8/24 14:33:29 ******************************************************

	8/24 14:33:29 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 14:33:29 ** C:\condor\bin\condor_shadow.exe

	8/24 14:33:29 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:33:29 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:33:29 ** PID = 684

	8/24 14:33:29 ** Log last touched 8/24 14:33:28

	8/24 14:33:29 ******************************************************

	8/24 14:33:29 Using config source: C:\condor\condor_config

8/24 14:33:29 Using local config sources:
	8/24 14:33:29    C:\condor/condor_config.local

	8/24 14:33:29 DaemonCore: Command Socket at <10.254.254.219:2976>

	8/24 14:33:29 Initializing a VANILLA shadow for job 10.0

	8/24 14:33:29 (10.0) (684): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 14:33:30 (10.0) (684): ERROR "Error from starter on vm2@comp12:
Create_Process(C:\condor\execute\dir_3120\condor_exec.exe,, ...) failed" at
line 643 in file ..\src\condor_shadow.V6.1\pseudo_ops.C

	8/24 14:33:31 ******************************************************

	8/24 14:33:31 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 14:33:31 ** C:\condor\bin\condor_shadow.exe

	8/24 14:33:31 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:33:31 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:33:31 ** PID = 1936

	8/24 14:33:31 ** Log last touched 8/24 14:33:30

	8/24 14:33:31 ******************************************************

	8/24 14:33:31 Using config source: C:\condor\condor_config

8/24 14:33:31 Using local config sources:
	8/24 14:33:31    C:\condor/condor_config.local

	8/24 14:33:31 DaemonCore: Command Socket at <10.254.254.219:2991>

	8/24 14:33:31 Initializing a VANILLA shadow for job 9.0

	8/24 14:33:31 (9.0) (1936): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 14:33:32 (9.0) (1936): condor_read(): recv() returned -1, errno
= 10054, assuming failure reading 5 bytes from <10.254.254.219:1068>.

	8/24 14:33:32 (9.0) (1936): Can no longer talk to condor_starter
<10.254.254.219:1068>

	8/24 14:33:33 (9.0) (1936): Trying to reconnect to disconnected job

	8/24 14:33:33 (9.0) (1936): LastJobLeaseRenewal: 1187955212 Fri Aug
24 14:33:32 2007

	8/24 14:33:33 (9.0) (1936): JobLeaseDuration: 1200 seconds

	8/24 14:33:33 (9.0) (1936): JobLeaseDuration remaining: 1199

	8/24 14:33:33 (9.0) (1936): Attempting to locate disconnected
starter

	8/24 14:33:33 (9.0) (1936): Found starter: <10.254.254.219:2993>

	8/24 14:33:33 (9.0) (1936): Attempting to reconnect to starter
<10.254.254.219:2993>

	8/24 14:33:33 ******************************************************

	8/24 14:33:33 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 14:33:33 ** C:\condor\bin\condor_shadow.exe

	8/24 14:33:33 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:33:33 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:33:33 ** PID = 3220

	8/24 14:33:33 ** Log last touched 8/24 14:33:33

	8/24 14:33:33 ******************************************************

	8/24 14:33:33 Using config source: C:\condor\condor_config

8/24 14:33:33 Using local config sources:
	8/24 14:33:33    C:\condor/condor_config.local

	8/24 14:33:33 DaemonCore: Command Socket at <10.254.254.219:3003>

	8/24 14:33:33 Initializing a VANILLA shadow for job 10.0

	8/24 14:33:34 (10.0) (3220): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 14:33:34 (9.0) (1936): attempt to connect to
<10.254.254.219:2993> failed: connect errno = 10061 connection refused.

	8/24 14:33:34 (9.0) (1936): Attempt to reconnect failed: Failed to
connect to starter <10.254.254.219:2993>

	8/24 14:33:34 (9.0) (1936): JobLeaseDuration remaining: 1199

	8/24 14:33:34 (9.0) (1936): Scheduling another attempt to reconnect
in 8 seconds

	8/24 14:33:35 (10.0) (3220): ERROR "Error from starter on
vm2@comp12: Create_Process(C:\condor\execute\dir_3692\condor_exec.exe,, ...)
failed" at line 643 in file ..\src\condor_shadow.V6.1\pseudo_ops.C

	8/24 14:33:38 ******************************************************

	8/24 14:33:38 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 14:33:38 ** C:\condor\bin\condor_shadow.exe

	8/24 14:33:38 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:33:38 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:33:38 ** PID = 208

	8/24 14:33:38 ** Log last touched 8/24 14:33:35

	8/24 14:33:38 ******************************************************

	8/24 14:33:38 Using config source: C:\condor\condor_config

8/24 14:33:38 Using local config sources:
	8/24 14:33:38    C:\condor/condor_config.local

	8/24 14:33:38 DaemonCore: Command Socket at <10.254.254.219:3022>

	8/24 14:33:38 Initializing a VANILLA shadow for job 10.0

	8/24 14:33:38 (10.0) (208): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 14:33:39 (10.0) (208): ERROR "Error from starter on vm2@comp12:
Create_Process(C:\condor\execute\dir_3668\condor_exec.exe,, ...) failed" at
line 643 in file ..\src\condor_shadow.V6.1\pseudo_ops.C

	8/24 14:33:42 (9.0) (1936): Attempting to locate disconnected
starter

	8/24 14:33:42 (9.0) (1936): locateStarter(): ClaimId
(<10.254.254.219:1068>#1187941812#14) and GlobalJobId (
comp12#1187955110#9.0 ) not found

	8/24 14:33:42 (9.0) (1936): Reconnect FAILED: Job not found at
execution machine

	8/24 14:33:42 (9.0) (1936): **** condor_shadow (condor_SHADOW)
EXITING WITH STATUS 107

	8/24 14:33:42 ******************************************************

	8/24 14:33:42 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 14:33:42 ** C:\condor\bin\condor_shadow.exe

	8/24 14:33:42 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:33:42 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:33:42 ** PID = 2316

	8/24 14:33:42 ** Log last touched 8/24 14:33:42

	8/24 14:33:42 ******************************************************

	8/24 14:33:42 Using config source: C:\condor\condor_config

8/24 14:33:42 Using local config sources:
	8/24 14:33:42    C:\condor/condor_config.local

	8/24 14:33:42 DaemonCore: Command Socket at <10.254.254.219:3044>

	8/24 14:33:42 Initializing a VANILLA shadow for job 10.0

	8/24 14:33:42 (10.0) (2316): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 14:33:43 (10.0) (2316): condor_read(): recv() returned -1,
errno = 10054, assuming failure reading 5 bytes from <10.254.254.219:1068>.

	8/24 14:33:43 (10.0) (2316): Can no longer talk to condor_starter
<10.254.254.219:1068>

	8/24 14:33:44 (10.0) (2316): Trying to reconnect to disconnected job

	8/24 14:33:44 (10.0) (2316): LastJobLeaseRenewal: 1187955223 Fri Aug
24 14:33:43 2007

	8/24 14:33:44 (10.0) (2316): JobLeaseDuration: 1200 seconds

	8/24 14:33:44 (10.0) (2316): JobLeaseDuration remaining: 1199

	8/24 14:33:44 (10.0) (2316): Attempting to locate disconnected
starter

	8/24 14:33:44 (10.0) (2316): Found starter: <10.254.254.219:3046>

	8/24 14:33:44 (10.0) (2316): Attempting to reconnect to starter
<10.254.254.219:3046>

	8/24 14:33:44 (10.0) (2316): attempt to connect to
<10.254.254.219:3046> failed: connect errno = 10061 connection refused.

	8/24 14:33:44 (10.0) (2316): Attempt to reconnect failed: Failed to
connect to starter <10.254.254.219:3046>

	8/24 14:33:44 (10.0) (2316): JobLeaseDuration remaining: 1200

	8/24 14:33:44 (10.0) (2316): Scheduling another attempt to reconnect
in 8 seconds

	8/24 14:33:52 (10.0) (2316): Attempting to locate disconnected
starter

	8/24 14:33:53 (10.0) (2316): locateStarter(): ClaimId
(<10.254.254.219:1068>#1187941812#12) and GlobalJobId (
comp12#1187955203#10.0 ) not found

	8/24 14:33:53 (10.0) (2316): Reconnect FAILED: Job not found at
execution machine

	8/24 14:33:53 (10.0) (2316): **** condor_shadow (condor_SHADOW)
EXITING WITH STATUS 107

	8/24 14:50:48 ******************************************************

	8/24 14:50:48 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 14:50:48 ** C:\condor\bin\condor_shadow.exe

	8/24 14:50:48 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:50:48 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:50:48 ** PID = 1452

	8/24 14:50:48 ** Log last touched 8/24 14:33:53

	8/24 14:50:48 ******************************************************

	8/24 14:50:48 Using config source: C:\condor\condor_config

8/24 14:50:48 Using local config sources:
	8/24 14:50:48    C:\condor/condor_config.local

	8/24 14:50:48 DaemonCore: Command Socket at <10.254.254.219:3211>

	8/24 14:50:48 Initializing a VANILLA shadow for job 11.0

	8/24 14:50:48 (11.0) (1452): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 14:50:49 (11.0) (1452): condor_read(): recv() returned -1,
errno = 10054, assuming failure reading 5 bytes from <10.254.254.219:1068>.

	8/24 14:50:49 (11.0) (1452): Can no longer talk to condor_starter
<10.254.254.219:1068>

	8/24 14:50:49 (11.0) (1452): Trying to reconnect to disconnected job

	8/24 14:50:49 (11.0) (1452): LastJobLeaseRenewal: 1187956249 Fri Aug
24 14:50:49 2007

	8/24 14:50:49 (11.0) (1452): JobLeaseDuration: 1200 seconds

	8/24 14:50:49 (11.0) (1452): JobLeaseDuration remaining: 1200

	8/24 14:50:49 (11.0) (1452): Attempting to locate disconnected
starter

	8/24 14:50:49 (11.0) (1452): Found starter: <10.254.254.219:3216>

	8/24 14:50:49 (11.0) (1452): Attempting to reconnect to starter
<10.254.254.219:3216>

	8/24 14:50:50 (11.0) (1452): attempt to connect to
<10.254.254.219:3216> failed: connect errno = 10061 connection refused.

	8/24 14:50:50 (11.0) (1452): Attempt to reconnect failed: Failed to
connect to starter <10.254.254.219:3216>

	8/24 14:50:50 (11.0) (1452): JobLeaseDuration remaining: 1199

	8/24 14:50:50 (11.0) (1452): Scheduling another attempt to reconnect
in 8 seconds

	8/24 14:50:58 (11.0) (1452): Attempting to locate disconnected
starter

	8/24 14:50:58 (11.0) (1452): locateStarter(): ClaimId
(<10.254.254.219:1068>#1187941812#17) and GlobalJobId (
comp12#1187956244#11.0 ) not found

	8/24 14:50:58 (11.0) (1452): Reconnect FAILED: Job not found at
execution machine

	8/24 14:50:58 (11.0) (1452): **** condor_shadow (condor_SHADOW)
EXITING WITH STATUS 107

	8/24 14:52:23 ******************************************************

	8/24 14:52:23 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 14:52:23 ** C:\condor\bin\condor_shadow.exe

	8/24 14:52:23 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:52:23 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:52:23 ** PID = 612

	8/24 14:52:23 ** Log last touched 8/24 14:50:58

	8/24 14:52:23 ******************************************************

	8/24 14:52:23 Using config source: C:\condor\condor_config

8/24 14:52:23 Using local config sources:
	8/24 14:52:23    C:\condor/condor_config.local

	8/24 14:52:23 DaemonCore: Command Socket at <10.254.254.219:3268>

	8/24 14:52:23 Initializing a VANILLA shadow for job 12.0

	8/24 14:52:23 (12.0) (612): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 14:52:25 (12.0) (612): ERROR "Error from starter on vm1@comp12:
Create_Process(C:\condor\execute\dir_3812\condor_exec.exe,, ...) failed" at
line 643 in file ..\src\condor_shadow.V6.1\pseudo_ops.C

	8/24 14:52:27 ******************************************************

	8/24 14:52:27 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 14:52:27 ** C:\condor\bin\condor_shadow.exe

	8/24 14:52:27 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:52:27 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:52:27 ** PID = 2648

	8/24 14:52:27 ** Log last touched 8/24 14:52:25

	8/24 14:52:27 ******************************************************

	8/24 14:52:27 Using config source: C:\condor\condor_config

8/24 14:52:27 Using local config sources:
	8/24 14:52:27    C:\condor/condor_config.local

	8/24 14:52:27 DaemonCore: Command Socket at <10.254.254.219:3286>

	8/24 14:52:27 Initializing a VANILLA shadow for job 12.0

	8/24 14:52:27 (12.0) (2648): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 14:52:29 (12.0) (2648): ERROR "Error from starter on
vm1@comp12: Create_Process(C:\condor\execute\dir_2744\condor_exec.exe,, ...)
failed" at line 643 in file ..\src\condor_shadow.V6.1\pseudo_ops.C

	8/24 14:52:31 ******************************************************

	8/24 14:52:31 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 14:52:31 ** C:\condor\bin\condor_shadow.exe

	8/24 14:52:31 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:52:31 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:52:31 ** PID = 1360

	8/24 14:52:31 ** Log last touched 8/24 14:52:29

	8/24 14:52:31 ******************************************************

	8/24 14:52:31 Using config source: C:\condor\condor_config

8/24 14:52:31 Using local config sources:
	8/24 14:52:31    C:\condor/condor_config.local

	8/24 14:52:31 DaemonCore: Command Socket at <10.254.254.219:3305>

	8/24 14:52:31 Initializing a VANILLA shadow for job 12.0

	8/24 14:52:31 (12.0) (1360): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 14:52:33 (12.0) (1360): ERROR "Error from starter on
vm1@comp12: Create_Process(C:\condor\execute\dir_3264\condor_exec.exe,, ...)
failed" at line 643 in file ..\src\condor_shadow.V6.1\pseudo_ops.C

	8/24 14:52:35 ******************************************************

	8/24 14:52:35 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 14:52:35 ** C:\condor\bin\condor_shadow.exe

	8/24 14:52:35 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:52:35 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:52:35 ** PID = 3804

	8/24 14:52:35 ** Log last touched 8/24 14:52:33

	8/24 14:52:35 ******************************************************

	8/24 14:52:35 Using config source: C:\condor\condor_config

8/24 14:52:35 Using local config sources:
	8/24 14:52:35    C:\condor/condor_config.local

	8/24 14:52:35 DaemonCore: Command Socket at <10.254.254.219:3323>

	8/24 14:52:35 Initializing a VANILLA shadow for job 12.0

	8/24 14:52:35 (12.0) (3804): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 14:52:36 (12.0) (3804): ERROR "Error from starter on
vm1@comp12: Create_Process(C:\condor\execute\dir_528\condor_exec.exe,, ...)
failed" at line 643 in file ..\src\condor_shadow.V6.1\pseudo_ops.C

	8/24 14:52:39 ******************************************************

	8/24 14:52:39 ** condor_shadow (CONDOR_SHADOW) STARTING UP

	8/24 14:52:39 ** C:\condor\bin\condor_shadow.exe

	8/24 14:52:39 ** $CondorVersion: 6.8.4 Feb  1 2007 $

	8/24 14:52:39 ** $CondorPlatform: INTEL-WINNT50 $

	8/24 14:52:39 ** PID = 1220

	8/24 14:52:39 ** Log last touched 8/24 14:52:36

	8/24 14:52:39 ******************************************************

	8/24 14:52:39 Using config source: C:\condor\condor_config

8/24 14:52:39 Using local config sources:
	8/24 14:52:39    C:\condor/condor_config.local

	8/24 14:52:39 DaemonCore: Command Socket at <10.254.254.219:3345>

	8/24 14:52:39 Initializing a VANILLA shadow for job 12.0

	8/24 14:52:40 (12.0) (1220): Request to run on <10.254.254.219:1068>
was ACCEPTED

	8/24 14:52:41 (12.0) (1220): ERROR "Error from starter on
vm1@comp12: Create_Process(C:\condor\execute\dir_2860\condor_exec.exe,, ...)
failed" at line 643 in file ..\src\condor_shadow.V6.1\pseudo_ops.C

	I had a doubdt that the problem could be in the compiler (in this
case Turbo Pascal 7). I tried to compile and link an executable with Borland
C. The result was exactly the same.

	I would be happy to understand where to seek the solution. Thanks in
advance!

	Ivailo Penev


_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at: https://lists.cs.wisc.edu/archive/condor-users/