[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] shawdow exception :create_process failed



Hi Everyone,
 
I have 6 windows machine seup to run condor when i submit a R job to the condor its is running for few seconds and going back to the idle state and again running when i checked my log file i got a message saying Error from slot2@xxxxxxxx: Create_Process(C:\condor\execute\dir_3168\condor_exec.R,, ...) failed:
here is my submit file:
********************************************************
Universe = vanilla
Executable = C:\Users\test_case\Simulate_Normal_Data.R
getenv = TRUE
should_transfer_files = YES
transfer_executable = NO
when_to_transfer_output = ON_EXIT
#Requirements = Arch == "INTEl" && Opsys == "WINNT50"
Output = test_r_out.out
Log = test_r_log.log
error = error_r.error
Queue
*********************************************************
 
here is my log file test_r_log.log
 
*********************************************************
001 (127.000.000) 08/13 13:22:00 Job executing on host:
...
007 (127.000.000) 08/13 13:22:01 Shadow exception!
 Error from slot2@xxxxxxxxxxxx: Create_Process(C:\condor\execute\dir_1688\condor_exec.R,, ...) failed:
 0  -  Run Bytes Sent By Job
 5025  -  Run Bytes Received By Job
...
001 (128.000.000) 08/13 13:22:04 Job executing on host:
...
007 (128.000.000) 08/13 13:22:05 Shadow exception!
 Error from slot1@xxxxxxxxxxxxx Create_Process(C:\condor\execute\dir_1752\condor_exec.R,, ...) failed:
 0  -  Run Bytes Sent By Job
 5025  -  Run Bytes Received By Job
...
001 (127.000.000) 08/13 13:22:08 Job executing on host:
...
007 (127.000.000) 08/13 13:22:09 Shadow exception!
 Error from slot1@xxxxxxxxxxxxx Create_Process(C:\condor\execute\dir_2396\condor_exec.R,, ...) failed:
 0  -  Run Bytes Sent By Job
 5025  -  Run Bytes Received By Job
...
001 (127.000.000) 08/13 13:22:13 Job executing on host:
...
007 (127.000.000) 08/13 13:22:13 Shadow exception!
 Error from slot1@xxxxxxxxxxxxx: Create_Process(C:\condor\execute\dir_2664\condor_exec.R,, ...) failed:
 0  -  Run Bytes Sent By Job
 5025  -  Run Bytes Received By Job
...
001 (127.000.000) 08/13 13:22:20 Job executing on host:
...
007 (127.000.000) 08/13 13:22:21 Shadow exception!
 Error from slot1@xxxxxxxxxxxxx: Create_Process(C:\condor\execute\dir_3764\condor_exec.R,, ...) failed:
 0  -  Run Bytes Sent By Job
 5025  -  Run Bytes Received By Job
...
001 (128.000.000) 08/13 13:22:21 Job executing on host:
...
007 (128.000.000) 08/13 13:22:21 Shadow exception!
 Error from slot2@xxxxxxxxxxxx: Create_Process(C:\condor\execute\dir_3724\condor_exec.R,, ...) failed:
 0  -  Run Bytes Sent By Job
 5025  -  Run Bytes Received By Job
...
001 (127.000.000) 08/13 13:22:25 Job executing on host:
...
007 (127.000.000) 08/13 13:22:26 Shadow exception!
 Error from slot2@xxxxxxxxxxxxxx: Create_Process(C:\condor\execute\dir_3112\condor_exec.R,, ...) failed:
 0  -  Run Bytes Sent By Job
 5025  -  Run Bytes Received By Job
...
001 (127.000.000) 08/13 13:22:30 Job executing on host:
...
007 (127.000.000) 08/13 13:22:30 Shadow exception!
 Error from slot2@xxxxxxxxxxxxCreate_Process(C:\condor\execute\dir_2508\condor_exec.R,, ...) failed:
 0  -  Run Bytes Sent By Job
 5025  -  Run Bytes Received By Job
...
001 (127.000.000) 08/13 13:22:34 Job executing on host:
...
007 (127.000.000) 08/13 13:22:35 Shadow exception!
 Error from slot2@xxxxxxxxxxxxxxxx: Create_Process(C:\condor\execute\dir_3168\condor_exec.R,, ...) failed:
 0  -  Run Bytes Sent By Job
 5025  -  Run Bytes Received By Job
...
****************************************************************************************
 
here is my starter log on executing machine
 
****************************************************************************************
Now in new log file C:\condor/log/StarterLog.slot1
08/13 13:03:05 ** Configuration: subsystem:STARTER local:<NONE> class:DAEMON
08/13 13:03:05 ** $CondorVersion: 7.4.2 Mar 30 2010 BuildID: 227044 $
08/13 13:03:05 ** $CondorPlatform: INTEL-WINNT50 $
08/13 13:03:05 ** PID = 3612
08/13 13:03:05 ** Log last touched 8/13 12:03:03
08/13 13:03:05 ******************************************************
08/13 13:03:05 Using config source: C:\condor\condor_config
08/13 13:03:05 Using local config sources:
08/13 13:03:05    C:\condor\condor_config.local
08/13 13:03:05 DaemonCore: Command Socket at <192.168.0.103:53075>
08/13 13:03:05 GLEXEC_JOB not supported on this platform; ignoring
08/13 13:03:05 Setting resource limits not implemented!
08/13 13:03:05 Communicating with shadow <192.168.0.106:52457>
08/13 13:03:05 Submitting machine is "machine_name"
08/13 13:03:05 setting the orig job name in starter
08/13 13:03:05 setting the orig job iwd in starter
08/13 13:03:08 File transfer completed successfully.
08/13 13:03:09 Job 127.0 set to execute immediately
08/13 13:03:09 Starting a VANILLA universe job with ID: 127.0
08/13 13:03:09 Tracking process family by login "condor-reuse-slot1"
08/13 13:03:09 IWD: C:\condor\execute\dir_3612
08/13 13:03:09 Output file: C:\condor\execute\dir_3612\test_r_out.out
08/13 13:03:09 Error file: C:\condor\execute\dir_3612\error_r.error
08/13 13:03:10 Renice expr "10" evaluated to 10
08/13 13:03:10 About to exec C:\condor\execute\dir_3612\condor_exec.R
08/13 13:03:10 GetExecutableAndArgumentsByExtention: failed to find extension *.R in the registry (last-error = 2).
08/13 13:03:10 Create_Process(): Failed to find an executable for extension *.R
08/13 13:03:10 ERROR: C:\condor\execute\dir_3612\condor_exec.R is not a valid Windows executable
08/13 13:03:10 ERROR "Create_Process(C:\condor\execute\dir_3612\condor_exec.R,, ...) failed: " at line 530 in file ..\src\condor_starter.V6.1\os_proc.cpp
08/13 13:03:10 ShutdownFast all jobs.
08/13 13:03:29 Locale: English_United States.1252
********************************************************************