[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Condor-C Create_Process failed!




Hello everyone!
 
When i use Condor-C to submit one job from a condor pool to another, i encountered a problem like this:
 
The job submitting file is as follows:
 
universe = grid
CMD = test.bat
Requirements = OpSys == "WINNT51"
grid_resource = condor node3.nova.cn node3.nova.cn
+remote_jobuniverse = 5
+remote_requirements = True
+remote_WhenToTransferOutput = "ON_EXIT"
queue
 
And ths bat file is as follows:
 
hostname
 
When i submit the job to node3.nova.cn, the shadow log on node3.nova.cn is as follows:

12/10 23:59:27 ******************************************************
12/10 23:59:27 ** condor_shadow (CONDOR_SHADOW) STARTING UP
12/10 23:59:27 ** /opt/condor-7.4.4/sbin/condor_shadow
12/10 23:59:27 ** SubsystemInfo: name=SHADOW type=SHADOW(6) class=DAEMON(1)
12/10 23:59:27 ** Configuration: subsystem:SHADOW local:<NONE> class:DAEMON
12/10 23:59:27 ** $CondorVersion: 7.4.4 Oct 14 2010 BuildID: 279383 $
12/10 23:59:27 ** $CondorPlatform: I386-LINUX_RHEL5 $
12/10 23:59:27 ** PID = 15273
12/10 23:59:27 ** Log last touched 12/10 23:58:31
12/10 23:59:27 ******************************************************
12/10 23:59:27 Using config source: /opt/condor-7.4.4/etc/condor_config
12/10 23:59:27 Using local config sources:
12/10 23:59:27    /opt/condor-7.4.4/local.node3/condor_config.local
12/10 23:59:27 DaemonCore: Command Socket at <10.1.1.28:52181>
12/10 23:59:27 Initializing a VANILLA shadow for job 368.0
12/10 23:59:27 (368.0) (15273): Request to run on jsi21-1 <10.1.1.11:1033> was ACCEPTED
12/10 23:59:28 (368.0) (15273): ERROR "Error from jsi21-1: Create_Process(C:\WINDOWS\system32\cmd.exe,/Q /C /opt/condor-7.4.4/local.node3/spool/cluster368.proc0.subproc0/test.ba
t, ...) failed" at line 687 in file pseudo_ops.cpp
12/10 23:59:28 ******************************************************
At the node node3.nova.cn ,we execute the command "condor_status", its output is like this:
 
Name               OpSys      Arch   State     Activity LoadAv Mem   ActvtyTime
slot1@xxxxxxxxxxxx LINUX      INTEL  Unclaimed Idle     0.870  1635  0+02:31:39
slot2@xxxxxxxxxxxx LINUX      INTEL  Unclaimed Idle     0.000  1635  0+06:08:19
slot3@xxxxxxxxxxxx LINUX      INTEL  Unclaimed Idle     0.000  1635  0+06:08:15
slot4@xxxxxxxxxxxx LINUX      INTEL  Unclaimed Idle     0.000  1635  0+06:08:17
jsi21-1            WINNT51    INTEL  Unclaimed Idle     0.050   511  0+00:00:04
                     Total Owner Claimed Unclaimed Matched Preempting Backfill
         INTEL/LINUX     4     0       0         4       0          0        0
       INTEL/WINNT51     1     0       0         1       0          0        0
               Total     5     0       0         5       0          0        0
As you can, there is an error. The node jsi21-1 is a windows machine and it's ip is 10.1.1.11.
 
So anybody encouter the same problem? Have any solutions?
 
Thanks!