[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] FW: condor job has exited normally with status-1073741515



I ran the batch file ‘echo “hello World” and it ran just fine. It says it exit out with code 0;

The logs are the following

a) Execute node’s Starterlog:

11/17 15:48:16 ******************************************************

11/17 15:48:16 ** condor_starter (CONDOR_STARTER) STARTING UP

11/17 15:48:16 ** C:\condor\bin\condor_starter.exe

11/17 15:48:16 ** $CondorVersion: 6.8.7 Nov 29 2007 $

11/17 15:48:16 ** $CondorPlatform: INTEL-WINNT50 $

11/17 15:48:16 ** PID = 516

11/17 15:48:16 ** Log last touched 11/17 15:48:16

11/17 15:48:16 ******************************************************

11/17 15:48:16 Using config source: C:\condor\condor_config

11/17 15:48:16 Using local config sources:

11/17 15:48:16    C:\condor/condor_config.local

11/17 15:48:16 DaemonCore: Command Socket at <1x.xx.xx.x9:3953>

11/17 15:48:16 Setting resource limits not implemented!

11/17 15:48:16 Communicating with shadow <1x.xx.xx.x4:3345>

11/17 15:48:16 Submitting machine is "theisman.earthdata.com"

11/17 15:48:17 File transfer completed successfully.

11/17 15:48:18 Starting a VANILLA universe job with ID: 26.14

11/17 15:48:18 IWD: C:\condor/execute\dir_516

11/17 15:48:18 Output file: C:\condor/execute\dir_516\Helloworld.out.14

11/17 15:48:18 Error file: C:\condor/execute\dir_516\Helloworld.err.14

11/17 15:48:18 Renice expr "10" evaluated to 10

11/17 15:48:18 About to exec C:\WINDOWS\system32\cmd.exe /Q /C condor_exec.bat

11/17 15:48:18 Create_Process succeeded, pid=1376

11/17 15:48:18 Process exited, pid=1376, status=0

11/17 15:48:18 Got SIGQUIT.  Performing fast shutdown.

11/17 15:48:18 ShutdownFast all jobs.

11/17 15:48:18 **** condor_starter (condor_STARTER) EXITING WITH STATUS 0

 

b) Scheduler’s Shadowlog is:

11/17 15:47:46 ******************************************************

11/17 15:47:46 ** condor_shadow (CONDOR_SHADOW) STARTING UP

11/17 15:47:46 ** C:\Condor\bin\condor_shadow.exe

11/17 15:47:46 ** $CondorVersion: 7.0.5 Sep 20 2008 BuildID: 105846 $

11/17 15:47:46 ** $CondorPlatform: INTEL-WINNT50 $

11/17 15:47:46 ** PID = 3556

11/17 15:47:46 ** Log last touched 11/17 15:47:46

11/17 15:47:46 ******************************************************

11/17 15:47:46 Using config source: C:\Condor\condor_config

11/17 15:47:46 Using local config sources:

11/17 15:47:46    C:\Condor/condor_config.local

11/17 15:47:46 DaemonCore: Command Socket at <1x.xx.xx.x4:3345>

11/17 15:47:47 Initializing a VANILLA shadow for job 26.14

11/17 15:47:47 (26.14) (3556): Request to run on <1x.xx.xx.x9:1104> was ACCEPTED

11/17 15:47:48 (26.14) (3556): ZKM: setting default map to (null)

11/17 15:47:48 (26.14) (3556): Job 26.14 terminated: exited with status 0

11/17 15:47:48 (26.14) (3556): **** condor_shadow (condor_SHADOW) EXITING WITH STATUS 100

 

From: condor-users-bounces@xxxxxxxxxxx [mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of David Watrous
Sent: Monday, November 17, 2008 2:55 PM
To: Condor-Users Mail List
Subject: Re: [Condor-users] condor job has exited normally with status-1073741515

 

Alex, 

 

What does your StarterLog on the execute node and the ShadowLog on the scheduler say about these jobs as they run? Do you get an "ERROR: Provider load failure" message in the output/error file from systeminfo?

 

If you don't see anything interesting in those logs, I'm assuming that your scheduler and execute nodes all have the referenced directories, so what happens when your batch file just does an 'echo "Hello World"'?

 

I hope this helps!

 

Good luck,

 

- Dave

 

-- 
===================================
David Watrous
main: 888.292.5320

 

Cycle Computing, LLC

Leader in Condor Grid Solutions

Enterprise Condor Support and Management Tools 

 

 

 

On Nov 17, 2008, at 2:34 PM, Alas, Alex [FEDI] wrote:



I am trying to run a job but it fails exiting with the code 107374515. I know if the job succeeds it will exit out with code 0 and anything else non-zero code means it failed but I don’t know if this code means anything or if it is a generic error code.

 

My description file is the following:

#########################################################################################

# Description file for Batch File for TESTING purposes

# Prepared by Alex Alas

##########################################################################################

universe = vanilla

requirements = (Arch == "INTEL" && OpSys == "WINNT51")

initialdir = c:\condor\execute_bk

should_transfer_files = YES

when_to_transfer_output = ON_EXIT

transfer_input_files = c:\windows\system32\systeminfo.exe

run_as_owner = true

executable = Batch4testv2.bat

output = Batch4testv3.out.$(Process)

error = Batch4testv3.err.$(Process)

log = Batch4testv3.log

queue 1

 

The batch file I am running is as follow:

> systeminfo.exe

 

Any input is much appreciated,




Respectfully,

Alex Alas

Systems Administrator
Fugro EarthData Inc.

 

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at: 
https://lists.cs.wisc.edu/archive/condor-users/

 

 

 

 

 

 

 

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at: 
https://lists.cs.wisc.edu/archive/condor-users/