[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [condor-users] Dagman script problem.

On Fri, 28 Nov 2003, Paul Wilson wrote:

> I'm trying out Dagman to shorten some jobs, and having problems getting
> condor to run any of my pre/post scripts.
> The first job in the DAG runs fine, then the whole thing stops at the
> first post script.
> Whether it's post A or pre B doesnt' make anydifference.
> The following is an example of ome of the post scripts to run between
> each job.
> -rwxr-xr-x    1 zhimei   users         178 Nov 27 17:04 copyPreStep2.sh
> #!/bin/sh
> cp /home/step1/REVCON /home/step2/CONFIG
> cp /home/step1/REVIVE /home/step2/REVOLD
> cp /home/step1/FIELD /home/step2/FIELD
> cp /home/step1/CONTROL /home/step2/CONTROL
> /usr/bin/perl preStep1.pl
> chmod a+w /home/zhimei/test-step/step2/*
> exit 1

Here's your problem. By convention, unix programs exit with value 0 to
indicate success and non-zero to indicate failure. Since your script is
exitting with value 1, DAGMan thinks it failed. You can look in the DAGMan
Applications section of the Condor manual to find out more about how
DAGMan reacts to program exit codes.

> This script works just fine when run manually from the command line. The
> perl line executes a perl script which edits the CONTROL file.
> Condor however cannot run it. Here is the error from the dagman.out
> file. Job A runs fine, then there's an error at the job A post script:
> I think I'm missing something fundamental here- about Condor, dagman and
> how to get my scripts to work from Condor.

|             Jaime Frey             |There are 10 types of people in|
|         jfrey@xxxxxxxxxxx          |the world: Those who understand|
|   http://www.cs.wisc.edu/~jfrey/   |  binary, and those who don't  |
Condor Support Information:
To Unsubscribe, send mail to majordomo@xxxxxxxxxxx with
unsubscribe condor-users <your_email_address>