[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Program not running, but condor is fine with it :(



----------------
On Thu, 12 Jan 2006 11:40:48 +0000
Matt Hope <matthew.hope@xxxxxxxxx> wrote:

> On 1/12/06, Nicolas GUIOT <nicolas.guiot@xxxxxxx> wrote:
> > Hi
> >
> > I have this program that runs perfectly when started from the command line, but when submitted through condor, it lasts just 1 second, and "exits normally with status 1", but doesn't give any result.
> > Here is the submit file :
> >
> > Universe = vanilla
> >
> > Executable      = /condor/bin/dy
> >
> > environment     = ICMDy=/condor/ICMD/
> > arguments       = 'longtj(,,150);*bdl001Ea33;q'
> >
> > error           = Alexey.err
> > Log             = Alexey.log
> >
> > notify_user     = my-email@xxxxxxxxxxx
> > notification    = always
> >
> > queue
> > ______
> >
> > And I found this in the StarterLog file of the execute machine :
> >
> > 1/12 10:57:29 Submitting machine is "chagall.galaxy.ibpc.fr"
> > 1/12 10:57:29 Starting a VANILLA universe job with ID: 99.0
> > 1/12 10:57:29 IWD: /condor/test
> > 1/12 10:57:29 Error file: /condor/test/myprog.err
> > 1/12 10:57:29 About to exec /condor/bin/dy 'longtj(,,150);*bdl001Ea33;q'
> > 1/12 10:57:29 Create_Process succeeded, pid=8913
> > 1/12 10:57:29 Process exited, pid=8913, status=1
> 
> what are the contents of the Alexey.err and Alexey.out files afterwards
> 

the log files says it ran fine, and the error file is just empty...

guiot@chagall:/condor/test#ll
total 1269
-rwxr--r--  1 guiot users    561 Jan 10 16:08 Alexey.cmd
-rw-r--r--  1 guiot users      0 Jan 12 13:38 Alexey.err
-rw-r--r--  1 guiot users    604 Jan 12 13:38 Alexey.log
-rwxr--r--  1 guiot users    147 Dec  7 16:02 EraseDone
-rw-r--r--  1 guiot users 194228 Jan 12 13:37 fin.cy
-rw-r--r--  1 guiot users 790278 Dec  2 15:50 ii.bi
-rw-r--r--  1 guiot users  10368 Dec  2 15:49 mi0.bi
-rw-r--r--  1 guiot users  11515 Jan 12 13:37 out
-rw-r--r--  1 guiot users 190010 Jan 12 13:37 rehydr.tmp.cy
-rw-r--r--  1 guiot users   3270 Dec  2 15:49 sa.dy
-rw-------  1 guiot users   1141 Dec  2 15:49 sequence.dy
-rw-r--r--  1 guiot users  61282 Jan 12 13:37 tj.nwsa.bi
-rw-r--r--  1 guiot users  20600 Jan 12 13:37 tj.qlcvp.bi
guiot@chagall:/condor/test#more Alexey.log
000 (100.000.000) 01/12 13:43:46 Job submitted from host: <193.49.27.24:38964>
...
001 (100.000.000) 01/12 13:43:51 Job executing on host: <193.49.27.54:39967>
...
005 (100.000.000) 01/12 13:43:51 Job terminated.
        (1) Normal termination (return value 1)
                Usr 0 00:00:00, Sys 0 00:00:00  -  Run Remote Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Run Local Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Total Remote Usage
                Usr 0 00:00:00, Sys 0 00:00:00  -  Total Local Usage
        0  -  Run Bytes Sent By Job
        0  -  Run Bytes Received By Job
        0  -  Total Bytes Sent By Job
        0  -  Total Bytes Received By Job
...
guiot@chagall:/condor/test#more Alexey.err
guiot@chagall:/condor/test#

-----------------------------------------------
CNRS - UPR 9080 : Laboratoire de Biochimie Theorique
Institut de Biologie Physico-Chimique
13 rue Pierre et Marie Curie
75005 PARIS - FRANCE

Tel : +33 158 41 51 70
Fax : +33 158 41 50 26
------------------------------------------------