[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] submitted jobs are not running



Dear condor users,
I have submitted jobs but they are still held and not running. Condor_status looks ok:



labounek@emperor:~$ condor_status
Name OpSys Arch State Activity LoadAv Mem ActvtyTime

slot10@xxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:23 slot11@xxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:24 slot12@xxxxxxxxxxx LINUX X86_64 Unclaimed Idle 6.320 2682 0+00:00:25 slot1@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:04 slot2@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:23 slot3@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:24 slot4@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:25 slot5@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:26 slot6@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:27 slot7@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:28 slot8@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:21 slot9@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:22
                     Total Owner Claimed Unclaimed Matched Preempting Backfill

        X86_64/LINUX    12     0       0        12       0          0        0

               Total    12     0       0        12       0          0        0
labounek@emperor:~$



Condor_submit comand looked like this:

condor_submit slice_0007.condor

The file contains this text:

Executable = /home/labounek/test/dti.bedpostX/condor_logs/slice_0007.sh
Universe = vanilla
output = /home/labounek/test/dti.bedpostX/condor_logs/slice_0007.out
error = /home/labounek/test/dti.bedpostX/condor_logs/slice_0007.error
Log   = /home/labounek/test/dti.bedpostX/condor_logs/slice_0007.log
Queue

The file slice_0007.sh contains one comand:

/usr/share/fsl/5.0/bin/bedpostx_single_slice.sh /home/labounek/test/dti 7 --nf=3 --fudge=1 --bi=1000 --nj=1250 --se=25 --model=2 --cnonlinear

I think everything should be ok, but it is stucked. Here is the condor_q output: