[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] submitted jobs are not running



No condor_q output was included.

Make sure that both slice_0007.sh and bedpostx_single_slice.sh have their execute bit set for your id. If this is the case then do "condor_q -analyze" on the stuck job.

bob

On 3/10/2016 1:45 PM, Labounek René wrote:
Dear condor users,
I have submitted jobs but they are still held and not running. Condor_status looks ok:



labounek@emperor:~$ condor_status
Name OpSys Arch State Activity LoadAv Mem ActvtyTime

slot10@xxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:23 slot11@xxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:24 slot12@xxxxxxxxxxx LINUX X86_64 Unclaimed Idle 6.320 2682 0+00:00:25 slot1@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:04 slot2@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:23 slot3@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:24 slot4@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:25 slot5@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:26 slot6@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:27 slot7@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:28 slot8@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:21 slot9@xxxxxxxxxxxx LINUX X86_64 Unclaimed Idle 1.000 2682 0+00:00:22 Total Owner Claimed Unclaimed Matched Preempting Backfill

        X86_64/LINUX    12     0       0        12 0          0        0

               Total    12     0       0        12 0          0        0
labounek@emperor:~$



Condor_submit comand looked like this:

condor_submit slice_0007.condor

The file contains this text:

Executable = /home/labounek/test/dti.bedpostX/condor_logs/slice_0007.sh
Universe = vanilla
output = /home/labounek/test/dti.bedpostX/condor_logs/slice_0007.out
error = /home/labounek/test/dti.bedpostX/condor_logs/slice_0007.error
Log   = /home/labounek/test/dti.bedpostX/condor_logs/slice_0007.log
Queue

The file slice_0007.sh contains one comand:

/usr/share/fsl/5.0/bin/bedpostx_single_slice.sh /home/labounek/test/dti 7 --nf=3 --fudge=1 --bi=1000 --nj=1250 --se=25 --model=2 --cnonlinear

I think everything should be ok, but it is stucked. Here is the condor_q output:







_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/