[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Orphaned jobs?

Title: Message

The problem has been fixed by rebooting the submitting PC. All the jobs have
been automatically requeued by condor and are sitting idle. We'll see if they
run OK overnight (our policy only lets jobs run outside of working hours).
-----Original Message-----
From: condor-users-bounces@xxxxxxxxxxx [mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of Hitchen, Greg (E&M, Kensington)
Sent: Thursday, 25 May 2006 9:53 AM
To: condor-users@xxxxxxxxxxx
Subject: [Condor-users] Orphaned jobs?

We have a situation where jobs are apparently running but are also not running!?

condor_q -global

shows a bunch of jobs running on different machines, as does

condor_q -global -run

all with increasing run times each time the command is used.

However the command

condor_status -run

shows NO jobs running, as does the condor_status command with no arguments.

I have even stopped the condor service running on some PCs and they STILL show up
using the condor_q commands with increasing run-times, although they no longer
appear using condor_status (as you would expect).

We are running a couple of windows pools (with flocking enabled) using linux central
managers and all running 6.6.10

Any ideas? Thanks.



Greg Hitchen                                            greg.hitchen@xxxxxxxx
CSIRO Exploration and Mining                            phone:ᄉ 8 6436 8663
Australian Resources Research Centre (ARRC)     fax:    ᄉ 8 6436 8555
Postal address:                                         mob:    0407 952 748
PO Box 1130, Bentley WA 6102, Australia
Street Address:
26 Dick Perry Avenue, Kensington WA 6151