[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] DAGMan jobs stuck, but "Running"



On Feb 20, 2008, at 2:43 AM, Jan Ploski wrote:

I have several DAGMan jobs (in Condor 7.0.0) that just won't leave the
status "Running".
In their .dagman.out I can see repeating messages such as

2/20 09:34:18 Pending DAG nodes:
2/20 09:34:18   Node wrf, Condor ID 16589, status STATUS_SUBMITTED

However, the subjob in question (16589) is no longer in the queue. How can
this happen?


DAGMan reads the user logs of its jobs to detect when they've completed. Check the contents of the user log of job 16589.0.

Thanks and regards,
Jaime Frey
UW-Madison Condor Team