[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Bad event error in condor DAG



Condor 6.7.3 fixed a bug (a double-run bug) that could cause this symptom. What version of Condor are you using? Is this bug repeatable?

-alain

At 11:03 AM 3/7/2005 -0700, Alexander Dietz wrote:
Hi,

while running a condor DAG on a cluster I discovered a strange error. At some time the following message appeared in the out-file:

3/4 12:04:01 EVENT ERROR: job 135872.0.0 executing; submit count != 1 (0)
3/4 12:04:01 WARNING: bad event here may indicate a serious bug in Condor
-- beware!
3/4 12:04:01 Aborting DAG because of bad event

And shortly after that the DAG stopped. I started then the rescue-DAG, it runs fine for a while but then the same error appeared again.

Maybe it is REALLY a Condor bug? But why are the same jobs run without problems on an other cluster with the same condor version installed?