[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Condor-users] Bad event error in condor DAG
- Date: Mon, 14 Mar 2005 10:46:08 -0600
- From: Alain Roy <roy@xxxxxxxxxxx>
- Subject: Re: [Condor-users] Bad event error in condor DAG
Condor 6.7.3 fixed a bug (a double-run bug) that could cause this symptom.
What version of Condor are you using? Is this bug repeatable?
At 11:03 AM 3/7/2005 -0700, Alexander Dietz wrote:
while running a condor DAG on a cluster I discovered a strange error. At
some time the following message appeared in the out-file:
3/4 12:04:01 EVENT ERROR: job 135872.0.0 executing; submit count != 1 (0)
3/4 12:04:01 WARNING: bad event here may indicate a serious bug in Condor
3/4 12:04:01 Aborting DAG because of bad event
And shortly after that the DAG stopped. I started then the rescue-DAG, it
runs fine for a while but then the same error appeared again.
Maybe it is REALLY a Condor bug? But why are the same jobs run without
problems on an other cluster with the same condor version installed?