[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Bad event error in condor DAG


while running a condor DAG on a cluster I discovered a strange error. At some time the following message appeared in the out-file:

3/4 12:04:01 EVENT ERROR: job 135872.0.0 executing; submit count != 1 (0)
3/4 12:04:01 WARNING: bad event here may indicate a serious bug in Condor
-- beware!
3/4 12:04:01 Aborting DAG because of bad event

And shortly after that the DAG stopped. I started then the rescue-DAG, it runs fine for a while but then the same error appeared again.

Maybe it is REALLY a Condor bug? But why are the same jobs run without problems on an other cluster with the same condor version installed?

I would really appreciate some help in that. And let me know if you need some more information on that problem.