[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] masters are dead, leaving orphaned daemons



Hi, I'm wondering if anyone else is seeing messages like the one below.
How often?  Is this is something I should be worried about?  Why is
condor dying so regularly?

Here are some of the details:

NOV 2004 - APRIL 12, 2005 

* Pool Running Condor v6.6.7.

* The Condor master reported orphans every 2 to 10 days.

* Only one machine at a time ever appeared in a report.  

* Only non-dedicated (end-user workstations) execute nodes ever
  appeared in a report.


APRIL 12, 2005 - PRESENT 

* Pool upgraded to Condor v6.6.9.

* Condor master sends orphan reports every three hours!

* Multiple machines appear in every report.

* ALL execute nodes routinely appear in the messages.

Thanks,

Bryan Maher
Carnegie Mellon University

---------------------
This is an automated email from the Condor system on machine
"BEHEMOTH.xxx.xxx.edu".  Do not reply.

The following masters are dead, leaving orphaned daemons

		< IQ05.xxx.xxx.edu >
		< FINALFANTASYXI.xxx.xxx.edu >
		< IQ01.xxx.xxx.edu >
		< COLOSSUS.xxx.xxx.edu >
---------------------