[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: [condor-users] Condor sleeping



I have many dual proc servers, and at most only one of the VMs go
offline, mostly vm1. Single procs look like they disappear. My user's
situation is different as I run the jobs for them, or our web system
does. All they care about are the results coming back and not really the
reporting of the nodes being online all the time. I can understand your
user's point of view though. We have an all-windows cluster for our
dedicated resources. Looking back, I think Ive also seen the same thing
on earlier versions, even earlier than 6.4. (XP, W2K, W2K3)

I easily run a script across the pool from time to time that restarts
the condor service. I have lived with that mechanism.

Ron

> -----Original Message-----
> From: condor-users-bounces@xxxxxxxxxxx 
> [mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of Mark 
> Silberstein
> Sent: Wednesday, June 16, 2004 12:17 AM
> To: Condor-Users Mail List
> Subject: RE: [condor-users] Condor sleeping
> 
> It never happened for me with 6.4.7. It started with 6.6 
> series, and is not only annoying, but makes my users feel 
> that the system is unreliable, which unfortunately is true in 
> these circumstances. I wish I had more time to debug it, but 
> maybe if someone has at least some time to try moving 
> collector and negotiator to another machine (with another 
> IP/Name - maybe some problem with DNS resolution), and more 
> likely - Linux or at least not Windows. From all mails on the 
> list it feels like Windows causes some problems here. By the 
> way, I don't experience any problems with the pools working 
> with Linux-based matchmaker.
> On Wed, 2004-06-16 at 12:53, Ron Viloria wrote:
> > Ive always seen it happen, as early as 6.4.7, again its more of an 
> > annoyance. Ive always assumed its because of the CPU being 
> busy doing 
> > non-condor stuff in the background or something like antivirus or 
> > backups.
> > 
> > ron
> > 
> > > -----Original Message-----
> > > From: condor-users-bounces@xxxxxxxxxxx 
> > > [mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of Kewley, J 
> > > (John)
> > > Sent: Tuesday, June 15, 2004 11:19 PM
> > > To: 'Condor-Users Mail List'
> > > Subject: RE: [condor-users] Condor sleeping
> > > 
> > > > Hi All
> > > > I haven't seen any updates on this thread. Does anyone have any 
> > > > solution
> > > > - we are struggling with the same problem and I have no
> > > idea where it
> > > > comes from Mark
> > > 
> > > I have seen no solution yet. They do come back after a while, but 
> > > then just as suddenly go away. Of course, there aren't 
> many jobs as 
> > > it is a test pool, but it doesn't look very impressive.
> > > 
> > > Has anyone found out whether the problem started in 
> 6.6.5, or was it 
> > > earlier in the 6.6.X series.
> > > 
> > > More importantly, will it be fixed in 6.6.6  !!
> > > 
> > > Cheers
> > > 
> > > JK
> > > _______________________________________________
> > > Condor-users mailing list
> > > Condor-users@xxxxxxxxxxx
> > > http://lists.cs.wisc.edu/mailman/listinfo/condor-users
> > > 
> > > 
> > _______________________________________________
> > Condor-users mailing list
> > Condor-users@xxxxxxxxxxx
> > http://lists.cs.wisc.edu/mailman/listinfo/condor-users
> 
> _______________________________________________
> Condor-users mailing list
> Condor-users@xxxxxxxxxxx
> http://lists.cs.wisc.edu/mailman/listinfo/condor-users
> 
>