Mailing List Archives Public Access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] machine for central-manager

Date: Wed, 21 Apr 2010 11:34:23 -0500
From: Dan Bradley <dan@xxxxxxxxxxxx>
Subject: Re: [Condor-users] machine for central-manager


The following page has some rough estimates of system requirements:

http://condor-wiki.cs.wisc.edu/index.cgi/wiki?p=HowToManageLargeCondorPools

--Dan

Ian Chesal wrote:

On Wed, Apr 21, 2010 at 11:50 AM, Santanu Das<santanu@xxxxxxxxxxxxxxxxx <mailto:santanu@xxxxxxxxxxxxxxxxx>> wrote:
    How powerful a machine (CPU-core, memory etc.) should be to run
    condor central-manager for a ~400-core cluster? I'm interested in
    vitalization, so I wanna get a rough idea about the hardware we
    should go for to accommodate all the services/virtual machines to
    run nicely.
Is this machine running condor_collector and condor_negotiator? Or isit also running a condor_schedd daemon? If it's just condor_collectorand condor_negotiator: not very powerful at all really. I kept a 500slot farm alive and well with an old 2-CPU (single core) Xeon box. Ithad 4GB of RAM IIRC. Ran 32-bit CentOS4 on it. It did have Gbfiber-based ethernet. It kept up just fine. The only reason it gotswapped out for new hardware was the hardware was well out ofwarranty, which apparently makes IT department heads nervous.
Obviously this older hardware wouldn't work well with virtualization.The CPUs aren't virtualization-friendly. But that does say you don'tneed a lot of juice if you're just running condor_collector andcondor_negotiator.
Worth noting that my farm configuration was incredibly sensitive tonegotiation cycle times at the time. I was seeing under 2 minutenegotiation cycles with 1 schedd in the system holding 40k jobs (and afairly heterogeneous job distribution so not a lot of re-clusteringhappening in the negotiator).
    Also, is there any recommendation on vitalization platform as far
    as condor is concerned? Is VMware ESXi a good choice??
I currently have a few farms with central managers virtualized on Xen.The VMs are all running CentOS 5.somethingorother. It's not as stableas I was hoping for. If there's any appreciable amount of disk latencybetween the VM manager program and the disk where the image is basedXen crashes and takes the images down with it. They restart, but it'sannoying. We were hosting the VM images on our NAS to make for easy,nearly instant migration of VMs from machine to machine in case offailure, but we get big latency spikes on our NAS thanks to load fromour farm and the spikes would take out Xen. Had to move 'em to localdisk for now which is very much less useful.
Hope that helps.

- Ian
------------------------------------------------------------------------

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/

References:
- [Condor-users] Condor and firewall.........which ports to open
  - From: michele pierri
- [Condor-users] machine for central-manager
  - From: Santanu Das
- Re: [Condor-users] machine for central-manager
  - From: Ian Chesal

Prev by Date: Re: [Condor-users] machine for central-manager
Next by Date: [Condor-users] validating exit codes
Previous by thread: Re: [Condor-users] machine for central-manager
Next by thread: Re: [Condor-users] machine for central-manager
Index(es):
- Date
- Thread

Mailing List Archives

Public Access

Re: [Condor-users] machine for central-manager