Mailing List Archives Public Access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] can Condor somehow be a HA?

Date: Mon, 22 May 2017 11:02:55 -0500
From: Todd Tannenbaum <tannenba@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] can Condor somehow be a HA?

On 5/22/2017 6:45 AM, lejeczek wrote:

hi fellas
I've only started looking at htcondor, not having a good understandingof it yet I wonder - htcondor has that concept of "central manager" andI wonder if this makes it a valid candidate for HA setup?
Does anybody have any experience with/thoughts on htcondor as HA andcould share it here?
many thanks
L.

Hi,

First off, understand that if your installations central manager dies,currently running jobs will continue to run and even new jobs willcontinue to get scheduled in many cases (i.e. new jobs will still getscheduled to claimed slots). Even in production pools, most sites haveno problem with rebooting their central manager or even taking it downfor an hour or two - while the central manger is down, users may noticethat condor_status stops working, but practically all other common toolscontinue to work (condor_submit, condor_q, condor_rm, etc). Thus manypools don't ever bother with an HA solution for the central manager.

If you are still concerned, the HTCondor central manager is actuallyvery lightweight and holds very little state (just user prioirties), andthis is very amenable to a high availability (HA) setup. Youessentially have two choices:

1. HTCondor can be configured to have two central managers (hot/hot),and automatically fail over as needed. See the section in the HTCondorManual titled "High Availability of the Central Manger" at


http://research.cs.wisc.edu/htcondor/manual/v8.6/3_13High_Availability.html#SECTION004132200000000000000

2. If you already run your services in a managed visualized setup(Mesos+Marathan, OpenStack, vSphere, HyperV, etc) that supportsfailover, you could setup your HTCondor central manager for HAleveraging those environments, i.e. same way you would setup a redundantemail server, for instance.



Hope the above helps
Todd

Follow-Ups:
- Re: [HTCondor-users] can Condor somehow be a HA?
  - From: lejeczek

References:
- [HTCondor-users] can Condor somehow be a HA?
  - From: lejeczek

Prev by Date: [HTCondor-users] can Condor somehow be a HA?
Next by Date: [HTCondor-users] HTCondor workshop in Europe 2017: Hurry up!
Previous by thread: [HTCondor-users] can Condor somehow be a HA?
Next by thread: Re: [HTCondor-users] can Condor somehow be a HA?
Index(es):
- Date
- Thread

Mailing List Archives

Public Access

Re: [HTCondor-users] can Condor somehow be a HA?