Mailing List Archives Public Access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] BOSCO question

Date: Tue, 05 Sep 2017 11:08:39 -0500
From: Greg Thain <gthain@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] BOSCO question

On 09/05/2017 09:50 AM, Zhuo Zhang wrote:

Hi Greg,
Thanks for the reply. But unfortunately flocking is not what I amlooking for. Jobs from pool A flock to pool B will have to go throughmachine A first, which is still a single point of failure.

Ah, if you need a high-availability solution, there are a number ofsolutions. In your case, I know that your condor jobs are part of aproduction system that generates jobs from outside the condor system,and high availability needs to be designed in that context.

For a single schedd, perhaps the easiest way to high availability is torun on top of a virtual machine solution with live migration.


-greg

References:
- Re: [HTCondor-users] BOSCO question
  - From: Zhuo Zhang

Prev by Date: Re: [HTCondor-users] Unable to get _HOOK_PREPARE_JOB to add something to job ClassAd
Next by Date: Re: [HTCondor-users] Drain HTCondor worker by setting instance metadata value
Previous by thread: Re: [HTCondor-users] BOSCO question
Next by thread: [HTCondor-users] htcondor + gpudirect + openmpi
Index(es):
- Date
- Thread

Mailing List Archives

Public Access

Re: [HTCondor-users] BOSCO question