[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] preserve on/off state over restart



Hi,

So far I've used condor_on/off to add/remove worker nodes (startds)
to/from the cluster. The problem with that is startd or masterd
apparently doesn't preserve this state over a reboot of the whole
machine or a restart of the service. In a big cluster with thousands
of worker nodes, nodes come and go all the time, so it'd be nice to
have a way to turn off/on components persistently.

I know there's no real bulletproof solution for this, but I'd be happy
with at least an eventually consistent solution too :) Any idea?

Thanks,
Daniel