[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Preparations to move master machine?



Hi Steffen,

As an alternate approach -

You could add both host names to COLLECTOR_HOST and reconfig hosts to ensure that they start reporting to both hosts.

Once both are fully populated, then stop the negotiator on the old one and start it on the new one.

Finally, remove the old collector from the configuration and do condor_reconfig.

Note you would want to do this on worker nodes first, then on the schedd daemons, and only do the user environment after the cutover (as user tools will randomly query one collector from the list).

If you need to do this regularly, you can use the condor_had to automatically cutover between central managers.

HTH,

Brian

Sent from my iPhone

> On May 13, 2020, at 3:13 AM, Steffen Grunewald <steffen.grunewald@xxxxxxxxxx> wrote:
> 
> ïGood morning,
> 
> I've got to move the central (collector, negotiator) functionality off a failing
> machine, and would like to do this with as little interruption as possible.
> 
> The old machine has an IP of a.b.c.100, the new one is at a.b.c.109, and I'd
> like to use an aliased interface at a.b.c.190 to provide access, first at the
> old machine, then (during a short maintenance) move that to the new one.
> 
> There is no firewall in effect, the a.b.c.0/24 network is purely internal.
> 
> Currently, most nodes still refer to a.b.c.100, and the central manager has
> a matching NETWORK_INTERFACE=a.b.c.100 - since these machines are in production,
> this part cannot be changed.
> 
> Is it possible (without any disruptions) to change NETWORK_INTERFACE to a.b.c.*,
> to answer connection requests on both .100 and .190?
> Would condor_shared_port accept connections on both IP addresses then, enabling
> the machines that got setup with the updated CENTRAL_MANAGER setting?
> Any other pitfalls I didn't see yet? (I've got to think about keeping the
> job history for accounting, but that's phase two.)
> 
> Thanks,
> Steffen
> 
> -- 
> Steffen Grunewald, Cluster Administrator
> Max Planck Institute for Gravitational Physics (Albert Einstein Institute)
> Am MÃhlenberg 1 * D-14476 Potsdam-Golm * Germany
> ~~~
> Fon: +49-331-567 7274
> Mail: steffen.grunewald(at)aei.mpg.de
> ~~~
> _______________________________________________
> HTCondor-users mailing list
> To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
> 
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/htcondor-users/