[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] hostname empty on node restart


I've seen the same behavior with HTCondor-8.2.4 on CentOS 7.0. Unfortunately, I haven't had much time to investigate and don't know how to fix this. As you said, restarting the HTCondor service fixes the problem.


On Wed, May 27, 2015 at 4:33 AM Yngve Inntjore Levinsen <yngve.levinsen@xxxxxxx> wrote:
Dear all,

We have a few machines in our pool now, running either Ubuntu (various
recent-ish versions) or CentOS 7. HT Condor is installed from
repositories on all machines. Version 8.2.8 is installed on all nodes
and the master (which runs CentOS 7).

When restarting (at least the Ubuntu machines, I forget if the CentOS
machines still do this), the condor service is started on the nodes, but
it seems that it is probably done too early. Hence if I run
condor_status I get a list as attached, where for the restarted node I
see slotN@ Instead of slotN@hostname

It seems that this is also hindering the slots from being used. The
solution we currently use is to manually run a simple 'sudo service
condor restart'. After that the hostname is shown correctly and
everything works fine. It is a bit troublesome as the Ubuntu boxes are
office computers which us 'condor admins' don't have sudo access to
necessarily, so we have to ask the owners to run the commands.

Does anyone know why this happens?