[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] ERROR "gethostbyname(xxx) failed, errno = 25" at line 173 in file my_hostname.C



Hi,

I am trying to use Condor 6.6.10 on SLES10. I wanted to set up a personal condor on a machine with two NICs. I have followed the section 3.10.7 in the manual which requires to set up the CONDOR_HOST and NETWORK_INTERFACE variables according to the network setup of the machine. But even after setting up these variables starting up condor_master would result the same message I have got before. Here is what I have tried so far. First starting with the network setup:

xxx = is the full hostname condor_master complains about, and the hostname of the IP where I was planning to bind condor
x = is its shortened form
AAA.AAA.AAA.AAA = is the ip address of the NIC the public network is available, and where I was planning to bind condor
yyy = is the full hostname of the other interface's ip
y = is its shortened form
192.168.2.81 = is the ip address of the NIC with the private network

Here is what I did to figure out the correctness of the network setup:
asd:~ # ifconfig
eth0      Link encap:Ethernet  HWaddr 00:14:5E:55:6B:1F
inet addr:192.168.2.81 Bcast:192.168.15.255 Mask: 255.255.240.0
          inet6 addr: B6B6B6/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:1530883 errors:0 dropped:0 overruns:0 frame:0
          TX packets:477631 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:185361094 (176.7 Mb)  TX bytes:94566043 (90.1 Mb)
          Interrupt:66

ib0 Link encap:UNSPEC HWaddr 80-00-04-04- FE-80-00-00-00-00-00-00-00-00-00-00 inet addr:AAA.AAA.AAA.AAA Bcast: AAA.AAA.AAA.255 Mask: 255.255.255.0
          inet6 addr: A6A6A6/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:65520  Metric:1
          RX packets:35758 errors:0 dropped:0 overruns:0 frame:0
          TX packets:38185 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:128
          RX bytes:2649862 (2.5 Mb)  TX bytes:9957883 (9.4 Mb)

lo        Link encap:Local Loopback
          inet addr:127.0.0.1  Mask:255.0.0.0
          inet6 addr: ::1/128 Scope:Host
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
          RX packets:412 errors:0 dropped:0 overruns:0 frame:0
          TX packets:412 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:167412 (163.4 Kb)  TX bytes:167412 (163.4 Kb)
x:~ # grep xxx /etc/hosts
AAA.AAA.AAA.AAA   xxx x iby
x:~ # grep yyy /etc/hosts
192.168.2.81	yyy y
x:~# hostname -f
xxx
x:~# hostname
xxx
x:~# domainname

x:~# su - condor
condor@x:~> echo $CONDOR_LOCATION
/opt/users/condor/condor
condor@x:~> echo $CONDOR_CONFIG
/opt/users/condor/condor/etc/condor_config

First I tried to configure condor on the public NIC:

condor@x:~/condor> grep NETWORK_INTERFACE etc/condor_config
NETWORK_INTERFACE = AAA.AAA.AAA.AAA
condor@x:~/condor> grep CONDOR_HOST etc/condor_config
CONDOR_HOST	= xxx
HOSTALLOW_ADMINISTRATOR = $(CONDOR_HOST)
##  COLLECTOR_HOST = $(CONDOR_HOST):1234
COLLECTOR_HOST  = $(CONDOR_HOST)
NEGOTIATOR_HOST = $(CONDOR_HOST)
condor@x:~/condor> grep LOCAL_CONFIG_FILE etc/condor_config
#LOCAL_CONFIG_FILE	= $(LOCAL_DIR)/condor_config.local
LOCAL_CONFIG_FILE = /opt/users/condor/condor/local.x/condor_config.local
#REQUIRE_LOCAL_CONFIG_FILE = TRUE
##  the global config file (not the LOCAL_CONFIG_FILE from above).
condor@x:~/condor> grep CONDOR_HOST local.x/condor_config.local
CONDOR_HOST = xxx
condor@x:~/condor> grep NETWORK_INTERFACE local.x/condor_config.local
NETWORK_INTERFACE = AAA.AAA.AAA.AAA
condor@x:~/condor> sbin/condor_master
ERROR "gethostbyname(xxx) failed, errno = 25" at line 173 in file my_hostname.C

Since I have received this error message, I tried to configure condor on the private NIC:

condor@x:~/condor> grep NETWORK_INTERFACE etc/condor_config
NETWORK_INTERFACE = 192.168.2.81
condor@x:~/condor> grep CONDOR_HOST etc/condor_config
CONDOR_HOST	= yyy
HOSTALLOW_ADMINISTRATOR = $(CONDOR_HOST)
##  COLLECTOR_HOST = $(CONDOR_HOST):1234
COLLECTOR_HOST  = $(CONDOR_HOST)
NEGOTIATOR_HOST = $(CONDOR_HOST)
condor@x:~/condor> grep LOCAL_CONFIG_FILE etc/condor_config
#LOCAL_CONFIG_FILE	= $(LOCAL_DIR)/condor_config.local
LOCAL_CONFIG_FILE = /opt/users/condor/condor/local.x/condor_config.local
#REQUIRE_LOCAL_CONFIG_FILE = TRUE
##  the global config file (not the LOCAL_CONFIG_FILE from above).
condor@x:~/condor> grep CONDOR_HOST local.x/condor_config.local
CONDOR_HOST = yyy
condor@x:~/condor> grep NETWORK_INTERFACE local.x/condor_config.local
NETWORK_INTERFACE = 192.168.2.81
condor@x:~/condor> sbin/condor_master
ERROR "gethostbyname(xxx) failed, errno = 25" at line 173 in file my_hostname.C


Still no luck, and I have no idea what is still missing. The last line is even more interesting since it is still complaining about the xxx hostname even though there is no reference to that hostname anymore:
condor@x:~/condor> grep -ri xxx *
condor@x:~/condor>

Can you give me a hint what to do next?

Regards,
Gabor Kecskemeti