[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Installation problems in a future Grid testbed



Hi Condor community!
I'm trying to create a Grid with machines from 2 research labs. First one contains 3 Linux FC3 machines (one is the central manager) and the other lab contains 2 Linux FC1 machines.
I've just tried to install condor-6.7.10. Following the condor documentation, I've issued the following commands (as globus user and sudo permission):


$tar xzf condor-6.7.10-linux-x86-glibc23-dynamic.tar.gz
$cd condor-6.7.10
$sudo ./condor_configure --type=manager,submit,execute --install-dir=/usr/local/condor-6.7.10/ --owner=globus --install

WARNING: Unable to determine local IP address. Condor  might not work
propertly until you set  NETWORK_INTERFACE=<machine IP address>

Use of uninitialized value in concatenation (.) or string at ./condor_configure line 908.

Condor has been installed into:
    /usr/local/condor-6.7.10

It seems strange to me, since NETWORK_INTERFACE was set to the IP address of the specified machine.
Anyway, I continued the process, updating condor_config and condor_config.local properly:

#######/etc/condor/condor_config#############
RELEASE_DIR       = /usr/local/condor-6.7.10
LOCAL_DIR         = /usr/local/condor-6.7.10/local.vivax
CONDOR_ADMIN      = globus@xxxxxxxxxxxxxxxxxx
MAIL              = /bin/mail
FULL_HOSTNAME     = vivax.biowebdb.org
UID_DOMAIN        = $(FULL_HOSTNAME)
FILESYSTEM_DOMAIN = $(FULL_HOSTNAME)
COLLECTOR_NAME    = BioWebDB Pool
CONDOR_IDS        = 504.504
QUEUE_SUPER_USERS = root, condor, globus
#############################################

##/usr/local/condor-6.7.10/local.vivax/condor_config.local###
CONDOR_HOST       = vivax.biowebdb.org vivax
CONDOR_ADMIN      = globus@xxxxxxxxxxxxxxxxxx
UID_DOMAIN        = $(FULL_HOSTNAME)
FILESYSTEM_DOMAIN = $(FULL_HOSTNAME)
CONDOR_IDS        = 504.504
##############################################################

After that I decided to move forward to install/configure Condor in other machine. I was aware about the type parameter for condor_install, so it was "--type=submit,execute".
I don't have a shared file system nor a common UID, so I changed FULL_HOSTNAME to its name either.
The problems come now. After updates all necessary fields in condor_config.local and condor_config files for both machines, I tried to start daemons.
But both machines, after the "condor_master" command issued in each one, execute both machines as masters!
So, how could I deal with that? Is there any configuration missing? What's wrong? Should I reinstall all the stuff?
Please, any glue will be important! I really need this feedback to go on!
Thanks in advance.
Regards,
Fabiano.

__________________________________________________
Converse com seus amigos em tempo real com o Yahoo! Messenger
http://br.download.yahoo.com/messenger/