[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] My Pool is empty--why?



Some sanity checks:

Who owns the log files and what user are you using to run condor?

What processes are running on your central manager

cheers

JK


-----Original Message-----
From: condor-users-bounces@xxxxxxxxxxx on behalf of Dave Schneider
Sent: Thu 13/03/2008 21:50
To: Condor-Users Mail List
Subject: [Condor-users] My Pool is empty--why?
 
Hi,

I've recently installed Condor 7.0.1  using an existing set of local 
config files from 6.8.x, and I can't seem to get any machines to show up 
in my queue. For testing purposes, I started up a central manager (which 
is also an execute node) and one machine that should just be an execute 
node. Here's what I'm seeing from the central manager and the execute 
node (running as the condor user):

~/hosts/george/log$ condor_status

~/hosts/george/log$


The only log file that has scary things in it is the CollectorLog, where 
I see this:


3/13 16:41:46 ******************************************************
3/13 16:41:46 ** condor_collector (CONDOR_COLLECTOR) STARTING UP
3/13 16:41:46 ** /home/condor/condor-7.0.1_64/sbin/condor_collector
3/13 16:41:46 ** $CondorVersion: 7.0.1 Feb 26 2008 BuildID: 76180 $
3/13 16:41:46 ** $CondorPlatform: X86_64-LINUX_RHEL3 $
3/13 16:41:46 ** PID = 12958
3/13 16:41:46 ** Log last touched time unavailable (No such file or 
directory)
3/13 16:41:46 ******************************************************
3/13 16:41:46 Using config source: /home/condor/condor_config
3/13 16:41:46 Using local config sources:
3/13 16:41:46    /home/condor/hosts/vela/condor_config.local
3/13 16:41:46 DaemonCore: Command Socket at <10.1.1.81:9618>
3/13 16:41:46 In ViewServer::Init()
3/13 16:41:46 In CollectorDaemon::Init()
3/13 16:41:46 In ViewServer::Config()
3/13 16:41:46 In CollectorDaemon::Config()
3/13 16:41:46 enable: Creating stats hash table
3/13 16:41:50 (Sending 0 ads in response to query)
3/13 16:41:50 Got QUERY_STARTD_PVT_ADS
3/13 16:41:50 (Sending 0 ads in response to query)
3/13 16:41:50 NegotiatorAd  : Inserting ** "< vela.cyc.com >"
3/13 16:41:50 stats: Inserting new hashent for 
'Negotiator':'vela.cyc.com':'10.1.1.81'
3/13 16:41:54 Got QUERY_STARTD_ADS
3/13 16:41:54 (Sending 0 ads in response to query)
3/13 16:41:54 DaemonCore: PERMISSION DENIED to unknown user from host 
<10.1.1.81:34615> for command 1 (UPDATE_SCHEDD_AD), access level 
ADVERTISE_SCHEDD
3/13 16:41:55 DaemonCore: PERMISSION DENIED to unknown user from host 
<10.1.1.81:34616> for command 2 (UPDATE_MASTER_AD), access level 
ADVERTISE_MASTER
3/13 16:41:57 Got QUERY_STARTD_ADS
3/13 16:41:57 (Sending 0 ads in response to query)
3/13 16:41:59 Got QUERY_STARTD_ADS
3/13 16:41:59 (Sending 0 ads in response to query)
3/13 16:42:02 Got QUERY_STARTD_ADS
3/13 16:42:02 (Sending 0 ads in response to query)
3/13 16:42:03 DaemonCore: PERMISSION DENIED to unknown user from host 
<10.1.1.81:34617> for command 0 (UPDATE_STARTD_AD), access level 
ADVERTISE_STARTD
3/13 16:42:04 DaemonCore: PERMISSION DENIED to unknown user from host 
<10.1.1.81:34618> for command 0 (UPDATE_STARTD_AD), access level 
ADVERTISE_STARTD

10.1.1.81 is the central manager.  Later on there are similar access 
denied message from the execute node.

I tried putting these settings in the master condor_config file (btw, 
even though these are mentioned in the manual for security stuff, 
they're not in the example config files, only the hostallow* setting are 
in there).

ALLOW_READ            = */*
ALLOW_WRITE           = */*
ALLOW_ADMINISTRATOR   = condor@xxxxxxx/*
ALLOW_CONFIG          = condor@xxxxxxx/*
ALLOW_NEGOTIATOR      = condor@xxxxxxx/*
ALLOW_DAEMON          = condor@xxxxxxx/*


Previously, I also tried to use these security settings, but they 
resulted in the same problem:

HOSTALLOW_ADMINISTRATOR = $(CONDOR_HOST)
HOSTALLOW_OWNER = $(FULL_HOSTNAME), $(HOSTALLOW_ADMINISTRATOR)
HOSTALLOW_READ = *
HOSTALLOW_WRITE = *
HOSTALLOW_NEGOTIATOR = $(CONDOR_HOST)
HOSTALLOW_NEGOTIATOR_SCHEDD = $(CONDOR_HOST), $(FLOCK_NEGOTIATOR_HOSTS)
HOSTALLOW_WRITE_COLLECTOR = $(HOSTALLOW_WRITE), $(FLOCK_FROM)
HOSTALLOW_WRITE_STARTD    = $(HOSTALLOW_WRITE), $(FLOCK_FROM)
HOSTALLOW_READ_COLLECTOR  = $(HOSTALLOW_READ), $(FLOCK_FROM)
HOSTALLOW_READ_STARTD     = $(HOSTALLOW_READ), $(FLOCK_FROM)

Anyone have any ideas? I checked DNS (as mentioned in a post to 
condor-users from some time ago), and it appears that the manager is 
findable via both name and number.