[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] HA question



Hi,

I have been configuring the HA using the manual section and everything seems to work fine so far, replication log looks good etc.

I used the example setup from the manual and try to 'HA' negotiator & collector using: 

HAD_CONTROLLEE = NEGOTIATOR, COLLECTOR

One thing I recognize that some condor_q commands come back empty since then, on the WN I use: 

CENTRAL_MANAGER1 = condor01.desy.de
CENTRAL_MANAGER2 = condor02.desy.de
CONDOR_HOST = $(CENTRAL_MANAGER1),$(CENTRAL_MANAGER2)

condor01 should be the primary CM and that works fine but whenever I query the second host I get back an empty queue: 

[root@batch0930]# condor_q -global -pool condor01
-- Schedd: grid-arcce0.desy.de : <131.169.223.110:9620?...
 ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD               
192957.0   desyusr004      2/17 11:03   0+00:10:16 H  0   732421.9 (test_hartmathde)
197928.0   desysgm000      2/18 11:42   0+00:39:56 I  0   0.0  (ARCTest)
197934.0   desyusr007      2/18 11:45   0+00:31:05 I  0   0.0  (ARCTest)
197956.0   cmsplt036       2/18 11:52   0+00:00:32 I  8   0.0  (gridjob)
263147.0   cmsplt036       3/4  22:47  10+09:59:00 R  0   7324.2 (gridjob)
[ snip ]


[root@batch0930 /etc/condor/config.d]# condor_q -global -pool condor02
All queues are empty

Omitting the '-pool' part results in a random query of both hosts and an empty return list when condor02 is used ...

Is that an expected behaviour or did I miss something ? 

best regards
        ~christoph


-- 
/*   Christoph Beyer     |   Office: Building 2b / 23     *\
 *   DESY                |    Phone: 040-8998-2317        *
 *   - IT -              |      Fax: 040-8994-2317        *
\*   22603 Hamburg       |     http://www.desy.de         */