[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Errors with HAD setup

Have you checked to see if the daemons are in condor\bin
They should be in condor\sbin.


On Fri, 28 Aug 2009, Fabrice Bouye wrote:

I am in the process of setting up two central managers using HAD and replication under Condor 7.2.4 in order to test the procedure before we setup our entire flock using HAD.
Both central managers are under Windows XP SP2 32-bits and the test clients are a mix of Windows XP and Linux computers.

On both central manager, I've copied over and modified the configuration files from http://www.cs.wisc.edu/condor/manual/v7.0/3_10High_Availability.html#SECTION004102400000000000000 \

But I get lots of error related to HAD and replication within the log files:

For example, on the 1st central manager MasterLog file:

8/28 08:05:50 C:\condor/bin/condor_had: Cannot execute
8/28 08:05:50 restarting C:\condor/bin/condor_had in 3600 seconds
8/28 09:01:39 C:\condor/bin/condor_replication: Cannot execute
8/28 09:01:39 restarting C:\condor/bin/condor_replication in 3600 seconds
8/28 09:05:50 C:\condor/bin/condor_had: Cannot execute
8/28 09:05:50 restarting C:\condor/bin/condor_had in 3600 seconds

The second central manager MasterLog file exhibits similar errors.
Is that normal ?

Except for that everything seems working OK so far (client slots are listed by condor_status and other condor commands seem to work ok).

Fabrice Bouyé (http://fabricebouye.cv.fm/)
Fisheries IT Specialist
Tel: +687 26 20 00 (Ext 411)
Oceanic Fisheries, Pacific Community

Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting

The archives can be found at:

Steven C. Timm, Ph.D  (630) 840-8525
timm@xxxxxxxx  http://home.fnal.gov/~timm/
Fermilab Computing Division, Scientific Computing Facilities,
Grid Facilities Department, FermiGrid Services Group, Assistant Group Leader.