[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Master won't start

On Wed, Mar 08, 2006 at 04:23:37PM -0500, Steve Huston wrote:
> We ran Condor awhile ago and turned it off while we did some changes,
> and now I'm setting it up again.  I've got a Fedora 3 machine (quad
> Opteron) which I'm setting up with Condor 6.7.17, and every time I run
> it I get the following error:
>  Neither the environment variable CONDOR_CONFIG,
>  /etc/condor/, nor ~condor/ contain a condor_config file.
>  Either set CONDOR_CONFIG to point to a valid config file,
>  or put a "condor_config" file in /etc/condor or ~condor/
> There *is* a condor_config file in ~condor/ and this is the same setup
> we used before with the previous machines (though I edited the file so
> it reflects the new setup, and new options since this is also a newer
> version of the software).  If I do as it says and set an environment
> variable, I instead get the following:
>  ERROR: the uid specified in CONDOR_IDS config file variable (9016)
>  does not exist in your password information.
>  Please set CONDOR_IDS to the '.' seperated uid, gid pair that
>  should be used by condor.
> Our authentication is done via LDAP, but the UID 9016 does in fact match
> the condor user.  All the files are owned by the condor user as well.

Is your config file in NFS? Do you have root squash turned on?

> If I instead run the daemon as root (which is what I want to do), I get
> the "no config" error no matter what, even if I set the environment
> variable - *except* if I try to run it through gdb, then I get the "no
> UID" error.  I hoped that there'd be some debugging in the binaries,
> which is how I found the difference running through gdb.
> I've attached an strace of the process if it will help someone, but I
> don't see where it even tries to look up anything to determine that
> there is no file in ~condor, nor does it look in /etc/condor.  This
> trace was when running it as root.

The master forks and goes into the background, and the process that
looks for the config file is the forked child. Run the master with
the '-f' flag:

strace condor_master -f

to get a better trace.