[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] Error with global Queue



Hello All,

I recently replaced my Central Manager, and a few odd things have come
up. The only definite error message I can find though happens when
"condor_q -global" is run:

>-- Failed to fetch ads from:
<xxx.xxx.xxx.49:9618?addrs=xxx.xxx.xxx.49-9618+[> : server1.my.domain.com
>AUTHENTICATE:1003:Failed to authenticate with any method
>AUTHENTICATE:1004:Failed to authenticate using GSI
>GSI:5003:Failed to authenticate. Globus is reporting error
(851968:50). There is probably a problem with your credentials. (Did
you run grid-proxy-init?)
>AUTHENTICATE:1004:Failed to authenticate using KERBEROS
>AUTHENTICATE:1004:Failed to authenticate using FS

My basic configuration is Central manager, connected to 2 submit nodes.
Each submit node seems to be able to see it's own queue, one of the
submit nodes off and on seems to be having trouble running jobs, but I
can't seem to find any errors that make sense. For now I'd like to
figure out the global queue error as I suspect they are related.

My config file as far as authentication goes looks like this:


>SEC_PASSWORD_FILE = /etc/condor/pool_password
>SEC_DAEMON_AUTHENTICATION = REQUIRED
>SEC_DAEMON_INTEGRITY = REQUIRED
>SEC_DAEMON_AUTHENTICATION_METHODS = PASSWORD
>SEC_NEGOTIATOR_AUTHENTICATION = REQUIRED
>SEC_NEGOTIATOR_INTEGRITY = REQUIRED
>SEC_NEGOTIATOR_AUTHENTICATION_METHODS = PASSWORD
>SEC_CLIENT_AUTHENTICATION_METHODS = FS, PASSWORD, KERBEROS, GSI

( I didn't do the initial install/configuration of HTcondor on these
systems, I'm just the new admin for them, and still getting my footing)

I've looked through some of the logs, but I can't seem to find any
specific error messages that point me in a new direction. Any
tips/tricks/idea's would be appreciated


--Brandon