[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] CEDAR:6001:Failed to fetch ads [ SOLVED!! ]



Hi all,

I've fixed the the problem; it's working now. Mainly, there were two things went wrong but none was reported correctly in the log.

First, I have a wrong entry in the condor_config file:
FLOCK_TO = FALSE

FLOCK_TO/FROM is not a boolean type and if it's set to anything other than "empty", that assumed as a hostname and will try to resolve the name. That's why I was getting:
IPVERIFY: unable to resolve IP address of FALSE

/lib64/tls/libc.so.6(__nss_hostname_digits_dots+0x47)[0x33394d7a87]

Once corrected, those errors were gone.

But the actual problem came from a confusing statement in the supplied condor_config file, which caused all of these trouble.
Line #961, in the condor_config file ($CONDOR_CONFIG), it says:
## The NEGOTIATOR_HOST parameter has been deprecated.

So, I kept the NEGOTIATOR_HOST line commented out. It turned out to be a bug and actually reported by our colleague at Cambridge eScience, which is being fixed/tested in v7.5. Until then, we do need to set:
NEGOTIATOR_HOST = $(CONDOR_HOST)

just like old days. Once I set this up, condor_schedd stopped crashing and everything came back in life. If any one else is also having the similar problem, it's worth checking.

Cheers,
Santanu

    

On 18/11/2010 18:39, Ines Dutra wrote:
Dear all, regarding this problem mentioned by Santanu, I am having the same one here in the Biostats and Med Informatics Department. Besides having these error messages I also have messages such as:

file .... is not readable by condor

when the file actually exists....

I would appreciate any help with this matter as well...

Thank you!

Best,

Inês.

On Wed, Nov 17, 2010 at 5:07 PM, Santanu Das <santanu@xxxxxxxxxxxxxxxxx> wrote:
Hi all,

I'm using  v7.4.4  dynamically linked RHEL3 rpm on Scientific Linux 4 and every time I try to use condor_q I get "Failed to fetch ads..." error

       [root@serv07 log]# condor_q

       -- Failed to fetch ads from: <172.24.116.185:9570> : serv07.hep.phy.cam.ac.uk
       CEDAR:6001:Failed to connect to <172.24.116.185:9570>

Looks like condor_schedd is crashing. How can I fix this? Thanks in advance for any help.

Cheers,
Santanu
_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/



--
Inês Dutra
Prof. Auxiliar
Department of Computer Science
University of Porto
CRACS and INESC-Porto LA
Visiting scholar at UW-Madison till Jan 2011

_______________________________________________ Condor-users mailing list To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a subject: Unsubscribe You can also unsubscribe by visiting https://lists.cs.wisc.edu/mailman/listinfo/condor-users The archives can be found at: https://lists.cs.wisc.edu/archive/condor-users/