[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] ERROR PLZZ HELP OUT Could not fetch ads --- can't find collector



On Mar 8, 2006, at 12:10 AM, kailash raj wrote:

Lot of Thanks to  ALAIN ROY , Ive follwed your  sugestion , its almost recovered still, i have some probs

In both the client and head node i dont have startd running, even if i start condor_startd manually , its not running,

also see the status of the commands,

IN HEAD NODE


[root@ca1 ~]# ps -ef | egrep condor_
condor    4567     1  0 11:03 ?        00:00:00 condor_master
condor    4569  4567  0 11:03 ?        00:00:00 condor_schedd -f
condor    4594     1  0 11:03 ?        00:00:00 condor_collector
condor    4603     1  0 11:03 ?        00:00:00 condor_negotiator
condor    4619     1  0 11:04 ?        00:00:00 condor_schedd
root      4658  3773  0 11:04 pts/1    00:00:00 egrep condor_

You have two schedds running on this machine. You should only have one. Kill the one you started manually.

[root@ca1 ~]# condor_status
Error:  Could not fetch ads --- can't find collector

If you run 'condor_config_val condor_host', it should print the hostname of your head node (the machine running the collector). If it prints something different, that means the Condor daemons don't know how to find your collector. Check this on both machines.

3/8 11:04:12 The STARTD (pid 4622) exited with status 4

Take a look at StartLog in your Condor log directory to see why the startd is exiting.

+--------------------------------+-----------------------------------+
|           Jaime Frey           | I used to be a heavy gambler.     |
|       jfrey@xxxxxxxxxxx        | But now I just make mental bets.  |
| http://www.cs.wisc.edu/~jfrey/ | That's how I lost my mind.        |
+--------------------------------+-----------------------------------+