[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Condor install / config issues, something wrong with collector configuration?



Hello All,

I'm new to condor.  I think something is not configured properly for
the collector, but I don't think I've changed anything for the default
collector configuration.

I appreciate your help!

Details below:
---
I'm installing on a single machine to start with, using
condor-7.0.5-linux-x86_64-rhel5.tar

---
I installed with the command:
./condor_configure  --backup --install=/opt/UPS/or/condor/condor-7.0.5
 --install-dir=/opt/UPS/or/condor --type=manager,execute,submit
--local-dir=/opt/UPS/or/condor/local.uskyarpds0310 --verbose

---
I am not running condor as root; I run it as the userid condor instead.
I start condor with "condor_master", then I see these demons running:

condor   16862     1  0 12:52 ?        00:00:00 condor_master
condor   16863 16862  0 12:52 ?        00:00:00 condor_collector -f
condor   16865 16862  0 12:52 ?        00:00:00 condor_negotiator -f
condor   16866 16862  0 12:52 ?        00:00:00 condor_schedd -f
condor   16867 16862  0 12:52 ?        00:00:05 condor_startd -f
condor   16868 16866  0 12:52 ?        00:00:00 condor_procd -A
/tmp/condor-lock.uskyarpds03100.524279833844961/procd_pipe.SCHEDD -S
60 -C 9015

---
Master log has:

MasterLog:1/7 17:52:48 Collector port not defined, will use default: 9618
---
My NegotiatorLog has msgs like this repeated in it:
1/7 17:57:51 ---------- Started Negotiation Cycle ----------
1/7 17:57:51 Phase 1:  Obtaining ads from collector ...
1/7 17:57:51   Getting all public ads ...
1/7 17:57:51 Couldn't fetch ads: can't find collector
1/7 17:57:51 Aborting negotiation cycle

---
I see this error in several of the logs:
1/7 17:52:51 ERROR: Unable to find collector info in configuration file!!!

---
I submitted  a job successfully, but it does not start.

condor_q


 Submitter: uskyarpds0310.air.ups.com : <10.224.217.231:46467> :
uskyarpds0310.air.ups.com
 ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD
  3.0   app3lam         1/7  13:03   0+00:00:00 I  0   0.0  foo

---
And I get errors on these commands

condor_q -better-analy
Error:  Could not fetch startd ads

condor_status
Error:  Could not fetch ads --- can't find collector

---

The top of my CollectorLog has:

1/7 17:52:48 ******************************************************
1/7 17:52:48 ** condor_collector (CONDOR_COLLECTOR) STARTING UP
1/7 17:52:48 ** /opt/UPS/or/condor/sbin/condor_collector
1/7 17:52:48 ** $CondorVersion: 7.0.5 Sep 20 2008 BuildID: 105846 $
1/7 17:52:48 ** $CondorPlatform: X86_64-LINUX_RHEL5 $
1/7 17:52:48 ** PID = 16863
1/7 17:52:48 ** Log last touched time unavailable (No such file or directory)
1/7 17:52:48 ******************************************************
1/7 17:52:48 Using config source: /opt/UPS/or/condor/etc/condor_config
1/7 17:52:48 Using local config sources:
1/7 17:52:48    /opt/UPS/or/condor/local.uskyarpds0310/condor_config.local
1/7 17:52:48 DaemonCore: Command Socket at <10.224.217.231:9618>
1/7 17:52:48 ERROR: Unable to find collector info in configuration file!!!
1/7 17:52:48 In ViewServer::Init()
1/7 17:52:48 In CollectorDaemon::Init()
1/7 17:52:48 In ViewServer::Config()
1/7 17:52:48 In CollectorDaemon::Config()
1/7 17:52:48 enable: Creating stats hash table
1/7 18:07:48 Housekeeper:  Ready to clean old ads
1/7 18:07:48    Cleaning StartdAds ...
1/7 18:07:48    Cleaning StartdPrivateAds ...
1/7 18:07:48    Cleaning QuillAds ...
1/7 18:07:48    Cleaning ScheddAds ...


---
Any recommendations or pointers to appropriate documents would be
greatly appreciated.

Thx,  Lee