[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] condor_q -analyze command producing error]



Hi Dan,

I am using....
CondorVersion: 7.0.1 Feb 26 2008 BuildID: 76180
CondorPlatform: I386-LINUX_RHEL3

In Central Manager config file....

##  How often should the negotiator start a negotiation cycle?
        NEGOTIATOR_INTERVAL    = 150

I have attached 2 NegotiatorLog file along with it copied at different date

by
Johnson


On Fri, 2008-04-25 at 08:56 -0500, Dan Bradley wrote:
What version of condor are you running?  Howlong are your negotiation 
cycles?  There should be a line in the log at the beginning of the cycle 
and at the end:

4/25 08:54:42 ---------- Started Negotiation Cycle ----------
...
4/25 08:54:42 ---------- Finished Negotiation Cycle ----------

--Dan

JohnsonKoilraj wrote:

> Hi,
>
> bash-3.1$ condor_q -analyze
> *Error: Could not connect to negotiator ((null))*
>
> After checking the NegotiatorLog file. I noticed that the command 
> producing desired result once in EVERY 5 Minutes for 1 minute.
>
> Why it is behaving like this. How to get status every time using 
> CONDOR_Q -ANALYZE
>
> Help me in this ..
>
> by
> johnson

Please do not print this email unless it is absolutely necessary.

The information contained in this electronic message and any attachments to this message are intended for the exclusive use of the addressee(s) and may contain proprietary, confidential or privileged information. If you are not the intended recipient, you should not disseminate, distribute or copy this e-mail. Please notify the sender immediately and destroy all copies of this message and any attachments.

WARNING: Computer viruses can be transmitted via email. The recipient should check this email and any attachments for the presence of viruses. The company accepts no liability for any damage caused by any virus transmitted by this email.

www.wipro.com

 ---------- Finished Negotiation Cycle ----------
4/28 10:13:15 Getting monitoring info for pid 27771
4/28 10:14:15 ---------- Started Negotiation Cycle ----------
4/28 10:14:15 Phase 1:  Obtaining ads from collector ...
4/28 10:14:15   Getting all public ads ...
4/28 10:14:15 Trying to query collector <10.201.42.242:9618>
4/28 10:14:35   Sorting 49 ads ...
4/28 10:14:35   Getting startd private ads ...
4/28 10:14:35 Trying to query collector <10.201.42.242:9618>
4/28 10:14:35 Got ads: 49 public and 19 private
4/28 10:14:35 Public ads include 1 submitter, 23 startd
4/28 10:14:35 Entering compute_signficant_attrs()
4/28 10:14:35 Leaving compute_signficant_attrs() - result=JobUniverse,LastCheckpointPlatform,NumCkpts
4/28 10:14:35 Phase 2:  Performing accounting ...
4/28 10:14:35 Phase 3:  Sorting submitter ads by priority ...
4/28 10:14:35 Phase 4.1:  Negotiating with schedds ...
4/28 10:14:35     NumStartdAds = 23
4/28 10:14:35     NormalFactor = 1.000000
4/28 10:14:35     MaxPrioValue = 1.457145
4/28 10:14:35     NumScheddAds = 1
4/28 10:14:35   Negotiating with idealgrid@xxxxxxxxxxxxxxxxxxxxxxxxx skipped because no idle jobs
4/28 10:14:35   Schedd idealgrid@xxxxxxxxxxxxxxxxxxxxxxxxx got all it wants; removing it.
4/28 10:14:35 ---------- Finished Negotiation Cycle ----------
4/28 10:14:35 enter Matchmaker::updateCollector
4/28 10:14:35 Trying to update collector <10.201.42.242:9618>
4/28 10:14:35 Attempting to send update via UDP to collector scorpio.pesgrid.wipro.com <10.201.42.242:9618>
4/28 10:14:35 exit Matchmaker::UpdateCollector
 ---------- Started Negotiation Cycle ----------
4/28 10:17:05 Phase 1:  Obtaining ads from collector ...
4/28 10:17:05   Getting all public ads ...
4/28 10:17:05 Trying to query collector <10.201.42.242:9618>
4/28 10:17:05   Sorting 49 ads ...
4/28 10:17:05   Getting startd private ads ...
4/28 10:17:05 Trying to query collector <10.201.42.242:9618>
4/28 10:17:05 Got ads: 49 public and 23 private
4/28 10:17:05 Public ads include 1 submitter, 23 startd
4/28 10:17:05 Entering compute_signficant_attrs()
4/28 10:17:05 Leaving compute_signficant_attrs() - result=JobUniverse,LastCheckpointPlatform,NumCkpts
4/28 10:17:05 Phase 2:  Performing accounting ...
4/28 10:17:05 Phase 3:  Sorting submitter ads by priority ...
4/28 10:17:05 Phase 4.1:  Negotiating with schedds ...
4/28 10:17:05     NumStartdAds = 23
4/28 10:17:05     NormalFactor = 1.000000
4/28 10:17:05     MaxPrioValue = 1.468622
4/28 10:17:05     NumScheddAds = 1
4/28 10:17:05   Negotiating with idealgrid@xxxxxxxxxxxxxxxxxxxxxxxxx skipped because no idle jobs
4/28 10:17:05   Schedd idealgrid@xxxxxxxxxxxxxxxxxxxxxxxxx got all it wants; removing it.
4/28 10:17:05 ---------- Finished Negotiation Cycle ----------
4/28 10:17:15 Getting monitoring info for pid 27771
4/28 10:19:35 enter Matchmaker::updateCollector
4/28 10:19:35 Trying to update collector <10.201.42.242:9618>
4/28 10:19:35 Attempting to send update via UDP to collector scorpio.pesgrid.wipro.com <10.201.42.242:9618>
4/28 10:19:35 exit Matchmaker::UpdateCollector
4/28 10:19:35 ---------- Started Negotiation Cycle ----------
4/28 10:19:35 Phase 1:  Obtaining ads from collector ...

6/25 10:28:36 ******************************************************
6/25 10:28:36 ** condor_negotiator (CONDOR_NEGOTIATOR) STARTING UP
6/25 10:28:36 ** /home/condor-7.0.1/sbin/condor_negotiator
6/25 10:28:36 ** $CondorVersion: 7.0.1 Feb 26 2008 BuildID: 76180 $
6/25 10:28:36 ** $CondorPlatform: I386-LINUX_RHEL3 $
6/25 10:28:36 ** PID = 29398
6/25 10:28:36 ** Log last touched 6/25 10:27:58
6/25 10:28:36 ******************************************************
6/25 10:28:36 Using config source: /home/condor-7.0.1/etc/condor_config
6/25 10:28:36 Using local config sources: 
6/25 10:28:36    /home/condor-7.0.1/local.scorpio/condor_config.local
6/25 10:28:36 Running as root.  Enabling specialized core dump routines
6/25 10:28:36 DaemonCore: Command Socket at <10.201.42.242:42409>
6/25 10:28:36 Initialized the following authorization table:
6/25 10:28:36 host 10.207.123.106: user *: READ,WRITE,DAEMON,ADVERTISE_STARTD,ADVERTISE_SCHEDD,ADVERTISE_MASTER
6/25 10:28:36 host 10.201.42.242: user *: READ,WRITE,NEGOTIATOR,ADMINISTRATOR,OWNER,DAEMON,ADVERTISE_STARTD,ADVERTISE_SCHEDD,ADVERTISE_MASTER
6/25 10:28:36 host 10.207.123.56: user *: READ,WRITE,DAEMON,ADVERTISE_STARTD,ADVERTISE_SCHEDD,ADVERTISE_MASTER
6/25 10:28:36 Will use UDP to update collector scorpio.pesgrid.wipro.com <10.201.42.242:9618>
6/25 10:28:36 NEGOTIATOR_SOCKET_CACHE_SIZE = 16
6/25 10:28:36 PREEMPTION_REQUIREMENTS = ( (CurrentTime - EnteredCurrentState) > (1 * (60 * 60)) && RemoteUserPrio > SubmittorPrio * 1.2 ) || (MY.NiceUser == True)
6/25 10:28:36 ACCOUNTANT_HOST = None (local)
6/25 10:28:36 NEGOTIATOR_INTERVAL = 300 sec
6/25 10:28:36 NEGOTIATOR_TIMEOUT = 30 sec
6/25 10:28:36 MAX_TIME_PER_SUBMITTER = 31536000 sec
6/25 10:28:36 MAX_TIME_PER_PIESPIN = 31536000 sec
6/25 10:28:36 PREEMPTION_RANK = (RemoteUserPrio * 1000000) - TARGET.ImageSize
6/25 10:28:36 NEGOTIATOR_PRE_JOB_RANK = RemoteOwner =?= UNDEFINED
6/25 10:28:36 NEGOTIATOR_POST_JOB_RANK = None
6/25 10:28:36 Getting monitoring info for pid 29398
6/25 10:28:36 ---------- Started Negotiation Cycle ----------
6/25 10:28:36 Phase 1:  Obtaining ads from collector ...
6/25 10:28:36   Getting all public ads ...
6/25 10:28:36 Trying to query collector <10.201.42.242:9618>
6/25 10:28:36   Sorting 0 ads ...
6/25 10:28:36   Getting startd private ads ...
6/25 10:28:36 Trying to query collector <10.201.42.242:9618>
6/25 10:28:36 Got ads: 0 public and 0 private
6/25 10:28:36 Public ads include 0 submitter, 0 startd
6/25 10:28:36 Entering compute_signficant_attrs()
6/25 10:28:36 Phase 2:  Performing accounting ...
6/25 10:28:36 Phase 3:  Sorting submitter ads by priority ...
6/25 10:28:36 Phase 4.1:  Negotiating with schedds ...
6/25 10:28:36     NumStartdAds = 0
6/25 10:28:36     NormalFactor = 0.000000
6/25 10:28:36     MaxPrioValue = 0.000000
6/25 10:28:36     NumScheddAds = 0
6/25 10:28:36 ---------- Finished Negotiation Cycle ----------
6/25 10:28:36 enter Matchmaker::updateCollector
6/25 10:28:36 Trying to update collector <10.201.42.242:9618>
6/25 10:28:36 Attempting to send update via UDP to collector scorpio.pesgrid.wipro.com <10.201.42.242:9618>
6/25 10:28:36 exit Matchmaker::UpdateCollector
6/25 10:28:37 DaemonCore: in SendAliveToParent()
6/25 10:28:37 DaemonCore: Leaving SendAliveToParent() - success
6/25 10:29:03 Getting state information from the accountant
6/25 10:29:11 Getting state information from the accountant
6/25 10:29:12 Getting state information from the accountant
6/25 10:32:36 Getting monitoring info for pid 29398
6/25 10:33:36 ---------- Started Negotiation Cycle ----------
6/25 10:33:36 Phase 1:  Obtaining ads from collector ...
6/25 10:33:36   Getting all public ads ...
6/25 10:33:36 Trying to query collector <10.201.42.242:9618>
6/25 10:33:36   Sorting 22 ads ...
6/25 10:33:36   Getting startd private ads ...
6/25 10:33:36 Trying to query collector <10.201.42.242:9618>
6/25 10:33:36 Got ads: 22 public and 10 private
6/25 10:33:36 Public ads include 1 submitter, 10 startd
6/25 10:33:36 Entering compute_signficant_attrs()
6/25 10:33:36 Leaving compute_signficant_attrs() - result=JobUniverse,LastCheckpointPlatform,NumCkpts
6/25 10:33:36 Phase 2:  Performing accounting ...
6/25 10:33:36 Phase 3:  Sorting submitter ads by priority ...
6/25 10:33:36 Phase 4.1:  Negotiating with schedds ...
6/25 10:33:36     NumStartdAds = 10
6/25 10:33:36     NormalFactor = 1.000000
6/25 10:33:36     MaxPrioValue = 5.447453
6/25 10:33:36     NumScheddAds = 1
6/25 10:33:36   Negotiating with idealgrid@xxxxxxxxxxxxxxxxxxxxxxxxx at <10.201.42.242:58361>
6/25 10:33:36 0 seconds so far
6/25 10:33:36   Calculating schedd limit with the following parameters
6/25 10:33:36     ScheddPrio       = 5.447453
6/25 10:33:36     ScheddPrioFactor = 1.000000
6/25 10:33:36     scheddShare      = 0.000000
6/25 10:33:36     scheddAbsShare   = 1.000000
6/25 10:33:36     ScheddUsage      = 0
6/25 10:33:36     scheddLimit      = 10
6/25 10:33:36     userprioCrumbs   = 0 (0)
6/25 10:33:36     MaxscheddLimit   = 10
6/25 10:33:36 Socket to <10.201.42.242:58361> not in cache, creating one
6/25 10:33:36 SocketCache:  Found unused slot 0
6/25 10:33:36     Sending SEND_JOB_INFO/eom
6/25 10:33:36     Getting reply from schedd ...
6/25 10:33:36     Got JOB_INFO command; getting classad/eom
6/25 10:33:36     Request 00047.00000:
6/25 10:33:36       Rejected 47.0 idealgrid@xxxxxxxxxxxxxxxxxxxxxxxxx <10.201.42.242:58361>: no match found
6/25 10:33:36     Sending SEND_JOB_INFO/eom
6/25 10:33:36     Getting reply from schedd ...
6/25 10:33:36     Got NO_MORE_JOBS;  done negotiating
6/25 10:33:36   Schedd idealgrid@xxxxxxxxxxxxxxxxxxxxxxxxx got all it wants; removing it.
6/25 10:33:36 ---------- Finished Negotiation Cycle ----------
6/25 10:33:36 enter Matchmaker::updateCollector
6/25 10:33:36 Trying to update collector <10.201.42.242:9618>
6/25 10:33:36 Attempting to send update via UDP to collector scorpio.pesgrid.wipro.com <10.201.42.242:9618>
6/25 10:33:36 exit Matchmaker::UpdateCollector
6/25 10:36:36 Getting monitoring info for pid 29398