[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] Is it negotiator problem?




I have condor in two machines, jobs can be send in both machines but
condor do not distribute them propertly. What can cause this?



negotiator log file looks like here
2/8 08:37:30 ******************************************************
2/8 08:37:30 ** condor_negotiator (CONDOR_NEGOTIATOR) STARTING UP
2/8 08:37:30 ** /home/condor/condor/sbin/condor_negotiator
2/8 08:37:30 ** $CondorVersion: 6.6.10 Jun 13 2005 $
2/8 08:37:30 ** $CondorPlatform: I386-LINUX_RH9 $
2/8 08:37:30 ** PID = 3487
2/8 08:37:30 ******************************************************
2/8 08:37:30 Using config file: /home/condor/condor/etc/condor_config
2/8 08:37:30 Using local config files: /home/condor/condor_config.local
2/8 08:37:30 DaemonCore: Command Socket at <172.16.16.42:9614>
2/8 08:37:30 SEC_DEFAULT_SESSION_DURATION is undefined, using default value of 3600
2/8 08:37:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 08:37:30 About to truncate log /home/condor/spool/Accountantnew.log
2/8 08:37:30 ACCOUNTANT_HOST = None (local)
2/8 08:37:30 NEGOTIATOR_INTERVAL = 300 sec
2/8 08:37:30 NEGOTIATOR_TIMEOUT = 30 sec
2/8 08:37:30 PREEMPTION_REQUIREMENTS = (CurrentTime - EnteredCurrentState) > (1 * (60 * 60)) && RemoteUserPrio > SubmittorPrio * 1.2
2/8 08:37:30 PREEMPTION_RANK = (RemoteUserPrio * 1000000) - TARGET.ImageSize
2/8 08:37:30 ---------- Started Negotiation Cycle ----------
2/8 08:37:30 Phase 1:  Obtaining ads from collector ...
2/8 08:37:30   Getting all public ads ...
2/8 08:37:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 08:37:30   Sorting 0 ads ...
2/8 08:37:30   Getting startd private ads ...
2/8 08:37:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 08:37:30 condor_read(): recv() returned -1, errno = 104, assuming failure.
2/8 08:37:30 Couldn't fetch ads: communication error
2/8 08:37:30 Aborting negotiation cycle
2/8 08:37:31 DaemonCore: in SendAliveToParent()
2/8 08:37:31 DaemonCore: attempting to connect to '<172.16.16.42:32781>'
2/8 08:37:31 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 08:42:30 ---------- Started Negotiation Cycle ----------
2/8 08:42:30 Phase 1:  Obtaining ads from collector ...
2/8 08:42:30   Getting all public ads ...
2/8 08:42:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 08:42:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 08:42:30   Sorting 7 ads ...
2/8 08:42:30   Getting startd private ads ...
2/8 08:42:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 08:42:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 08:42:30 condor_read(): recv() returned -1, errno = 104, assuming failure.
2/8 08:42:30 Couldn't fetch ads: communication error
2/8 08:42:30 Aborting negotiation cycle
2/8 08:47:30 ---------- Started Negotiation Cycle ----------
2/8 08:47:30 Phase 1:  Obtaining ads from collector ...
2/8 08:47:30   Getting all public ads ...
2/8 08:47:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 08:47:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 08:47:30   Sorting 7 ads ...
2/8 08:47:30   Getting startd private ads ...
2/8 08:47:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 08:47:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 08:47:30 condor_read(): recv() returned -1, errno = 104, assuming failure.
2/8 08:47:30 Couldn't fetch ads: communication error
2/8 08:47:30 Aborting negotiation cycle
2/8 08:52:30 ---------- Started Negotiation Cycle ----------
2/8 08:52:30 Phase 1:  Obtaining ads from collector ...
2/8 08:52:30   Getting all public ads ...
2/8 08:52:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 08:52:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 08:52:30   Sorting 7 ads ...
2/8 08:52:30   Getting startd private ads ...
2/8 08:52:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 08:52:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 08:52:30 condor_read(): recv() returned -1, errno = 104, assuming failure.
2/8 08:52:30 Couldn't fetch ads: communication error
2/8 08:52:30 Aborting negotiation cycle
2/8 08:57:01 DaemonCore: in SendAliveToParent()
2/8 08:57:01 DaemonCore: attempting to connect to '<172.16.16.42:32781>'
2/8 08:57:01 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 08:57:01 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 08:57:30 ---------- Started Negotiation Cycle ----------
2/8 08:57:30 Phase 1:  Obtaining ads from collector ...
2/8 08:57:30   Getting all public ads ...
2/8 08:57:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 08:57:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 08:57:30   Sorting 7 ads ...
2/8 08:57:30   Getting startd private ads ...
2/8 08:57:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 08:57:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 08:57:30 condor_read(): recv() returned -1, errno = 104, assuming failure.
2/8 08:57:30 Couldn't fetch ads: communication error
2/8 08:57:30 Aborting negotiation cycle
2/8 09:02:30 ---------- Started Negotiation Cycle ----------
2/8 09:02:30 Phase 1:  Obtaining ads from collector ...
2/8 09:02:30   Getting all public ads ...
2/8 09:02:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 09:02:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 09:02:30   Sorting 7 ads ...
2/8 09:02:30   Getting startd private ads ...
2/8 09:02:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 09:02:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 09:02:30 condor_read(): recv() returned -1, errno = 104, assuming failure.
2/8 09:02:30 Couldn't fetch ads: communication error
2/8 09:02:30 Aborting negotiation cycle
2/8 09:07:30 ---------- Started Negotiation Cycle ----------
2/8 09:07:30 Phase 1:  Obtaining ads from collector ...
2/8 09:07:30   Getting all public ads ...
2/8 09:07:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 09:07:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 09:07:30   Sorting 7 ads ...
2/8 09:07:30   Getting startd private ads ...
2/8 09:07:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 09:07:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 09:07:30 condor_read(): recv() returned -1, errno = 104, assuming failure.
2/8 09:07:30 Couldn't fetch ads: communication error
2/8 09:07:30 Aborting negotiation cycle
2/8 09:12:30 ---------- Started Negotiation Cycle ----------
2/8 09:12:30 Phase 1:  Obtaining ads from collector ...
2/8 09:12:30   Getting all public ads ...
2/8 09:12:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 09:12:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 09:12:30   Sorting 7 ads ...
2/8 09:12:30   Getting startd private ads ...
2/8 09:12:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 09:12:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 09:12:30 condor_read(): recv() returned -1, errno = 104, assuming failure.
2/8 09:12:30 Couldn't fetch ads: communication error
2/8 09:12:30 Aborting negotiation cycle
2/8 09:16:31 DaemonCore: in SendAliveToParent()
2/8 09:16:31 DaemonCore: attempting to connect to '<172.16.16.42:32781>'
2/8 09:16:31 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 09:16:31 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 09:17:30 ---------- Started Negotiation Cycle ----------
2/8 09:17:30 Phase 1:  Obtaining ads from collector ...
2/8 09:17:30   Getting all public ads ...
2/8 09:17:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 09:17:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 09:17:30   Sorting 7 ads ...
2/8 09:17:30   Getting startd private ads ...
2/8 09:17:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 09:17:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 09:17:30 condor_read(): recv() returned -1, errno = 104, assuming failure.
2/8 09:17:30 Couldn't fetch ads: communication error
2/8 09:17:30 Aborting negotiation cycle
2/8 09:22:30 ---------- Started Negotiation Cycle ----------
2/8 09:22:30 Phase 1:  Obtaining ads from collector ...
2/8 09:22:30   Getting all public ads ...
2/8 09:22:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 09:22:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 09:22:30   Sorting 7 ads ...
2/8 09:22:30   Getting startd private ads ...
2/8 09:22:30 NEGOTIATOR_TIMEOUT_MULTIPLIER is undefined, using default value of 0
2/8 09:22:30 SEC_DEBUG_PRINT_KEYS is undefined, using default value of False
2/8 09:22:30 condor_read(): recv() returned -1, errno = 104, assuming failure.
2/8 09:22:30 Couldn't fetch ads: communication error
2/8 09:22:30 Aborting negotiation cycle
--
Thanks and regards,
Srinivas.Malyala