[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] [Condor-users] how to fix 'DC_AUTHENTICATE: Unable to reconcile!'?



Dear Zach:
  
  I added '  TOOL_DEBUG = D_ALL' and   'SUBMIT_DEBUG = D_ALL' in condor_config and the following is the result of 'condor_submit -debug **', any idea what part leads to this 'ERROR: Failed to connect to local queue manager; SECMAN:2007:Failed to end classad message' ? 

  Please notice that this machine valtical00 is used as   ' COLLECTOR, MASTER, NEGOTIATOR, SCHEDD, STARTD'.

 Enclose are the 2 configuration files and all the logs(old logs already removed).

  
-bash-4.1$ condor_submit -debug valtical00.job 
11/02/13 12:51:49 (fd:3) (pid:28613) config: using subsystem 'SUBMIT', local ''
11/02/13 12:51:49 (fd:3) (pid:28613) OpSysMajorVersion:  6 
11/02/13 12:51:49 (fd:3) (pid:28613) OpSysShortName:  SLCern 
11/02/13 12:51:49 (fd:3) (pid:28613) OpSysLongName:  Scientific Linux CERN SLC release 6.4 (Carbon) 
11/02/13 12:51:49 (fd:3) (pid:28613) OpSysAndVer:  SLCern6 
11/02/13 12:51:49 (fd:3) (pid:28613) OpSysLegacy:  LINUX 
11/02/13 12:51:49 (fd:3) (pid:28613) OpSysName:  SLCern 
11/02/13 12:51:49 (fd:3) (pid:28613) OpSysVer:  604 
11/02/13 12:51:49 (fd:3) (pid:28613) OpSys:  LINUX 
11/02/13 12:51:49 (fd:3) (pid:28613) Reading from /proc/cpuinfo
11/02/13 12:51:49 (fd:3) (pid:28613) Found: Physical-IDs:True; Core-IDs:True
11/02/13 12:51:49 (fd:3) (pid:28613) Analyzing 16 processors using IDs...
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #0 (PID:0, CID:0):
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0   and P#1  : pid:0!=0 or  cid:0!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0   and P#2  : pid:0!=0 or  cid:0!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0   and P#3  : pid:0!=0 or  cid:0!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0   and P#4  : pid:0!=1 or  cid:0!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0   and P#5  : pid:0!=1 or  cid:0!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0   and P#6  : pid:0!=1 or  cid:0!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0   and P#7  : pid:0!=1 or  cid:0!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0   and P#8  : pid:0==0 and cid:0==0 (match=2)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0   and P#9  : pid:0!=0 or  cid:0!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0   and P#10 : pid:0!=0 or  cid:0!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0   and P#11 : pid:0!=0 or  cid:0!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0   and P#12 : pid:0!=1 or  cid:0!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0   and P#13 : pid:0!=1 or  cid:0!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0   and P#14 : pid:0!=1 or  cid:0!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0   and P#15 : pid:0!=1 or  cid:0!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 1
11/02/13 12:51:49 (fd:3) (pid:28613) P0: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) P8: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #1 (PID:0, CID:1):
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1   and P#2  : pid:0!=0 or  cid:1!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1   and P#3  : pid:0!=0 or  cid:1!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1   and P#4  : pid:0!=1 or  cid:1!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1   and P#5  : pid:0!=1 or  cid:1!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1   and P#6  : pid:0!=1 or  cid:1!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1   and P#7  : pid:0!=1 or  cid:1!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1   and P#8  : pid:0!=0 or  cid:1!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1   and P#9  : pid:0==0 and cid:1==1 (match=2)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1   and P#10 : pid:0!=0 or  cid:1!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1   and P#11 : pid:0!=0 or  cid:1!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1   and P#12 : pid:0!=1 or  cid:1!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1   and P#13 : pid:0!=1 or  cid:1!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1   and P#14 : pid:0!=1 or  cid:1!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1   and P#15 : pid:0!=1 or  cid:1!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 2
11/02/13 12:51:49 (fd:3) (pid:28613) P1: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) P9: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #2 (PID:0, CID:2):
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2   and P#3  : pid:0!=0 or  cid:2!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2   and P#4  : pid:0!=1 or  cid:2!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2   and P#5  : pid:0!=1 or  cid:2!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2   and P#6  : pid:0!=1 or  cid:2!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2   and P#7  : pid:0!=1 or  cid:2!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2   and P#8  : pid:0!=0 or  cid:2!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2   and P#9  : pid:0!=0 or  cid:2!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2   and P#10 : pid:0==0 and cid:2==2 (match=2)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2   and P#11 : pid:0!=0 or  cid:2!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2   and P#12 : pid:0!=1 or  cid:2!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2   and P#13 : pid:0!=1 or  cid:2!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2   and P#14 : pid:0!=1 or  cid:2!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2   and P#15 : pid:0!=1 or  cid:2!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 3
11/02/13 12:51:49 (fd:3) (pid:28613) P2: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) P10: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #3 (PID:0, CID:3):
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3   and P#4  : pid:0!=1 or  cid:3!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3   and P#5  : pid:0!=1 or  cid:3!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3   and P#6  : pid:0!=1 or  cid:3!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3   and P#7  : pid:0!=1 or  cid:3!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3   and P#8  : pid:0!=0 or  cid:3!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3   and P#9  : pid:0!=0 or  cid:3!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3   and P#10 : pid:0!=0 or  cid:3!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3   and P#11 : pid:0==0 and cid:3==3 (match=2)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3   and P#12 : pid:0!=1 or  cid:3!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3   and P#13 : pid:0!=1 or  cid:3!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3   and P#14 : pid:0!=1 or  cid:3!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3   and P#15 : pid:0!=1 or  cid:3!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 4
11/02/13 12:51:49 (fd:3) (pid:28613) P3: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) P11: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #4 (PID:1, CID:0):
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4   and P#5  : pid:1!=1 or  cid:0!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4   and P#6  : pid:1!=1 or  cid:0!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4   and P#7  : pid:1!=1 or  cid:0!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4   and P#8  : pid:1!=0 or  cid:0!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4   and P#9  : pid:1!=0 or  cid:0!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4   and P#10 : pid:1!=0 or  cid:0!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4   and P#11 : pid:1!=0 or  cid:0!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4   and P#12 : pid:1==1 and cid:0==0 (match=2)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4   and P#13 : pid:1!=1 or  cid:0!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4   and P#14 : pid:1!=1 or  cid:0!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4   and P#15 : pid:1!=1 or  cid:0!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 5
11/02/13 12:51:49 (fd:3) (pid:28613) P4: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) P12: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #5 (PID:1, CID:1):
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5   and P#6  : pid:1!=1 or  cid:1!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5   and P#7  : pid:1!=1 or  cid:1!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5   and P#8  : pid:1!=0 or  cid:1!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5   and P#9  : pid:1!=0 or  cid:1!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5   and P#10 : pid:1!=0 or  cid:1!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5   and P#11 : pid:1!=0 or  cid:1!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5   and P#12 : pid:1!=1 or  cid:1!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5   and P#13 : pid:1==1 and cid:1==1 (match=2)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5   and P#14 : pid:1!=1 or  cid:1!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5   and P#15 : pid:1!=1 or  cid:1!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 6
11/02/13 12:51:49 (fd:3) (pid:28613) P5: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) P13: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #6 (PID:1, CID:2):
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6   and P#7  : pid:1!=1 or  cid:2!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6   and P#8  : pid:1!=0 or  cid:2!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6   and P#9  : pid:1!=0 or  cid:2!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6   and P#10 : pid:1!=0 or  cid:2!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6   and P#11 : pid:1!=0 or  cid:2!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6   and P#12 : pid:1!=1 or  cid:2!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6   and P#13 : pid:1!=1 or  cid:2!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6   and P#14 : pid:1==1 and cid:2==2 (match=2)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6   and P#15 : pid:1!=1 or  cid:2!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 7
11/02/13 12:51:49 (fd:3) (pid:28613) P6: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) P14: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #7 (PID:1, CID:3):
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7   and P#8  : pid:1!=0 or  cid:3!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7   and P#9  : pid:1!=0 or  cid:3!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7   and P#10 : pid:1!=0 or  cid:3!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7   and P#11 : pid:1!=0 or  cid:3!=3 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7   and P#12 : pid:1!=1 or  cid:3!=0 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7   and P#13 : pid:1!=1 or  cid:3!=1 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7   and P#14 : pid:1!=1 or  cid:3!=2 (match=No)
11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7   and P#15 : pid:1==1 and cid:3==3 (match=2)
11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 8
11/02/13 12:51:49 (fd:3) (pid:28613) P7: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) P15: match->2
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #8 (PID:0, CID:0):
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #9 (PID:0, CID:1):
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #10 (PID:0, CID:2):
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #11 (PID:0, CID:3):
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #12 (PID:1, CID:0):
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #13 (PID:1, CID:1):
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #14 (PID:1, CID:2):
11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #15 (PID:1, CID:3):
11/02/13 12:51:49 (fd:3) (pid:28613) Using IDs: 16 processors, 8 CPUs, 8 HTs
11/02/13 12:51:49 (fd:3) (pid:28613) Reading condor configuration from '/etc/condor/condor_config'
11/02/13 12:51:49 (fd:3) (pid:28613) condor_gethostname() claims we are valtical00.cern.ch
11/02/13 12:51:49 (fd:3) (pid:28613) NETWORK_INTERFACE=* matches lo 127.0.0.1, eth0 137.138.40.140, virbr0 192.168.122.1, choosing IP 137.138.40.140
11/02/13 12:51:49 (fd:3) (pid:28613) ENABLE_IPV6 is undefined, using default value of False
11/02/13 12:51:49 (fd:3) (pid:28613) Considering valtical00.cern.ch (Ranked at 3) as possible local hostname versus valtical00.cern.ch/ (0)
11/02/13 12:51:49 (fd:3) (pid:28613) Identifying myself as: Short:: valtical00, Long: valtical00.cern.ch, IP: 137.138.40.140
11/02/13 12:51:49 (fd:3) (pid:28613) Trying to getting network interface informations (after reading config)
11/02/13 12:51:49 (fd:3) (pid:28613) NETWORK_INTERFACE=* matches lo 127.0.0.1, eth0 137.138.40.140, virbr0 192.168.122.1, choosing IP 137.138.40.140
11/02/13 12:51:49 (fd:3) (pid:28613) condor_gethostname() claims we are valtical00.cern.ch
11/02/13 12:51:49 (fd:3) (pid:28613) NETWORK_INTERFACE=* matches lo 127.0.0.1, eth0 137.138.40.140, virbr0 192.168.122.1, choosing IP 137.138.40.140
11/02/13 12:51:49 (fd:3) (pid:28613) Considering valtical00.cern.ch (Ranked at 3) as possible local hostname versus valtical00.cern.ch/valtical00.cern.ch (0)
11/02/13 12:51:49 (fd:3) (pid:28613) Identifying myself as: Short:: valtical00, Long: valtical00.cern.ch, IP: 137.138.40.140
11/02/13 12:51:49 (fd:3) (pid:28613) CONDOR_FSYNC is undefined, using default value of True
11/02/13 12:51:49 (fd:3) (pid:28613) WARN_ON_UNUSED_SUBMIT_FILE_MACROS is undefined, using default value of True
11/02/13 12:51:49 (fd:3) (pid:28613) TOOL_LOG_KEEP_OPEN is undefined, using default value of True
11/02/13 12:51:49 (fd:3) (pid:28613) SUBMIT_SKIP_FILECHECKS is undefined, using default value of False
11/02/13 12:51:49 (fd:3) (pid:28613) SUBMIT_MAX_PROCS_IN_CLUSTER is undefined, using default value of 0
11/02/13 12:51:49 (fd:3) (pid:28613) KEYCACHE: created: 0x2684c70
11/02/13 12:51:49 (fd:3) (pid:28613) TIMEOUT_MULTIPLIER is undefined, using default value of 0
11/02/13 12:51:49 (fd:3) (pid:28613) SUBMIT_TIMEOUT_MULTIPLIER is undefined, using default value of 0
11/02/13 12:51:49 (fd:3) (pid:28613) *** TIMEOUT_MULTIPLIER :: 0
11/02/13 12:51:49 (fd:3) (pid:28613) New Daemon obj (schedd) name: "NULL", pool: "NULL", addr: "NULL"
11/02/13 12:51:49 (fd:3) (pid:28613) Neither name nor addr specified, using local values - name: "valtical00.cern.ch", full host: "valtical00.cern.ch"
11/02/13 12:51:49 (fd:3) (pid:28613) Finding classad for local daemon, SCHEDD_DAEMON_AD_FILE is "/var/lib/condor/spool/.schedd_classad"
11/02/13 12:51:49 (fd:4) (pid:28613) STRICT_CLASSAD_EVALUATION is undefined, using default value of False
11/02/13 12:51:49 (fd:3) (pid:28613) Found Name in ClassAd, using "valtical00.cern.ch"
11/02/13 12:51:49 (fd:3) (pid:28613) Found SCHEDDIpAddr in ClassAd, using "<137.138.40.140:39738>"
11/02/13 12:51:49 (fd:3) (pid:28613) Found CondorVersion in ClassAd, using "$CondorVersion: 7.8.8 Jun 17 2013 $"
11/02/13 12:51:49 (fd:3) (pid:28613) Found CondorPlatform in ClassAd, using "$CondorPlatform: X86_64-CentOS_6.4 $"
11/02/13 12:51:49 (fd:3) (pid:28613) Found Machine in ClassAd, using "valtical00.cern.ch"
11/02/13 12:51:49 (fd:3) (pid:28613) validate <137.138.40.140:39738>
11/02/13 12:51:49 (fd:3) (pid:28613) success
11/02/13 12:51:49 (fd:3) (pid:28613) Using port 39738 based on address "<137.138.40.140:39738>"
Submitting job(s)11/02/13 12:51:49 (fd:4) (pid:28613) TIMEOUT_MULTIPLIER is undefined, using default value of 0
11/02/13 12:51:49 (fd:4) (pid:28613) SUBMIT_TIMEOUT_MULTIPLIER is undefined, using default value of 0
11/02/13 12:51:49 (fd:4) (pid:28613) *** TIMEOUT_MULTIPLIER :: 0
11/02/13 12:51:49 (fd:4) (pid:28613) validate <137.138.40.140:39738>
11/02/13 12:51:49 (fd:4) (pid:28613) success
11/02/13 12:51:49 (fd:4) (pid:28613) New Daemon obj (schedd) name: "NULL", pool: "NULL", addr: "<137.138.40.140:39738>"
11/02/13 12:51:49 (fd:4) (pid:28613) validate <137.138.40.140:39738>
11/02/13 12:51:49 (fd:4) (pid:28613) success
11/02/13 12:51:49 (fd:4) (pid:28613) Already have address, no info to locate
11/02/13 12:51:49 (fd:4) (pid:28613) validate <137.138.40.140:39738>
11/02/13 12:51:49 (fd:4) (pid:28613) success
11/02/13 12:51:49 (fd:4) (pid:28613) Using port 39738 based on address "<137.138.40.140:39738>"
11/02/13 12:51:49 (fd:4) (pid:28613) Guess address string for host = <137.138.40.140:39738>, port = 0
11/02/13 12:51:49 (fd:4) (pid:28613) it was sinful string. ip = 137.138.40.140, port = 39738
11/02/13 12:51:49 (fd:5) (pid:28613) OUT_LOWPORT is undefined, using default value of 0
11/02/13 12:51:49 (fd:5) (pid:28613) LOWPORT is undefined, using default value of 0
11/02/13 12:51:49 (fd:5) (pid:28613) CONNECT bound to <137.138.40.140:46124> fd=4 peer=<137.138.40.140:39738>
11/02/13 12:51:49 (fd:5) (pid:28613) SECMAN: command 1112 QMGMT_WRITE_CMD to schedd at <137.138.40.140:39738> from TCP port 46124 (blocking).
11/02/13 12:51:49 (fd:5) (pid:28613) SEC_SUBMIT_CLIENT_SESSION_DURATION is undefined, using default value of 0
11/02/13 12:51:49 (fd:5) (pid:28613) SEC_SUBMIT_DEFAULT_SESSION_DURATION is undefined, using default value of 0
11/02/13 12:51:49 (fd:5) (pid:28613) SEC_CLIENT_SESSION_DURATION is undefined, using default value of 0
11/02/13 12:51:49 (fd:5) (pid:28613) SEC_DEFAULT_SESSION_DURATION is undefined, using default value of 0
11/02/13 12:51:49 (fd:5) (pid:28613) SEC_CLIENT_SESSION_LEASE is undefined, using default value of 0
11/02/13 12:51:49 (fd:5) (pid:28613) SEC_DEFAULT_SESSION_LEASE is undefined, using default value of 0
11/02/13 12:51:49 (fd:5) (pid:28613) SECMAN: no cached key for {<137.138.40.140:39738>,<1112>}.
11/02/13 12:51:49 (fd:5) (pid:28613) SECMAN: Security Policy:
AuthMethods = "FS,KERBEROS,GSI"
SessionDuration = "60"
Authentication = "NEVER"
Enact = "NO"
Subsystem = "SUBMIT"
Integrity = "NEVER"
NewSession = "YES"
CryptoMethods = "3DES,BLOWFISH"
OutgoingNegotiation = "PREFERRED"
Encryption = "NEVER"
CurrentTime = time()
SessionLease = 3600
ServerPid = 28613
11/02/13 12:51:49 (fd:5) (pid:28613) SECMAN: negotiating security for command 1112.
11/02/13 12:51:49 (fd:5) (pid:28613) SECMAN: sending DC_AUTHENTICATE command
11/02/13 12:51:49 (fd:5) (pid:28613) SECMAN: sending following classad:
Command = 1112
AuthMethods = "FS,KERBEROS,GSI"
SessionDuration = "60"
Authentication = "NEVER"
Enact = "NO"
Subsystem = "SUBMIT"
Integrity = "NEVER"
RemoteVersion = "$CondorVersion: 7.8.8 Jun 17 2013 $"
NewSession = "YES"
CryptoMethods = "3DES,BLOWFISH"
OutgoingNegotiation = "PREFERRED"
Encryption = "NEVER"
CurrentTime = time()
SessionLease = 3600
ServerPid = 28613
11/02/13 12:51:49 (fd:5) (pid:28613) selector 0x7fff9f3ca7e0 resetting
11/02/13 12:51:49 (fd:5) (pid:28613) condor_write(fd=4 schedd at <137.138.40.140:39738>,,size=370,timeout=0,flags=0)
11/02/13 12:51:49 (fd:5) (pid:28613) selector 0x7fff9f3ca7e0 adding fd 4 (socket:[1039394])
11/02/13 12:51:49 (fd:5) (pid:28613) selector 0x7fff9f3ca7e0 adding fd 4 (socket:[1039394])
11/02/13 12:51:49 (fd:5) (pid:28613) selector 0x7fff9f3ca7e0 adding fd 4 (socket:[1039394])
11/02/13 12:51:49 (fd:5) (pid:28613) selector 0x7fff9f3ca6d0 resetting
11/02/13 12:51:49 (fd:5) (pid:28613) condor_read(fd=4 schedd at <137.138.40.140:39738>,,size=5,timeout=0,flags=0)
11/02/13 12:51:49 (fd:5) (pid:28613) selector 0x7fff9f3ca6d0 adding fd 4 (socket:[1039394])
11/02/13 12:51:49 (fd:5) (pid:28613) condor_read(): Socket closed when trying to read 5 bytes from schedd at <137.138.40.140:39738>
11/02/13 12:51:49 (fd:5) (pid:28613) IO: EOF reading packet header
11/02/13 12:51:49 (fd:5) (pid:28613) Stream::get(int) failed to read padding
11/02/13 12:51:49 (fd:5) (pid:28613) SECMAN: no classad from server, failing
11/02/13 12:51:49 (fd:5) (pid:28613) CLOSE <137.138.40.140:46124> fd=4
11/02/13 12:51:49 (fd:4) (pid:28613) Destroying Daemon object:
11/02/13 12:51:49 (fd:4) (pid:28613) Type: 3 (schedd), Name: (null), Addr: <137.138.40.140:39738>
11/02/13 12:51:49 (fd:4) (pid:28613) FullHost: (null), Host: (null), Pool: (null), Port: 39738
11/02/13 12:51:49 (fd:4) (pid:28613) IsLocal: N, IdStr: schedd at <137.138.40.140:39738>, Error: (null)
11/02/13 12:51:49 (fd:4) (pid:28613)  --- End of Daemon object info ---

ERROR: Failed to connect to local queue manager
SECMAN:2007:Failed to end classad message.


Cheers,Gang


________________________________________
From: condor-users-bounces@xxxxxxxxxxx [condor-users-bounces@xxxxxxxxxxx] on behalf of Zachary Miller [zmiller@xxxxxxxxxxx]
Sent: 31 August 2012 01:14
To: Condor-Users Mail List
Subject: Re: [Condor-users] how to fix 'DC_AUTHENTICATE: Unable to      reconcile!'?

On Thu, Aug 30, 2012 at 10:05:39PM +0000, Gang Qin wrote:
> Dear expert:
>
>   Today I try to add a new machine as condor submitter, after adding 'SCHEDD'
> to the DAEMON_LIST and restarting the condor service, condor_q and
> condor_status could work. But when I try to submit a job, it fails with the
> following error:
>
>   ERROR: Failed to connect to local queue manager
> SECMAN:2007:Failed to end classad message.
>
>   And in SchedLog , I see the following error message:
>
> 08/30/12 23:54:33 (pid:12036) DC_AUTHENTICATE: Unable to reconcile!

this means the security policy can't be agreed on by the client and server.

you can more info by setting (in the condor_config)
  TOOL_DEBUG = D_ALL
  SUBMIT_DEBUG = D_ALL

in the condor config file, and then running:
  condor_submit -debug <your submit file>

if you want to send me the output of that (offlist is fine) i'll see if i can
find the problem.


cheers,
-zach

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/

Attachment: condor.tar.gz
Description: condor.tar.gz