[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] condor_read()&condor_write() problem



There are some problems with my condor, and I need your help!
 
Following is the MasterLog in the Central Manager:
 
9/4 13:54:22 ******************************************************
9/4 13:54:22 ** condor_master (CONDOR_MASTER) STARTING UP
9/4 13:54:22 ** /usr/local/condor/sbin/condor_master
9/4 13:54:22 ** $CondorVersion: 6.8.0 Jul 19 2006 $
9/4 13:54:22 ** $CondorPlatform: I386-LINUX_RHEL3 $
9/4 13:54:22 ** PID = 2164
9/4 13:54:22 ** Log last touched 9/4 13:50:31
9/4 13:54:22 ******************************************************
9/4 13:54:22 Using config source: /home/condor/condor_config
9/4 13:54:22 Using local config sources:
9/4 13:54:22    /home/condor/condor_config.local
9/4 13:54:22 DaemonCore: Command Socket at < 219.239.227.121:32770>
9/4 13:54:22 Started DaemonCore process "/usr/local/condor/sbin/condor_collector", pid and pgroup = 2165
9/4 13:54:22 Started DaemonCore process "/usr/local/condor/sbin/condor_negotiator", pid and pgroup = 2166
9/4 13:54:22 Started DaemonCore process "/usr/local/condor/sbin/condor_startd", pid and pgroup = 2167
9/4 13:54:22 Started DaemonCore process "/usr/local/condor/sbin/condor_schedd", pid and pgroup = 2168
9/4 13:54:47 condor_read(): timeout reading buffer.
9/4 14:54:22 Preen pid is 3027
9/4 14:54:22 Child 3027 died, but not a daemon -- Ignored
9/5 14:54:22 Preen pid is 5757
9/5 14:54:22 Child 5757 died, but not a daemon -- Ignored

-----------------------------------------------------------------------------------------
Following is the SchedLog in the Central Manager
 
9/4 13:54:24 (pid:2168) ******************************************************
9/4 13:54:24 (pid:2168) ** condor_schedd (CONDOR_SCHEDD) STARTING UP
9/4 13:54:24 (pid:2168) ** /usr/local/condor/sbin/condor_schedd
9/4 13:54:24 (pid:2168) ** $CondorVersion: 6.8.0 Jul 19 2006 $
9/4 13:54:24 (pid:2168) ** $CondorPlatform: I386-LINUX_RHEL3 $
9/4 13:54:24 (pid:2168) ** PID = 2168
9/4 13:54:24 (pid:2168) ** Log last touched 9/4 13:50:31
9/4 13:54:24 (pid:2168) ******************************************************
9/4 13:54:24 (pid:2168) Using config source: /home/condor/condor_config
9/4 13:54:24 (pid:2168) Using local config sources:
9/4 13:54:24 (pid:2168)    /home/condor/condor_config.local
9/4 13:54:24 (pid:2168) DaemonCore: Command Socket at <219.239.227.121:32773>
9/4 13:54:24 (pid:2168) History file rotation is enabled.
9/4 13:54:24 (pid:2168)   Maximum history file size is: 20971520 bytes
9/4 13:54:24 (pid:2168)   Number of rotated history files is: 2
9/4 13:54:46 (pid:2168) condor_read(): timeout reading buffer.
-------------------------------------------------------------------------------------------------------
Following is the MasterLog in the Condor Machine
 
9/5 23:52:40 ******************************************************
9/5 23:52:40 ** condor_master (CONDOR_MASTER) STARTING UP
9/5 23:52:40 ** /usr/local/condor/sbin/condor_master
9/5 23:52:40 ** $CondorVersion: 6.8.0 Jul 19 2006 $
9/5 23:52:40 ** $CondorPlatform: I386-LINUX_RHEL3 $
9/5 23:52:40 ** PID = 1845
9/5 23:52:40 ** Log last touched 9/5 23:42:45
9/5 23:52:40 ******************************************************
9/5 23:52:40 Using config source: /home/condor/condor_config
9/5 23:52:40 Using local config sources:
9/5 23:52:40    /home/condor/condor_config.local
9/5 23:52:40 DaemonCore: Command Socket at < 211.71.7.172:32771>
9/5 23:52:40 Started DaemonCore process "/usr/local/condor/sbin/condor_startd", pid and pgroup = 1846
9/5 23:52:40 Started DaemonCore process "/usr/local/condor/sbin/condor_schedd", pid and pgroup = 1847
9/5 23:52:47 attempt to connect to <211.71.7.172:32780> failed
9/5 23:52:48 condor_write(): Socket closed when trying to write buffer, fd is 9, errno=107
9/5 23:52:48 Buf::write(): condor_write() failed
9/5 23:52:48 SECMAN: failed to end classad message
9/5 23:52:48 ERROR: SECMAN:2004:Failed to start a session to <211.71.7.172:32770> with TCP|SECMAN:2007:Failed to end classad message
9/5 23:52:48 Failed to start non-blocking update to <211.71.7.172:32770>.
 
-----------------------------------------------------------------------------------------------------
Following is the SchedLog in the Condor Machine
 
9/5 23:52:40 (pid:1847) ******************************************************
9/5 23:52:40 (pid:1847) ** condor_schedd (CONDOR_SCHEDD) STARTING UP
9/5 23:52:40 (pid:1847) ** /usr/local/condor/sbin/condor_schedd
9/5 23:52:40 (pid:1847) ** $CondorVersion: 6.8.0 Jul 19 2006 $
9/5 23:52:40 (pid:1847) ** $CondorPlatform: I386-LINUX_RHEL3 $
9/5 23:52:40 (pid:1847) ** PID = 1847
9/5 23:52:40 (pid:1847) ** Log last touched 9/5 23:42:45
9/5 23:52:40 (pid:1847) ******************************************************
9/5 23:52:40 (pid:1847) Using config source: /home/condor/condor_config
9/5 23:52:40 (pid:1847) Using local config sources:
9/5 23:52:40 (pid:1847)    /home/condor/condor_config.local
9/5 23:52:40 (pid:1847) DaemonCore: Command Socket at <211.71.7.172:32773>
9/5 23:52:40 (pid:1847) History file rotation is enabled.
9/5 23:52:40 (pid:1847)   Maximum history file size is: 20971520 bytes
9/5 23:52:40 (pid:1847)   Number of rotated history files is: 2
9/5 23:52:43 (pid:1847) attempt to connect to <211.71.7.172:32775> failed
9/5 23:52:44 (pid:1847) condor_write(): Socket closed when trying to write buffer, fd is 11, errno=107
9/5 23:52:44 (pid:1847) Buf::write(): condor_write() failed
9/5 23:52:44 (pid:1847) SECMAN: failed to end classad message
9/5 23:52:44 (pid:1847) ERROR: SECMAN:2004:Failed to start a session to < 211.71.7.172:32769> with TCP|SECMAN:2007:Failed to end classad message
9/5 23:52:44 (pid:1847) Failed to start non-blocking update to <211.71.7.172:32769>.

---------------------------------------------------------
Dongbo Yang
16 Floor, Shining Building
BeiHang University
37 Xueyuan RD.
Haidian District
Bejing, PR China
100083