[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Condor-users] SCHEDD restarting



Hi, 

if I restart my server, or stop condor running and then try to start it using 

Condor_master

all Proceses apart from SCHEDD start up. 

looking in Masterlog is see

04/27 18:37:50 restarting /usr/sbin/condor_schedd in 3600 seconds
04/27 19:37:50 Started DaemonCore process "/usr/sbin/condor_schedd", pid and pgroup = 3334
04/27 19:37:50 The SCHEDD (pid 3334) exited with status 4
04/27 19:37:50 restarting /usr/sbin/condor_schedd in 3600 seconds
04/27 20:37:50 Started DaemonCore process "/usr/sbin/condor_schedd", pid and pgroup = 3764
04/27 20:37:50 The SCHEDD (pid 3764) exited with status 4
04/27 20:37:50 restarting /usr/sbin/condor_schedd in 3600 seconds
04/27 21:37:50 Started DaemonCore process "/usr/sbin/condor_schedd", pid and pgroup = 3982
04/27 21:37:50 The SCHEDD (pid 3982) exited with status 4
04/27 21:37:50 restarting /usr/sbin/condor_schedd in 3600 seconds


the schdlog shows 

4/27 21:37:50 (pid:3982) ******************************************************
04/27 21:37:50 (pid:3982) ** condor_schedd (CONDOR_SCHEDD) STARTING UP
04/27 21:37:50 (pid:3982) ** /usr/sbin/condor_schedd
04/27 21:37:50 (pid:3982) ** SubsystemInfo: name=SCHEDD type=SCHEDD(5) class=DAEMON(1)
04/27 21:37:50 (pid:3982) ** Configuration: subsystem:SCHEDD local:<NONE> class:DAEMON
04/27 21:37:50 (pid:3982) ** $CondorVersion: 7.4.2 Mar 29 2010 BuildID: 227044 $
04/27 21:37:50 (pid:3982) ** $CondorPlatform: I386-LINUX_DEBIAN50 $
04/27 21:37:50 (pid:3982) ** PID = 3982
04/27 21:37:50 (pid:3982) ** Log last touched 4/27 20:37:50
04/27 21:37:50 (pid:3982) ******************************************************
04/27 21:37:50 (pid:3982) Using config source: /etc/condor/condor_config
04/27 21:37:50 (pid:3982) Using local config sources:
04/27 21:37:50 (pid:3982)    /etc/condor/condor_config.local
04/27 21:37:50 (pid:3982) DaemonCore: Command Socket at <127.0.1.1:53483>
04/27 21:37:50 (pid:3982) error opening watchdog pipe /var/run/condor/procd_pipe.SCHEDD.watchdog: No such file or directory (2)
04/27 21:37:50 (pid:3982) ProcFamilyClient: error initializing LocalClient
04/27 21:37:50 (pid:3982) ProcFamilyProxy: error initializing ProcFamilyClient
04/27 21:37:50 (pid:3982) ERROR "ProcD has failed" at line 599 in file proc_family_proxy.cpp

does any one know what status 4 means, or what is going on? 

strangely if I reinstall condor then it starts ok? 

For some reason I just cant get this to start. is there some thing else i need to start running first?

Cheers





Get a new e-mail account with Hotmail - Free. Sign-up now.