[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] SCHEDD restarting




To debug this, it may help to turn on logging in the procd.  Example:

PROCD_LOG = $(LOG)/ProcLog

Adding more debug verbosity may also help:

SCHEDD_DEBUG = $(SCHEDD_DEBUG) D_PROCFAMILY D_FULLDEBUG

--Dan

Dave STREET wrote:
Hi, if I restart my server, or stop condor running and then try to start it using
Condor_master

all Proceses apart from SCHEDD start up.
looking in Masterlog is see

04/27 18:37:50 restarting /usr/sbin/condor_schedd in 3600 seconds
04/27 19:37:50 Started DaemonCore process "/usr/sbin/condor_schedd", pid and pgroup = 3334
04/27 19:37:50 The SCHEDD (pid 3334) exited with status 4
04/27 19:37:50 restarting /usr/sbin/condor_schedd in 3600 seconds
04/27 20:37:50 Started DaemonCore process "/usr/sbin/condor_schedd", pid and pgroup = 3764
04/27 20:37:50 The SCHEDD (pid 3764) exited with status 4
04/27 20:37:50 restarting /usr/sbin/condor_schedd in 3600 seconds
04/27 21:37:50 Started DaemonCore process "/usr/sbin/condor_schedd", pid and pgroup = 3982
04/27 21:37:50 The SCHEDD (pid 3982) exited with status 4
04/27 21:37:50 restarting /usr/sbin/condor_schedd in 3600 seconds


the schdlog shows 4/27 21:37:50 (pid:3982) ******************************************************
04/27 21:37:50 (pid:3982) ** condor_schedd (CONDOR_SCHEDD) STARTING UP
04/27 21:37:50 (pid:3982) ** /usr/sbin/condor_schedd
04/27 21:37:50 (pid:3982) ** SubsystemInfo: name=SCHEDD type=SCHEDD(5) class=DAEMON(1) 04/27 21:37:50 (pid:3982) ** Configuration: subsystem:SCHEDD local:<NONE> class:DAEMON 04/27 21:37:50 (pid:3982) ** $CondorVersion: 7.4.2 Mar 29 2010 BuildID: 227044 $
04/27 21:37:50 (pid:3982) ** $CondorPlatform: I386-LINUX_DEBIAN50 $
04/27 21:37:50 (pid:3982) ** PID = 3982
04/27 21:37:50 (pid:3982) ** Log last touched 4/27 20:37:50
04/27 21:37:50 (pid:3982) ******************************************************
04/27 21:37:50 (pid:3982) Using config source: /etc/condor/condor_config
04/27 21:37:50 (pid:3982) Using local config sources:
04/27 21:37:50 (pid:3982)    /etc/condor/condor_config.local
04/27 21:37:50 (pid:3982) DaemonCore: Command Socket at <127.0.1.1:53483>
04/27 21:37:50 (pid:3982) error opening watchdog pipe /var/run/condor/procd_pipe.SCHEDD.watchdog: No such file or directory (2)
04/27 21:37:50 (pid:3982) ProcFamilyClient: error initializing LocalClient
04/27 21:37:50 (pid:3982) ProcFamilyProxy: error initializing ProcFamilyClient 04/27 21:37:50 (pid:3982) ERROR "ProcD has failed" at line 599 in file proc_family_proxy.cpp

does any one know what status 4 means, or what is going on? strangely if I reinstall condor then it starts ok? For some reason I just cant get this to start. is there some thing else i need to start running first?

Cheers




------------------------------------------------------------------------
Get a new e-mail account with Hotmail - Free. Sign-up now. <http://clk.atdmt.com/UKM/go/197222280/direct/01/>
------------------------------------------------------------------------

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/