[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Strange things about my configuration, and the reality...



On Oct 11, 2005, at 7:40 AM, Nicolas GUIOT wrote:

I want to have condor on 4 machines :
io : the central manager only
vrubel, goya : execute machine only
chagall : submit machine only.

in the local config files, I setup the values to "TESTINGMODE_*" (I wanted to set it up in the main config file, but it seems it didn't take it into account...., and here what looks like a local config file for vrubel or goya (execute) :

# Start only as EXECUTE machine
DAEMON_LIST = MASTER, STARTD

##  What machines have administrative rights for your pool?  This
##  defaults to your central manager.  You should set it to the
##  machine(s) where whoever is the condor administrator(s) works
##  (assuming you trust all the users who log into that/those
##  machine(s), since this is machine-wide access you're granting).
HOSTALLOW_ADMINISTRATOR = $(CONDOR_HOST), localhost


##################################################################### ## This is a Configuration that will cause your Condor jobs to ## always run. This is intended for testing only. ######################################################################

##  This mode will cause your jobs to start on a machine an will let
##  them run to completion.  Condor will ignore all of what is going
##  on in the machine (load average, keyboard activity, etc.)

TESTINGMODE_WANT_SUSPEND        = False
TESTINGMODE_WANT_VACATE         = False
TESTINGMODE_START               = True
TESTINGMODE_SUSPEND             = False
TESTINGMODE_CONTINUE            = True
TESTINGMODE_PREEMPT             = False
TESTINGMODE_KILL                = False
TESTINGMODE_PERIODIC_CHECKPOINT = False
TESTINGMODE_PREEMPTION_REQUIREMENTS = False
TESTINGMODE_PREEMPTION_RANK = 0


######################################################################

_______end of local config_file____________

I still have the problem that the "schedd" daemon is running.
So, I tried to "condor_restart", but it doesn' want, neither with "root" nor with "condor" user :


root@vrubel:~# tail /scratch/condor/log/MasterLog
10/11 14:13:31 ** $CondorVersion: 6.7.10 Aug 3 2005 $
10/11 14:13:31 ** $CondorPlatform: I386-LINUX_RH9 $
10/11 14:13:31 ** PID = 21382
10/11 14:13:31 ******************************************************
10/11 14:13:31 Using config file: /ibpc/io/condor/etc/condor_config
10/11 14:13:31 Using local config files: /scratch/condor/ condor_config.local
10/11 14:13:31 DaemonCore: Command Socket at <193.49.27.11:33429>
10/11 14:13:31 Started DaemonCore process "/ibpc/io/condor/sbin/ condor_startd", pid and pgroup = 21383
10/11 14:13:31 Started DaemonCore process "/ibpc/io/condor/sbin/ condor_schedd", pid and pgroup = 21384
10/11 14:14:14 DaemonCore: PERMISSION DENIED to unknown user from host <193.49.27.11:33442> for command 453 (RESTART)
root@vrubel:~#


How can I solve both problems ? (permission denied, and daemons which run)

You have to issue condor_restart from a machine listed in HOSTALLOW_ADMINISTRATOR in the config file. By default, only the central manager is included. Since anyone on listed machine can issue these commands, you want limit the list to machines with restricted logins (or set up strong, user-based authentication). You can also just kill and restart the master.


Your change to DAEMON_LIST should prevent the schedd from being started, but the master won't automatically stop an already-running schedd. If the schedd is still being started when you restart the master, run 'condor_config_val -v DAEMON_LIST' on the machine. That will tell you what value of DAEMON_LIST is being used and where it's being defined.

+----------------------------------+---------------------------------+
|            Jaime Frey            |  Public Split on Whether        |
|        jfrey@xxxxxxxxxxx         |  Bush Is a Divider              |
|  http://www.cs.wisc.edu/~jfrey/  |         -- CNN Scrolling Banner |
+----------------------------------+---------------------------------+