[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] how to ask an execute machine "stop after this job" ?



I tried that, and also "condor_config_val -rset START=False", and it gives me the following in the MasterLog : 

5/15 17:52:51 DaemonCore: PERMISSION DENIED to unknown user from host <172.27.1.7:35148> for command 484 (DAEMONS_OFF_PEACEFUL) 
5/15 17:57:30 DaemonCore: Command received via TCP from host <172.27.1.7:35172> 
5/15 17:57:30 DaemonCore: received command 60002 (DC_CONFIG_PERSIST), calling handler (handle_config()) 
5/15 17:57:30 WARNING: Someone at 172.27.1.7 is trying to modify "START" 
5/15 17:57:30 WARNING: Potential security problem, request refused 
5/15 17:58:42 DaemonCore: Command received via TCP from host <172.27.1.7:35173> 
5/15 17:58:42 DaemonCore: received command 60003 (DC_CONFIG_RUNTIME), calling handler (handle_config()) 
5/15 17:58:42 WARNING: Someone at 172.27.1.7 is trying to modify "START" 
5/15 17:58:42 WARNING: Potential security problem, request refused 
5/15 17:59:54 DaemonCore: Command received via TCP from host <172.27.1.7:35176> 
5/15 17:59:54 DaemonCore: received command 60003 (DC_CONFIG_RUNTIME), calling handler (handle_config()) 
5/15 17:59:54 WARNING: Someone at 172.27.1.7 is trying to modify "START" 
5/15 17:59:54 WARNING: Potential security problem, request refused 
5/15 18:01:44 DaemonCore: PERMISSION DENIED to unknown user from host <172.27.1.7:35194> for command 484 (DAEMONS_OFF_PEACEFUL) 
5/15 18:02:58 DaemonCore: PERMISSION DENIED to unknown user from host <172.27.1.7:35201> for command 483 (DAEMON_OFF_PEACEFUL)

I tried as root, condor, from the central manager or on the localhost, although I have this (io is the central manager) : 

$ condor_config_val -verbose HOSTALLOW_ADMINISTRATOR
HOSTALLOW_ADMINISTRATOR: io.lbt.ibpc.fr, localhost
  Defined in '/scratch/condor/condor_config.local', line 9.

Any idea ?

Nicolas

----------------
On Tue, 15 May 2007 11:49:33 -0400
Ian Chesal wrote:

> On 5/15/07, Nicolas GUIOT  wrote:
> >
> > Hi,
> >
> > I need to make some maintenance on some execute machines.
> > How can I tell them "Finish the job you are running right now, but don't
> > start any new one (until I authorize it again...)" ?
> 
> 
> condor_off -peaceful -subst startd -name <hostname>
> 
> See: http://www.cs.wisc.edu/condor/manual/v6.8/condor_off.html#53615
> 
> I can't recall if that prevents new jobs from running on other VMs on the
> machine if one VM takes a particularly long time for it's job to finish. It
> should. Seems like a bug if that's not what it does.
> 
> - Ian

----------

----------------------------------------------------
CNRS - UPR 9080 : Laboratoire de Biochimie Theorique

Institut de Biologie Physico-Chimique
13 rue Pierre et Marie Curie
75005 PARIS - FRANCE

Tel : +33 158 41 51 70
Fax : +33 158 41 50 26
----------------------------------------------------