[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] condor deamon questions



On Wed, 30 Sep 2009, Mag Gam wrote:

What happens if "collector, negotiator, dbmsd, and quill.schedd and
shadow" are restarted very slowly? Does that kill jobs?


If the schedd is down for too long (greater than JobLeaseDuration)
the jobs will time out and get killed by the startd.

Negotiator can be down indefinitely, it just means no new jobs
will get matched.
If the collector is down stuff will stay running but eventually
things will start timing out.

Steve



On Tue, Sep 29, 2009 at 10:35 PM, Steven Timm <timm@xxxxxxxx> wrote:
On Tue, 29 Sep 2009, Mag Gam wrote:

I have these processes running in my pool:

condor_master
condor_startd
condor_quill
condor_dbmsd
condor_starter
condor_schedd
condor_shadow
condor_collector
condor_negotiator


Which process is it ok for me to stop completely and restart without
affecting any running jobs?  Which process is it okay for me to -HUP
(condor_restart -subsystem )?

collector, negotiator, dbmsd, and quill.
schedd and shadow if you do the restart fast enough.

restart of condor_startd will always kill the jobs,
restart of master will cause restart of the startd and the jobs.
condor_starter I think doesn't take a normal reset signal but
I could be wrong.  In any case if the condor_Starter is killed
the job that it starts dies too.

Steve



TIA
_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/


--
------------------------------------------------------------------
Steven C. Timm, Ph.D  (630) 840-8525
timm@xxxxxxxx  http://home.fnal.gov/~timm/
Fermilab Computing Division, Scientific Computing Facilities,
Grid Facilities Department, FermiGrid Services Group, Assistant Group Leader.
_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/


--
------------------------------------------------------------------
Steven C. Timm, Ph.D  (630) 840-8525
timm@xxxxxxxx  http://home.fnal.gov/~timm/
Fermilab Computing Division, Scientific Computing Facilities,
Grid Facilities Department, FermiGrid Services Group, Assistant Group Leader.