[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

RE: [Condor-users] about priorities






Any suggestions for the following condor_config entries on the execute
machines?

WANT_SUSPEND = FALSE
WANT_VACATE = FALSE
START = TRUE
SUSPEND = FALSE
CONTINUE          = $(UWCS_CONTINUE)
PREEMPT = FALSE
KILL              = $(UWCS_KILL)
PERIODIC_CHECKPOINT     = $(UWCS_PERIODIC_CHECKPOINT)
PREEMPTION_REQUIREMENTS = $(UWCS_PREEMPTION_REQUIREMENTS)
PREEMPTION_RANK         = $(UWCS_PREEMPTION_RANK)


Thanks,
Bob Nordlund



|---------+-------------------------------->
|         |           "Matt Hope"          |
|         |           <Matt.Hope@xxxxxxxx> |
|         |           Sent by:             |
|         |           condor-users-bounces@|
|         |           cs.wisc.edu          |
|         |                                |
|         |                                |
|         |           08/05/2004 10:36 AM  |
|         |           Please respond to    |
|         |           Condor-Users Mail    |
|         |           List                 |
|         |                                |
|---------+-------------------------------->
  >--------------------------------------------------------------------------------------------------------------|
  |                                                                                                              |
  |       To:       "Condor-Users Mail List" <condor-users@xxxxxxxxxxx>                                          |
  |       cc:                                                                                                    |
  |       Subject:  RE: [Condor-users] about priorities                                                          |
  >--------------------------------------------------------------------------------------------------------------|




the cluster/proc option should work fine for you then...

just alter the rank expression on the execute machines and, assuming all
other settings are at their defaults, it should all just work...

try it with cpusoak test jobs, send a vacate to one of the machines and see
what happens.

> -----Original Message-----
> From: condor-users-bounces@xxxxxxxxxxx
> [mailto:condor-users-bounces@xxxxxxxxxxx]On Behalf Of
> Robert.Nordlund@xxxxxxxxxxxxxxxx
> Sent: 05 August 2004 15:25
> To: Condor-Users Mail List
> Subject: RE: [Condor-users] about priorities
>
>
>
>
>
>
> Matt,
>
> Thanks for the quick reply.  I'll try to clarify my
> situation.  Jobs are
> submitted with a five-second delay.  In the case where the
> first ten jobs
> are running and utilizing all available pool resources and
> then a hardware
> failure causes the available pool resources to decrease and
> thereby cause a
> job failure, is there a way to remove an active
> lower-priority job(s) to
> make room for the higher priority job that failed?  This is a
> dedicated
> pool with only one scheduler.  I control the horizontal and
> the vertical.
>
> Thanks,
> Bob Nordlund
>
>
>
>
> |---------+-------------------------------->
> |         |           "Matt Hope"          |
> |         |           <Matt.Hope@xxxxxxxx> |
> |         |           Sent by:             |
> |         |           condor-users-bounces@|
> |         |           cs.wisc.edu          |
> |         |                                |
> |         |                                |
> |         |           08/05/2004 10:12 AM  |
> |         |           Please respond to    |
> |         |           Condor-Users Mail    |
> |         |           List                 |
> |         |                                |
> |---------+-------------------------------->
>
> >-------------------------------------------------------------
> -------------------------------------------------|
>   |
>                                                    |
>   |       To:       "Condor-Users Mail List"
> <condor-users@xxxxxxxxxxx>                                          |
>   |       cc:
>                                                    |
>   |       Subject:  RE: [Condor-users] about priorities
>                                                    |
>
> >-------------------------------------------------------------
> -------------------------------------------------|
>
>
>
>
> assumption - jobs are submitted in time order with sufficient
> gap between
> them to account for time differences between submit machines...
>
> place
> RANK=-TARGET.QDate
> on all execute machines.
>
> assumption - the jobs are all submitted from one cluster..
> place
> RANK=-TARGET.ProcId
> on all execute machines.
> If you wish to have multiple sets of these ordered jobs at
> once make sure
> you either include the cluster id as so (works for jobs up to
> 100000000
> processes big!)
> RANK= (-TARGET.ClusterId * 100000000) - TARGET.ProcId
>
>
> assumption you can control the submission process such that
> every job is
> given a unique monotonic increasing/decreasing number (easy
> if all done
> from one cluster but still possible otherwise)
> use for example
> +MY_ORDER_NUMBER=[x]
> in the submit file and ensure this ticks on every time
>
> increasing
> RANK=-TARGET.MY_ORDER_NUMBER
>
> decreasing
> RANK=TARGET.MY_ORDER_NUMBER
>
> if you have to play nicely with other machines this rank
> expression must be
> modified to correctly handle jobs which don't have this
> attribute (using
> =!= and/or =?=)
>
> Igf you haven't got control of the execute machines rank
> you'll have some
> issues...
>
> also you will have to deal with the possibility that:
>
> jobs 1-10 start
> job 5 is vacated before finishing for some reasona at roughly
> the same time
> job 11 starts.
> job 11 then completes before the negotiation cycle can swap 6
> for it. job
> 11 has now run before 6 and nothing can change that.
>
> This may not be a problem but this was not clear from your request.
>
> all sorts of options available really.
>
> > -----Original Message-----
> > From: condor-users-bounces@xxxxxxxxxxx
> > [mailto:condor-users-bounces@xxxxxxxxxxx]On Behalf Of
> > Robert.Nordlund@xxxxxxxxxxxxxxxx
> > Sent: 05 August 2004 14:51
> > To: Condor-Users Mail List
> > Subject: Re: [Condor-users] about priorities
> >
> >
> >
> >
> >
> >
> > Hello all.
> >
> > I am running a dedicated pool of Windows 2000 servers with
> one central
> > submit machine.  I submit 20 jobs with multiple processes and
> > want to make
> > sure that they are run in succession, i.e. Job1 is more
> important than
> > Job2, etc.  If the first 10 jobs are concurrently running how
> > can I make
> > sure that if Job1 fails due to a hardware failure that it
> > will preempt the
> > the claim of the lowest priority job?  Is this something
> > fairly easy to
> > accomplish with priorities/preemption?
> >
> > Thank you,
> >
> > Bob Nordlund
> >
> >
> >
> > |---------+-------------------------------->
> > |         |           Erik Paulson         |
> > |         |           <epaulson@xxxxxxxxxxx|
> > |         |           >                    |
> > |         |           Sent by:             |
> > |         |           condor-users-bounces@|
> > |         |           cs.wisc.edu          |
> > |         |                                |
> > |         |                                |
> > |         |           08/04/2004 10:43 PM  |
> > |         |           Please respond to    |
> > |         |           Condor-Users Mail    |
> > |         |           List                 |
> > |         |                                |
> > |---------+-------------------------------->
> >
> > >-------------------------------------------------------------
> > -------------------------------------------------|
> >   |
> >                                                    |
> >   |       To:       Condor-Users Mail List
> > <condor-users@xxxxxxxxxxx>
> >         |
> >   |       cc:
> >                                                    |
> >   |       Subject:  Re: [Condor-users] about priorities
> >                                                    |
> >
> > >-------------------------------------------------------------
> > -------------------------------------------------|
> >
> >
> >
> >
> > On Wed, Aug 04, 2004 at 06:16:45PM -0500, David A. Kotz wrote:
> > > I think what you want to do is to set the following in your
> > > condor_config:
> > >
> > > SUSPEND = FALSE
> > > PREEMPT = FALSE
> > > KILL = FALSE
> > >
> >
> > That will help prevent the local machine from evicting the
> > job (you'll also
> > probably want have START = TRUE, or some such)
> >
> > Those expresssions have nothing to do with preemption for priority
> > purposes, though. Condor can "preempt" a job for two reasons - one,
> > because the local conditions on the machine force us to get
> > rid of a job,
> > or two, because there is a user with a higher priority who
> > wants to run a
> > job. That preemption is controlled by
> PREEMPTION_REQUIREMENTS. It is a
> > pool-wide setting that is evaluated only when Condor has a higher
> > priority user who wants to run a job. It is tested against
> > each machine
> > in the pool to see if the job currently running on that machine
> > can be preempted.
> >
> > To prevent priority preemption, set:
> >
> > PREEMPTION_REQUIREMENTS = FALSE
> >
> > in the config file of your central manager.
> >
> > -Erik
> >
> > each machine of the pool.
> > > - dave
> > >
> > >
> > > On Wed, 2004-08-04 at 03:44 -0700, Fernando Rannou wrote:
> > > > Hello,
> > > > I'm a little confused on how Condor handles priorities.
> > > > On Section 2.7.2 of the Condor manual it reads:
> > > > ---------
> > > > "The priority of each individual user changes according
> > to the number
> > > > of resources the individual is using.
> > > > ...
> > > >
> > > > "For instance, if a low priority user is utilizing all available
> > machines
> > > > and suddenly a higher priority user submits jobs, Condor
> > > > will immediately checkpoint and vacate jobs belonging to
> > > > the lower priority user."
> > > >
> > > > ---------
> > > > Which priority do they refer to? Effective, Real, Factor??
> > > >
> > > > We need that once a job is running it never, ever
> > > > gets preempted, suspended, killed, or whatever
> > > > no matter how long it's been running and
> > > > no matter how low its priority is.
> > > >
> > > > How can I accomplished that?
> > > > Thanks
> > > >
> > > > Fernando
> > > >
> > > >
> > > > ----------------------------------------------------------
> > > > IMPORTANT WARNING:  This email (and any attachments) is
> > only intended
> > for the use of the person or entity to which it is
> addressed, and may
> > contain information that is privileged and confidential.  You, the
> > recipient, are obligated to maintain it in a safe, secure and
> > confidential
> > manner.  Unauthorized redisclosure or failure to maintain
> > confidentiality
> > may subject you to federal and state penalties. If you are not the
> > recipient, please immediately notify us by return email, and
> > delete this
> > message from your computer.
> > > > ----------------------------------------------------------
> > > > _______________________________________________
> > > > Condor-users mailing list
> > > > Condor-users@xxxxxxxxxxx
> > > > http://lists.cs.wisc.edu/mailman/listinfo/condor-users
> > > --
> > > David A. Kotz <dkotz@xxxxxxxxxxxxx>
> > >
> > > _______________________________________________
> > > Condor-users mailing list
> > > Condor-users@xxxxxxxxxxx
> > > http://lists.cs.wisc.edu/mailman/listinfo/condor-users
> > _______________________________________________
> > Condor-users mailing list
> > Condor-users@xxxxxxxxxxx
> > http://lists.cs.wisc.edu/mailman/listinfo/condor-users
> >
> >
> >
> >
> >
> > **************************************************************
> > ***********
> > PRIVILEGED AND CONFIDENTIAL: This communication, including
> > attachments, is for the exclusive use of addressee and may
> > contain proprietary, confidential and/or privileged
> > information.  If you are not the intended recipient, any use,
> > copying, disclosure, dissemination or distribution is
> > strictly prohibited.  If you are not the intended recipient,
> > please notify the sender immediately by return e-mail, delete
> > this communication and destroy all copies.
> > **************************************************************
> > ***********
> >
> > _______________________________________________
> > Condor-users mailing list
> > Condor-users@xxxxxxxxxxx
> > http://lists.cs.wisc.edu/mailman/listinfo/condor-users
> >
>
>
> *****************************************************************
> Gloucester Research Limited believes the information
> provided herein is reliable. While every care has been
> taken to ensure accuracy, the information is furnished
> to the recipients with no warranty as to the completeness
> and accuracy of its contents and on condition that any
> errors or omissions shall not be made the basis for any
> claim, demand or cause for action.
> *****************************************************************
>
> _______________________________________________
> Condor-users mailing list
> Condor-users@xxxxxxxxxxx
> http://lists.cs.wisc.edu/mailman/listinfo/condor-users
>
>
>
>
>
> **************************************************************
> ***********
> PRIVILEGED AND CONFIDENTIAL: This communication, including
> attachments, is for the exclusive use of addressee and may
> contain proprietary, confidential and/or privileged
> information.  If you are not the intended recipient, any use,
> copying, disclosure, dissemination or distribution is
> strictly prohibited.  If you are not the intended recipient,
> please notify the sender immediately by return e-mail, delete
> this communication and destroy all copies.
> **************************************************************
> ***********
>
> _______________________________________________
> Condor-users mailing list
> Condor-users@xxxxxxxxxxx
> http://lists.cs.wisc.edu/mailman/listinfo/condor-users
>
_______________________________________________
Condor-users mailing list
Condor-users@xxxxxxxxxxx
http://lists.cs.wisc.edu/mailman/listinfo/condor-users





*************************************************************************
PRIVILEGED AND CONFIDENTIAL: This communication, including attachments, is for the exclusive use of addressee and may contain proprietary, confidential and/or privileged information.  If you are not the intended recipient, any use, copying, disclosure, dissemination or distribution is strictly prohibited.  If you are not the intended recipient, please notify the sender immediately by return e-mail, delete this communication and destroy all copies.
*************************************************************************