Mailing List Archives Public Access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7

Date: Mon, 23 Oct 2017 17:43:49 -0500
From: Todd Tannenbaum <tannenba@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7

On 10/23/2017 10:07 AM, Alessandra Forti wrote:

1) Why SYSTEM_PERIODIC_REMOVEÂ didn't work?
Because the (system_)periodic_remove expressions are evaluated by thecondor_shadow while the job is running, and the *_RAW attributes areonly updated in the condor_schedd.
A simple solution is to use attribute MemoryUsage instead ofResidentSetSize_RAW.Â So I think things will work as you want if youinstead did:
Â RemoveMemoryUsage = ( MemoryUsage > 2*RequestMemory )
Â SYSTEM_PERIODIC_REMOVE = $(RemoveMemoryUsage)Â || <OtherParameters>
let me get this straight if I replace ResidentSetSize_RAW withMemoryUsage it should work?



Yes, that is correct, with MemoryUsage it should work.

Also you may want to place your memory limit policy on the executenodes via startd policy expression, instead of having them enforced onthe submit machine (what I think you are calling the head node).Â Thereason is the execute node policy is evaluated every five seconds,while the submit machine policy is evaluated every several minutes.
I read that the submit machine evaluates the expression every 60 secondssince version 7.4 (though admitedly the blog I read is quite old sothings might have changed again(https://spinningmatt.wordpress.com/2009/12/05/cap-job-runtime-debugging-periodic-job-policy-in-a-condor-pool/)

But realize that there is a lot of polling going on here. Thecondor_starter on the execute machine (worker node) will poll theoperating system for the resource utilization of the job, and sendupdated job attributes like MemoryUsage to both the condor_startd andthe condor_shadow every STARTER_UPDATE_INTERVAL seconds (300secs bydefault). Then, for a running job, the condor_shadow will evaluate yourSYSTEM_PERIODIC_REMOVE expression every PERIODIC_EXPR_INTERVAL seconds(60 by default). The condor_shadow will also push updated jobattributes up to the condor_schedd every SHADOW_QUEUE_UPDATE_INTERVALseconds (900secs by default).

The above polling/update parameters are set how they are by default tolimit the update rates to accommodate one schedd managing many thousandsof live jobs.

So... given the above, note the default config means yourSYSTEM_PERIODIC_REMOVE expression could take up to 5 or 6 minutes beforeit removes a large memory job. And if you are monitoring MemoryUsageand/or ResidentSetSize job attributes via condor_q, it will take 15minutes (up to 20 minutes) for condor_q to show a MemoryUsage spike.

A runaway job could consume a lot of memory in a few minutes :).
Do you mean I should move SYSTEM_PERIODIC_REMOVE to the WN? or is thereanother recipe?

Yes, if you have control over the config of the worker node, it may bebetter to configure the worker node to simply kill a job that exceedsyour memory policy instead of waiting for the memory usage informationto propagate back to the submit node. The worker node would (bydefault) kill the job either immediately if using cgroups with the hardmemory policy, or within 5 seconds if you want a custom PREEMPTexpression that could state things like only kill if the job is using 2xthe provisioned memory (still don't understand why you want to allow thejob to use twice the memory it requested...).


Hope the above helps,
Todd

--
Todd Tannenbaum <tannenba@xxxxxxxxxxx> University of Wisconsin-Madison
Center for High Throughput Computing   Department of Computer Sciences
HTCondor Technical Lead                1210 W. Dayton St. Rm #4257
Phone: (608) 263-7132                  Madison, WI 53706-1685

References:
- [HTCondor-users] htcondor cgroups and memory limits on CentOS7
  - From: Alessandra Forti
- Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7
  - From: Alessandra Forti
- Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7
  - From: Todd Tannenbaum
- Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7
  - From: Alessandra Forti

Prev by Date: Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7
Next by Date: Re: [HTCondor-users] Forwarding Kerberos-Credentials
Previous by thread: Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7
Next by thread: Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7
Index(es):
- Date
- Thread

Mailing List Archives

Public Access

Re: [HTCondor-users] htcondor cgroups and memory limits on CentOS7