[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] cgroups question/problem



It is interesting, we are getting a fair number of OOM Holds these days, but only a few seem to end this way with the WN locked up. One I just observed started near the end of what I would call a small storm in the number of kernel process creations on the WN. Typical is around <25/s, and this one was running around 600-700/s. I am leaving this WN "as-is" until at least tomorrow should there be anything I could pull out of this for you.

bob

On 3/16/2016 5:41 PM, Martin Bukatovic wrote:
On Tue, Mar 15, 2016 at 12:46:24PM -0500, Greg Thain wrote:
What OS and kernel version are you running?  As I recall, we saw something
like this for a while on an older rhel 6 kernel.
While it looks similar, this issue seems to be a bit different compared
to that RHEL 6 bug (BZ 870011). Moreover it looks like Bob is running
kernel which already contains a fix for the BZ.