[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] out-of-memory event?



found further evidence in the starterlog

"job was held due to OOM event: job has encountered an out-of-memory event"

however, when i look through the system logs, the OOM killer doesn't
seem to have killed anything.



On Thu, Oct 12, 2017 at 10:17 AM, Michael Di Domenico
<mdidomenico4@xxxxxxxxx> wrote:
> ever since i upgraded our condor pool from 8.4.x to 8.6.1 a lot (but
> not all) of my jobs are getting put on hold with "job has encountered
> an out-of-memory event".
>
> there were a lot of condor/system changes at the same time, so it's
> certainly and very possible that a config setting got changed.
>
> the problem is i can't seem to locate which knob/knobs produces this error
>
> our config is fairly generic and we use most of the default condor settings