[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [HTCondor-users] File last modification time or job last write() attribute?
- Date: Thu, 26 May 2016 14:06:24 -0400
- From: Michael V Pelletier <Michael.V.Pelletier@xxxxxxxxxxxx>
- Subject: Re: [HTCondor-users] File last modification time or job last write() attribute?
From: MIRON LIVNY <miron@xxxxxxxxxxx>
Date: 05/26/2016 01:46 PM
> You do not have an algorithm to decide when a job stopped making progress
> based on its Output behavior after it consumed one hour of CPU time.
> What am I missing?
Ah, I see what you're getting at now.
Regardless of how much time the job has spent in slot,
we can decide
that it is hung and needs to be terminated if it has
gone at least one
hour (for example) without making any updates to a