so to clarify possible reasons a job dies
state + pre condition -> behaviour : post condition
admin/user does condor_rm -> no time out (I hope)
: is machine still claimed?
admin/user does condor_vacate -> no time out (I hope) : is machine still claimed?
PREEMPT evaluates to true -> time out?
: I read this as machine unclaimed
Yes. If the machine promised the job some retirement time and the job accepted it, then it certainly applies in this case. Once the job retires, the machine is claimed by the preempting job.user prio is higher and PREEMPTION_REQUIREMENTS evaluates to true -> retirement timeout : I read this as machine unclaimed
Graceful shutdown requested -> retirement timeout is used instead of the normal gracefultime out?
: I read this as machine unclaimed
Machine ranks a job higher than an existing one - relative user prio immaterial -> timeout : Unclaimed
It is this last one that is critical to me - if it only works if the user prio is higher then it's not much use to preform job rather than user allocation policies...