[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] windows xp log off kills jobs



On Dec 27, 2007 10:00 PM, Finch, Ralph <rfinch@xxxxxxxxxxxx> wrote:
> condor -version
> $CondorVersion: 6.8.3 Jan  5 2007 $
> $CondorPlatform: INTEL-WINNT50 $
>
> I am submitting jobs from machine1 to a pool, all windows xp.  If I then
> remote login to a machine running my jobs--say machine2--then logoff,
> the jobs on machine2 are killed and new jobs restart a few minutes later
> from the idle jobs in the pool.  Damn annoying as you can guess.
>
> In this thread
> https://lists.cs.wisc.edu/archive/condor-users/2004-November/msg00076.sh
> tml
>
> the poster had the same problem but seemed to think it was only Java
> jobs.  Mine are not Java, my executable is a windows .bat file which
> then runs a compiled exe.  He had a klugy solution to his Java jobs
> which I doubt would work with mine, plus it seems a serious deficiency
> and should have a better solution.  I'm believing I'm not the first
> person to hit on this problem so is there a good solution?

What are the values of SUSPEND and PREEMPT on these machines.

It is possible the standard 'kick a job off this machine if the owner
wants to use it' routines are kicking in.
You may wish to change that behaviour...

Matt