Mailing List Archives Public Access	UW Madison Computer Sciences Department Computer Systems Lab

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] using idle computers in computer labs for CFD jobs

Date: Tue, 08 Mar 2016 14:00:30 -0600
From: Todd L Miller <tlmiller@xxxxxxxxxxx>
Subject: Re: [HTCondor-users] using idle computers in computer labs for CFD jobs

Now, this type of checkpoint is distinct from the standard universe'scheckpoint, as it's managed internally by the application rather thanthe standard universe wrapper applied by condor_compile. For Fluent andsimilar applications which can't be relinked in this way, we need tofigure out how to signal Fluent itself to checkpoint periodically.

We expect to be releasing a new developer version (8.5.3) ofHTCondor soon, which will contain some experimental features to helpsimplify situations like this. It sounds like you'd still need to write awrapper script, but that may be easier than changing the configuration ofyour execute nodes. At any rate, if you'd like to help test the newfeatures (or are just curious about what they'll probably be), pleasecontact me off-list.

I think that the alternative would have to be having a wrapper scriptaround the Fluent executable which would be able to recognize theeviction signals from HTCondor and create the exit-fluent flag file whensuch a signal is received.

IIRC, the 'KillSig' job attribute determines which signal is senton an eviction, so if you'd rather not trap SIGTERM, you can choosesomething else.


- ToddM

References:
- Re: [HTCondor-users] using idle computers in computer labs for CFD jobs
  - From: Peter Ellevseth
- Re: [HTCondor-users] using idle computers in computer labs for CFD jobs
  - From: Michael V Pelletier

Prev by Date: Re: [HTCondor-users] Universe Docker: Cannot start container
Next by Date: [HTCondor-users] HTCondor helps LIGO confirm last unproven Albert Einstein theory
Previous by thread: Re: [HTCondor-users] using idle computers in computer labs for CFD jobs
Next by thread: [HTCondor-users] qedit question
Index(es):
- Date
- Thread

Mailing List Archives

Public Access

Re: [HTCondor-users] using idle computers in computer labs for CFD jobs