[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] hanging file transfers



> The manual has JobStatus 6 for "transferring output" but none transferring input, is that correct?

yes.  but there is a Boolean attribute that is TRUE while a job is transferring input, and a second one
that indicates if it is waiting in the queue or actually transferring, so the expression

TransferringInput && ! TransferQueued

Will evaluate to true if the job is actually transferring input data.

By the way, there is also a TransferringOutput Boolean, in addition to the job status.

-tj

-----Original Message-----
From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> On Behalf Of Dimitri Maziuk via HTCondor-users
Sent: Wednesday, August 21, 2019 1:47 PM
To: htcondor-users@xxxxxxxxxxx
Cc: Dimitri Maziuk <dmaziuk@xxxxxxxxxxxxx>
Subject: [HTCondor-users] hanging file transfers

Hi all,

every once in a while we get jobs stuck on http transfers. E.g.

> # ps -AF | grep condor
...
> bbee     13689 13688  0 41582  5868   0 Jul29 ?        00:01:14 /usr/libexec/condor/curl_plugin http://proxy.chtc.wisc.edu/SQUID/bmrb/3.8/combined.tgz.enc /var/lib/condor/execute/dir_13685/combined.tgz.enc
...
> bbee     13817 13816  0 41582  5876   0 Jul29 ?        00:01:17 /usr/libexec/condor/curl_plugin http://proxy.chtc.wisc.edu/SQUID/bmrb/3.8/combined.tgz.enc /var/lib/condor/execute/dir_13813/combined.tgz.enc

Is there a knob I could use to auto-kill these? The manual has JobStatus
6 for "transferring output" but none transferring input, is that correct?

This is v. 8.8.4-1.el7.x86_64

TIA,
-- 
Dimitri Maziuk
Programmer/sysadmin
BioMagResBank, UW-Madison -- http://www.bmrb.wisc.edu