[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] HAS_CVMFS_something ?



Hi Brian,

On Mon, 2023-04-24 at 12:33:50 +0000, Bockelman, Brian wrote:
> Hi Steffan,
> 
> That's a custom attribute.  Is the user perhaps copying a job they ran from a different HTCondor pool?

I suspect OSG...

> For example, in the OSPool, we set that attribute using the following GlideinWMS configuration script snippet:
> 
> https://github.com/opensciencegrid/osg-flock/blob/master/ospool-pilot/main/pilot/advertise-base#L380-L516
> 
> Feel free to borrow as needed.  We have a goal of extracting commonly-useful logic from the OSPool and putting it into HTCondor itself (as an option); but that's more of a "TODO this year" rather than "it already exists".

Looks interesting, but I obviously need lines 49--68 as well, and have to find out
which interface provides the "add_*" functionality - and how to possibly reproduce
this for "condor_config only".

Thanks so far,
 Steffen

> 
> Brian
> 
> > On Apr 24, 2023, at 5:17 AM, Steffen Grunewald <steffen.grunewald@xxxxxxxxxx> wrote:
> > 
> > Good morning/afternoon/...,
> > 
> > today, after a rather lengthy upgrade of our HTCondor pool, I found some jobs
> > in "idle" state, despite plenty of resources still available.
> > 
> > Investigation (better_analyze) shows, for one of the affected jobs:
> > 
> > 
> > The Requirements expression for job 27474.000 is
> > 
> >    ((HAS_CVMFS_singularity_opensciencegrid_org is true)) && (TARGET.Arch == "X86_64") && (TARGET.OpSys == "LINUX") && (TARGET.Disk >= RequestDisk) && (TARGET.Memory >= RequestMemory) && (TARGET.Cpus >= RequestCpus) && (TARGET.HasFileTransfer && versioncmp(split(TARGET.CondorVersion)[1],"8.9.7") >= 0) &&
> >    TARGET.HasSelfCheckpointTransfers
> > 
> > Job 27474.000 defines the following attributes:
> > 
> >    RequestCpus = 16
> >    RequestDisk = 10485760
> >    RequestMemory = 40000
> > 
> > The Requirements expression for job 27474.000 reduces to these conditions:
> > 
> >         Slots
> > Step    Matched  Condition
> > -----  --------  ---------
> > [0]           0  HAS_CVMFS_singularity_opensciencegrid_org is true
> > [11]        585  TARGET.HasFileTransfer
> > [12]        585  versioncmp(split(TARGET.CondorVersion)[1],"8.9.7") >= 0
> > 
> > 
> > Older jobs from the same user (and similar DAG) didn't require that HAS_CVMFS_...
> > to be set.
> > 
> > While all machines have CVMFS access, `ls /cvmfs/singularity.opensciencegrid.org`
> > works on all pool nodes, this (never seen before) requirement needs to be fulfilled
> > - but I have no idea whether this has to be done via setting that HAS_CVMFS_... via
> > a configuration file (any suggestion is welcome) or whether HTCondor could take
> > care of it itself (perhaps triggered by another setting in the config).
> > 
> > I've searched the latest documentation and all of my notes, to no avail.
> > 
> > Is there somebody who can cure my (obvious?) blindness?
> > 
> > Thanks,
> > Steffen

-- 
Steffen Grunewald, Cluster Administrator
Max Planck Institute for Gravitational Physics (Albert Einstein Institute)
Am Mühlenberg 1 * D-14476 Potsdam-Golm * Germany
~~~
Fon: +49-331-567 7274
Mail: steffen.grunewald(at)aei.mpg.de
~~~