Hello,
We are submitting condor jobs that use singularity containers. The
startds use the --nv feature, in order to bring GPU support inside the
containers for Machine Learning applications:
SINGULARITY_EXTRA_ARGUMENTS = --nv
SINGULARITY_JOB = !isUndefined(TARGET.SingularityImage)
SINGULARITY_IMAGE_EXPR = TARGET.SingularityImage
This works great, however, when we use condor_ssh_to_job, we lose the
environment related to libcuda (what --nv does), see [1]. Could it be
that condor does not use --nv when entering the container?