[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] htcondor + gpudirect + openmpi



On Monday 11 September 2017 22:10:18 Michael Pelletier wrote:
> I've been using the job ad file for non-dynamic values. For dynamic stuff
> you could use condor_chirp get_job_attr.
> 
> 
>      condor_q -jobads $_CONDOR_JOB_AD -autoformat RequestCpus

Thanks!

> 
> -Michael Pelletier.
> 
> > -----Original Message-----
> > From: HTCondor-users [mailto:htcondor-users-bounces@xxxxxxxxxxx] On
> > Behalf Of Harald van Pee
> > Sent: Monday, September 11, 2017 3:45 PM
> > To: htcondor-users@xxxxxxxxxxx
> > Subject: Re: [HTCondor-users] htcondor + gpudirect + openmpi
> > 
> > Hi Jason,
> > 
> > I think I have done something wrong, or its just working since I have
> > installed Mellanox OFED 4.1.
> > 
> > Up to now I have tested it only with openmpi-2.0.2a1 but at least for
> > this version its working if I make a loop over the requested gpus to
> > just get more lines in the machine file and for the -n argument I have
> > to multiply $_CONDOR_NPROCS with the requested gpus.
> > 
> > How I get the number of requested gpus in the script?
> > At the moment I would parse
> > _CONDOR_AssignedGPUs and count them.
> > 
> > OMP_NUM_THREADS can be used to get the request_cpus value but in general
> > this is not the number of mpinodes per node.
> > 
> > Up to now I just test the cpus but I hope I can start with real gpu jobs
> > soon.
> 
> _______________________________________________
> HTCondor-users mailing list
> To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
> 
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/htcondor-users/