[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Translating GPU device assignments?




> -----Original Message-----
> From: HTCondor-users [mailto:htcondor-users-bounces@xxxxxxxxxxx] On Behalf
> Of John M Knoeller
> Sent: Thursday, July 06, 2017 5:04 PM
> 
> GPU_DEVICE_ORDINAL is the equivalent of CUDA_VISIBLE_DEVICES for OpenCL,
> It would be incorrect for us to renumber it.
> 
> it sounds like you are saying that the job shouldn't look at
> CUDA_VISIBLE_DEVICES at all, it should just look at the number of GPUs it
> has been assigned and then start from 0.
[Michael Pelletier] 

At least in the case of how Caffe is doing it with CUDA devices. I'm not aware of whether or not this is the normal behavior, though. Does it need a "CUDA_DEVICE_ORDINAL" if it is normal?

As far as I can tell the CUDA_VISIBLE_DEVICES is interpreted by the CUDA library, not by Caffe, and from Caffe's perspective it just sees a sequential list of ID numbers corresponding to the visible devices.

	-Michael Pelletier.