[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] condor_gpu_discovery: wrong amount of GPU memory



Dear all,

 

we are using HTCondor 8.8 on Windows (Win 10 and Win 2016 specifically) with CUDA 10.0 installed. Some systems do have large GPUs, e.g. with 12 GB or even 32 GB of memory. Nevertheless, condor_gpu_discovery will only show a maximum of

CUDA0GlobalMemoryMb=4096

for these cards. I have tried to run cudaGetDeviceProperties from my own code and the memory is correctly returned, so I am not sure what is going on here. Any ideas what might be the reason? Btw: I am using the 64bit-Build of HTCondor.

 

Besides that, I discovered that condor_gpu_discovery tries to access the registry key

 

"SOFTWARE\\NVIDIA Corporation\\GPU Computing Toolkit\\CUDA"

 

which does not seem to exist on any of our systems. Could you please tell me under which circumstances you would expect this key to exist?

 

Thanks a lot,

 

Jens