[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] NVIDIA L40 Identified as an OCL Device



Trying this again.Â

Benedikt

On Wed, Jul 5, 2023 at 09:14 Benedikt Riedel <briedel@xxxxxxxxxxxxxxxx> wrote:
Hi,

We recently deployed an a machine with NVIDIA L40s. Condor identifies this as an OCL device rather than a CUDA device, seeÂ

005 (193200029.000.000) 2023-07-05 07:07:52 Job terminated.
    (1) Normal termination (return value 1)
        Usr 0 00:00:01, Sys 0 00:00:00 Â- ÂRun Remote Usage
        Usr 0 00:00:00, Sys 0 00:00:00 Â- ÂRun Local Usage
        Usr 0 00:00:01, Sys 0 00:00:00 Â- ÂTotal Remote Usage
        Usr 0 00:00:00, Sys 0 00:00:00 Â- ÂTotal Local Usage
    3064 Â- ÂRun Bytes Sent By Job
    181 Â- ÂRun Bytes Received By Job
    3064 Â- ÂTotal Bytes Sent By Job
    181 Â- ÂTotal Bytes Received By Job
    Partitionable Resources :  ÂUsage ÂRequest Allocated Assigned
     Âadmin_mutex_1     :              0
     Âadmin_mutex_2     :              0
     Âadmin_mutex_3     :              0
     ÂCpus         Â:         1     1
     ÂDisk (KB)       :    20 Â1000000  1060093
     ÂGpus (Average)    Â:         1     1 "OCL0"
     Âmachine_token     :              0
     ÂMemory (MB)      :    Â0   4096   Â4096

    Job terminated of its own accord at 2023-07-05T12:07:52Z.


Is there a setting we missed or to force it to be a âCUDAâ device?

Thanks,Â

Benedikt


--
Benedikt Riedel
Global Computing Coordinator IceCube Neutrino Observatory
Technical Coordinator IceCube Neutrino Observatory
Computing Manager Wisconsin IceCube Particle Astrophysics Center
University of Wisconsin-Madison
--
Benedikt Riedel
Global Computing Coordinator IceCube Neutrino Observatory
Technical Coordinator IceCube Neutrino Observatory
Computing Manager Wisconsin IceCube Particle Astrophysics Center
University of Wisconsin-Madison