Skip to content

Tracking not supported for NVIDIA Blackwell GPUs #1037

@raphischer

Description

@raphischer

Issue:

When testing CodeCarbon on an NVIDIA DGX Spark, I noticed that the tracker cannot be correctly initialized. When debugging into that, I noticed that calling pynvml.nvmlDeviceGetMemoryInfo in codecarbon/core/gpu.py throws an pynvml.NVMLError_NotSupported, probably due to the shared memore architecture of the NVIDIA Blackwell GPU (internally read as NVIDIA GB10). nvidia-smi currently also lists N/A for some static information.

Solution:

Catch the error and instead return negative default values. As far as I see, this information is only statically stored and thus does not negatively impact the downstream functionality of CodeCarbon.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions