Set LIB_PATH to libnvidia-ml.so.1 instead of libnvidia-ml.so on Linux #63

dmitryduev · 2024-08-15T18:19:45Z

In the official Go bindings for NVML, they use libnvidia-ml.so.1: https://github.com/NVIDIA/go-nvml/blob/0e815c71ca6e8184387d8b502b2ef2d2722165b9/pkg/nvml/lib.go#L30, and I believe the same is true for pynvml.

In the official Go bindings for NVML, they use libnvidia-ml.so.1: https://github.com/NVIDIA/go-nvml/blob/0e815c71ca6e8184387d8b502b2ef2d2722165b9/pkg/nvml/lib.go#L30

scaronni · 2024-11-22T19:08:06Z

Yes please, in driver 560 and above, as shipped in the CUDA repository, we've also removed the symlink to the unversioned library.

The approach of loading libnvidia-ml.so.1 is the correct approach. The unversioned library should be used only for linking building against it.

In the previous driver versions, there was in fact an nvidia-driver-devel subpackage which contained the unversioned libnvidia-ml.so library. But that was a mistake, as the package could not really be used as the unversioned libraries contained therein did not have any header for compiling against them and it was a leftover.

The few remaining unversioned libraries that are required in the driver have been moved to the main library packages.

Regarding NVML, the NVML stub and the headers are in the cuda-nvml-devel package, so if you need to link to it that's what should be installed.

Sample output for the RPM (deb is similar):

$ rpm -qpl cuda-nvml-devel-12-6-12.6.77-1.x86_64.rpm | grep targets
/usr/local/cuda-12.6/targets
/usr/local/cuda-12.6/targets/x86_64-linux
/usr/local/cuda-12.6/targets/x86_64-linux/include
/usr/local/cuda-12.6/targets/x86_64-linux/include/nvml.h
/usr/local/cuda-12.6/targets/x86_64-linux/lib
/usr/local/cuda-12.6/targets/x86_64-linux/lib/stubs
/usr/local/cuda-12.6/targets/x86_64-linux/lib/stubs/libnvidia-ml.a
/usr/local/cuda-12.6/targets/x86_64-linux/lib/stubs/libnvidia-ml.so

Again, please stick to loading libnvidia-ml.so.1 which is the correct approach. Thanks!

dmitryduev · 2024-11-22T23:24:09Z

@Cldfire can I please get a stamp?

scaronni · 2024-11-25T12:17:49Z

This is actually again #47

Cldfire · 2024-12-13T22:10:37Z

Hi folks. My apologies for the delay here, and thank you for the PR and the information :)

I've recently started a job at Apple which makes it difficult for me to continue maintaining this library. I am in the process of finding new ownership for this repository, and I've also reached out to a contact at NVIDIA to see if there's any interest on their side in making this crate more official.

I'll provide an update in the coming weeks. In the meantime please continue to use NvmlBuilder to load libnvidia-ml.so.1.

Recently, NVIDIA CUDA repository packages started shipping only `libnvidia-ml.so.1` file, without `libnvidia-ml.so`. The upstream `nvml-wrapper` package has a fix proposed (Cldfire/nvml-wrapper#63), yet the package is in search of a maintainer at the moment. To allow `bottom` to correctly detect NVIDIA GPUs on Ubuntu with official NVIDIA packages, add a wrapper around `Nvml::init` to be more persistent in its search for the NVML library.

Set LIB_PATH to libnvidia-ml.so.1 instead of libnvidia-ml.so on Linux

c63c33b

In the official Go bindings for NVML, they use libnvidia-ml.so.1: https://github.com/NVIDIA/go-nvml/blob/0e815c71ca6e8184387d8b502b2ef2d2722165b9/pkg/nvml/lib.go#L30

dmitryduev mentioned this pull request Aug 15, 2024

failing to load driver nvml from wsl2 #51

Open

scaronni mentioned this pull request Nov 25, 2024

libnvidia-ml.so symlink in the wrong location causes issues negativo17/cuda-nvml#1

Closed

codifryed mentioned this pull request Dec 8, 2024

Update lib path codifryed/nvml-wrapper#1

Merged

al42and mentioned this pull request Jan 4, 2025

other: handle systems with only libnvidia-ml.so.1 ClementTsang/bottom#1655

Merged

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set LIB_PATH to libnvidia-ml.so.1 instead of libnvidia-ml.so on Linux #63

Set LIB_PATH to libnvidia-ml.so.1 instead of libnvidia-ml.so on Linux #63

dmitryduev commented Aug 15, 2024

scaronni commented Nov 22, 2024

dmitryduev commented Nov 22, 2024

scaronni commented Nov 25, 2024

Cldfire commented Dec 13, 2024

Set LIB_PATH to libnvidia-ml.so.1 instead of libnvidia-ml.so on Linux #63

Are you sure you want to change the base?

Set LIB_PATH to libnvidia-ml.so.1 instead of libnvidia-ml.so on Linux #63

Conversation

dmitryduev commented Aug 15, 2024

scaronni commented Nov 22, 2024

dmitryduev commented Nov 22, 2024

scaronni commented Nov 25, 2024

Cldfire commented Dec 13, 2024