[Feature Request] Using the cuda dlls installed with pip from official Nvidia python packages in onnxruntime-gpu #19350

martinResearch · 2024-01-31T11:01:37Z

Describe the feature request

Short Description

Enable the usage of CUDA DLLs installed via pip from the NVIDIA Python index or include the CUDA DLLs in the onnxruntime-gpu wheel when installing with pip install onnxruntime-gpu[cuda_dlls].

Problem

Installing CUDA DLLs for onnxruntime-gpu currently involves limitations such as:

Mandatory user account creation on the NVIDIA website.
Dependency on admin rights, restricting installation on machines without such privileges.
Risk of installing incompatible CUDA versions.
Inconvenience of updating CUDA_PATH when switching Python environments with different CUDA versions.

In contrast, PyTorch on Windows includes CUDA DLLs in its wheels, simplifying the installation process and reducing version mismatch risks. On Linux, PyTorch seems to use NVIDIA packages from the feed https://pypi.ngc.nvidia.com (installable with pip install nvidia-pyindex) (I did not double check this).

Possible Solutions

To streamline the installation process for onnxruntime-gpu, the following solutions could be considered:

Packaged CUDA DLLs with onnxruntime-gpu Wheels:
- Create onnxruntime-gpu wheels that include CUDA DLLs, allowing users to install them conveniently with pip install onnxruntime-gpu[cuda_dlls].
Dependency Configuration via onnxruntime-gpu Wheel:
- Create an onnxruntime-gpu wheel installable with pip install onnxruntime-gpu[cuda_dlls].
- This wheel would list packages from the NVIDIA package index as install dependencies (e.g., nvidia-cudnn-cu12), which can be installed with pip install nvidia-pyindex followed by pip install nvidia-cudnn-cu12.
- Configure onnxruntime to utilize these DLLs instead of those in CUDA_PATH.

The second solution may facilitate reuse of the same CUDA DLLs by other packages like CuPy or PyTorch, potentially reducing the overall size of the Python environment.

Describe scenario use case

allow full automation of the the cuda dlls installation
allow the use different cuda dlls versions in different python environment with editing CUDA_PATH

The text was updated successfully, but these errors were encountered:

snnn · 2024-01-31T18:25:14Z

Due to some internal requirements, we are not allowed to use a second feed.

snnn · 2024-01-31T18:27:48Z

The CUDA DLLs are so huge that they cannot be hosted at pypi. So they must be hosted at somewhere else. However, due to some security concerns we are not allowed to use a second feed. So this went to a dead end.

snnn · 2024-01-31T18:30:18Z

I will keep this issue open, while we could continue discussing the details offline.

martinResearch · 2024-02-01T09:18:21Z

Is the problem that you are you not allowed to consume packages from the external feed https://pypi.ngc.nvidia.com/ in your development environment and/or in your CI tests pipelines (github actions) for security reasons?

snnn · 2024-02-01T23:56:05Z

In CI.

martinResearch · 2024-02-15T15:44:47Z

could you possibly add https://pypi.ngc.nvidia.com/ as an upstream to the feed you are currently using?

snnn · 2024-02-15T16:19:51Z

Azure DevOps Artifact's upstream feature doesn't support that. It only supports pypi. Correct me if I was wrong.

martinResearch · 2024-03-16T20:52:34Z

Indeed it seems I cannot use https://pypi.ngc.nvidia.com/ as an upstream in Azure Devops.
However It seems that all the required nvidia packages to get the required dlls for cuda 12 are actually on pypi.org, so not need to add https://pypi.ngc.nvidia.com/ as an upstream.

Here is a list of package on pypi.org that allowed me to get all the dlls I needed to use onnxruntime-gpu (some might actually not be required)

nvidia-nvjitlink-cu12
nvidia-nvtx-cu12
nvidia-cuda-runtime-cu12
nvidia-cublas-cu12
nvidia-cuda-cupti-cu12
nvidia-cuda-nvrtc-cu12
nvidia-cudnn-cu12
nvidia-cufft-cu12
nvidia-curand-cu12
nvidia-cusolver-cu12
nvidia-cusparse-cu12

so for cuda 12 it seems we could potentially get onnxruntime-gpu to use the dlls from these packages.
Unfortunately onnxruntime-gpu for cuda 12 is not on pypi.org see #19438, which I hope will be solved soon.

snnn · 2024-04-08T16:16:15Z

Are they real? The file size of nvidia-cuda-runtime-cu12 is less than 1MB.
https://pypi.org/project/nvidia-cuda-runtime-cu12/12.4.127/#files

martinResearch · 2024-04-10T21:27:16Z

Are they real? The file size of nvidia-cuda-runtime-cu12 is less than 1MB. https://pypi.org/project/nvidia-cuda-runtime-cu12/12.4.127/#files

I think so. The whl file contains the cudart64_12.dll that is about 540KB, which is about the same as the dll we can get from the official cuda toolkit installer

snnn · 2024-04-11T00:33:05Z

But the cudnn files looks wired. The latest one(9.0) only has binaries for ARM64, while a previous version(8.9) only has binaries for Windows/Linux x64. And we don't know if the ARM one is for sbsa or jetson.

gedoensmax · 2024-05-28T15:08:32Z

@snnn These cudnn wheels look reasonable to me: https://pypi.org/project/nvidia-cudnn-cu12/#files
Which ones did you look at ?

snnn · 2024-05-28T17:52:30Z

Now it is good. Thanks!

gedoensmax · 2024-10-14T11:13:12Z

@snnn or @jchen351 did you start any work on this yet ? The problem propagates to ORT GenAI as well which is even more python focused. pip deployment will make it much easier with nvidia-* packages.

snnn · 2024-10-14T18:35:59Z

Sorry the work has not been started.

snnn · 2024-10-14T18:37:51Z

The first solution is not available to us

Packaged CUDA DLLs with onnxruntime-gpu Wheels:

Because these DLLs are large and we are very tight on space.

We can try the second one.

Dependency Configuration via onnxruntime-gpu Wheel:

jchen351 · 2024-10-22T00:58:26Z

@martinResearch Do you know if libcuda.so should to be installed via pip? If so, do you know where can I find it?

martinResearch · 2024-11-03T17:23:02Z

@martinResearch Do you know if libcuda.so should to be installed via pip? If so, do you know where can I find it?

I don't know. Is it required for onnxruntime-gpu? it is not not in the python environment I use to run onnxruntime-gpu. You have in libcudart.so.12 in https://files.pythonhosted.org/packages/f0/62/65c05e161eeddbafeca24dc461f47de550d9fa8a7e04eb213e32b55cfd99/nvidia_cuda_runtime_cu12-12.6.77-py3-none-manylinux2014_x86_64.whl

gedoensmax · 2024-11-03T17:29:13Z

libcuda.so is a driver library and is installed with the nvidia driver. In a docker container it will be mounted when using nvidia docker toolkit.

martinResearch · 2024-11-04T18:42:29Z

When installing the packages nvidia-cuda-nvrtc-cu12 and nvidia-cudnn-cu12 the dlls end up respectively in lib\site-packages\nvidia\cuda_nvrtc\bin and lib\site-packages\nvidia\cudnn\bin and I wonder if that makes using these dlls more complicated on the onnxruntime side.
Maybe Nvidia could structure the published packages using namespace packages so that the dlls for the different packages end up in the same lib\site-packages\nvidia\bin folder? I don't know if that solution is pip-compliant and would allow uninstalling a nvidia packages individually though.

martinResearch · 2024-11-05T13:20:53Z

A similar effort is planned in the cupy repository cupy/cupy#8013. It might provide some valuable information on how to proceed.

snnn · 2024-12-12T17:58:56Z

Configure onnxruntime to utilize these DLLs instead of those in CUDA_PATH.

Would you mind explaining more on this part? Do we need to manually preload the libraries, or we just need to setup some paths(like https://docs.python.org/3/library/os.html#os.add_dll_directory)

martinResearch added the feature request request for unsupported feature or enhancement label Jan 31, 2024

github-actions bot added ep:CUDA issues related to the CUDA execution provider platform:windows issues related to the Windows platform labels Jan 31, 2024

yufenglee assigned snnn and pranavsharma Jan 31, 2024

snnn closed this as completed Jan 31, 2024

snnn reopened this Jan 31, 2024

snnn mentioned this issue Apr 8, 2024

[Feature Request] Easier installation of onnxruntime-gpu #20200

Closed

snnn assigned jchen351 and unassigned snnn and pranavsharma May 28, 2024

jchen351 linked a pull request Oct 18, 2024 that will close this issue

Adding optional CUDA DLLs when installing onnxruntime_gpu #22506

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Using the cuda dlls installed with pip from official Nvidia python packages in onnxruntime-gpu #19350

[Feature Request] Using the cuda dlls installed with pip from official Nvidia python packages in onnxruntime-gpu #19350

martinResearch commented Jan 31, 2024

snnn commented Jan 31, 2024

snnn commented Jan 31, 2024

snnn commented Jan 31, 2024

martinResearch commented Feb 1, 2024

snnn commented Feb 1, 2024

martinResearch commented Feb 15, 2024 •

edited

Loading

snnn commented Feb 15, 2024

martinResearch commented Mar 16, 2024

snnn commented Apr 8, 2024

martinResearch commented Apr 10, 2024

snnn commented Apr 11, 2024

gedoensmax commented May 28, 2024

snnn commented May 28, 2024

gedoensmax commented Oct 14, 2024

snnn commented Oct 14, 2024 •

edited

Loading

snnn commented Oct 14, 2024

jchen351 commented Oct 22, 2024

martinResearch commented Nov 3, 2024

gedoensmax commented Nov 3, 2024

martinResearch commented Nov 4, 2024 •

edited

Loading

martinResearch commented Nov 5, 2024

snnn commented Dec 12, 2024

[Feature Request] Using the cuda dlls installed with pip from official Nvidia python packages in onnxruntime-gpu #19350

[Feature Request] Using the cuda dlls installed with pip from official Nvidia python packages in onnxruntime-gpu #19350

Comments

martinResearch commented Jan 31, 2024

Describe the feature request

Short Description

Problem

Possible Solutions

Describe scenario use case

snnn commented Jan 31, 2024

snnn commented Jan 31, 2024

snnn commented Jan 31, 2024

martinResearch commented Feb 1, 2024

snnn commented Feb 1, 2024

martinResearch commented Feb 15, 2024 • edited Loading

snnn commented Feb 15, 2024

martinResearch commented Mar 16, 2024

snnn commented Apr 8, 2024

martinResearch commented Apr 10, 2024

snnn commented Apr 11, 2024

gedoensmax commented May 28, 2024

snnn commented May 28, 2024

gedoensmax commented Oct 14, 2024

snnn commented Oct 14, 2024 • edited Loading

snnn commented Oct 14, 2024

jchen351 commented Oct 22, 2024

martinResearch commented Nov 3, 2024

gedoensmax commented Nov 3, 2024

martinResearch commented Nov 4, 2024 • edited Loading

martinResearch commented Nov 5, 2024

snnn commented Dec 12, 2024

martinResearch commented Feb 15, 2024 •

edited

Loading

snnn commented Oct 14, 2024 •

edited

Loading

martinResearch commented Nov 4, 2024 •

edited

Loading