OnnxRuntime C# bindings and multi gpu memory allocation. #24453

BernhardGlueck · 2025-04-17T04:09:10Z

Describe the issue

System: 2x RTX 4090, fully working ( pytorch, etc )

I am trying to implement multi gpu inference ( by splitting my inputs up etc )
When i create my memory allocator for Ort Tensors like this:

 var memoryInfo = new OrtMemoryInfo(OrtMemoryInfo.allocatorCUDA_PINNED,
               OrtAllocatorType.DeviceAllocator,
                deviceId,
                OrtMemType.CpuInput | OrtMemType.CpuOutput);

  var ortAllocator = new OrtAllocator(session, memoryInfo);

The session was of course created on the correct device as well:

  var cudaOptions = new OrtCUDAProviderOptions()
        {
        };

        cudaOptions.UpdateOptions(new Dictionary<string, string>()
        {
            { "device_id", deviceId.ToString() },
            { "enable_cuda_graph", "1" },
            { "cudnn_conv_use_max_workspace", "1" }
        });

        sessionOptions.AppendExecutionProvider_CUDA(cudaOptions);

This works fine for my first card ( deviceId = 0 ) but for deviceId = 1 i get an error:
[ErrorCode:InvalidArgument] No requested allocator available

Could this be an issue or is there some incantantion to make this work ?

To reproduce

´´´
var memoryInfo = new OrtMemoryInfo(OrtMemoryInfo.allocatorCUDA_PINNED,
OrtAllocatorType.DeviceAllocator,
deviceId,
OrtMemType.CpuInput | OrtMemType.CpuOutput);

var ortAllocator = new OrtAllocator(session, memoryInfo);
´´´

Urgency

No response

Platform

Windows

OS Version

Fedora 41

ONNX Runtime Installation

Released Package

ONNX Runtime Version or Commit ID

1.21.0

ONNX Runtime API

C#

Architecture

X64

Execution Provider

CUDA

Execution Provider Library Version

Cuda 12.8

The text was updated successfully, but these errors were encountered:

yuslepukhin · 2025-04-18T20:20:44Z

We do not support multiple devices, and the second device allocator is simply not instantiated.

github-actions bot added the api:CSharp issues related to the C# API label Apr 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OnnxRuntime C# bindings and multi gpu memory allocation. #24453

OnnxRuntime C# bindings and multi gpu memory allocation. #24453

BernhardGlueck commented Apr 17, 2025 •

edited

Loading

yuslepukhin commented Apr 18, 2025

OnnxRuntime C# bindings and multi gpu memory allocation. #24453

OnnxRuntime C# bindings and multi gpu memory allocation. #24453

Comments

BernhardGlueck commented Apr 17, 2025 • edited Loading

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

yuslepukhin commented Apr 18, 2025

BernhardGlueck commented Apr 17, 2025 •

edited

Loading