Skip to content

Limiting GPU Resource Usage in Onnxruntime with DirectML Provider #1069

Closed Answered by kunal-vaishnavi
asynclee asked this question in Q&A
Discussion options

You must be logged in to vote

You can use ONNX Runtime's execution provider options with ONNX Runtime GenAI. You can add them in the genai_config.json.

The CPU EP's options are available via ONNX Runtime's SessionOptions.

"session_options": {
    "log_id": "onnxruntime-genai",
    "provider_options": [],
    "cpu_ep_option_1_key": cpu_ep_option_1_value (e.g. false, 0, etc.),
    ...
}

For other EPs such as CUDA or DirectML, their options are available via ONNX Runtime's ProviderOptions.

"session_options": {
    "log_id": "onnxruntime-genai",
    "provider_options": [
        "ep_name": {
            "ep_name_option_1_key": "ep_name_option_1_value",
            ...
        }
    ],
}

You can also set the provider opt…

Replies: 3 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by kunal-vaishnavi
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants