Wasm/WebGPU backends? #484
josephrocca
started this conversation in
Support for Targets (OS / EPs / Hardware)
Replies: 1 comment
-
If I read this correctly, you are asking if we plan to ship onnxruntime-genai on top of ort-web. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Wondering if there'll be an ORT Web version of this package? Phi-3 can run at >20 tok/s on WebGPU using ORT Web even at this early stage of WebGPU development, and model can be cached in browser, so large-ish models in the browser may enter the realm of practicality some time soon, and broad support for web backends can help with that (exemplified by ORT's web backend spawning projects like Transformers.js).
Beta Was this translation helpful? Give feedback.
All reactions