WebNN and WebLLM on RISC-V: Closing the AI Acceleration Gap with RVV and Tenstorrent

2026-01-31T12:40:00+01:00 for 00:20

As AI workloads move to the browser, the lack of a unified low-level acceleration layer on Linux—equivalent to DirectML or CoreML—creates major bottlenecks. In this talk, we explore how WebNN and next-generation WebLLM can unlock efficient on-device inference on RISC-V, using Tenstorrent hardware and the emerging RVV 1.0 Variable-Length vector ISA. We cover the challenges of WebNN integration on Linux, the importance of WASM support for RVV, and demonstrate progress on running modern LLMs directly in the browser. We will also detail the RVV-enabled WASM implementation path for WebNN and what’s needed upstream.

AI Plumbers Track
Chat room(web)
Chat room(app)
Submit Feedback

View on FOSDEM site