Single-source cross-platform GPU LLM inference with Slang and Rust

2026-01-31T13:05:00+01:00 for 00:20

Leveraging Rust and Khronos' emerging Slang initiative, we introduce our efforts toward a cross-platform GPU LLM inference ecosystem. With a single-source approach we aim to minimize backend-specific code and foster community participation by writing inference kernels once and run them everywhere.

AI Plumbers Track
Chat room(web)
Chat room(app)
Submit Feedback

View on FOSDEM site