Best AI News — Updated Every 3 Hours
Story Page
← All Stories
Home Community Story
Community

Run Qwen3.5-4B on AMD NPU

Via r/LocalLlama
Wednesday, Mar 25, 2026 · 3:41PM
Summary

Tested on Ryzen AI 7 350 (XDNA2 NPU), 32GB RAM, using Lemonade v10.0.1 and FastFlowLM v0.9.36. Features Low-power Well below 50°C without screen recording Tool-calling support Up to 256k tokens (not on this 32GB machine) VLMEvalKit score: 85.6% FLM supports all XDNA 2 NPUs. Some links: Perf. benchma

Continue reading the full article
Read at r/LocalLlama
www.reddit.com
Back to all stories