Community

Run Qwen3.5-4B on AMD NPU

Via r/LocalLlama

Wednesday, Mar 25, 2026 · 3:41PM

Summary

Tested on Ryzen AI 7 350 (XDNA2 NPU), 32GB RAM, using Lemonade v10.0.1 and FastFlowLM v0.9.36. Features Low-power Well below 50°C without screen recording Tool-calling support Up to 256k tokens (not on this 32GB machine) VLMEvalKit score: 85.6% FLM supports all XDNA 2 NPUs. Some links: Perf. benchma

Continue reading the full article

Read at r/LocalLlama

www.reddit.com