Community

I wrote a PowerShell script to sweep llama.cpp MoE nCpuMoe vs batch settings

Via r/LocalLlama

Saturday, Mar 21, 2026 · 8:15PM

Summary

Hi all, I have been playing around with Qwen 3.5 MOE models and found the sweetspot tradeoff between nCpuMoe and the batchsize for speed isn't linear. I also kept rerunning the same tests across different quants, which got tedious. If there is a tool/script that does this already, and I mi

Continue reading the full article

Read at r/LocalLlama

www.reddit.com