Best AI News — Updated Every 3 Hours
Story Page
← All Stories
Home Community Story
Community

I wrote a PowerShell script to sweep llama.cpp MoE nCpuMoe vs batch settings

Via r/LocalLlama
Saturday, Mar 21, 2026 · 8:15PM
Summary

Hi all, I have been playing around with Qwen 3.5 MOE models and found the sweetspot tradeoff between nCpuMoe and the batchsize for speed isn't linear. I also kept rerunning the same tests across different quants, which got tedious. If there is a tool/script that does this already, and I mi

Continue reading the full article
Read at r/LocalLlama
www.reddit.com
Back to all stories