Best AI News — Updated Every 3 Hours
Story Page
← All Stories
Home Community Story
Community

Fixing Qwen thinking repetition

Via r/LocalLlama
Saturday, Mar 21, 2026 · 2:08PM
Summary

ok so I found the fix to Qwen thinking repetition. I discovered that pasting this system prompt from Claude fixes it completely. Other long system prompts might also work. I use 1.5 presence penalty, everything else llama.cpp webui defaults, no kv cache quant (f16), and i use a q6k static quan

Continue reading the full article
Read at r/LocalLlama
www.reddit.com
Back to all stories