Community

Fixing Qwen thinking repetition

Via r/LocalLlama

Saturday, Mar 21, 2026 · 2:08PM

Summary

ok so I found the fix to Qwen thinking repetition. I discovered that pasting this system prompt from Claude fixes it completely. Other long system prompts might also work. I use 1.5 presence penalty, everything else llama.cpp webui defaults, no kv cache quant (f16), and i use a q6k static quan

Continue reading the full article

Read at r/LocalLlama

www.reddit.com

More from Best AI News

95% of UK students now use AI and their experiences couldn't be more divided

The Decoder · Industry & Money

Chinese AI model MiniMax M2.7 reportedly helped develop itself

The Decoder · Industry & Money

I Tried DoorDash’s Tasks App and Saw the Bleak Future of AI Gig Work

Wired AI · Policy & Culture

OpenAI's chief scientist trusts AI with experiments but says it's not at the level to design complex systems

The Decoder · Industry & Money

Back to all stories