ok so I found the fix to Qwen thinking repetition. I discovered that pasting this system prompt from Claude fixes it completely. Other long system prompts might also work. I use 1.5 presence penalty, everything else llama.cpp webui defaults, no kv cache quant (f16), and i use a q6k static quan