Right now I'm running a 12GB GPU with models Qwen3-30B-A3B and Omnicoder, I'm looking at a 16GB new card and yet I don't see what better model I could run on that: QWEN 27B would take at least ~24GB. Pretty much I would run the same 30B A3B with a slight better quantization, little more context. Am