I tested 7 local models on 22 real agent tasks using OpenClaw on a Raspberry Pi 5 with an RTX 3090 running Ollama. Tasks included reading emails, scheduling meetings, creating tasks, detecting phishing, handling errors, and browser automation. The winner by a massive margin: qwen3.5:27b-q4_K_M at 59