Community

A few days ago I switched to Linux to try vLLM out of curiosity. Ended up creating a %100 local, parallel, multi-agent setup with Claude Code and gpt-oss-120b for concurrent vibecoding and orchestration with CC's agent Teams entirely offline. This video shows 4 agents collaborating.

Via r/LocalLlama

Sunday, Mar 22, 2026 · 4:10AM

Summary

This isn't a repo, its just how my Linux workstation is built. My setup was the following: vLLM Docker container - for easy deployment and parallel inference. Claude Code - vibecoding and Agent Teams orchestration. Points at vLLM localhost endpoint instead of a cloud provider. gpt-oss:120

Continue reading the full article

Read at r/LocalLlama

www.reddit.com