27 models from 5 providers in LiteLLM, exposed to OpenCode through smart routers that pick the right model tier by prompt content, not context size. Runs locally via Docker with caching, spend tracking, and one endpoint.
12 open coding models benchmarked against Claude and GPT-5.5. DeepSeek V4 Flash handles 70% of tasks at 12x cheaper than DeepSeek V4 Pro. MiMo-V2.5 is now the cheapest high-volume option at 30,100 req/5h. Qwen3.7 Max leads on SWE-bench Pro (60.6%). Kimi K2.6 leads on agentic coding. Here’s how to route between them.