AI
Models
Opus · Sonnet · Haiku · ChatGPT · OpenRouter.
The model choice matters more than the prompt. I run a routing layer — Opus for hard thinking, Sonnet for production defaults, Haiku for fast/cheap, GPT-5 for second opinions, OpenRouter for everything else.
- Frontier · production · fast tier per task
- Cost ceilings enforced per pipeline
- Model swap behind a single interface (LiteLLM / OpenRouter)
Why
Picking one model and forcing every task through it is how AI bills blow up. A routing layer maps each task to the cheapest model that can do it well — and lets you swap when something better ships.
How
- Tier the work: frontier · production · fast
- Hard cost ceiling per pipeline, fail loud if hit
- All providers behind one interface (LiteLLM / OpenRouter)
Proof
- Cost reduction vs. flat-Opus
- ~60%
- Providers wired
- Claude · OpenAI · Gemini · Mistral · local
- Pipelines on routing
- all prod