AI

Models

Opus · Sonnet · Haiku · ChatGPT · OpenRouter.

The model choice matters more than the prompt. I run a routing layer — Opus for hard thinking, Sonnet for production defaults, Haiku for fast/cheap, GPT-5 for second opinions, OpenRouter for everything else.

  • Frontier · production · fast tier per task
  • Cost ceilings enforced per pipeline
  • Model swap behind a single interface (LiteLLM / OpenRouter)
Why

Picking one model and forcing every task through it is how AI bills blow up. A routing layer maps each task to the cheapest model that can do it well — and lets you swap when something better ships.

How
  • Tier the work: frontier · production · fast
  • Hard cost ceiling per pipeline, fail loud if hit
  • All providers behind one interface (LiteLLM / OpenRouter)
Proof
Cost reduction vs. flat-Opus
~60%
Providers wired
Claude · OpenAI · Gemini · Mistral · local
Pipelines on routing
all prod