Kilo Code
Fable 5-class intelligence at ~59% less. The split-the-cost play.
What is Kilo Code?
Kilo Code is the Kilo frontier model with a Varies — Kilo splits planning from execution across multiple models context window, released 2026-06-16. Tagline: Fable 5-class intelligence at ~59% less. The split-the-cost play..
Pricing detail. Kilo Code is a routing layer that splits planning (heavy model) from execution (cheaper model) so you get Fable-5-class plans driving GPT-5.5-class builds. Total spend lands at ~59% less than running Fable 5 end-to-end.
How I use it inside the Agent OS. Used inside Agent OS as a routing layer: Fable 5 generates the plan, cheaper models execute. Bench scoring pending a head-to-head comparison.
What I built with Kilo Code
Every model on Goldie Bench gets the same fixed prompt set — one shot, single HTML file out — and I score the result 0–10 inside the Agent Operating System. Here's what Kilo Code shipped on the bench: 0 one-shot demos across Varies — Kilo splits planning from execution across multiple models of context. Of those, 0 are scored against the field with my honest 0–10 from the source guides at agentos.guide.
Strengths
- Kilo's own rubric: Fable 5 plan = 9.1/10, GPT-5.5 plan = 8.3/10 — Kilo isolates where the intelligence actually lives
- Plan quality stays high while execution cost drops
- Drop-in for Agent OS — Kilo Split framework already wired
Trade-offs
- Adds routing complexity — two model providers in one workflow
- No per-task goldiebench head-to-heads yet
Best for
- Cost-conscious operators who run high-volume agent loops
- Multi-step workflows where the plan is the expensive part
- Teams already paying for Fable 5 who want to keep the plan but drop the execution bill
Every demo by Kilo Code
0 live demos, sorted by category. Click any tile to play the actual one-shot result. Verdicts and 0–10 scores are pulled from the source guides where I posted them publicly.
Head-to-heads with Kilo Code
Direct comparisons against every other scored model on the bench:
Kilo Code vs Opus 4.8 Kilo Code vs GLM-5.2 Kilo Code vs Grok Kilo Code vs Qwen 3.7 Kilo Code vs Kimi K2.7Read more on agentos.guide: /kilo-split
Run this stack yourself.
Every demo on this bench was built inside the Agent Operating System — one prompt, one shot, single HTML file out. The Agent OS, the prompts, the templates, the weekly walkthroughs and 3,600+ founders shipping with it every day all live inside the AI Profit Boardroom.