Moonshot AI

Kimi K2.7

The heavy lifter — frontier coder at flat-rate.

Context256,000 tokens
PricingFlat plan (no per-token bill)
Tasks tested23
Avg score7.25/10 average
Medals🥇3 🥈2 🥉9
Release2026-06

What is Kimi K2.7?

Kimi K2.7 is the Moonshot AI frontier model with a 256,000 tokens context window, released 2026-06. Tagline: The heavy lifter — frontier coder at flat-rate..

Pricing detail. Available on Moonshot's flat-rate subscription plan — no per-token billing for individual builders. The plan covers all three speed modes (Fast, No-Think, Quality).

How I use it inside the Agent OS. Wired into the Agent OS as the heavy-lifter for game/sim prototypes and Kanban-dispatched code work. Mode toggled per task: Quality for one-shot games, Fast for short bursts.

What I built with Kimi K2.7

Every model on Goldie Bench gets the same fixed prompt set — one shot, single HTML file out — and I score the result 0–10 inside the Agent Operating System. Here's what Kimi K2.7 shipped on the bench: 23 one-shot demos across 256,000 tokens of context. Of those, 14 are scored against the field with my honest 0–10 from the source guides at agentos.guide.

Strengths

  • Best-of-three on interactive games — raycaster, DOOM, monster AI
  • Three speed modes (Fast / No-Think / Quality) you can swap per task
  • Flat-rate plan eliminates the per-token meter, so iteration is free

Trade-offs

  • Plays plainest on abstract visual prompts — synthwave grids, fluid sims, aurora — where GLM and Opus add more flair
  • Bronze average on the Goldie Bench bench despite the gold-medal games — its visual builds are accurate but understated

Best for

  • Interactive game prototypes you want shippable on the first prompt
  • High-iteration agent loops where per-token cost would dominate
  • Long-context refactors using the 256K window inside Agent OS

Every demo by Kimi K2.7

23 live demos, sorted by category. Click any tile to play the actual one-shot result. Verdicts and 0–10 scores are pulled from the source guides where I posted them publicly.

Head-to-heads with Kimi K2.7

Direct comparisons against every other scored model on the bench:

Kimi K2.7 vs Opus 4.8 Kimi K2.7 vs GLM-5.2 Kimi K2.7 vs Qwen 3.7

Read more on agentos.guide: /kimi-code, /kimi-hermes, /kimi-modes-head-to-head, /three-dragons

The same stack Julian uses

Run this stack yourself.

Every demo on this bench was built inside the Agent Operating System — one prompt, one shot, single HTML file out. The Agent OS, the prompts, the templates, the weekly walkthroughs and 3,600+ founders shipping with it every day all live inside the AI Profit Boardroom.

3,600+founders
258documented wins
38countries
$100k+/mocommunity MRR