Vendor

All Moonshot AI AI models

Beijing-based vendor behind Kimi K2.7 — a frontier coder available on a flat plan.

Models on bench4

Total task attempts188

Scored cells25

Gold medals🥇 1

Vendor avg score7.46/10

Top modelKimi K2.7

My take on Moonshot AI

Moonshot AI's Kimi K2.7 is the sleeper hit of the bench — frontier-class coder on a flat-rate plan, which changes the economics of agent loops entirely. The three modes (Fast, No-Think, Quality) are useful in different ways: Fast for snappy iteration, Quality for one-shot polish, No-Think when the plan is already in the prompt. The flat plan means I can iterate without watching a meter.

Where I use Moonshot AI inside the Agent OS. Each model below has a "How I use it" line in its detail page — that's the daily-usage view, not the marketing pitch.

Every Moonshot AI model on Goldie Bench

Click any card for the full model card, every demo, and direct head-to-head comparisons.

Kimi K2.7 Moonshot AI

The heavy lifter — frontier coder at flat-rate.

7.46avg

47tasks

1🥇

2🥈

Kimi K2.7 · Fast Moonshot AI

Fast mode — top speed, minimal thinking.

Kimi K2.7 · No-Think Moonshot AI

Pure execution mode — no chain of thought.

Kimi K2.7 · Quality Moonshot AI

Quality mode — deepest thinking, best output.

How I tested Moonshot AI's models

Every model on this page received the exact same fixed prompt as every other model on the bench. One shot, single HTML file out, scored 0–10 by me on three axes (runs, hits the brief, looks good). The scoring is published in my source comparison guides on agentos.guide — see the methodology page for full data provenance.

Vendor: moonshot.ai ↗

The same stack Julian uses

Run this stack yourself.

Every demo on this bench was built inside the Agent Operating System — one prompt, one shot, single HTML file out. The Agent OS, the prompts, the templates, the weekly walkthroughs and 4,000+ founders shipping with it every day all live inside the AI Profit Boardroom.

4,000+founders

258documented wins

38countries

$59/momonthly

Join AIPB · $59/mo → Read the Agent OS guides →