Page

Webos

Web-OS Desktop — a working desktop with windows, dock, Notes / Paint / Terminal apps.

CategoryPage
Models tested5
Scored4/5
Avg score8.62/10
WinnerFusion

What I asked each model — the Webos prompt

Every model on this page got this exact prompt inside the Agent Operating System: Web-OS Desktop — a working desktop with windows, dock, Notes / Paint / Terminal apps.

Single HTML file out. No iteration. No examples in the system prompt. Whatever each model produced on the first run is what's on this page. 5 frontier models have attempted it so far: Fusion, Grok, Kimi K2.7, MiniMax M3, GLM-5.2.

Why this task matters. Webos is a textbook test of page-class capability — the kind of build that exposes whether a model is doing pattern-matching or actual reasoning. Shipping this cleanly is the floor for what I expect from a frontier model — every model on the leaderboard should at least attempt it.

How each model handled Webos

Ranked by my 0–10 score from the source comparison guides on agentos.guide. Click any to play the actual one-shot HTML the model produced.

Fusion OpenRouter
🥇 9.0/10 · winner · most ambitious app

What I saw: A tiny working desktop OS in 24KB — wallpaper, taskbar dock with app icons, draggable resizable windows for Notes, Paint, Terminal (echo-only), Calculator. Apple-Sequoia aesthetic. Most ambitious application build on the bench.

▶ Play Fusion's attempt →
Grok xAI
🥇 9.0/10 · winner · ambitious desktop

What I saw: Web-OS desktop with wallpaper, dock, draggable resizable windows for Notes/Paint/Terminal/Calculator. 33KB — beats Fusion's 24KB attempt on density.

▶ Play Grok's attempt →
Kimi K2.7 Moonshot AI
• 8.0/10

What I saw: 30KB working desktop — wallpaper, dock, draggable windows.

▶ Play Kimi K2.7's attempt →
MiniMax M3 MiniMax
🥉 8.5/10

What I saw: 38KB working desktop — wallpaper, dock, draggable Notes/Paint/Terminal/Calculator windows.

▶ Play MiniMax M3's attempt →
GLM-5.2 Zhipu / Z.ai
• unranked

Demo on the bench. Not scored yet — play it and form your own opinion.

▶ Play GLM-5.2's attempt →

The winner on Webos

Fusion took gold on this task. winner · most ambitious app.

What I saw: A tiny working desktop OS in 24KB — wallpaper, taskbar dock with app icons, draggable resizable windows for Notes, Paint, Terminal (echo-only), Calculator. Apple-Sequoia aesthetic. Most ambitious application build on the bench.

See Fusion's full model card: /models/fusion.

How I scored Webos — methodology

Three axes, 0–10 each, averaged. Runs: drop the .html in a browser; if it opens to a broken page, it scores zero. Hits the brief: did the model ship the thing the prompt asked for, or a different thing it found easier. Looks good: visual polish, motion, interactivity — where most of the gap between gold and silver lives.

My scores trace back to the source comparison guides on agentos.guide. See the full methodology page for data provenance, including which source guide each cell's score came from.

The same stack Julian uses

Run this stack yourself.

Every demo on this bench was built inside the Agent Operating System — one prompt, one shot, single HTML file out. The Agent OS, the prompts, the templates, the weekly walkthroughs and 3,600+ founders shipping with it every day all live inside the AI Profit Boardroom.

3,600+founders
258documented wins
38countries
$100k+/mocommunity MRR