What's the prompt for the Synthwave test?

Synthwave — sunset-grid synthwave loop. Every model receives this exact prompt, one shot, single HTML file out.

Visual

Synthwave

Q: What's the best AI model for Synthwave?

GLM-5.2 — This is GLM's. A cyan wireframe mountain range scrolling under a scanline synthwave sun — the single most beautiful frame in the whole shoot-out. Opus's clean Tron grid and magenta horizon is a close, cooler-toned second. Kimi got the idea but blew the exposure — the grid washes out to near-white. GLM wins this one going away.

Q: How many AI models attempted Synthwave?

23 models on Goldie Bench have attempted Synthwave: Claude Fable 5, Fugu Ultra, Fugu Mini, Fusion, Gemini 3.6 Flash, GLM-5.2, GPT-5.6 Sol, Grok, Inkling, Kimi K2.7, Kimi K3, MiniMax M3, Hermes MoA, Opus 4.8, Claude Opus 5, Qwen 3.8, Qwen 3.7, Claude Sonnet 5, Kimi K2.7 · Fast, Kimi K2.7 · No-Think, Kimi K2.7 · Quality, DeepSeek V4 Pro, DeepSeek V4 Flash.

Synthwave — sunset-grid synthwave loop.

CategoryVisual

Models tested23

Scored18/23

Avg score8.09/10

WinnerGLM-5.2

What I asked each model — the Synthwave prompt

Every model on this page got this exact prompt inside the Agent Operating System: Synthwave — sunset-grid synthwave loop.

Single HTML file out. No iteration. No examples in the system prompt. Whatever each model produced on the first run is what's on this page. 23 frontier models have attempted it so far: Claude Fable 5, Fugu Ultra, Fugu Mini, Fusion, Gemini 3.6 Flash, GLM-5.2, GPT-5.6 Sol, Grok, Inkling, Kimi K2.7, Kimi K3, MiniMax M3, Hermes MoA, Opus 4.8, Claude Opus 5, Qwen 3.8, Qwen 3.7, Claude Sonnet 5, Kimi K2.7 · Fast, Kimi K2.7 · No-Think, Kimi K2.7 · Quality, DeepSeek V4 Pro, DeepSeek V4 Flash.

Why this task matters. Synthwave is a textbook test of visual-class capability — the kind of build that exposes whether a model is doing pattern-matching or actual reasoning. Shipping this cleanly is the floor for what I expect from a frontier model — every model on the leaderboard should at least attempt it.

How each model handled Synthwave

Ranked by my 0–10 score from the source comparison guides on agentos.guide. Click any to play the actual one-shot HTML the model produced.

Claude Fable 5 Anthropic

• 8.6/10 · textbook synthwave sunset

What I saw: Nails every synthwave staple — banded gradient sun, glowing perspective grid, layered mountain silhouettes, starfield and neon pylons with strong purple/pink/cyan palette and clean typographic title. The one oddity is the pink pylon clipping oddly close to the camera, but overall it's a polished, on-brief render that competes for the top.

▶ Play Claude Fable 5's attempt →

Fugu Ultra Sakana AI

• 8.5/10

What I saw: Ultra v2 — synthwave terrain flythrough. Smoke-test PASS (4.3% pixel diff).

▶ Play Fugu Ultra's attempt →

Fugu Mini Sakana AI

• 8.0/10

What I saw: Synthwave terrain flythrough with grid + scanline sun. Smoke-test PASS.

▶ Play Fugu Mini's attempt →

Fusion OpenRouter

• 8.0/10

What I saw: Synthwave flythrough on three.js + WebGL. Neon grid below, scanline sun, mountain silhouettes. CRT effect on top.

▶ Play Fusion's attempt →

Gemini 3.6 Flash Google

• 8.4/10

What I saw: Strong synthwave build with glowing neon grid road, wireframe mountains, retro arch/sun, chip-tune audio synthesis, and polished CRT-scanline HUD; the pink/cyan palette and vaporwave title nail the brief, though the tunnel arch reads slightly muddy and the road wireframe geometry near the camera looks a touch busy versus a cleaner classic sunset-sun composition.

▶ Play Gemini 3.6 Flash's attempt →

GLM-5.2 Zhipu / Z.ai

🥇 9.0/10 · winner · best frame here

What I saw: This is GLM's. A cyan wireframe mountain range scrolling under a scanline synthwave sun — the single most beautiful frame in the whole shoot-out. Opus's clean Tron grid and magenta horizon is a close, cooler-toned second. Kimi got the idea but blew the exposure — the grid washes out to near-white. GLM wins this one going away.

▶ Play GLM-5.2's attempt →

GPT-5.6 Sol OpenAI

🥉 8.7/10 · textbook synthwave sunset

What I saw: Nails every synthwave cue — striped sunset, layered neon mountains, twinkling stars, glowing perspective grid with a clean vanishing point and steer/pulse interactivity — with polished typography and gradients; only nit is the paused status showing on capture, otherwise a textbook, task-topping render.

▶ Play GPT-5.6 Sol's attempt →

Grok xAI

• 8.0/10

What I saw: A scanline sun behind a bold silhouette, a neon grid running to the horizon. Proper retro atmosphere, zero clicks.

▶ Play Grok's attempt →

Inkling Thinking Machines

• 7.2/10

What I saw: Strong typography and neon grid perspective land the synthwave vibe, but the sun renders as a flat pale-white disc (not a warm gradient sunset) and the emissive cyan 'lines' read as stray beams shooting through the sun rather than a polished retro loop, keeping it below the field's best.

▶ Play Inkling's attempt →

Kimi K2.7 Moonshot AI

• 6.5/10

▶ Play Kimi K2.7's attempt →

The winner on Synthwave

GLM-5.2 took gold on this task. winner · best frame here.

See GLM-5.2's full model card: /models/glm.

Every attempt — live, playable

Side by side. Click any tile to run that model's actual one-shot HTML in a new tab.

Synthwave

What I asked each model — the Synthwave prompt

How each model handled Synthwave

The winner on Synthwave

Every attempt — live, playable

How I scored Synthwave — methodology

Related

Run this stack yourself.