What's the prompt for the Neonracer test?

Neon Racer — fullscreen neon racer with vapor-trail particle effects. Every model receives this exact prompt, one shot, single HTML file out.

Game

Neonracer

Q: What's the best AI model for Neonracer?

Claude Opus 5 — Gorgeous vaporwave 3D scene with glowing vapor-trail particles, neon road edges, radar with tracked threats, and rich HUD — the vapor plume behind the craft is exactly on-brief and the twin-moon/skyline backdrop sells the atmosphere. Enemies/threats are present (radar shows 5, triangles ahead) and combat systems are wired up, edging past the field's best.

Q: How many AI models attempted Neonracer?

25 models on Goldie Bench have attempted Neonracer: Claude Fable 5, Fugu Ultra, Fugu Ultra 1.1, Fugu Mini, Fusion, Gemini 3.6 Flash, GLM-5.2, GPT-5.6 Sol, Grok, Inkling, Kimi K2.7, Kimi K3, MiniMax M3, Hermes MoA, Muse Spark 1.2, Opus 4.8, Claude Opus 5, Qwen 3.8, Qwen 3.7, Claude Sonnet 5, Kimi K2.7 · Fast, Kimi K2.7 · No-Think, Kimi K2.7 · Quality, DeepSeek V4 Pro, DeepSeek V4 Flash.

Neon Racer — fullscreen neon racer with vapor-trail particle effects.

CategoryGame

Models tested25

Scored20/25

Avg score8.07/10

WinnerClaude Opus 5

What I asked each model — the Neonracer prompt

Every model on this page got this exact prompt inside the Agent Operating System: Neon Racer — fullscreen neon racer with vapor-trail particle effects.

Single HTML file out. No iteration. No examples in the system prompt. Whatever each model produced on the first run is what's on this page. 25 frontier models have attempted it so far: Claude Fable 5, Fugu Ultra, Fugu Ultra 1.1, Fugu Mini, Fusion, Gemini 3.6 Flash, GLM-5.2, GPT-5.6 Sol, Grok, Inkling, Kimi K2.7, Kimi K3, MiniMax M3, Hermes MoA, Muse Spark 1.2, Opus 4.8, Claude Opus 5, Qwen 3.8, Qwen 3.7, Claude Sonnet 5, Kimi K2.7 · Fast, Kimi K2.7 · No-Think, Kimi K2.7 · Quality, DeepSeek V4 Pro, DeepSeek V4 Flash.

Why this task matters. Neonracer is a textbook test of game-class capability — the kind of build that exposes whether a model is doing pattern-matching or actual reasoning. A model that ships this in one shot is usually safe to wire into your agent loop for harder tasks of the same shape.

How each model handled Neonracer

Ranked by my 0–10 score from the source comparison guides on agentos.guide. Click any to play the actual one-shot HTML the model produced.

Claude Fable 5 Anthropic

• 8.3/10 · polished neon racer

What I saw: Renders cleanly with strong on-brief neon aesthetic — glowing pink/cyan road edges, retro grid, colorful pylons and dashes create a convincing synthwave scene. Weaknesses: the vapor trail particles aren't clearly visible in the shot and the player car is a bit blocky, keeping it just shy of the top.

▶ Play Claude Fable 5's attempt →

Fugu Ultra Sakana AI

• 8.0/10

What I saw: Ultra v2 — top-down neon racer. Smoke-test PASS.

▶ Play Fugu Ultra's attempt →

Fugu Ultra 1.1 Sakana AI

• 8.0/10

What I saw: Strong neon vaporwave scene with detailed craft, gradient sky, retro sun stripes, and a polished HUD showing 9 enemies to hunt; the vapor-trail particle effects aren't visible in this static shot and no enemies are yet in frame, keeping it just shy of the field's best.

▶ Play Fugu Ultra 1.1's attempt →

Fugu Mini Sakana AI

• 8.0/10

What I saw: Top-down neon racer with vapor trails. Smoke-test PASS.

▶ Play Fugu Mini's attempt →

Fusion OpenRouter

• 8.5/10

What I saw: Top-down neon racer with vapor-trail particles, synthwave aesthetic, drift physics, lap timer, procedural track. 22KB.

▶ Play Fusion's attempt →

Gemini 3.6 Flash Google

• 8.3/10

What I saw: Strong synthwave aesthetic with glowing neon grid, retro sun/skyline, polished HUD, and a detailed ship with vapor-trail particles; enemies (8 hostiles) are present with combat wired in but appear small/distant, keeping it just shy of the field's best.

▶ Play Gemini 3.6 Flash's attempt →

GLM-5.2 Zhipu / Z.ai

• 7.5/10

What I saw: 39KB · plays clean · plain

▶ Play GLM-5.2's attempt →

GPT-5.6 Sol OpenAI

🥈 8.6/10 · polished synthwave racer

What I saw: Strong synthwave scene renders beautifully — retro sun with scanlines, glowing neon arches, layered city skyline, and a clean 3D car with cohesive HUD (nitro, speed, brand). Only nit is the vapor-trail particles aren't visible in this static frame, but the overall polish and on-brief execution clearly top the field.

▶ Play GPT-5.6 Sol's attempt →

Grok xAI

• 8.0/10

What I saw: Top-down neon racer with vapor trails, synthwave aesthetic, drift, lap timer, procedural track. 22KB.

▶ Play Grok's attempt →

Inkling Thinking Machines

• 7.4/10

What I saw: Strong neon aesthetic with glowing title, cyan/pink pillars and a clear car, and the code has a working follow-cam, boost and vapor particles; but the screenshot shows almost no visible particle trail and the pillars float/scatter awkwardly, leaving it polished-looking but generic rather than a standout racer.

▶ Play Inkling's attempt →

The winner on Neonracer

Claude Opus 5 took gold on this task. neon vaporwave racer.

What I saw: Gorgeous vaporwave 3D scene with glowing vapor-trail particles, neon road edges, radar with tracked threats, and rich HUD — the vapor plume behind the craft is exactly on-brief and the twin-moon/skyline backdrop sells the atmosphere. Enemies/threats are present (radar shows 5, triangles ahead) and combat systems are wired up, edging past the field's best.

See Claude Opus 5's full model card: /models/opus5. Direct head-to-head against the runner-up: Claude Opus 5 vs GPT-5.6 Sol.

Every attempt — live, playable

Side by side. Click any tile to run that model's actual one-shot HTML in a new tab.

Neonracer

What I asked each model — the Neonracer prompt

How each model handled Neonracer

The winner on Neonracer

Every attempt — live, playable

How I scored Neonracer — methodology

Related

Run this stack yourself.