What's the prompt for the Fluid test?

Fluid — WebGL fluid simulation with swirling particles. Every model receives this exact prompt, one shot, single HTML file out.

Sim

Fluid

Q: What's the best AI model for Fluid?

GLM-5.2 — GLM filled the bowl with glowing liquid that actually sloshes — the most convincing 'liquid in a bowl'. Opus's particles glowed but clumped to the centre. Kimi's collapsed into a tiny blob.

Q: How many AI models attempted Fluid?

23 models on Goldie Bench have attempted Fluid: Claude Fable 5, Fugu Ultra, Fugu Mini, Fusion, Gemini 3.6 Flash, GLM-5.2, GPT-5.6 Sol, Grok, Inkling, Kimi K2.7, Kimi K3, MiniMax M3, Hermes MoA, Opus 4.8, Claude Opus 5, Qwen 3.8, Qwen 3.7, Claude Sonnet 5, DeepSeek V4 Pro, DeepSeek V4 Flash, Kimi K2.7 · Fast, Kimi K2.7 · No-Think, Kimi K2.7 · Quality.

Fluid — WebGL fluid simulation with swirling particles.

CategorySim

Models tested23

Scored18/23

Avg score7.76/10

WinnerGLM-5.2

What I asked each model — the Fluid prompt

Every model on this page got this exact prompt inside the Agent Operating System: Fluid — WebGL fluid simulation with swirling particles.

Single HTML file out. No iteration. No examples in the system prompt. Whatever each model produced on the first run is what's on this page. 23 frontier models have attempted it so far: Claude Fable 5, Fugu Ultra, Fugu Mini, Fusion, Gemini 3.6 Flash, GLM-5.2, GPT-5.6 Sol, Grok, Inkling, Kimi K2.7, Kimi K3, MiniMax M3, Hermes MoA, Opus 4.8, Claude Opus 5, Qwen 3.8, Qwen 3.7, Claude Sonnet 5, DeepSeek V4 Pro, DeepSeek V4 Flash, Kimi K2.7 · Fast, Kimi K2.7 · No-Think, Kimi K2.7 · Quality.

Why this task matters. Fluid is a textbook test of sim-class capability — the kind of build that exposes whether a model is doing pattern-matching or actual reasoning. Shipping this cleanly is the floor for what I expect from a frontier model — every model on the leaderboard should at least attempt it.

How each model handled Fluid

Ranked by my 0–10 score from the source comparison guides on agentos.guide. Click any to play the actual one-shot HTML the model produced.

Claude Fable 5 Anthropic

• 8.3/10

What I saw: Renders beautifully with elegant curl-noise flow lines and a lovely cyan-to-magenta speed-based gradient, plus autopilot vortex and clean pointer interaction. Strong and shippable, but it's more a flow-field particle field than a true fluid sim with density diffusion, so it falls just short of topping the best.

▶ Play Claude Fable 5's attempt →

Fugu Ultra Sakana AI

• 6.5/10

What I saw: Ultra v2 — fluid sim. Smoke-test MAYBE (0.0% diff) — needs click-drag injection which generic input didn't replicate.

▶ Play Fugu Ultra's attempt →

Fugu Mini Sakana AI

• 8.0/10

What I saw: 2D fluid simulation, click-drag density+velocity. Smoke-test PASS.

▶ Play Fugu Mini's attempt →

Fusion OpenRouter

• 8.5/10

What I saw: Stable-fluids style 2D simulation with click-and-drag density+velocity injection. Polished glass HUD, gradient title, live pill indicator, right-side controls panel. Touch-action set so it works on mobile. Visually on par with Opus's blob-in-bowl.

▶ Play Fusion's attempt →

Gemini 3.6 Flash Google

• 7.8/10

What I saw: Strong particle density (75k) with glowing additive blending, polished glassmorphism UI, presets and shockwave controls all render cleanly; but the center is blown out to solid white and it reads more as a bright particle cloud than an elegant swirling fluid, keeping it just short of the field's best.

▶ Play Gemini 3.6 Flash's attempt →

GLM-5.2 Zhipu / Z.ai

🥇 9.0/10 · winner · best liquid

What I saw: GLM filled the bowl with glowing liquid that actually sloshes — the most convincing 'liquid in a bowl'. Opus's particles glowed but clumped to the centre. Kimi's collapsed into a tiny blob.

▶ Play GLM-5.2's attempt →

GPT-5.6 Sol OpenAI

🥉 8.6/10 · swirling vortex flow

What I saw: Gorgeous swirling vortex of multicolored particle trails around a glowing core — the cyan/violet/pink palette, additive-blended trails, and clean UI chrome (title, metrics, custom cursor) make it genuinely polished and clearly on-brief. Not a literal Navier-Stokes fluid sim but the flow-field aesthetic reads as fluid convincingly, and the composition tops the field.

▶ Play GPT-5.6 Sol's attempt →

Grok xAI

• 7.5/10

What I saw: A round bowl of ~500 distinct glowing droplets that fall, pool at the bottom and slosh when you tilt with the mouse, with colour themes. The first version melted into one white blob; this one keeps every droplet separate, like real liquid.

▶ Play Grok's attempt →

Inkling Thinking Machines

• 6.8/10

What I saw: Renders cleanly with a polished Orbitron title and a dense particle cloud with pleasing color mix, but it reads as a scattered particle sphere rather than convincing swirling fluid motion — the flow field is subtle and the CPU per-particle loop limits it to a generic point cloud rather than the cohesive vortex the brief implies.

▶ Play Inkling's attempt →

Kimi K2.7 Moonshot AI

• 5.0/10

▶ Play Kimi K2.7's attempt →

The winner on Fluid

GLM-5.2 took gold on this task. winner · best liquid.

See GLM-5.2's full model card: /models/glm.

Every attempt — live, playable

Side by side. Click any tile to run that model's actual one-shot HTML in a new tab.

Fluid

What I asked each model — the Fluid prompt

How each model handled Fluid

The winner on Fluid

Every attempt — live, playable

How I scored Fluid — methodology

Related

Run this stack yourself.