xAI

Grok

Snappy + real-time — the X-native model.

Context256,000 tokens
PricingSubscription via X Premium
Tasks tested13
Avg scorecurrently unranked
Medals🥇0 🥈0 🥉0
Release2026-04

What is Grok?

Grok is the xAI frontier model with a 256,000 tokens context window, released 2026-04. Tagline: Snappy + real-time — the X-native model..

Pricing detail. Bundled with X (Twitter) Premium subscription — no per-token bill for end users, no individual API pricing for the chat product.

How I use it inside the Agent OS. Used for real-time content workflows where the model needs current X timeline context. Standalone bench scoring pending.

What I built with Grok

Every model on Goldie Bench gets the same fixed prompt set — one shot, single HTML file out — and I score the result 0–10 inside the Agent Operating System. Here's what Grok shipped on the bench: 13 one-shot demos across 256,000 tokens of context. Of those, 0 are scored against the field with my honest 0–10 from the source guides at agentos.guide.

Strengths

  • Real-time access to X timeline data — unique signal no other model has
  • Snappy latency on shorter prompts
  • 256K context window keeps pace with the open-weights field

Trade-offs

  • 13 demos on the bench but zero have curated 0–10 verdicts yet — currently unranked
  • API access is gated behind X Premium, awkward for backend agent loops

Best for

  • Workflows that need live X / Twitter context
  • Snappy prompts where latency matters
  • Researchers comparing X-native models against the rest of the field

Head-to-heads with Grok

Direct comparisons against every other scored model on the bench:

Grok vs Opus 4.8 Grok vs GLM-5.2 Grok vs Qwen 3.7 Grok vs Kimi K2.7

Read more on agentos.guide: /grok-build

The same stack Julian uses

Run this stack yourself.

Every demo on this bench was built inside the Agent Operating System — one prompt, one shot, single HTML file out. The Agent OS, the prompts, the templates, the weekly walkthroughs and 3,600+ founders shipping with it every day all live inside the AI Profit Boardroom.

3,600+founders
258documented wins
38countries
$100k+/mocommunity MRR