Evalry
ArenaBenchmarksRankingsModels
  1. Models
  2. Group: xAI

xAI

All xAI models

4 models

Highlights

The standout models in this collection.

Best Score

40%

Grok 4.3

Fastest

90 tok/s

Grok 4.3

Cheapest

$1.00 / 1M

Grok Build 0.1

Performance Analysis

Benchmark 3 more

Based on 1 of 4 models with benchmark data.

Price vs Quality

Top-left = best value

Speed vs Quality

Top-right = ideal

Find the best model for you

Run your prompts against these models and see which works best for you.

Start Benchmark

All Models

4 of 4

Every model in this collection.

#
1
Grok 4.20

Grok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering...

xAI2.0M
$1.25
$2.50 out
—
2
Grok 4.20 Multi-Agent

Grok 4.20 Multi-Agent is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in parallel to conduct deep research, coordinate tool use, and synthesize information...

xAI2.0M
$2.00
$6.00 out
—
3
Grok 4.3

Grok 4.3 is a reasoning model from xAI. It accepts text and image inputs with text output, and is suited for agentic workflows, instruction-following tasks, and applications requiring high factual...

xAI1.0M
$1.25
$2.50 out
40%
4
Grok Build 0.1

Grok Build 0.1 is xAI’s fast coding model trained specifically for agentic software engineering workflows. It supports text and image inputs with text output, and is optimized for interactive coding...

xAI256K
$1.00
$2.00 out
—
Evalry

No artificial analysis, but the best LLM for you.

LinkedInGitHubEmail
HomeBenchmarksRankingsModelsChangelogTermsLegalPrivacy

© 2026 apistemic GmbH. All rights reserved.