Evalry
ArenaBenchmarksRankingsModels
  1. Models
  2. Group: xAI

xAI

All xAI models

3 models

Highlights

The standout models in this collection.

Best Score

47%

Grok 4.3

Fastest

94 tok/s

Grok 4.3

Cheapest

$1.25 / 1M

Grok 4.3

Performance Analysis

Benchmark 2 more

Based on 1 of 3 models with benchmark data.

Price vs Quality

Top-left = best value

Speed vs Quality

Top-right = ideal

Find the best model for you

Run your prompts against these models and see which works best for you.

Start Benchmark

All Models

3 of 3

Every model in this collection.

#
1
Grok 4.20

Grok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering...

xAI2.0M
$1.25
$2.50 out
—
2
Grok 4.20 Multi-Agent

Grok 4.20 Multi-Agent is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in parallel to conduct deep research, coordinate tool use, and synthesize information...

xAI2.0M
$2.00
$6.00 out
—
3
Grok 4.3

Grok 4.3 is a reasoning model from xAI. It accepts text and image inputs with text output, and is suited for agentic workflows, instruction-following tasks, and applications requiring high factual...

xAI1.0M
$1.25
$2.50 out
47%
Evalry

No artificial analysis, but the best LLM for you.

LinkedInGitHubEmail
HomeBenchmarksRankingsModelsTermsLegalPrivacy

© 2026 apistemic GmbH. All rights reserved.