All models

xAI: Grok 4.1 Fast

by xAI

Grok 4.1 Fast is xAI's best agentic tool calling model that shines in real-world use cases like customer support and deep research. 2M context window. Reasoning can be enabled/disabled using the `reasoning` `enabled` parameter in the API. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#controlling-reasoning-tokens)

Avg Score

79.3%

231 answers

Avg Latency

7.7s

13 runs

Pricing

$0.20

input

/

$0.50

output

per 1M tokens

Context

2000K

tokens

Alternatives

Models with similar or better quality but different tradeoffs

Same Quality, Cheaper

Models with similar or better performance at a lower cost per token.

Same Quality, Faster

Models with similar or better performance but lower latency.

Same Cost, Better

Models at a similar price point with higher benchmark scores.

Other Models from xAI

Compare performance with other models from the same creator

ModelScoreLatencyCost/1M
xAI: Grok 482.5%24.3s$9.00
xAI: Grok 4 Fast$0.35
xAI: Grok Code Fast 1$0.85
xAI: Grok 3 Mini$0.40
xAI: Grok 3$9.00
xAI: Grok 3 Mini Beta$0.40
xAI: Grok 3 Beta$9.00

Benchmark Performance

How this model performs across different benchmarks

Price vs Performance

Compare cost efficiency across all models

Current model
Other models
X-axis uses log scale for better visualization of price range

Score Over Time

Performance trends across all benchmark runs

Benchmark Activity

Number of benchmark runs over time

Quickstart

Get started with this model using OpenRouter

View on OpenRouter
import { OpenRouter } from "@openrouter/sdk";

const openrouter = new OpenRouter({
  apiKey: "<OPENROUTER_API_KEY>"
});

const completion = await openrouter.chat.completions.create({
  model: "x-ai/grok-4.1-fast",
  messages: [
    {
      role: "user",
      content: "Hello!"
    }
  ]
});

console.log(completion.choices[0].message.content);

Get your API key at openrouter.ai/keys