Meta: Llama 4 Maverick

by Meta

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward pass (400B total). It supports multilingual text and image input, and produces multilingual text and code output across 12 supported languages. Optimized for vision-language tasks, Maverick is instruction-tuned for assistant-like behavior, image reasoning, and general-purpose multimodal interaction. Maverick features early fusion for native multimodality and a 1 million token context window. It was trained on a curated mixture of public, licensed, and Meta-platform data, covering ~22 trillion tokens, with a knowledge cutoff in August 2024. Released on April 5, 2025 under the Llama 4 Community License, Maverick is suited for research and commercial applications requiring advanced multimodal understanding and high model throughput.

Avg Score

76.4%

240 answers

Avg Latency

2.6s

15 runs

Pricing

$0.15

input

$0.60

output

per 1M tokens

Context

1049K

tokens

Alternatives

Models with similar or better quality but different tradeoffs

Same Quality, Cheaper

Models with similar or better performance at a lower cost per token.

Model	Cost
Meta: Llama 4 Scout	-73%
Google: Gemini 2.0 Flash	-63%
Google: Gemini 2.5 Flash Lite	-42%
DeepSeek: DeepSeek V3.2	-13%

Same Quality, Faster

Models with similar or better performance but lower latency.

Model	Latency
Google: Gemini 2.5 Flash Lite	-58%
Google: Gemini 2.0 Flash	-57%
Google: Gemini 2.5 Flash	-47%
Anthropic: Claude 3 Haiku	-36%
Google: Gemini 3 Flash Preview	-30%

Same Cost, Better

Models at a similar price point with higher benchmark scores.

Model	Score
DeepSeek: DeepSeek V3.2	+3%
Google: Gemini 2.0 Flash	+2%

Other Models from Meta

Compare performance with other models from the same creator

Model	Score	Latency	Cost/1M
Meta: Llama 4 Scout	74.3%	2.8s	$0.19
Meta: Llama 3.3 70B Instruct	71.7%	3.5s	$0.21
Meta: Llama 3.2 3B Instruct	53.9%	3.0s	$0.02
Meta: Llama 3.2 1B Instruct	39.3%	4.4s	$0.11
Meta: Llama 3.1 405B (base)	3.2%	45.0s	$4.00
Meta: Llama Guard 4 12B	—	—	$0.18
Llama Guard 3 8B	—	—	$0.04
Meta: Llama 3.3 70B Instruct	—	—	$0.0000
Meta: Llama 3.2 3B Instruct	—	—	$0.0000
Meta: Llama 3.2 90B Vision Instruct	—	—	$0.38
Meta: Llama 3.2 11B Vision Instruct	—	—	$0.05
Meta: Llama 3.1 8B Instruct	—	—	$0.03
Meta: Llama 3.1 405B Instruct	—	—	$0.0000
Meta: Llama 3.1 405B Instruct	—	—	$3.50
Meta: Llama 3.1 70B Instruct	—	—	$0.40
Meta: LlamaGuard 2 8B	—	—	$0.20
Meta: Llama 3 70B Instruct	—	—	$0.35
Meta: Llama 3 8B Instruct	—	—	$0.04

Benchmark Performance

How this model performs across different benchmarks

Benchmark	Score	Rank
Math Bench: Addition (1-10)	100.0%	19 / 35
Math Bench: Subtraction (1-10)	100.0%	21 / 35
Math Bench: Multiplication (1-10)	100.0%	21 / 35
Math Bench: Arithmetic (1-100)	100.0%	22 / 35
Venture Capital Terms Benchmark	99.2%	14 / 25
Math Bench: Division (1-10)	97.0%	26 / 35
Math Bench: Arithmetic (1-1000)	95.0%	15 / 35
Spatial Reasoning: Germany	90.9%	18 / 35
Character Frequency Bench	90.9%	13 / 35
Motor production test	90.0%	33 / 36
Karlsruhe Local Knowledge Benchmark	68.9%	24 / 35
Product Recommendations	50.0%	33 / 36
Money Boy Cultural Literacy Test	28.6%	18 / 35
German Memelord Bench	11.4%	23 / 35
Karl Lorey knowlege	1.1%	25 / 36

Price vs Performance

Compare cost efficiency across all models

Current model

Other models

X-axis uses log scale for better visualization of price range

Score Over Time

Performance trends across all benchmark runs

Benchmark Activity

Number of benchmark runs over time

Quickstart

Get started with this model using OpenRouter

View on OpenRouter

import { OpenRouter } from "@openrouter/sdk";

const openrouter = new OpenRouter({
  apiKey: "<OPENROUTER_API_KEY>"
});

const completion = await openrouter.chat.completions.create({
  model: "meta-llama/llama-4-maverick",
  messages: [
    {
      role: "user",
      content: "Hello!"
    }
  ]
});

console.log(completion.choices[0].message.content);

Get your API key at openrouter.ai/keys