Llama 3.1 405B Instruct

The highly anticipated 400B class of Llama3 is here! Clocking in at 128k context with impressive eval scores, the Meta AI team continues to push the frontier of open-source LLMs. Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 405B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong performance compared to leading closed-source models including GPT-4o and Claude 3.5 Sonnet in evaluations. To read more about the model release, [click here](https://ai.meta.com/blog/meta-llama-3-1/). Usage of this model is subject to [Meta's Acceptable Use Policy](https://llama.meta.com/llama3/use-policy/).

by Meta

Overview

Quick stats across all benchmark runs.

Score

—

9 benchmarks

Avg Latency

48.9s

11 requests

Pricing

Free in / Free out

per 1M tokens

Context

131K

tokens

Alternatives

Models with similar or better quality but different tradeoffs

No alternatives found

Run benchmarks on this model to discover alternatives

Benchmark Performance

How this model performs across different benchmarks

No benchmark data available

Run benchmarks with this model to see performance breakdown

Price vs Performance

Compare cost efficiency across all models

Current model (baseline)

Other models (relative score)

Y-axis shows score difference from shared benchmarks. X-axis uses log scale.

Score Over Time

Performance trends across all benchmark runs

Benchmark Activity

Number of benchmark runs over time

Quickstart

View on OpenRouter

Get started with this model using OpenRouter

import { OpenRouter } from "@openrouter/sdk";

const openrouter = new OpenRouter({
  apiKey: "<OPENROUTER_API_KEY>"
});

const completion = await openrouter.chat.completions.create({
  model: "meta-llama/llama-3.1-405b-instruct:free",
  messages: [
    {
      role: "user",
      content: "Hello!"
    }
  ]
});

console.log(completion.choices[0].message.content);

Get your API key at openrouter.ai/keys

Other Models from Meta

Compare performance with other models from the same creator

Model	Latency	Cost/1M	Score
Llama 4 Scout	6.6s	$0.19	45%
Llama 3.3 70B Instruct	20.5s	$0.21	42%
Llama 4 Maverick	4.7s	$0.38	34%
Llama 3.2 1B Instruct	30.7s	$0.11	23%
Llama 3.2 3B Instruct	30.9s	$0.19	11%
Llama Guard 3 8B	6.6s	$0.26	—
Llama 3.1 70B Instruct	14.5s	$0.40	—
Llama 3.2 90B Vision Instruct	—	$0.38	—
Llama Guard 4 12B	513ms	$0.18	—
Llama 3 70B Instruct	20.1s	$0.63	—
Llama 3 8B Instruct	8.8s	$0.04	—
Llama 3.1 8B Instruct	23.0s	$0.02	—
Llama 3.2 11B Vision Instruct	44.8s	$0.24	—
Llama 3.1 405B (base)	39.5s	$4.00	—
LlamaGuard 2 8B	1.6s	$0.20	—