Hermes 4 405B

Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with...

by NousResearch

Overview

Quick stats across all benchmark runs.

Score

10 benchmarks

Avg Latency

16.0s

14 requests

Pricing

$1.00 in / $3.00 out

per 1M tokens

Context

131K

tokens

Alternatives

Models with similar or better quality but different tradeoffs

No alternatives found

Run benchmarks on this model to discover alternatives

Benchmark Performance

How this model performs across different benchmarks

BenchmarkScoreRank
Evalry Knowledge Benchmark
0%
1 / 8

Price vs Performance

Compare cost efficiency across all models

Current model (baseline)
Other models (relative score)
Y-axis shows score difference from shared benchmarks. X-axis uses log scale.

Score Over Time

Performance trends across all benchmark runs

Benchmark Activity

Number of benchmark runs over time

Get started with this model using OpenRouter

import { OpenRouter } from "@openrouter/sdk";

const openrouter = new OpenRouter({
  apiKey: "<OPENROUTER_API_KEY>"
});

const completion = await openrouter.chat.completions.create({
  model: "nousresearch/hermes-4-405b",
  messages: [
    {
      role: "user",
      content: "Hello!"
    }
  ]
});

console.log(completion.choices[0].message.content);

Get your API key at openrouter.ai/keys

Other Models from NousResearch

Compare performance with other models from the same creator

ModelLatencyCost/1MScore
Hermes 4 70B7.0s$0.27
Hermes 2 Pro - Llama-3 8B12.8s$0.14
Hermes 3 405B Instruct22.4s$1.00
Hermes 3 70B Instruct165.0s$0.30
DeepHermes 3 Mistral 24B Preview$0.06