Nemotron Nano 9B V2

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...

by NVIDIA

Overview

Quick stats across all benchmark runs.

Score

—

9 benchmarks

Avg Latency

39.7s

21 requests

Pricing

$0.04 in / $0.16 out

per 1M tokens

Context

131K

tokens

Alternatives

Models with similar or better quality but different tradeoffs

No alternatives found

Run benchmarks on this model to discover alternatives

Benchmark Performance

How this model performs across different benchmarks

No benchmark data available

Run benchmarks with this model to see performance breakdown

Price vs Performance

Compare cost efficiency across all models

Current model (baseline)

Other models (relative score)

Y-axis shows score difference from shared benchmarks. X-axis uses log scale.

Score Over Time

Performance trends across all benchmark runs

Benchmark Activity

Number of benchmark runs over time

Quickstart

View on OpenRouter

Get started with this model using OpenRouter

import { OpenRouter } from "@openrouter/sdk";

const openrouter = new OpenRouter({
  apiKey: "<OPENROUTER_API_KEY>"
});

const completion = await openrouter.chat.completions.create({
  model: "nvidia/nemotron-nano-9b-v2",
  messages: [
    {
      role: "user",
      content: "Hello!"
    }
  ]
});

console.log(completion.choices[0].message.content);

Get your API key at openrouter.ai/keys

Other Models from NVIDIA

Compare performance with other models from the same creator

Model	Latency	Cost/1M	Score
Nemotron 3 Nano 30B A3B	5.1s	$0.13	—
Llama 3.3 Nemotron Super 49B V1.5	31.1s	$0.25	—
Nemotron 3.5 Content Safety (free)	—	Free	—
Nemotron 3 Ultra	—	Free	—
Nemotron 3 Nano Omni (free)	—	Free	—
Nemotron 3 Super	—	Free	—
Llama 3.1 Nemotron 70B Instruct	16.7s	$1.20	—
Llama 3.1 Nemotron Ultra 253B v1	61.9s	$1.20	—
Nemotron Nano 12B 2 VL	34.2s	$0.40	—