Qwen3 4B (free)

Qwen3-4B is a 4 billion parameter dense language model from the Qwen3 series, designed to support both general-purpose and reasoning-intensive tasks. It introduces a dual-mode architecture—thinking and non-thinking—allowing dynamic switching between high-precision logical reasoning and efficient dialogue generation. This makes it well-suited for multi-turn chat, instruction following, and complex agent workflows.

by Qwen

Overview

Quick stats across all benchmark runs.

Score

0 benchmarks

Avg Latency

0ms

0 requests

Pricing

Free in / Free out

per 1M tokens

Context

41K

tokens

Alternatives

Models with similar or better quality but different tradeoffs

No alternatives found

Run benchmarks on this model to discover alternatives

Benchmark Performance

How this model performs across different benchmarks

No benchmark data available

Run benchmarks with this model to see performance breakdown

Score Over Time

Performance trends across all benchmark runs

No score trend data

Score history will appear here after multiple runs

Benchmark Activity

Number of benchmark runs over time

No activity data

Activity will appear here after benchmark runs

Get started with this model using OpenRouter

import { OpenRouter } from "@openrouter/sdk";

const openrouter = new OpenRouter({
  apiKey: "<OPENROUTER_API_KEY>"
});

const completion = await openrouter.chat.completions.create({
  model: "qwen/qwen3-4b:free",
  messages: [
    {
      role: "user",
      content: "Hello!"
    }
  ]
});

console.log(completion.choices[0].message.content);

Get your API key at openrouter.ai/keys

Other Models from Qwen

Compare performance with other models from the same creator

ModelLatencyCost/1MScore
Qwen3.6 Max Preview21.5s$3.6453%
Qwen3.6 Plus33.1s$1.1445%
Qwen3.6 Flash9.6s$0.6638%
Qwen3 235B A22B Instruct 250729.7s$0.09
Qwen3 VL 8B Instruct123.8s$0.29
Qwen3 Next 80B A3B Instruct23.1s$0.59
Qwen2.5 VL 72B Instruct22.1s$0.50
Qwen2.5 Coder 32B Instruct19.7s$0.83
Qwen3 Coder Flash13.3s$0.58
Qwen3 Coder 30B A3B Instruct30.8s$0.17
Qwen2.5 7B Instruct12.6s$0.07
Qwen3 Coder 480B A35B17.1s$1.01
Qwen3.7 Plus$1.00
Qwen3 235B A22B78.3s$1.14
Qwen3 VL 235B A22B Instruct37.5s$0.54
Qwen3 32B28.3s$0.18
Qwen3 8B111.5s$0.22
Qwen3 30B A3B Thinking 250744.0s$0.24
Qwen3.5 Plus 2026-04-20$1.05
Qwen3 VL 30B A3B Instruct38.3s$0.33
Qwen3.7 Max$2.50
Qwen3 VL 235B A22B Thinking112.0s$1.43
Qwen3 Coder Plus20.5s$1.95
Qwen3 235B A22B Thinking 2507248.4s$0.10
Qwen3 Next 80B A3B Thinking28.8s$0.44
Qwen Plus 072819.6s$0.52
Qwen3 VL 30B A3B Thinking83.6s$0.85
Qwen3 30B A3B Instruct 250721.6s$0.12
Qwen3.5-122B-A10B$1.17
Qwen3.5-Flash14.7s$0.16
Qwen-Plus22.0s$0.52
Qwen3 VL 32B Instruct21.0s$0.26
Qwen3 Max Thinking$2.34
Qwen3 30B A3B149.8s$0.27
Qwen2.5 72B Instruct22.2s$0.38
Qwen3 Max31.7s$2.34
Qwen3 14B70.4s$0.17
Qwen3 VL 8B Thinking79.4s$0.74
Qwen3 Coder Next2.0s$0.45
Qwen2.5-VL 7B Instruct10.9s$0.20
Qwen2.5 Coder 7B Instruct6.4s$0.06
Qwen2.5 VL 32B Instruct39.1s$0.14
QwQ 32B143.6s$0.28
Qwen VL Plus10.4s$0.42
Qwen VL Max36.4s$2.00
Qwen-Turbo18.6s$0.13
Qwen-Max 12.7s$4.00
Qwen3.5 Plus 2026-02-1544.8s$0.91
Qwen3.5-27B55.5s$0.88
Qwen3.5-9B112.8s$0.09
Qwen3.5 397B A17B3.5s$1.36
Qwen3.6 35B A3B13.4s$0.57
Qwen3.5-35B-A3B43.1s$0.57
Qwen3.6 27B22.3s$1.74