DeepSeek V4 Flash

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

by DeepSeek

Overview

Quick stats across all benchmark runs.

Score

55%

3 benchmarks

Avg Latency

5.0s

16 requests

Pricing

Free in / Free out

per 1M tokens

Context

256K

tokens

Alternatives

Models with similar or better quality but different tradeoffs

No alternatives found

Run benchmarks on this model to discover alternatives

Benchmark Performance

How this model performs across different benchmarks

BenchmarkScoreRank
Money Boy Cultural Literacy Test
63%
13 / 107
Categorization Bench
38%
35 / 54

Price vs Performance

Compare cost efficiency across all models

Current model (baseline)
Other models (relative score)
Y-axis shows score difference from shared benchmarks. X-axis uses log scale.

Score Over Time

Performance trends across all benchmark runs

Benchmark Activity

Number of benchmark runs over time

Get started with this model using OpenRouter

import { OpenRouter } from "@openrouter/sdk";

const openrouter = new OpenRouter({
  apiKey: "<OPENROUTER_API_KEY>"
});

const completion = await openrouter.chat.completions.create({
  model: "deepseek/deepseek-v4-flash:free",
  messages: [
    {
      role: "user",
      content: "Hello!"
    }
  ]
});

console.log(completion.choices[0].message.content);

Get your API key at openrouter.ai/keys

Other Models from DeepSeek

Compare performance with other models from the same creator

ModelLatencyCost/1MScore
DeepSeek V4 Pro12.3s$0.6559%
DeepSeek V3.2 Speciale149.1s$0.3652%
DeepSeek V3.236.7s$0.3240%
DeepSeek V3.152.2s$0.50
DeepSeek R1 0528 Qwen3 8B$0.07
DeepSeek Prover V2$1.34
R1 Distill Qwen 14B$0.15
R1 Distill Qwen 32B53.5s$0.29
DeepSeek V3.1 Terminus20.0s$0.61
R1109.7s$1.60
R1 0528120.7s$1.32
DeepSeek V3.2 Exp79.4s$0.34
DeepSeek V319.8s$0.60
R1 Distill Llama 70B33.8s$0.75
DeepSeek V3 032415.2s$0.49