Qwen2.5-VL 7B Instruct

Qwen2.5 VL 7B is a multimodal LLM from the Qwen Team with the following key enhancements: - SoTA understanding of images of various resolution & ratio: Qwen2.5-VL achieves state-of-the-art performance on visual understanding benchmarks, including MathVista, DocVQA, RealWorldQA, MTVQA, etc. - Understanding videos of 20min+: Qwen2.5-VL can understand videos over 20 minutes for high-quality video-based question answering, dialog, content creation, etc. - Agent that can operate your mobiles, robots, etc.: with the abilities of complex reasoning and decision making, Qwen2.5-VL can be integrated with devices like mobile phones, robots, etc., for automatic operation based on visual environment and text instructions. - Multilingual Support: to serve global users, besides English and Chinese, Qwen2.5-VL now supports the understanding of texts in different languages inside images, including most European languages, Japanese, Korean, Arabic, Vietnamese, etc. For more details, see this [blog post](https://qwenlm.github.io/blog/qwen2-vl/) and [GitHub repo](https://github.com/QwenLM/Qwen2-VL). Usage of this model is subject to [Tongyi Qianwen LICENSE AGREEMENT](https://huggingface.co/Qwen/Qwen1.5-110B-Chat/blob/main/LICENSE).

by Qwen

Overview

Quick stats across all benchmark runs.

Score

9 benchmarks

Avg Latency

14.8s

20 requests

Pricing

$0.20 in / $0.20 out

per 1M tokens

Context

33K

tokens

Alternatives

Models with similar or better quality but different tradeoffs

No alternatives found

Run benchmarks on this model to discover alternatives

Benchmark Performance

How this model performs across different benchmarks

No benchmark data available

Run benchmarks with this model to see performance breakdown

Price vs Performance

Compare cost efficiency across all models

Current model (baseline)
Other models (relative score)
Y-axis shows score difference from shared benchmarks. X-axis uses log scale.

Score Over Time

Performance trends across all benchmark runs

Benchmark Activity

Number of benchmark runs over time

Get started with this model using OpenRouter

import { OpenRouter } from "@openrouter/sdk";

const openrouter = new OpenRouter({
  apiKey: "<OPENROUTER_API_KEY>"
});

const completion = await openrouter.chat.completions.create({
  model: "qwen/qwen-2.5-vl-7b-instruct",
  messages: [
    {
      role: "user",
      content: "Hello!"
    }
  ]
});

console.log(completion.choices[0].message.content);

Get your API key at openrouter.ai/keys

Other Models from Qwen

Compare performance with other models from the same creator

ModelLatencyCost/1MScore
Qwen3.6 Max Preview21.5s$3.6449%
Qwen3.6 Plus32.3s$1.1437%
Qwen3.6 Flash9.2s$0.6633%
Qwen3.5 Plus 2026-04-20$1.05
Qwen3.6 35B A3B11.3s$0.57
Qwen3.6 27B22.3s$1.76
Qwen3.5-9B97.6s$0.09
Qwen3.5-35B-A3B43.1s$0.57
Qwen3.5-27B57.6s$0.88
Qwen3.5-122B-A10B$1.17
Qwen3.5-Flash$0.16
Qwen3.5 Plus 2026-02-1531.8s$0.91
Qwen3.5 397B A17B3.5s$1.36
Qwen3 Max Thinking$2.34
Qwen3 Coder Next1.9s$0.45
Qwen3 VL 32B Instruct21.8s$0.26
Qwen3 VL 8B Thinking79.4s$0.74
Qwen3 VL 8B Instruct123.8s$0.29
Qwen3 VL 30B A3B Thinking83.6s$0.85
Qwen3 VL 30B A3B Instruct38.3s$0.33
Qwen3 VL 235B A22B Thinking112.0s$1.43
Qwen3 VL 235B A22B Instruct37.5s$0.54
Qwen3 Max31.7s$2.34
Qwen3 Coder Plus20.5s$1.95
Qwen3 Coder Flash13.3s$0.58
Qwen3 Next 80B A3B Thinking28.8s$0.44
Qwen3 Next 80B A3B InstructFree
Qwen Plus 072849.2s$0.52
Qwen3 30B A3B Thinking 250744.0s$0.24
Qwen3 Coder 30B A3B Instruct30.8s$0.17
Qwen3 30B A3B Instruct 250725.3s$0.19
Qwen3 235B A22B Thinking 2507248.4s$0.82
Qwen3 Coder 480B A35B6.2s$1.01
Qwen3 235B A22B Instruct 250724.4s$0.09
Qwen3 30B A3B149.8s$0.27
Qwen3 8B111.5s$0.22
Qwen3 14B70.4s$0.17
Qwen3 32B30.4s$0.18
Qwen3 235B A22B78.3s$1.14
Qwen2.5 VL 72B Instruct22.1s$0.50
Qwen2.5 7B Instruct12.6s$0.07
Qwen VL Max36.4s$2.00
Qwen-Plus22.0s$0.52
Qwen VL Plus10.4s$0.42
Qwen3 4B (free)Free
Qwen-Turbo18.6s$0.13
Qwen2.5 Coder 7B Instruct6.4s$0.06
QwQ 32B143.6s$0.28
Qwen2.5 VL 32B Instruct39.1s$0.14
Qwen-Max 12.7s$4.00
Qwen2.5 Coder 32B Instruct19.7s$0.83
Qwen2.5 72B Instruct22.2s$0.38