All models

Google: Gemini 2.5 Flash Lite

by Google

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the [Reasoning API parameter](https://openrouter.ai/docs/use-cases/reasoning-tokens) to selectively trade off cost for intelligence.

Avg Score

74.8%

285 answers

Avg Latency

1.8s

44 runs

Pricing

$0.10

input

/

$0.40

output

per 1M tokens

Context

1049K

tokens

Alternatives

Models with similar or better quality but different tradeoffs

Same Quality, Cheaper

Models with similar or better performance at a lower cost per token.

Same Quality, Faster

Models with similar or better performance but lower latency.

Same Cost, Better

Models at a similar price point with higher benchmark scores.

Other Models from Google

Compare performance with other models from the same creator

Benchmark Performance

How this model performs across different benchmarks

BenchmarkScoreRank
moose
100.0%
11 / 17
Math Bench: Subtraction (1-10)
100.0%
17 / 35
Math Bench: Division (1-10)
100.0%
22 / 35
Single Character Instruction Adherence
100.0%
7 / 17
Breakfast
100.0%
10 / 17
my test
100.0%
7 / 17
Math Bench: Arithmetic (1-100)
100.0%
18 / 35
Customer service
100.0%
8 / 17
Venture Capital Terms Benchmark
100.0%
11 / 25
Best bathroom technique
100.0%
8 / 17
Repetitive Onomatopoeia Response Benchmark
100.0%
7 / 17
Math Bench: Addition (1-10)
100.0%
15 / 35
Customer support
100.0%
4 / 17
Instruction Adherence and Logical Constraints
100.0%
9 / 17
Cheeky_test
100.0%
8 / 17
Customer Support Ticket Generation
100.0%
8 / 17
Math Bench: Multiplication (1-10)
100.0%
17 / 35
Structured Customer Support Extraction
100.0%
9 / 17
Smell test
100.0%
7 / 17
SaaS Customer Support Extraction and Strategy
100.0%
9 / 17
Motor production test
95.0%
25 / 36
Maak een erp systeem
95.0%
3 / 17
Spatial Reasoning: Germany
94.3%
14 / 35
Product Recommendations
93.3%
6 / 17
Enhancing FIDO UV Consistency with UXWG 2026 priority plan
90.0%
16 / 25
KUKA force Torque control programming skills
90.0%
10 / 17
Westat Test
90.0%
8 / 292
Math Bench: Arithmetic (1-1000)
90.0%
23 / 35
Omnishambles
85.0%
9 / 17
Karlsruhe Local Knowledge Benchmark
66.2%
25 / 35
cemantle solve initiator
60.0%
22 / 25
Product Recommendations
52.5%
32 / 36
Rust performance
50.0%
6 / 16
Character Frequency Bench
41.8%
32 / 35
Testing_website
40.0%
16 / 17
solve n-queens in clojure in one shot
40.0%
10 / 17
Define data taxonomy for product catalog suitable for PIM
40.0%
17 / 17
Money Boy Cultural Literacy Test
28.6%
21 / 35
Docs
28.3%
11 / 17
German Memelord Bench
7.6%
28 / 35
Karl Lorey knowlege
0.0%
30 / 36
Current Global Political Leaders
0.0%
6 / 17

Price vs Performance

Compare cost efficiency across all models

Current model
Other models
X-axis uses log scale for better visualization of price range

Score Over Time

Performance trends across all benchmark runs

Benchmark Activity

Number of benchmark runs over time

Quickstart

Get started with this model using OpenRouter

View on OpenRouter
import { OpenRouter } from "@openrouter/sdk";

const openrouter = new OpenRouter({
  apiKey: "<OPENROUTER_API_KEY>"
});

const completion = await openrouter.chat.completions.create({
  model: "google/gemini-2.5-flash-lite",
  messages: [
    {
      role: "user",
      content: "Hello!"
    }
  ]
});

console.log(completion.choices[0].message.content);

Get your API key at openrouter.ai/keys