All models

Z.AI: GLM 4.5 Air

by Z.AI

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter size. GLM-4.5-Air also supports hybrid inference modes, offering a "thinking mode" for advanced reasoning and tool use, and a "non-thinking mode" for real-time interaction. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config)

Avg Score

86.2%

21 answers

Avg Latency

56.6s

9 runs

Pricing

Free

input

/

Free

output

per 1M tokens

Context

131K

tokens

Alternatives

Models with similar or better quality but different tradeoffs

Same Quality, Cheaper

Models with similar or better performance at a lower cost per token.

Same Quality, Faster

Models with similar or better performance but lower latency.

Same Cost, Better

Models at a similar price point with higher benchmark scores.

Other Models from Z.AI

Compare performance with other models from the same creator

ModelScoreLatencyCost/1M
Z.AI: GLM 4.696.9%36.3s$0.93
Z.AI: GLM 4.592.1%48.4s$0.95
Z.AI: GLM 4.691.9%41.7s$1.10
Z.AI: GLM 4.782.7%56.9s$0.95
Z.AI: GLM 4.5V80.8%51.3s$1.20
Z.AI: GLM 4.6V71.4%57.3s$0.60
Z.AI: GLM 4 32B 67.7%18.7s$0.10
Z.AI: GLM 4.7 Flash54.2%175.4s$0.23

Benchmark Performance

How this model performs across different benchmarks

No benchmark data available

Run benchmarks with this model to see performance breakdown

Price vs Performance

Compare cost efficiency across all models

Current model (baseline)
Other models (relative score)
Y-axis shows score difference from shared benchmarks. X-axis uses log scale.

Score Over Time

Performance trends across all benchmark runs

Benchmark Activity

Number of benchmark runs over time

Quickstart

Get started with this model using OpenRouter

View on OpenRouter
import { OpenRouter } from "@openrouter/sdk";

const openrouter = new OpenRouter({
  apiKey: "<OPENROUTER_API_KEY>"
});

const completion = await openrouter.chat.completions.create({
  model: "z-ai/glm-4.5-air:free",
  messages: [
    {
      role: "user",
      content: "Hello!"
    }
  ]
});

console.log(completion.choices[0].message.content);

Get your API key at openrouter.ai/keys