Z.AI: GLM 4.5

by Z.AI

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly enhanced capabilities in reasoning, code generation, and agent alignment. It supports a hybrid inference mode with two options, a "thinking mode" designed for complex reasoning and tool use, and a "non-thinking mode" optimized for instant responses. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config)

Avg Score

92.1%

12 answers

Avg Latency

48.4s

8 runs

Pricing

$0.35

input

$1.55

output

per 1M tokens

Context

131K

tokens

Alternatives

Models with similar or better quality but different tradeoffs

No alternatives found

Run benchmarks on this model to discover alternatives

Other Models from Z.AI

Compare performance with other models from the same creator

Model	Score	Latency	Cost/1M
Z.AI: GLM 4.6	96.9%	36.3s	$0.93
Z.AI: GLM 4.6	91.9%	41.7s	$1.10
Z.AI: GLM 4.5 Air	86.5%	35.6s	$0.14
Z.AI: GLM 4.5 Air	85.6%	90.8s	Free
Z.AI: GLM 4.7	82.7%	56.9s	$0.95
Z.AI: GLM 4.5V	80.8%	51.3s	$1.20
Z.AI: GLM 4.6V	71.4%	57.3s	$0.60
Z.AI: GLM 4 32B	67.7%	18.7s	$0.10
Z.AI: GLM 4.7 Flash	54.2%	175.4s	$0.23

Benchmark Performance

How this model performs across different benchmarks

No benchmark data available

Run benchmarks with this model to see performance breakdown

Price vs Performance

Compare cost efficiency across all models

Current model (baseline)

Other models (relative score)

Y-axis shows score difference from shared benchmarks. X-axis uses log scale.

Score Over Time

Performance trends across all benchmark runs

Benchmark Activity

Number of benchmark runs over time

Quickstart

Get started with this model using OpenRouter

View on OpenRouter

import { OpenRouter } from "@openrouter/sdk";

const openrouter = new OpenRouter({
  apiKey: "<OPENROUTER_API_KEY>"
});

const completion = await openrouter.chat.completions.create({
  model: "z-ai/glm-4.5",
  messages: [
    {
      role: "user",
      content: "Hello!"
    }
  ]
});

console.log(completion.choices[0].message.content);

Get your API key at openrouter.ai/keys