GLM 4.7 Flash

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

by Z.ai

Overview

Quick stats across all benchmark runs.

Score

29%

6 benchmarks

Avg Latency

31.6s

23 requests

Pricing

$0.06 in / $0.40 out

per 1M tokens

Context

200K

tokens

Alternatives

Models with similar or better quality but different tradeoffs

Same Quality, Cheaper

Models with similar or better performance at a lower cost per token.

Same Quality, Faster

Models with similar or better performance but lower latency.

Same Cost, Better

Models at a similar price point with higher benchmark scores.

Benchmark Performance

How this model performs across different benchmarks

BenchmarkScoreRank
Money Boy Cultural Literacy Test
38%
68 / 107
Categorization Bench
34%
41 / 54

Price vs Performance

Compare cost efficiency across all models

Current model (baseline)
Other models (relative score)
Y-axis shows score difference from shared benchmarks. X-axis uses log scale.

Score Over Time

Performance trends across all benchmark runs

Benchmark Activity

Number of benchmark runs over time

Get started with this model using OpenRouter

import { OpenRouter } from "@openrouter/sdk";

const openrouter = new OpenRouter({
  apiKey: "<OPENROUTER_API_KEY>"
});

const completion = await openrouter.chat.completions.create({
  model: "z-ai/glm-4.7-flash",
  messages: [
    {
      role: "user",
      content: "Hello!"
    }
  ]
});

console.log(completion.choices[0].message.content);

Get your API key at openrouter.ai/keys

Other Models from Z.ai

Compare performance with other models from the same creator

ModelLatencyCost/1MScore
GLM 4.753.4s$1.0745%
GLM 5.119.9sFree44%
GLM 4.5 Air30.1s$0.49
GLM 4.6V43.5s$0.60
GLM 4 32B 18.7s$0.10
GLM 4.5V51.3s$1.20
GLM 4.548.4s$1.40
GLM 4.636.3s$1.08
GLM 5V Turbo$2.60
GLM 5 Turbo9.2s$2.60
GLM 56.2s$1.26