Karl Lorey knowlege

Test specific knowledge of the LLM about Karl Lorey.

Jan 19, 2026

9 tasks

110 models

$1.3943

user_c636b9d7

Link only

ResultsPreliminary

Vote in the arena

30 of 110 models on the leaderboard so far. More join with each arena vote.

Gemini 3 Flash Preview

by Google

2.4s

$0.0034

33%

score

DeepSeek V3.2 Speciale

by DeepSeek

66.8s

$0.0184

17%

score

DeepSeek V3.2

by DeepSeek

5.4s

$0.0009

16%

score

Gemini 2.0 Flash

by Google

1.2s

$0.0003

11%

score

Gemini 2.5 Flash

by Google

1.3s

$0.0029

11%

score

Prompt Details

Expand each prompt to see per-model responses and reasoning.

Model Comparison

Compare performance across models and prompts.

Gemini 3 Flash Preview

by Google on OpenRouter

2.4s

$0.0034

33%

DeepSeek V3.2 Speciale

by DeepSeek on OpenRouter

66.8s

$0.0184

17%

DeepSeek V3.2

by DeepSeek on OpenRouter

5.4s

$0.0009

16%

Gemini 2.0 Flash

by Google on OpenRouter

1.2s

$0.0003

11%

Gemini 2.5 Flash

by Google on OpenRouter

1.3s

$0.0029

11%

Claude Sonnet 4.5

by Anthropic on OpenRouter

3.9s

$0.0156

11%

Auto Router

by OpenRouter on OpenRouter

5.5s

$0.0205

11%

Kimi K2 Thinking

by MoonshotAI on OpenRouter

23.6s

$0.0188

10%

DeepSeek V3 0324

by DeepSeek on OpenRouter

4.3s

$0.0009

GPT-5 Mini

by OpenAI on OpenRouter

11.7s

$0.0101

Model	Duration	Cost	Score
Gemini 3 Flash Preview by Google on OpenRouter	2.4s	$0.0034	33%
DeepSeek V3.2 Speciale by DeepSeek on OpenRouter	66.8s	$0.0184	17%
DeepSeek V3.2 by DeepSeek on OpenRouter	5.4s	$0.0009	16%
Gemini 2.0 Flash by Google on OpenRouter	1.2s	$0.0003	11%
Gemini 2.5 Flash by Google on OpenRouter	1.3s	$0.0029	11%
Claude Sonnet 4.5 by Anthropic on OpenRouter	3.9s	$0.0156	11%
Auto Router by OpenRouter on OpenRouter	5.5s	$0.0205	11%
Kimi K2 Thinking by MoonshotAI on OpenRouter	23.6s	$0.0188	10%
DeepSeek V3 0324 by DeepSeek on OpenRouter	4.3s	$0.0009	9%
GPT-5 Mini by OpenAI on OpenRouter	11.7s	$0.0101	9%

Value Analysis

Find models with the best balance of quality, cost, and speed.

Best value frontier

Best value

Size = duration

Highlighted models offer the best score at their price point. Larger dots take longer to produce a result.

Token Usage

Average tokens used per model across all prompts.

GLM 4.7OpenRouter

4,141 avg (27 in / 4,114 out)

DeepSeek V3.2 SpecialeOpenRouter

1,720 avg (26 in / 1,694 out)

Gemini 2.5 ProOpenRouter

1,422 avg (24 in / 1,398 out)

GPT-5 NanoOpenRouter

1,194 avg (27 in / 1,166 out)

gpt-oss-20bOpenRouter

1,067 avg (88 in / 979 out)