Karl Lorey knowlege

Test specific knowledge of the LLM about Karl Lorey.

Jan 19, 2026
9 tasks
110 models
$1.3943
user_c636b9d7
Link only

ResultsPreliminary

Vote in the arena

30 of 110 models on the leaderboard so far. More join with each arena vote.

Gemini 3 Flash Preview
by Google
33%
score
DeepSeek V3.2 Speciale
by DeepSeek
17%
score
DeepSeek V3.2
by DeepSeek
16%
score
4
Gemini 2.0 Flash
by Google
11%
score
5
Gemini 2.5 Flash
by Google
11%
score

Prompt Details

Expand each prompt to see per-model responses and reasoning.

Model Comparison

Compare performance across models and prompts.

Gemini 3 Flash Preview
by Google on OpenRouter
2.4s
$0.0034
33%
DeepSeek V3.2 Speciale
by DeepSeek on OpenRouter
66.8s
$0.0184
17%
DeepSeek V3.2
by DeepSeek on OpenRouter
5.4s
$0.0009
16%
Gemini 2.0 Flash
by Google on OpenRouter
1.2s
$0.0003
11%
Gemini 2.5 Flash
by Google on OpenRouter
1.3s
$0.0029
11%
Claude Sonnet 4.5
by Anthropic on OpenRouter
3.9s
$0.0156
11%
Auto Router
by OpenRouter on OpenRouter
5.5s
$0.0205
11%
Kimi K2 Thinking
by MoonshotAI on OpenRouter
23.6s
$0.0188
10%
DeepSeek V3 0324
by DeepSeek on OpenRouter
4.3s
$0.0009
9%
GPT-5 Mini
by OpenAI on OpenRouter
11.7s
$0.0101
9%

Value Analysis

Find models with the best balance of quality, cost, and speed.

Best value frontier
Best value
Size = duration

Highlighted models offer the best score at their price point. Larger dots take longer to produce a result.

Token Usage

Average tokens used per model across all prompts.

GLM 4.7OpenRouter
4,141 avg (27 in / 4,114 out)
DeepSeek V3.2 SpecialeOpenRouter
1,720 avg (26 in / 1,694 out)
Gemini 2.5 ProOpenRouter
1,422 avg (24 in / 1,398 out)
GPT-5 NanoOpenRouter
1,194 avg (27 in / 1,166 out)
gpt-oss-20bOpenRouter
1,067 avg (88 in / 979 out)