Niederstetten Benchmark

Jan 28, 2026
5 tasks
110 models
$0.5276
karllorey
Link only

Tests

Each test is one prompt sent to every model in the benchmark.

5 tests × 110 models = 1100 arena votes for reliable rankings.