Untitled Benchmark

May 26, 2026
50 tasks
41 models
$0.0092
user_c636b9d7
Public

Tests

Each test is one prompt sent to every model in the benchmark.

50 tests × 41 models = 4100 arena votes for reliable rankings.