Product Recommendation Bench

Benchmarks product recommendations for a diverse set of SaaS products

Jan 13, 2026
8 tasks
110 models
$0.0214
user_c636b9d7
Public

ResultsPreliminary

Vote in the arena

14 of 110 models on the leaderboard so far. More join with each arena vote.

Prompt Details

Expand each prompt to see per-model responses and reasoning.

Model Comparison

Compare performance across models and prompts.

Value Analysis

Find models with the best balance of quality, cost, and speed.

Best value frontier
Best value
Size = duration

Highlighted models offer the best score at their price point. Larger dots take longer to produce a result.

Token Usage

Average tokens used per model across all prompts.