Molmo2 8B (free)

Molmo2-8B is an open vision-language model developed by the Allen Institute for AI (Ai2) as part of the Molmo2 family, supporting image, video, and multi-image understanding and grounding. It is based on Qwen3-8B and uses SigLIP 2 as its vision backbone, outperforming other open-weight, open-data models on short videos, counting, and captioning, while remaining competitive on long-video tasks.

by AllenAI

Overview

Quick stats across all benchmark runs.

Score

—

4 benchmarks

Avg Latency

3.8s

8 requests

Pricing

Free in / Free out

per 1M tokens

Context

37K

tokens

Alternatives

Models with similar or better quality but different tradeoffs

No alternatives found

Run benchmarks on this model to discover alternatives

Benchmark Performance

How this model performs across different benchmarks

No benchmark data available

Run benchmarks with this model to see performance breakdown

Score Over Time

Performance trends across all benchmark runs

Benchmark Activity

Number of benchmark runs over time

Quickstart

View on OpenRouter

Get started with this model using OpenRouter

import { OpenRouter } from "@openrouter/sdk";

const openrouter = new OpenRouter({
  apiKey: "<OPENROUTER_API_KEY>"
});

const completion = await openrouter.chat.completions.create({
  model: "allenai/molmo-2-8b:free",
  messages: [
    {
      role: "user",
      content: "Hello!"
    }
  ]
});

console.log(completion.choices[0].message.content);

Get your API key at openrouter.ai/keys

Other Models from AllenAI

Compare performance with other models from the same creator

Model	Latency	Cost/1M	Score
Olmo 3.1 32B Think	80.3s	$0.32	—
Olmo 3.1 32B Instruct	23.8s	$0.40	—
Olmo 3 7B Instruct	27.3s	$0.15	—
Olmo 3 7B Think	32.6s	$0.16	—
Olmo 2 32B Instruct	—	$0.13	—
Olmo 3 32B Think	82.2s	$0.32	—