The best open LLMs for your use case:

1Kimi K2.6Moonshot

1T-parameter MoE flagship from Moonshot with long-horizon coding, agent swarms scaling to 300 sub-agents, and state-of-the-art reasoning.

Speed:

Intelligence:

Price: (1M Tokens)

$1.20 / 4.50

Cached input: (1M Tokens)

$0.20

Context: (tokens)

262,144

Inputs:

ImageText

Benchmarks:

EQBench

Creative Writing

1561

SciCode

Coding Agents

52.2

MCP-Mark

Agents and Function Calling

55.9

MMMU-Pro

Multimodal - Vision

79.4

Apex Agents

Agents and Function Calling

27.9

FrontierCode

Coding Agents

3.8

LiveCodeBench

Coding Agents

89.6

Terminal-Bench 2.0

Coding Agents

66.7

EQBench

Creative Writing

1561

SciCode

Coding Agents

52.2

MCP-Mark

Agents and Function Calling

55.9

MMMU-Pro

Multimodal - Vision

79.4

Apex Agents

Agents and Function Calling

27.9

FrontierCode

Coding Agents

3.8

LiveCodeBench

Coding Agents

89.6

Terminal-Bench 2.0

Coding Agents

66.7

Try it out

2Qwen3.5 9BQwen

Compact 9B dense model from Qwen punching above its weight class on knowledge and coding benchmarks at a fraction of the cost.

Speed:

Intelligence:

Price: (1M Tokens)

$0.17 / 0.25

Context: (tokens)

262,144

Inputs:

ImageText

Benchmarks:

AA-LCR

Summarization

LongBenchv2

Summarization

55.2

MMMU

Multimodal - Vision

78.4

BFCL

Agents and Function Calling

66.1

TAU2-Bench

Agents and Function Calling

79.1

Multilingual MMLU

Multilingual

81.2

MMLU-Pro

General Knowledge

82.5

LiveCodeBench

Coding Agents

65.6

AA-LCR

Summarization

LongBenchv2

Summarization

55.2

MMMU

Multimodal - Vision

78.4

BFCL

Agents and Function Calling

66.1

TAU2-Bench

Agents and Function Calling

79.1

Multilingual MMLU

Multilingual

81.2

MMLU-Pro

General Knowledge

82.5

LiveCodeBench

Coding Agents

65.6

Try it out

Use case:

Creative Writing

Features:

Low Latency