The best LLMs for your use case:

1DeepSeek-R1DeepSeek

SOTA reasoning model trained with reinforcement learning, delivering strong performance on math, code, and logic tasks.

Speed:

Intelligence:

Price: (1M Tokens)

$3.00

Inputs:

ImageText

JSON Mode:

Function Calling:

Benchmarks:

#1

LongBenchv2

Summarization

59.3
#1

GPQA-Diamond

General Knowledge

71.5
#1

MMLU-Pro

General Knowledge

84
#2

LiveBench

General Knowledge

72.49
#2

LMArena

Chat

1359
#2

LiveCodeBench

Code

65.9
#2

WebDevArena

Code

1199
#2

Aider Polyglot

Code

56.9
#3

MGSM

Multilingual

92.4
#1

LongBenchv2

Summarization

59.3
#1

GPQA-Diamond

General Knowledge

71.5
#1

MMLU-Pro

General Knowledge

84
#2

LiveBench

General Knowledge

72.49
#2

LMArena

Chat

1359
#2

LiveCodeBench

Code

65.9
#2

WebDevArena

Code

1199
#2

Aider Polyglot

Code

56.9
#3

MGSM

Multilingual

92.4
2Qwen3 235B A22BQwen

Hybrid instruct + reasoning model (232Bx22B MoE) optimized for high-throughput, cost-efficient inference and distillation.

Speed:

Intelligence:

Price: (1M Tokens)

$0.20

Inputs:

ImageText

JSON Mode:

Function Calling:

Benchmarks:

#2

LongBenchv2

Summarization

50.1
#1

LiveBench

General Knowledge

73.23
#1

EQBench

Creative Writing

1271.6
#1

LiveCodeBench

Code

80.4
#1

Aider Polyglot

Code

59.6
#1

MGSM

Multilingual

92.7
#1

BFCL

Agents and Function Calling

70.8
#2

GPQA-Diamond

General Knowledge

70
#2

MMLU-Pro

General Knowledge

83.66
#2

Multilingual MMLU

Multilingual

82.8
#3

WebDevArena

Code

1186
#11

LMArena

Chat

45.92
#2

LongBenchv2

Summarization

50.1
#1

LiveBench

General Knowledge

73.23
#1

EQBench

Creative Writing

1271.6
#1

LiveCodeBench

Code

80.4
#1

Aider Polyglot

Code

59.6
#1

MGSM

Multilingual

92.7
#1

BFCL

Agents and Function Calling

70.8
#2

GPQA-Diamond

General Knowledge

70
#2

MMLU-Pro

General Knowledge

83.66
#2

Multilingual MMLU

Multilingual

82.8
#3

WebDevArena

Code

1186
#11

LMArena

Chat

45.92

Use case:

Summarization