The best LLMs for your use case:

1DeepSeek-R1-0528DeepSeek

SOTA reasoning model trained with reinforcement learning, delivering strong performance on math, code, and logic tasks.

Speed:

Intelligence:

Price: (1M Tokens)

$3.00

Inputs:

ImageText

JSON Mode:

Function Calling:

Benchmarks:

#1

LongBenchv2

Summarization

59.3
#1

GPQA-Diamond

General Knowledge

81
#1

MMLU-Pro

General Knowledge

85
#1

WebDevArena

Code

1409
#2

Aider Polyglot

Code

71.4
#3

MGSM

Multilingual

92.4
#4

LMArena

Chat

1411
#4

LiveCodeBench

Code

73.3
#4

SimpleQA

General Knowledge

27.8
#6

EQBench

Creative Writing

1270
#8

LiveBench

General Knowledge

70.1
#17

BFCL

Agents and Function Calling

37
#1

LongBenchv2

Summarization

59.3
#1

GPQA-Diamond

General Knowledge

81
#1

MMLU-Pro

General Knowledge

85
#1

WebDevArena

Code

1409
#2

Aider Polyglot

Code

71.4
#3

MGSM

Multilingual

92.4
#4

LMArena

Chat

1411
#4

LiveCodeBench

Code

73.3
#4

SimpleQA

General Knowledge

27.8
#6

EQBench

Creative Writing

1270
#8

LiveBench

General Knowledge

70.1
#17

BFCL

Agents and Function Calling

37
2Qwen3 235B A22BQwen

Hybrid instruct + reasoning model (232Bx22B MoE) optimized for high-throughput, cost-efficient inference and distillation.

Speed:

Intelligence:

Price: (1M Tokens)

$0.20

Inputs:

ImageText

JSON Mode:

Function Calling:

Benchmarks:

#2

LongBenchv2

Summarization

50.1
#1

LiveCodeBench

Code

80.4
#1

MGSM

Multilingual

92.7
#2

Multilingual MMLU

Multilingual

82.8
#4

MMLU-Pro

General Knowledge

83.66
#4

Aider Polyglot

Code

59.6
#5

GPQA-Diamond

General Knowledge

70
#5

EQBench

Creative Writing

1271.6
#5

BFCL

Agents and Function Calling

70.9
#6

LiveBench

General Knowledge

73.23
#8

WebDevArena

Code

1186
#10

SimpleQA

General Knowledge

11
#18

LMArena

Chat

45.92
#2

LongBenchv2

Summarization

50.1
#1

LiveCodeBench

Code

80.4
#1

MGSM

Multilingual

92.7
#2

Multilingual MMLU

Multilingual

82.8
#4

MMLU-Pro

General Knowledge

83.66
#4

Aider Polyglot

Code

59.6
#5

GPQA-Diamond

General Knowledge

70
#5

EQBench

Creative Writing

1271.6
#5

BFCL

Agents and Function Calling

70.9
#6

LiveBench

General Knowledge

73.23
#8

WebDevArena

Code

1186
#10

SimpleQA

General Knowledge

11
#18

LMArena

Chat

45.92

Use case:

Summarization