The best LLMs for your use case:

1GLM-5.1Z.ai

Agentic-focused refinement of GLM-5 with improved coding, tool use, and reasoning capabilities.

Speed:

Intelligence:

Price: (1M Tokens)

$1.40 / 4.40

Inputs:

ImageText

JSON Mode:

Function Calling:

Benchmarks:

#1

LMArena

Chat

1467
#2

WebDevArena

Code

1449
#3

SimpleQA

General Knowledge

48
#4

GPQA-Diamond

General Knowledge

86.2
#4

MMLU-Pro

General Knowledge

87
#4

LiveCodeBench

Code

84.9
#1

LMArena

Chat

1467
#2

WebDevArena

Code

1449
#3

SimpleQA

General Knowledge

48
#4

GPQA-Diamond

General Knowledge

86.2
#4

MMLU-Pro

General Knowledge

87
#4

LiveCodeBench

Code

84.9
2Kimi K2.6Moonshot

1T-parameter MoE flagship from Moonshot with long-horizon coding, agent swarms scaling to 300 sub-agents, and state-of-the-art reasoning.

Speed:

Intelligence:

Price: (1M Tokens)

$1.20 / 4.50

Inputs:

ImageText

JSON Mode:

Function Calling:

Benchmarks:

#2

LMArena

Chat

1447
#1

GPQA-Diamond

General Knowledge

90.5
#1

EQBench

Creative Writing

1565.3
#2

LiveCodeBench

Code

89.6
#2

MMMU

Multimodal - Vision

84.3
#3

MMLU-Pro

General Knowledge

87.1
#3

WebDevArena

Code

1446
#3

LongBenchv2

Summarization

61
#3

BFCL

Agents and Function Calling

71.1
#5

SimpleQA

General Knowledge

36.9
#2

LMArena

Chat

1447
#1

GPQA-Diamond

General Knowledge

90.5
#1

EQBench

Creative Writing

1565.3
#2

LiveCodeBench

Code

89.6
#2

MMMU

Multimodal - Vision

84.3
#3

MMLU-Pro

General Knowledge

87.1
#3

WebDevArena

Code

1446
#3

LongBenchv2

Summarization

61
#3

BFCL

Agents and Function Calling

71.1
#5

SimpleQA

General Knowledge

36.9

Use case:

Chat

Features:

Long Context Handling