The best LLMs for your use case:

1GLM-5.1Z.ai

Agentic-focused refinement of GLM-5 with improved coding, tool use, and reasoning capabilities.

Speed:

Intelligence:

Price: (1M Tokens)

$1.40 / 4.40

Inputs:

ImageText

JSON Mode:

Function Calling:

Benchmarks:

#1

LMArena

Chat

1467
#2

WebDevArena

Code

1449
#3

SimpleQA

General Knowledge

48
#4

GPQA-Diamond

General Knowledge

86.2
#4

MMLU-Pro

General Knowledge

87
#4

LiveCodeBench

Code

84.9
#1

LMArena

Chat

1467
#2

WebDevArena

Code

1449
#3

SimpleQA

General Knowledge

48
#4

GPQA-Diamond

General Knowledge

86.2
#4

MMLU-Pro

General Knowledge

87
#4

LiveCodeBench

Code

84.9
2GPT-OSS 120BOpenAI

OpenAI's open-source 120B parameter model with MXFP4 quantization for efficient inference.

Speed:

Intelligence:

Price: (1M Tokens)

$0.15 / 0.60

Inputs:

ImageText

JSON Mode:

Function Calling:

Benchmarks:

#4

LMArena

Chat

1355
#2

Multilingual MMLU

Multilingual

79.3
#4

EQBench

Creative Writing

1152
#5

WebDevArena

Code

1090
#7

GPQA-Diamond

General Knowledge

73.1
#8

SimpleQA

General Knowledge

16.8
#4

LMArena

Chat

1355
#2

Multilingual MMLU

Multilingual

79.3
#4

EQBench

Creative Writing

1152
#5

WebDevArena

Code

1090
#7

GPQA-Diamond

General Knowledge

73.1
#8

SimpleQA

General Knowledge

16.8

Use case:

Chat

Features:

Low Latency