The best LLMs for your use case:
Agentic-focused refinement of GLM-5 with improved coding, tool use, and reasoning capabilities.
Speed:
Intelligence:
Price: (1M Tokens)
$1.40 / 4.40Inputs:
JSON Mode:
Function Calling:
Benchmarks:
LMArena
Chat
WebDevArena
Code
SimpleQA
General Knowledge
GPQA-Diamond
General Knowledge
MMLU-Pro
General Knowledge
LiveCodeBench
Code
LMArena
Chat
WebDevArena
Code
SimpleQA
General Knowledge
GPQA-Diamond
General Knowledge
MMLU-Pro
General Knowledge
LiveCodeBench
Code
OpenAI's open-source 120B parameter model with MXFP4 quantization for efficient inference.
Speed:
Intelligence:
Price: (1M Tokens)
$0.15 / 0.60Inputs:
JSON Mode:
Function Calling:
Benchmarks:
LMArena
Chat
Multilingual MMLU
Multilingual
EQBench
Creative Writing
WebDevArena
Code
GPQA-Diamond
General Knowledge
SimpleQA
General Knowledge
LMArena
Chat
Multilingual MMLU
Multilingual
EQBench
Creative Writing
WebDevArena
Code
GPQA-Diamond
General Knowledge
SimpleQA
General Knowledge
Use case:
Chat
Features:
Low Latency