The best LLMs for your use case:
Hybrid instruct + reasoning model (232Bx22B MoE) optimized for high-throughput, cost-efficient inference and distillation.
Speed:
Intelligence:
Price: (1M Tokens)
$0.20Inputs:
JSON Mode:
Function Calling:
Benchmarks:
WebDevArena
Code
Aider Polyglot
Code
LiveCodeBench
Code
LiveBench
General Knowledge
EQBench
Creative Writing
MGSM
Multilingual
BFCL
Agents and Function Calling
GPQA-Diamond
General Knowledge
MMLU-Pro
General Knowledge
LongBenchv2
Summarization
Multilingual MMLU
Multilingual
LMArena
Chat
WebDevArena
Code
Aider Polyglot
Code
LiveCodeBench
Code
LiveBench
General Knowledge
EQBench
Creative Writing
MGSM
Multilingual
BFCL
Agents and Function Calling
GPQA-Diamond
General Knowledge
MMLU-Pro
General Knowledge
LongBenchv2
Summarization
Multilingual MMLU
Multilingual
LMArena
Chat
SOTA reasoning model trained with reinforcement learning, delivering strong performance on math, code, and logic tasks.
Speed:
Intelligence:
Price: (1M Tokens)
$3.00Inputs:
JSON Mode:
Function Calling:
Benchmarks:
LiveCodeBench
Code
Aider Polyglot
Code
WebDevArena
Code
GPQA-Diamond
General Knowledge
MMLU-Pro
General Knowledge
LMArena
Chat
LongBenchv2
Summarization
LiveBench
General Knowledge
MGSM
Multilingual
LiveCodeBench
Code
Aider Polyglot
Code
WebDevArena
Code
GPQA-Diamond
General Knowledge
MMLU-Pro
General Knowledge
LMArena
Chat
LongBenchv2
Summarization
LiveBench
General Knowledge
MGSM
Multilingual
Use case:
Code