The best LLMs for your use case:
Hybrid instruct + reasoning model (232Bx22B MoE) optimized for high-throughput, cost-efficient inference and distillation.
Speed:
Intelligence:
Price: (1M Tokens)
$0.20Inputs:
JSON Mode:
Function Calling:
Benchmarks:
EQBench
Creative Writing
LiveBench
General Knowledge
LiveCodeBench
Code
Aider Polyglot
Code
MGSM
Multilingual
BFCL
Agents and Function Calling
GPQA-Diamond
General Knowledge
MMLU-Pro
General Knowledge
LongBenchv2
Summarization
Multilingual MMLU
Multilingual
WebDevArena
Code
LMArena
Chat
EQBench
Creative Writing
LiveBench
General Knowledge
LiveCodeBench
Code
Aider Polyglot
Code
MGSM
Multilingual
BFCL
Agents and Function Calling
GPQA-Diamond
General Knowledge
MMLU-Pro
General Knowledge
LongBenchv2
Summarization
Multilingual MMLU
Multilingual
WebDevArena
Code
LMArena
Chat
Qwen series reasoning model excelling in complex tasks, outperforming conventional instruction-tuned models on hard problems.
Speed:
Intelligence:
Price: (1M Tokens)
$1.20Inputs:
JSON Mode:
Function Calling:
Benchmarks:
EQBench
Creative Writing
LiveBench
General Knowledge
LMArena
Chat
LiveCodeBench
Code
LongBenchv2
Summarization
Aider Polyglot
Code
MMLU-Pro
General Knowledge
BFCL
Agents and Function Calling
MGSM
Multilingual
EQBench
Creative Writing
LiveBench
General Knowledge
LMArena
Chat
LiveCodeBench
Code
LongBenchv2
Summarization
Aider Polyglot
Code
MMLU-Pro
General Knowledge
BFCL
Agents and Function Calling
MGSM
Multilingual
Use case:
Creative Writing