The best LLMs for your use case:
1T-parameter MoE flagship from Moonshot with long-horizon coding, agent swarms scaling to 300 sub-agents, and state-of-the-art reasoning.
Speed:
Intelligence:
Price: (1M Tokens)
$1.20 / 4.50Inputs:
JSON Mode:
Function Calling:
Benchmarks:
EQBench
Creative Writing
GPQA-Diamond
General Knowledge
LMArena
Chat
LiveCodeBench
Code
MMMU
Multimodal - Vision
MMLU-Pro
General Knowledge
WebDevArena
Code
LongBenchv2
Summarization
BFCL
Agents and Function Calling
SimpleQA
General Knowledge
EQBench
Creative Writing
GPQA-Diamond
General Knowledge
LMArena
Chat
LiveCodeBench
Code
MMMU
Multimodal - Vision
MMLU-Pro
General Knowledge
WebDevArena
Code
LongBenchv2
Summarization
BFCL
Agents and Function Calling
SimpleQA
General Knowledge
Qwen's native multimodal MoE model with 397B total parameters and 17B active, featuring hybrid Gated Delta Networks for strong reasoning and vision capabilities.
Speed:
Intelligence:
Price: (1M Tokens)
$0.60 / 3.60Inputs:
JSON Mode:
Function Calling:
Benchmarks:
EQBench
Creative Writing
MMLU-Pro
General Knowledge
Multilingual MMLU
Multilingual
MMMU
Multimodal - Vision
LongBenchv2
Summarization
BFCL
Agents and Function Calling
SimpleQA
General Knowledge
GPQA-Diamond
General Knowledge
LMArena
Chat
WebDevArena
Code
LiveCodeBench
Code
EQBench
Creative Writing
MMLU-Pro
General Knowledge
Multilingual MMLU
Multilingual
MMMU
Multimodal - Vision
LongBenchv2
Summarization
BFCL
Agents and Function Calling
SimpleQA
General Knowledge
GPQA-Diamond
General Knowledge
LMArena
Chat
WebDevArena
Code
LiveCodeBench
Code
Use case:
Creative Writing
Features:
JSON Mode