The best LLMs for your use case:
Compact 9B dense model from Qwen punching above its weight class on knowledge and coding benchmarks at a fraction of the cost.
Speed:
Intelligence:
Price: (1M Tokens)
$0.10 / 0.15Inputs:
JSON Mode:
Function Calling:
Benchmarks:
LongBenchv2
Summarization
EQBench
Creative Writing
Multilingual MMLU
Multilingual
BFCL
Agents and Function Calling
MMLU-Pro
General Knowledge
GPQA-Diamond
General Knowledge
LMArena
Chat
LiveCodeBench
Code
SimpleQA
General Knowledge
LongBenchv2
Summarization
EQBench
Creative Writing
Multilingual MMLU
Multilingual
BFCL
Agents and Function Calling
MMLU-Pro
General Knowledge
GPQA-Diamond
General Knowledge
LMArena
Chat
LiveCodeBench
Code
SimpleQA
General Knowledge
70B multilingual LLM optimized for dialogue, excelling in benchmarks and surpassing many chat models
Speed:
Intelligence:
Price: (1M Tokens)
$0.88Inputs:
JSON Mode:
Function Calling:
Benchmarks:
LongBenchv2
Summarization
Multilingual MMLU
Multilingual
BFCL
Agents and Function Calling
SimpleQA
General Knowledge
MMLU-Pro
General Knowledge
LMArena
Chat
LiveCodeBench
Code
GPQA-Diamond
General Knowledge
LongBenchv2
Summarization
Multilingual MMLU
Multilingual
BFCL
Agents and Function Calling
SimpleQA
General Knowledge
MMLU-Pro
General Knowledge
LMArena
Chat
LiveCodeBench
Code
GPQA-Diamond
General Knowledge
Use case:
Summarization
Features:
Low Latency