The best LLMs for your use case:

1Qwen3.5 9BQwen

Compact 9B dense model from Qwen punching above its weight class on knowledge and coding benchmarks at a fraction of the cost.

Speed:

Intelligence:

Price: (1M Tokens)

$0.10 / 0.15

Inputs:

ImageText

JSON Mode:

Function Calling:

Benchmarks:

#4

LongBenchv2

Summarization

48.9
#3

EQBench

Creative Writing

1210.1
#3

Multilingual MMLU

Multilingual

75
#4

BFCL

Agents and Function Calling

65
#5

MMLU-Pro

General Knowledge

82.5
#6

GPQA-Diamond

General Knowledge

81.7
#6

LMArena

Chat

1313
#6

LiveCodeBench

Code

82.7
#7

SimpleQA

General Knowledge

18
#4

LongBenchv2

Summarization

48.9
#3

EQBench

Creative Writing

1210.1
#3

Multilingual MMLU

Multilingual

75
#4

BFCL

Agents and Function Calling

65
#5

MMLU-Pro

General Knowledge

82.5
#6

GPQA-Diamond

General Knowledge

81.7
#6

LMArena

Chat

1313
#6

LiveCodeBench

Code

82.7
#7

SimpleQA

General Knowledge

18
2Llama 3.3 70B Instruct TurboMeta

70B multilingual LLM optimized for dialogue, excelling in benchmarks and surpassing many chat models

Speed:

Intelligence:

Price: (1M Tokens)

$0.88

Inputs:

ImageText

JSON Mode:

Function Calling:

Benchmarks:

#5

LongBenchv2

Summarization

36.2
#5

Multilingual MMLU

Multilingual

71.67
#5

BFCL

Agents and Function Calling

52.24
#6

SimpleQA

General Knowledge

20.9
#7

MMLU-Pro

General Knowledge

68.9
#7

LMArena

Chat

1257
#7

LiveCodeBench

Code

33.3
#9

GPQA-Diamond

General Knowledge

50.5
#5

LongBenchv2

Summarization

36.2
#5

Multilingual MMLU

Multilingual

71.67
#5

BFCL

Agents and Function Calling

52.24
#6

SimpleQA

General Knowledge

20.9
#7

MMLU-Pro

General Knowledge

68.9
#7

LMArena

Chat

1257
#7

LiveCodeBench

Code

33.3
#9

GPQA-Diamond

General Knowledge

50.5

Use case:

Summarization

Features:

Low Latency