The best LLMs for your use case:

1Kimi K2 Instruct 0905Moonshot

Advanced multimodal AI model from Moonshot with 256k context length for instruction following.

Speed:

Intelligence:

Price: (1M Tokens)

$1.20

Inputs:

ImageText

JSON Mode:

Function Calling:

Benchmarks:

#1

LMArena

Chat

1421
#1

EQBench

Creative Writing

1565.3
#3

GPQA-Diamond

General Knowledge

75.1
#3

LiveBench

General Knowledge

76.4
#3

Aider Polyglot

Code

60
#3

SimpleQA

General Knowledge

31
#4

WebDevArena

Code

1314
#4

LongBenchv2

Summarization

44.9
#4

BFCL

Agents and Function Calling

71.1
#9

MMLU-Pro

General Knowledge

81.1
#10

LiveCodeBench

Code

53.7
#1

LMArena

Chat

1421
#1

EQBench

Creative Writing

1565.3
#3

GPQA-Diamond

General Knowledge

75.1
#3

LiveBench

General Knowledge

76.4
#3

Aider Polyglot

Code

60
#3

SimpleQA

General Knowledge

31
#4

WebDevArena

Code

1314
#4

LongBenchv2

Summarization

44.9
#4

BFCL

Agents and Function Calling

71.1
#9

MMLU-Pro

General Knowledge

81.1
#10

LiveCodeBench

Code

53.7
2DeepSeek-V3.1DeepSeek

Advanced Mixture-of-Experts model with improved efficiency and performance over V3, delivering enhanced reasoning capabilities at competitive cost.

Speed:

Intelligence:

Price: (1M Tokens)

$0.71

Inputs:

ImageText

JSON Mode:

Function Calling:

Benchmarks:

#2

LMArena

Chat

1419
#1

Aider Polyglot

Code

76.3
#1

SimpleQA

General Knowledge

93.4
#2

GPQA-Diamond

General Knowledge

80.1
#2

MMLU-Pro

General Knowledge

84.8
#2

LiveCodeBench

Code

74.8
#2

WebDevArena

Code

1364
#9

LiveBench

General Knowledge

70.1
#2

LMArena

Chat

1419
#1

Aider Polyglot

Code

76.3
#1

SimpleQA

General Knowledge

93.4
#2

GPQA-Diamond

General Knowledge

80.1
#2

MMLU-Pro

General Knowledge

84.8
#2

LiveCodeBench

Code

74.8
#2

WebDevArena

Code

1364
#9

LiveBench

General Knowledge

70.1

Use case:

Chat