Shyliuli 在 [持续更新]livebench0425+aider综合榜单(更新至qwen3 32b) 中发帖
更新至qwen3 32b
Model
Organization
Global Average
Reasoning Average
aider
Mathematics Average
Data Analysis Average
Language Average
IF Average
o3 High
OpenAI
81.19
93.33
79.6
85
67.02
76
86.17
o4-Mini High
OpenAI
77.39
88.11
72
84.9
68.33
66.05
84.96
Gemini 2.5 Pro Preview
76.99
87.53
72.9
89.16
62.47
69.31
80.59
Claude 3.7 Sonnet Thinking
Anthropic
73.12
76.17
64.9
79
69.11
68...