F-Droid 在 DeepSeek开源6850亿参数V3模型,利好套壳公司吗? 中发帖
成绩如此耀眼:
model
average
reasoning
coding
math
data_analysis
language
if
company
o1-2024-12-17-high
75.67
91.58
69.69
80.32
65.47
65.39
81.55
OpenAI
o1-preview-2024-09-12
65.79
67.42
50.85
65.49
67.69
68.72
74.60
OpenAI
gemini-exp-1206
64.09
57.00
63.41
72.36
63.16
51.29
77.34
deepseek-v3
61.97
53.3
62.1
61.9
58.6
52.9
83.0
DeepSeek
gemini-2.0-flash-thinking-exp-1219
61.83
64....