F-DroidDeepSeek开源6850亿参数V3模型,利好套壳公司吗? 中发帖

成绩如此耀眼: 




model
average
reasoning
coding
math
data_analysis
language
if
company




o1-2024-12-17-high
75.67
91.58
69.69
80.32
65.47
65.39
81.55
OpenAI


o1-preview-2024-09-12
65.79
67.42
50.85
65.49
67.69
68.72
74.60
OpenAI


gemini-exp-1206
64.09
57.00
63.41
72.36
63.16
51.29
77.34
Google


deepseek-v3
61.97
53.3
62.1
61.9
58.6
52.9
83.0
DeepSeek


gemini-2.0-flash-thinking-exp-1219
61.83
64....