@lieyanqzu 在 sonnet3.7 livebench跑分出炉,coding能力仅提升0.36分 中发帖
[QQ截图20250225091201]
[QQ截图20250225091246]
思考模型coding分数超过o1-high,但距离o3mini-high仍有较大差距
[QQ截图20250225091201]
[QQ截图20250225091246]
思考模型coding分数超过o1-high,但距离o3mini-high仍有较大差距