@lieyanqzusonnet3.7 livebench跑分出炉,coding能力仅提升0.36分 中发帖

[QQ截图20250225091201] 
[QQ截图20250225091246]
思考模型coding分数超过o1-high,但距离o3mini-high仍有较大差距