@popy 在 opus 4.6 < opus 4.5 中发帖
两个平台的测评,opus 4.6 综合能力还不如 opus 4.5
[image]
SWE-Bench Verified Leaderboard
[image]
两个平台的测评,opus 4.6 综合能力还不如 opus 4.5
[image]
SWE-Bench Verified Leaderboard
[image]