飛空 (@feikong)国外的 Vibe Code 测评排行:opus 4.7、gpt-5.5、deepseek V4、Kimi K2.6……等 中发帖

Key Takeaways 要点总结

Claude Opus 4.7 now leads at 71.00% overall accuracy, ahead of GPT 5.4 (67.42%), GPT 5.3 Codex(61.77%), and Claude Opus 4.6 (Nonthinking)(57.57%).
Claude Opus 4.7 现在以 71.00% 的总体准确率领先,高于 GPT 5.4 (67.42%)、GPT 5.3 Codex (61.77%) 和 Claude Opus 4.6 (Nonthinking) (57.57%)。
The top seven models are relatively tightly clustered (71.00% down to 51.48%), followed by a sharp drop to the ...