@PSP 在 MRCR v2(大海捞针8轮)更新:1M成绩,GLM 5.2 强于V4 Pro,低于Gemini 3.5 Flash 中发帖
Leaderboard – Context Arena
部分模型1M上下文成绩
AUC@1M项目成绩
50.9%:gpt-5.5
46.9%:claude-opus-4.6
44.4%:claude-sonnet-4.6
43.3%:gemini-3.5-flash
41.8%:claude-opus-4.8
40.0%:gemini-3.1-pro-preview
38.2%:gpt-5.4
35.8%:gemini-3-flash-preview
33.0%:glm-5.2
28.3%:deepseek-v4-pro
25.4%:deepseek-v4-flash
15.8%:mimo-v2.5
15.3%:mimo-v2.5-pro