Bunn (@BunnHack) 在 DeepSeek R1 在 Confabulations(幻覺)基準測試中的表現優於 o3-mini-medium 中发帖
[deepseek-r1-outperforms-o3-mini-medium-on-the-v0-yz8n6c9nycie1]
[deepseek-r1-outperforms-o3-mini-medium-on-the-v0-yz8n6c9nycie1]