𝓵𝓮𝔃𝓲𝓼𝓱𝓮𝓷 (@lezishen)“DeepSeek-V3 基于我们的架构打造”,Mistral CEO Arthur Mensch 逆天发言被喷 中发帖

[image] 
[image]
[image]
[image]
[image]
[image]
[image]
[image]
[image]
[image]
[image]
论文链接:

Mixtral:[2401.04088] Mixtral of Experts
DeepSeek:[2401.06066] DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
“DeepSeek-V3 基于我们的架构打造”,Mistral CEO Arthur Mensch 逆天发言被喷 - IT之家