@HCPTangHYDeepSeek开源稀疏注意力模型DeepSeek-V3.2-Exp 中发帖

deepseek-ai/DeepSeek-V3.2-Exp · Hugging Face — deepseek-ai/DeepSeek-V3.2-Exp · Hugging Face 
[image]
可能是对年初新科研成果 [2502.11089] Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention的技术验证