DeepSeek-v3.2-Exp: Long-Context Efficiency with DeepSeek Sparse Attention [pdf]

(github.com)

4 points | by g42gregory 8 hours ago ago

No comments yet.