Implement Flash Attention Back End in SGLang – Basics and KV Cache

(hebiao064.github.io)

35 points | by latchkey 3 days ago ago

5 comments