FlashAttention-T: Towards Tensorized Attention

(dl.acm.org)

72 points | by matt_d 4 hours ago ago

27 comments