How to Write a Fast Matrix Multiplication from Scratch with Tensor Cores (2024)

(alexarmbr.github.io)

147 points | by skidrow 5 days ago ago

18 comments