AutoMegaKernel: Compiling a LLM into a single CUDA kernel

(arxiv.org)

3 points | by OsamaJaber 12 hours ago ago

No comments yet.