Hierarchical Autoregressive Modeling for Memory-Efficient Language Generation

(arxiv.org)

46 points | by PaulHoule 3 days ago ago

3 comments