AdapTive-LeArning Speculator System (ATLAS): Faster LLM inference

(together.ai)

198 points | by alecco 4 days ago ago

48 comments