Speculative cascades – A hybrid approach for smarter, faster LLM inference

(research.google)

5 points | by emschwartz 14 hours ago ago

No comments yet.