vLLM-MLX – Run LLMs on Mac at 464 tok/s

(github.com)

33 points | by waybarrios 3 days ago ago

3 comments