Pushing the Limits of LLM Quantization via the Linearity Theorem

(arxiv.org)

95 points | by felineflock 4 days ago ago

3 comments