Profiling LLM Inference on Apple Silicon: A Quantization Perspective

(arxiv.org)

2 points | by diggan 20 hours ago ago

No comments yet.