DeepSeek-V3: Achieving Efficient LLM Scaling with 2,048 GPUs

(arxiv.org)

6 points | by qtwhat 20 hours ago ago

1 comments