4 points | by adityaathalye a day ago ago
1 comments
Earlier...
Training Compute-Optimal Large Language Models (2022) https://arxiv.org/abs/2203.15556
Chinchilla Scaling: A replication attempt (2024) https://arxiv.org/abs/2404.10102
Earlier...
Training Compute-Optimal Large Language Models (2022) https://arxiv.org/abs/2203.15556
Chinchilla Scaling: A replication attempt (2024) https://arxiv.org/abs/2404.10102