PopuLoRA: Co-Evolving LLM Populations for Reasoning Self- Play

(vmax.ai)

29 points | by AMavorParker 2 hours ago ago

6 comments