Search-R1: Training LLMs to Reason and Leverage Search Engines with RL

(arxiv.org)

97 points | by jonbaer 2 days ago ago

12 comments