Summary of METR's predeployment evaluation of GPT-5.6 Sol

(metr.org)

6 points | by pongogogo 10 hours ago ago

5 comments