GDPVal: Measuring the performance of our models on real-world tasks

(openai.com)

25 points | by BGyss 8 hours ago ago

9 comments