HealthBench – An evaluation for AI systems and human health

(openai.com)

173 points | by mfiguiere 4 days ago ago

171 comments