EnigmaEval: A Benchmark of Long Multimodal Reasoning Challenges

(arxiv.org)

17 points | by apsec112 4 days ago ago

1 comments