Show HN: Cua-Bench – a benchmark for AI agents in GUI environments

(github.com)

34 points | by someguy101010 2 days ago ago

7 comments