A benchmark called OSWorld-Verified, designed to monitor AI's ability to navigate desktop environments, found that GPT 5.4 ...