Google’s new Android Bench ranks the top AI models for Android coding, with Gemini 3.1 Pro Preview leading Claude Opus 4.6 and GPT-5.2-Codex.
Abstract: We present a research-based framework for measuring successful environments on software teams for long-term and sustainable sociocognitive problem-solving. Across 1,282 full-time developers ...
OpenAI launches GPT-5.4, calling it its most capable and efficient AI model yet, with AI agents, computer control, improved reasoning, and a 1M-token context.