Add Popular Science (opens in a new tab) More information Adding us as a Preferred Source in Google by using this link indicates that you would like to see more of our content in Google News results.
We live in a world where AI companies like OpenAI and Google are constantly looking for new ways to pit their AI models against each other. One of the most recent attempts to measure how top AI models ...
OpenAI’s o3 defeated Elon Musk’s Grok 4 at chess Magnus Carlsen delivered biting commentary on the quality of Grok's logic Grok 4 made repeated blunders, while o3 played steady The AI chess tournament ...
The world’s top performing artificial intelligence models, including OpenAI’s o3 and 04-mini, Google LLC’s Gemini 2.5 Pro and Gemini 2.5 Flash, Anthropic’s Claude Opus 4, and xAI Corp.’s Grok 4 are ...
Booth is a reporter at TIME. Virtual chess pieces in the data matrix. 3d illustration. Booth is a reporter at TIME. Complex games like chess and Go have long been used to test AI models’ capabilities.
Palisade Research recently detailed a ChatGPT experiment in which a reasoning model was told to play chess against a more powerful opponent and win. Rather than attempt to beat the stronger opponent, ...
When IBM’s Deep Blue first defeated Garry Kasparov in 1997, the world chess champion accused the company of cheating. There was no way, he thought, that the computer could have beaten him without ...