Forum: TFSI

OpenAI o3 checkmates Grok in a chess showdown, and it wasn't even

From TechnologyDaily@1337:1/100 to All on Thu Aug 14 04:15:06 2025

OpenAI o3 checkmates Grok in a chess showdown, and it wasn't even close

Date:
Thu, 14 Aug 2025 03:00:00 +0000

Description:
OpenAIs o3 crushed Grok 4 in an AI chess tournament, 4-0.

FULL STORY ======================================================================OpenAIs o3 defeated Elon Musks Grok 4 at chess Magnus Carlsen delivered biting commentary on the quality of Grok's logic Grok 4 made repeated blunders,
while o3 played steady

The AI chess tournament between OpenAIs o3 model and xAI's Grok 4 invited plenty of speculation as a kind of proxy battle between the two companies and their respective CEOs. Any comparison to the days of Deep Blue and Bobby Fischer soon faded, though, as OpenAI o3 repeatedly wiped out Grok 4, winning four games in a row, accompanied by the derisive commentary of former world chess champion Magnus Carlsen and grandmaster David Howell.

The showdown happened on Kaggles Game Arena, a digital coliseum where AI models battle in chess and other games. The tournament featured eight of the most prominent LLMs in the business: OpenAIs o3 and o4-mini, Googles Gemini 2.5 Pro and Flash, Anthropics Claude Opus, Moonshots DeepSeek and Kimi, and xAIs Grok 4. The final came down to Grok and o3, but Grok's performance in
the final round didn't seem like a battle of champions.

Carlsen and Howell veered between serious commentary and a roast as Groks performance came off as somewhat erratic. In the first game, it quickly sacrificed its bishop, then began trading pieces like it was in a hurry to go home. Things didn't improve in the next game for Grok.

[Grok] is like that one guy in a club tournament who has learnt theory and literally knows nothing else," Carlsen said during the second game. "Makes
the worst blunders after that.

Groks performance was so off-the-rails that Carlsen rated it around 800 ELO, or slightly above a beginner. He gave o3 a modest but respectable 1200, in
the middle of most hobby players. Though o3 didnt play brilliantly, it didnt have to. It played solid chess. It didnt blunder pieces. It converted its advantages and carried out the classic chess moves.

o3 is fairly ruthless in conversions; it looks like a chess player. Grok
looks like it learnt a few opening moves and knows the rules, but not much more.," Carlsen said. "Groks moves are chess-related moves. They just came at the wrong time and in weird sequences. Chess AI

The chess wasn't the main point of the tournament, despite its prominence. It was about how general-purpose AI models handle events with strict rules like chess games. Turns out, they're not great, but o3 is the best of the limited sample. As AI becomes embedded in everything, the ability to follow rules and spot patterns becomes essential. Chess is a uniquely transparent way to observe that. You either made the right move or you didnt. When a model plays well, you can see the logic; otherwise, queens fall like dominoes, and the game becomes as confused as that metaphor.

Chess is a window into how well an AI can plan, evaluate options, avoid catastrophic mistakes, and stay logically consistent. If Grok throws away a queen because it doesnt grasp long-term consequences, what might it do in a legal document, or when booking travel?

That the final was between OpenAI and xAI did add some drama with Sam Altman and Elon Musk at loggerheads in public . The chess final didnt resolve the battle between them, but it did give OpenAI a PR win in the realm of public perception, and a limited but very real compliment from Magnus Carlsen. You might also like ChatGPT is no match for a 40-year-old digital Pocket Chess game, and I bet Garry Kasparov would be pleased Grok may start remembering everything you ask it to do, according to new reports I tried Groks new AI image editing features theyre fun but wont replace Photoshop any time soon Grok 3s voice mode is unhinged, and thats the point

======================================================================
Link to news story: https://www.techradar.com/ai-platforms-assistants/openai-o3-checkmates-grok-in -a-chess-showdown-and-it-wasnt-even-close

--- Mystic BBS v1.12 A49 (Linux/64)
* Origin: tqwNet Technology News (1337:1/100)

Who's Online
Recent Visitors
- Guest
  Fri Aug 22 00:08:52 2025
  from Vitoria via Telnet
- gretchiie
  Mon Aug 18 07:04:13 2025
  from austin tx via Telnet
- Guest
  Sat Aug 16 15:41:31 2025
  from Awelfhbwlefu via SSH
- CyberNix
  Thu Aug 7 22:36:50 2025
  from London, UK via SSH

System Info

Sysop:	CyberNix
Location:	London, UK
Users:	22
Nodes:	10 (0 / 10)
Uptime:	77:36:32
Calls:	899
Files:	4,596
Messages:	688,281

OpenAI o3 checkmates Grok in a chess showdown, and it wasn't even

Who's Online

Recent Visitors

System Info