HeadlinesBriefing favicon HeadlinesBriefing.com

AI training flaws exposed in simple games

Ars Technica •
×

Google's DeepMind Alpha AIs have mastered complex games like chess and Go through self-play training, yet researchers discovered these systems fail dramatically at simpler games requiring mathematical reasoning. Bei Zhou and Soren Riis found that AlphaZero's training approach works poorly for games like Nim, where victory depends on understanding a parity function. Performance gains essentially stopped after 500 training iterations on seven-row boards.

The problem stems from how these AIs learn. AlphaZero excels at associative learning, connecting board configurations with winning probabilities, but cannot develop the symbolic reasoning needed to understand mathematical functions behind impartial games. When researchers tested the system on larger Nim boards, the trained AI performed no better than one making random moves.

This limitation extends beyond Nim - similar blind spots exist in chess-playing AIs that miss certain long-term mating combinations. The research reveals a "tangible, catastrophic failure mode" in current AI training approaches, particularly concerning for math problems requiring symbolic reasoning. Understanding these limitations becomes crucial as AI systems take on more complex decision-making tasks.