Evidence of Learned Look-Ahead in a Chess-Playing Neural Network

Jenner, Erik; Kapur, Shreyas; Georgiev, Vasil; Allen, Cameron; Emmons, Scott; Russell, Stuart

Computer Science > Machine Learning

arXiv:2406.00877 (cs)

[Submitted on 2 Jun 2024]

Title:Evidence of Learned Look-Ahead in a Chess-Playing Neural Network

Authors:Erik Jenner, Shreyas Kapur, Vasil Georgiev, Cameron Allen, Scott Emmons, Stuart Russell

View PDF HTML (experimental)

Abstract:Do neural networks learn to implement algorithms such as look-ahead or search "in the wild"? Or do they rely purely on collections of simple heuristics? We present evidence of learned look-ahead in the policy network of Leela Chess Zero, the currently strongest neural chess engine. We find that Leela internally represents future optimal moves and that these representations are crucial for its final output in certain board states. Concretely, we exploit the fact that Leela is a transformer that treats every chessboard square like a token in language models, and give three lines of evidence (1) activations on certain squares of future moves are unusually important causally; (2) we find attention heads that move important information "forward and backward in time," e.g., from squares of future moves to squares of earlier ones; and (3) we train a simple probe that can predict the optimal move 2 turns ahead with 92% accuracy (in board states where Leela finds a single best line). These findings are an existence proof of learned look-ahead in neural networks and might be a step towards a better understanding of their capabilities.

Comments:	Project page: this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2406.00877 [cs.LG]
	(or arXiv:2406.00877v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.00877

Submission history

From: Erik Jenner [view email]
[v1] Sun, 2 Jun 2024 21:57:32 UTC (2,458 KB)

Computer Science > Machine Learning

Title:Evidence of Learned Look-Ahead in a Chess-Playing Neural Network

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Evidence of Learned Look-Ahead in a Chess-Playing Neural Network

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators