Hierarchical autoregressive neural networks for statistical systems

Białas, Piotr; Korcyl, Piotr; Stebel, Tomasz

doi:10.1016/j.cpc.2022.108502

Condensed Matter > Statistical Mechanics

arXiv:2203.10989 (cond-mat)

[Submitted on 21 Mar 2022 (v1), last revised 15 Nov 2022 (this version, v2)]

Title:Hierarchical autoregressive neural networks for statistical systems

Authors:Piotr Białas, Piotr Korcyl, Tomasz Stebel

View PDF

Abstract:It was recently proposed that neural networks could be used to approximate many-dimensional probability distributions that appear e.g. in lattice field theories or statistical mechanics. Subsequently they can be used as variational approximators to asses extensive properties of statistical systems, like free energy, and also as neural samplers used in Monte Carlo simulations. The practical application of this approach is unfortunately limited by its unfavorable scaling both of the numerical cost required for training, and the memory requirements with the system size. This is due to the fact that the original proposition involved a neural network of width which scaled with the total number of degrees of freedom, e.g. $L^2$ in case of a two dimensional $L\times L$ lattice. In this work we propose a hierarchical association of physical degrees of freedom, for instance spins, to neurons which replaces it with the scaling with the linear extent $L$ of the system. We demonstrate our approach on the two-dimensional Ising model by simulating lattices of various sizes up to $128 \times 128$ spins, with time benchmarks reaching lattices of size $512 \times 512$. We observe that our proposal improves the quality of neural network training, i.e. the approximated probability distribution is closer to the target that could be previously achieved. As a consequence, the variational free energy reaches a value closer to its theoretical expectation and, if applied in a Markov Chain Monte Carlo algorithm, the resulting autocorrelation time is smaller. Finally, the replacement of a single neural network by a hierarchy of smaller networks considerably reduces the memory requirements.

Comments:	14 pages, 6 figures
Subjects:	Statistical Mechanics (cond-mat.stat-mech); Machine Learning (cs.LG); High Energy Physics - Lattice (hep-lat); Machine Learning (stat.ML)
Cite as:	arXiv:2203.10989 [cond-mat.stat-mech]
	(or arXiv:2203.10989v2 [cond-mat.stat-mech] for this version)
	https://doi.org/10.48550/arXiv.2203.10989
Journal reference:	Comput.Phys.Commun. 281 (2022) 108502
Related DOI:	https://doi.org/10.1016/j.cpc.2022.108502

Submission history

From: Tomasz Stebel [view email]
[v1] Mon, 21 Mar 2022 13:55:53 UTC (321 KB)
[v2] Tue, 15 Nov 2022 21:24:19 UTC (1,026 KB)

Condensed Matter > Statistical Mechanics

Title:Hierarchical autoregressive neural networks for statistical systems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Condensed Matter > Statistical Mechanics

Title:Hierarchical autoregressive neural networks for statistical systems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators