Search | arXiv e-print repository

arXiv:2505.20406 [pdf, ps, other]

Leveraging recurrence in neural network wavefunctions for large-scale simulations of Heisenberg antiferromagnets: the triangular lattice

Authors: M. Schuyler Moss, Roeland Wiersema, Mohamed Hibat-Allah, Juan Carrasquilla, Roger G. Melko

Abstract: Variational Monte Carlo simulations have been crucial for understanding quantum many-body systems, especially when the Hamiltonian is frustrated and the ground-state wavefunction has a non-trivial sign structure. In this paper, we use recurrent neural network (RNN) wavefunction ansätze to study the triangular-lattice antiferromagnetic Heisenberg model (TLAHM) for lattice sizes up to $30\times30$.… ▽ More Variational Monte Carlo simulations have been crucial for understanding quantum many-body systems, especially when the Hamiltonian is frustrated and the ground-state wavefunction has a non-trivial sign structure. In this paper, we use recurrent neural network (RNN) wavefunction ansätze to study the triangular-lattice antiferromagnetic Heisenberg model (TLAHM) for lattice sizes up to $30\times30$. In a recent study [M. S. Moss et al. arXiv:2502.17144], the authors demonstrated how RNN wavefunctions can be iteratively retrained in order to obtain variational results for multiple lattice sizes with a reasonable amount of compute. That study, which looked at the sign-free, square-lattice antiferromagnetic Heisenberg model, showed favorable scaling properties, allowing accurate finite-size extrapolations to the thermodynamic limit. In contrast, our present results illustrate in detail the relative difficulty in simulating the sign-problematic TLAHM. We find that the accuracy of our simulations can be significantly improved by transforming the Hamiltonian with a judicious choice of basis rotation. We also show that a similar benefit can be achieved by using variational neural annealing, an alternative optimization technique that minimizes a pseudo free energy. Ultimately, we are able to obtain estimates of the ground-state properties of the TLAHM in the thermodynamic limit that are in close agreement with values in the literature, showing that RNN wavefunctions provide a powerful toolbox for performing finite-size scaling studies for frustrated quantum many-body systems. △ Less

Submitted 16 June, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

Comments: 22 pages, 15 figures, 5 tables

arXiv:2502.20632 [pdf, other]

Lattice Protein Folding with Variational Annealing

Authors: Shoummo Ahsan Khandoker, Estelle M. Inack, Mohamed Hibat-Allah

Abstract: Understanding the principles of protein folding is a cornerstone of computational biology, with implications for drug design, bioengineering, and the understanding of fundamental biological processes. Lattice protein folding models offer a simplified yet powerful framework for studying the complexities of protein folding, enabling the exploration of energetically optimal folds under constrained co… ▽ More Understanding the principles of protein folding is a cornerstone of computational biology, with implications for drug design, bioengineering, and the understanding of fundamental biological processes. Lattice protein folding models offer a simplified yet powerful framework for studying the complexities of protein folding, enabling the exploration of energetically optimal folds under constrained conditions. However, finding these optimal folds is a computationally challenging combinatorial optimization problem. In this work, we introduce a novel upper-bound training scheme that employs masking to identify the lowest-energy folds in two-dimensional Hydrophobic-Polar (HP) lattice protein folding. By leveraging Dilated Recurrent Neural Networks (RNNs) integrated with an annealing process driven by temperature-like fluctuations, our method accurately predicts optimal folds for benchmark systems of up to 60 beads. Our approach also effectively masks invalid folds from being sampled without compromising the autoregressive sampling properties of RNNs. This scheme is generalizable to three spatial dimensions and can be extended to lattice protein models with larger alphabets. Our findings emphasize the potential of advanced machine learning techniques in tackling complex protein folding problems and a broader class of constrained combinatorial optimization challenges. △ Less

Submitted 27 February, 2025; originally announced February 2025.

Comments: Github respository will be provided soon

arXiv:2502.17144 [pdf, ps, other]

Leveraging recurrence in neural network wavefunctions for large-scale simulations of Heisenberg antiferromagnets: the square lattice

Authors: M. Schuyler Moss, Roeland Wiersema, Mohamed Hibat-Allah, Juan Carrasquilla, Roger G. Melko

Abstract: Machine-learning-based variational Monte Carlo simulations are a promising approach for targeting quantum many-body ground states, especially in two dimensions and in cases where the ground state is known to have a non-trivial sign structure. While many state-of-the-art variational energies have been reached with these methods for finite-size systems, little work has been done to use these results… ▽ More Machine-learning-based variational Monte Carlo simulations are a promising approach for targeting quantum many-body ground states, especially in two dimensions and in cases where the ground state is known to have a non-trivial sign structure. While many state-of-the-art variational energies have been reached with these methods for finite-size systems, little work has been done to use these results to extract information about the target state in the thermodynamic limit. In this work, we employ recurrent neural networks (RNNs) as a variational ansätze, and leverage their recurrent nature to simulate the ground states of progressively larger systems through iterative retraining. This transfer learning technique allows us to simulate spin-$\frac{1}{2}$ systems on lattices with more than 1,000 spins without beginning optimization from scratch for each system size, thus reducing the demands for computational resources. In this study, we focus on the square-lattice antiferromagnetic Heisenberg model, where it is possible to carefully benchmark our results. We show that we are able to systematically improve the accuracy of the results from our simulations by increasing the training time, and obtain results for finite-sized lattices that are in good agreement with the literature values. Furthermore, we use these results to extract accurate estimates of the ground-state properties in the thermodynamic limit. This work demonstrates that RNN wavefunctions can be used to accurately study quantum many-body physics in the thermodynamic limit. △ Less

Submitted 16 June, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

Comments: 19 pages, 13 figures, 6 tables

arXiv:2405.20384 [pdf, other]

Recurrent neural network wave functions for Rydberg atom arrays on kagome lattice

Authors: Mohamed Hibat-Allah, Ejaaz Merali, Giacomo Torlai, Roger G Melko, Juan Carrasquilla

Abstract: Rydberg atom array experiments have demonstrated the ability to act as powerful quantum simulators, preparing strongly-correlated phases of matter which are challenging to study for conventional computer simulations. A key direction has been the implementation of interactions on frustrated geometries, in an effort to prepare exotic many-body states such as spin liquids and glasses. In this paper,… ▽ More Rydberg atom array experiments have demonstrated the ability to act as powerful quantum simulators, preparing strongly-correlated phases of matter which are challenging to study for conventional computer simulations. A key direction has been the implementation of interactions on frustrated geometries, in an effort to prepare exotic many-body states such as spin liquids and glasses. In this paper, we apply two-dimensional recurrent neural network (RNN) wave functions to study the ground states of Rydberg atom arrays on the kagome lattice. We implement an annealing scheme to find the RNN variational parameters in regions of the phase diagram where exotic phases may occur, corresponding to rough optimization landscapes. For Rydberg atom array Hamiltonians studied previously on the kagome lattice, our RNN ground states show no evidence of exotic spin liquid or emergent glassy behavior. In the latter case, we argue that the presence of a non-zero Edwards-Anderson order parameter is an artifact of the long autocorrelations times experienced with quantum Monte Carlo simulations. This result emphasizes the utility of autoregressive models, such as RNNs, to explore Rydberg atom array physics on frustrated lattices and beyond. △ Less

Submitted 30 May, 2024; originally announced May 2024.

Comments: 13 pages, 5 figures, 3 tables. Link to GitHub repository: https://github.com/mhibatallah/RNNWavefunctions

arXiv:2303.15626 [pdf, other]

A Framework for Demonstrating Practical Quantum Advantage: Racing Quantum against Classical Generative Models

Authors: Mohamed Hibat-Allah, Marta Mauri, Juan Carrasquilla, Alejandro Perdomo-Ortiz

Abstract: Generative modeling has seen a rising interest in both classical and quantum machine learning, and it represents a promising candidate to obtain a practical quantum advantage in the near term. In this study, we build over a proposed framework for evaluating the generalization performance of generative models, and we establish the first quantitative comparative race towards practical quantum advant… ▽ More Generative modeling has seen a rising interest in both classical and quantum machine learning, and it represents a promising candidate to obtain a practical quantum advantage in the near term. In this study, we build over a proposed framework for evaluating the generalization performance of generative models, and we establish the first quantitative comparative race towards practical quantum advantage (PQA) between classical and quantum generative models, namely Quantum Circuit Born Machines (QCBMs), Transformers (TFs), Recurrent Neural Networks (RNNs), Variational Autoencoders (VAEs), and Wasserstein Generative Adversarial Networks (WGANs). After defining four types of PQAs scenarios, we focus on what we refer to as potential PQA, aiming to compare quantum models with the best-known classical algorithms for the task at hand. We let the models race on a well-defined and application-relevant competition setting, where we illustrate and demonstrate our framework on 20 variables (qubits) generative modeling task. Our results suggest that QCBMs are more efficient in the data-limited regime than the other state-of-the-art classical generative models. Such a feature is highly desirable in a wide range of real-world applications where the available data is scarce. △ Less

Submitted 27 March, 2023; originally announced March 2023.

Comments: 17 pages, 5 figures, 3 tables

arXiv:2303.11207 [pdf, other]

doi 10.1103/PhysRevB.108.075152

Investigating Topological Order using Recurrent Neural Networks

Authors: Mohamed Hibat-Allah, Roger G. Melko, Juan Carrasquilla

Abstract: Recurrent neural networks (RNNs), originally developed for natural language processing, hold great promise for accurately describing strongly correlated quantum many-body systems. Here, we employ 2D RNNs to investigate two prototypical quantum many-body Hamiltonians exhibiting topological order. Specifically, we demonstrate that RNN wave functions can effectively capture the topological order of t… ▽ More Recurrent neural networks (RNNs), originally developed for natural language processing, hold great promise for accurately describing strongly correlated quantum many-body systems. Here, we employ 2D RNNs to investigate two prototypical quantum many-body Hamiltonians exhibiting topological order. Specifically, we demonstrate that RNN wave functions can effectively capture the topological order of the toric code and a Bose-Hubbard spin liquid on the kagome lattice by estimating their topological entanglement entropies. We also find that RNNs favor coherent superpositions of minimally-entangled states over minimally-entangled states themselves. Overall, our findings demonstrate that RNN wave functions constitute a powerful tool to study phases of matter beyond Landau's symmetry-breaking paradigm. △ Less

Submitted 25 October, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

Comments: 15 pages, 6 figures, 2 tables. Published version in Physical Review B

Journal ref: Phys. Rev. B 108, 075152, August 2023

arXiv:2302.04919 [pdf, other]

doi 10.1126/science.adg9774

Variational Benchmarks for Quantum Many-Body Problems

Authors: Dian Wu, Riccardo Rossi, Filippo Vicentini, Nikita Astrakhantsev, Federico Becca, Xiaodong Cao, Juan Carrasquilla, Francesco Ferrari, Antoine Georges, Mohamed Hibat-Allah, Masatoshi Imada, Andreas M. Läuchli, Guglielmo Mazzola, Antonio Mezzacapo, Andrew Millis, Javier Robledo Moreno, Titus Neupert, Yusuke Nomura, Jannes Nys, Olivier Parcollet, Rico Pohle, Imelda Romero, Michael Schmid, J. Maxwell Silvester, Sandro Sorella , et al. (8 additional authors not shown)

Abstract: The continued development of computational approaches to many-body ground-state problems in physics and chemistry calls for a consistent way to assess its overall progress. In this work, we introduce a metric of variational accuracy, the V-score, obtained from the variational energy and its variance. We provide an extensive curated dataset of variational calculations of many-body quantum systems,… ▽ More The continued development of computational approaches to many-body ground-state problems in physics and chemistry calls for a consistent way to assess its overall progress. In this work, we introduce a metric of variational accuracy, the V-score, obtained from the variational energy and its variance. We provide an extensive curated dataset of variational calculations of many-body quantum systems, identifying cases where state-of-the-art numerical approaches show limited accuracy, and future algorithms or computational platforms, such as quantum computing, could provide improved accuracy. The V-score can be used as a metric to assess the progress of quantum variational methods toward a quantum advantage for ground-state problems, especially in regimes where classical verifiability is impossible. △ Less

Submitted 22 October, 2024; v1 submitted 9 February, 2023; originally announced February 2023.

Comments: 27 pages, 6 figures

Journal ref: Science 386, 296-301 (2024)

arXiv:2207.14314 [pdf, other]

Supplementing Recurrent Neural Network Wave Functions with Symmetry and Annealing to Improve Accuracy

Authors: Mohamed Hibat-Allah, Roger G. Melko, Juan Carrasquilla

Abstract: Recurrent neural networks (RNNs) are a class of neural networks that have emerged from the paradigm of artificial intelligence and has enabled lots of interesting advances in the field of natural language processing. Interestingly, these architectures were shown to be powerful ansatze to approximate the ground state of quantum systems. Here, we build over the results of [Phys. Rev. Research 2, 023… ▽ More Recurrent neural networks (RNNs) are a class of neural networks that have emerged from the paradigm of artificial intelligence and has enabled lots of interesting advances in the field of natural language processing. Interestingly, these architectures were shown to be powerful ansatze to approximate the ground state of quantum systems. Here, we build over the results of [Phys. Rev. Research 2, 023358 (2020)] and construct a more powerful RNN wave function ansatz in two dimensions. We use symmetry and annealing to obtain accurate estimates of ground state energies of the two-dimensional (2D) Heisenberg model, on the square lattice and on the triangular lattice. We show that our method is superior to Density Matrix Renormalisation Group (DMRG) for system sizes larger than or equal to $14 \times 14$ on the triangular lattice. △ Less

Submitted 12 January, 2024; v1 submitted 28 July, 2022; originally announced July 2022.

Comments: 11 pages, 4 figures, 1 table. Corrected typos. Originally published in Machine Learning and the Physical Sciences Workshop (NeurIPS 2021), see: https://ml4physicalsciences.github.io/2021/files/NeurIPS_ML4PS_2021_92.pdf. Our reproducibility code can be found at https://github.com/mhibatallah/RNNWavefunctions

Journal ref: Machine Learning and the Physical Sciences, NeurIPS 2021

arXiv:2207.08189 [pdf, other]

doi 10.1088/2632-2153/acb895

Supplementing Recurrent Neural Networks with Annealing to Solve Combinatorial Optimization Problems

Authors: Shoummo Ahsan Khandoker, Jawaril Munshad Abedin, Mohamed Hibat-Allah

Abstract: Combinatorial optimization problems can be solved by heuristic algorithms such as simulated annealing (SA) which aims to find the optimal solution within a large search space through thermal fluctuations. The algorithm generates new solutions through Markov-chain Monte Carlo techniques. This sampling scheme can result in severe limitations, such as slow convergence and a tendency to stay within th… ▽ More Combinatorial optimization problems can be solved by heuristic algorithms such as simulated annealing (SA) which aims to find the optimal solution within a large search space through thermal fluctuations. The algorithm generates new solutions through Markov-chain Monte Carlo techniques. This sampling scheme can result in severe limitations, such as slow convergence and a tendency to stay within the same local search space at small temperatures. To overcome these shortcomings, we use the variational classical annealing (VCA) framework that combines autoregressive recurrent neural networks (RNNs) with traditional annealing to sample solutions that are uncorrelated. In this paper, we demonstrate the potential of using VCA as an approach to solving real-world optimization problems. We explore VCA's performance in comparison with SA at solving three popular optimization problems: the maximum cut problem (Max-Cut), the nurse scheduling problem (NSP), and the traveling salesman problem (TSP). For all three problems, we find that VCA outperforms SA on average in the asymptotic limit by one or more orders of magnitude in terms of relative error. Interestingly, we reach large system sizes of up to $256$ cities for the TSP. We also conclude that in the best case scenario, VCA can serve as a great alternative when SA fails to find the optimal solution. △ Less

Submitted 26 October, 2023; v1 submitted 17 July, 2022; originally announced July 2022.

Comments: 14 pages, 3 figures, 4 tables. Github code: https://github.com/RNN-VCA-CO/RNN-VCA-CO. Published version

Journal ref: Mach. Learn.: Sci. Technol. 4 015026, Feb 2023

arXiv:2101.10154 [pdf, other]

doi 10.1038/s42256-021-00401-3

Variational Neural Annealing

Authors: Mohamed Hibat-Allah, Estelle M. Inack, Roeland Wiersema, Roger G. Melko, Juan Carrasquilla

Abstract: Many important challenges in science and technology can be cast as optimization problems. When viewed in a statistical physics framework, these can be tackled by simulated annealing, where a gradual cooling procedure helps search for groundstate solutions of a target Hamiltonian. While powerful, simulated annealing is known to have prohibitively slow sampling dynamics when the optimization landsca… ▽ More Many important challenges in science and technology can be cast as optimization problems. When viewed in a statistical physics framework, these can be tackled by simulated annealing, where a gradual cooling procedure helps search for groundstate solutions of a target Hamiltonian. While powerful, simulated annealing is known to have prohibitively slow sampling dynamics when the optimization landscape is rough or glassy. Here we show that by generalizing the target distribution with a parameterized model, an analogous annealing framework based on the variational principle can be used to search for groundstate solutions. Modern autoregressive models such as recurrent neural networks provide ideal parameterizations since they can be exactly sampled without slow dynamics even when the model encodes a rough landscape. We implement this procedure in the classical and quantum settings on several prototypical spin glass Hamiltonians, and find that it significantly outperforms traditional simulated annealing in the asymptotic limit, illustrating the potential power of this yet unexplored route to optimization. △ Less

Submitted 25 January, 2021; originally announced January 2021.

Comments: 19 pages, 9 figures, 1 table

arXiv:2002.02973 [pdf, other]

doi 10.1103/PhysRevResearch.2.023358

Recurrent Neural Network Wave Functions

Authors: Mohamed Hibat-Allah, Martin Ganahl, Lauren E. Hayward, Roger G. Melko, Juan Carrasquilla

Abstract: A core technology that has emerged from the artificial intelligence revolution is the recurrent neural network (RNN). Its unique sequence-based architecture provides a tractable likelihood estimate with stable training paradigms, a combination that has precipitated many spectacular advances in natural language processing and neural machine translation. This architecture also makes a good candidate… ▽ More A core technology that has emerged from the artificial intelligence revolution is the recurrent neural network (RNN). Its unique sequence-based architecture provides a tractable likelihood estimate with stable training paradigms, a combination that has precipitated many spectacular advances in natural language processing and neural machine translation. This architecture also makes a good candidate for a variational wave function, where the RNN parameters are tuned to learn the approximate ground state of a quantum Hamiltonian. In this paper, we demonstrate the ability of RNNs to represent several many-body wave functions, optimizing the variational parameters using a stochastic approach. Among other attractive features of these variational wave functions, their autoregressive nature allows for the efficient calculation of physical estimators by providing independent samples. We demonstrate the effectiveness of RNN wave functions by calculating ground state energies, correlation functions, and entanglement entropies for several quantum spin models of interest to condensed matter physicists in one and two spatial dimensions. △ Less

Submitted 20 June, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

Comments: The GitHub link to the open-source code is fixed

Journal ref: Phys. Rev. Research 2, 023358 (2020)

Showing 1–11 of 11 results for author: Hibat-Allah, M