Search | arXiv e-print repository

Symmetry Breaking in Neural Network Optimization: Insights from Input Dimension Expansion

Authors: Jun-Jie Zhang, Nan Cheng, Fu-Peng Li, Xiu-Cheng Wang, Jian-Nan Chen, Long-Gang Pang, Deyu Meng

Abstract: Understanding the mechanisms behind neural network optimization is crucial for improving network design and performance. While various optimization techniques have been developed, a comprehensive understanding of the underlying principles that govern these techniques remains elusive. Specifically, the role of symmetry breaking, a fundamental concept in physics, has not been fully explored in neura… ▽ More Understanding the mechanisms behind neural network optimization is crucial for improving network design and performance. While various optimization techniques have been developed, a comprehensive understanding of the underlying principles that govern these techniques remains elusive. Specifically, the role of symmetry breaking, a fundamental concept in physics, has not been fully explored in neural network optimization. This gap in knowledge limits our ability to design networks that are both efficient and effective. Here, we propose the symmetry breaking hypothesis to elucidate the significance of symmetry breaking in enhancing neural network optimization. We demonstrate that a simple input expansion can significantly improve network performance across various tasks, and we show that this improvement can be attributed to the underlying symmetry breaking mechanism. We further develop a metric to quantify the degree of symmetry breaking in neural networks, providing a practical approach to evaluate and guide network design. Our findings confirm that symmetry breaking is a fundamental principle that underpins various optimization techniques, including dropout, batch normalization, and equivariance. By quantifying the degree of symmetry breaking, our work offers a practical technique for performance enhancement and a metric to guide network design without the need for complete datasets and extensive training processes. △ Less

Submitted 12 September, 2024; v1 submitted 10 September, 2024; originally announced September 2024.

Comments: 29 pages, 8 figures

arXiv:2403.14920 [pdf, ps, other]

3d Modularity Revisited

Authors: Miranda C. N. Cheng, Ioana Coman, Piotr Kucharski, Davide Passaro, Gabriele Sgroi

Abstract: The three-manifold topological invariants $\hat Z$ capture the half-index of the three-dimensional theory with ${\cal N}=2$ supersymmetry obtained by compactifying the M5 brane theory on the closed three-manifold. In 2019, surprising general relations between the $\hat Z$-invariants, quantum modular forms, and vertex algebras, have been proposed. In the meanwhile, an extensive array of examples ha… ▽ More The three-manifold topological invariants $\hat Z$ capture the half-index of the three-dimensional theory with ${\cal N}=2$ supersymmetry obtained by compactifying the M5 brane theory on the closed three-manifold. In 2019, surprising general relations between the $\hat Z$-invariants, quantum modular forms, and vertex algebras, have been proposed. In the meanwhile, an extensive array of examples have been studied, but several general important structural questions remain. First, for many three-manifolds it was observed that the different $\hat Z$-invariants for the same three-manifolds are quantum modular forms that span a subspace of a Weil representation for the modular group $SL_2(Z)$, corresponding to the structure of vector-valued quantum modular forms. We elucidate the meaning of this vector-valued quantum modular form structure by first proposing the analogue $\hat Z$-invariants with supersymmetric defects, and subsequently showing that the full vector-valued quantum modular form is precisely the object capturing all the $\hat Z$-invariants, with and without defects, of a given three-manifold. Second, it was expected that matching radial limits is a key feature of $\hat Z$-invariants when changing the orientation of the plumbed three-manifold, suggesting the relevance of mock modularity. We substantiate the conjecture by providing explicit proposals for such $\hat Z$-invariants for an infinite family of three-manifolds and verify their mock modularity and limits. Third, we initiate the study of the vertex algebra structure of the mock type invariants by showcasing a systematic way to construct cone vertex operator algebras associated to these invariants, which can be viewed as the partner of logarithmic vertex operator algebras in this context. △ Less

Submitted 25 March, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

Comments: 59 pages, typos corrected

arXiv:2304.03934 [pdf, other]

doi 10.3842/SIGMA.2024.018

Quantum Modular $\widehat Z{}^G$-Invariants

Authors: Miranda C. N. Cheng, Ioana Coman, Davide Passaro, Gabriele Sgroi

Abstract: We study the quantum modular properties of $\widehat Z{}^G$-invariants of closed three-manifolds. Higher depth quantum modular forms are expected to play a central role for general three-manifolds and gauge groups $G$. In particular, we conjecture that for plumbed three-manifolds whose plumbing graphs have $n$ junction nodes with definite signature and for rank $r$ gauge group $G$, that… ▽ More We study the quantum modular properties of $\widehat Z{}^G$-invariants of closed three-manifolds. Higher depth quantum modular forms are expected to play a central role for general three-manifolds and gauge groups $G$. In particular, we conjecture that for plumbed three-manifolds whose plumbing graphs have $n$ junction nodes with definite signature and for rank $r$ gauge group $G$, that $\widehat Z{}^G$ is related to a quantum modular form of depth $nr$. We prove this for $G={\rm SU}(3)$ and for an infinite class of three-manifolds (weakly negative Seifert with three exceptional fibers). We also investigate the relation between the quantum modularity of $\widehat Z{}^G$-invariants of the same three-manifold with different gauge group $G$. We conjecture a recursive relation among the iterated Eichler integrals relevant for $\widehat Z{}^G$ with $G={\rm SU}(2)$ and ${\rm SU}(3)$, for negative Seifert manifolds with three exceptional fibers. This is reminiscent of the recursive structure among mock modular forms playing the role of Vafa-Witten invariants for ${\rm SU}(N)$. We prove the conjecture when the three-manifold is moreover an integral homological sphere. △ Less

Submitted 9 March, 2024; v1 submitted 8 April, 2023; originally announced April 2023.

Journal ref: SIGMA 20 (2024), 018, 52 pages

arXiv:2206.10482 [pdf, other]

Random tensor networks with nontrivial links

Authors: Newton Cheng, Cécilia Lancien, Geoff Penington, Michael Walter, Freek Witteveen

Abstract: Random tensor networks are a powerful toy model for understanding the entanglement structure of holographic quantum gravity. However, unlike holographic quantum gravity, their entanglement spectra are flat. It has therefore been argued that a better model consists of random tensor networks with link states that are not maximally entangled, i.e., have nontrivial spectra. In this work, we initiate a… ▽ More Random tensor networks are a powerful toy model for understanding the entanglement structure of holographic quantum gravity. However, unlike holographic quantum gravity, their entanglement spectra are flat. It has therefore been argued that a better model consists of random tensor networks with link states that are not maximally entangled, i.e., have nontrivial spectra. In this work, we initiate a systematic study of the entanglement properties of these networks. We employ tools from free probability, random matrix theory, and one-shot quantum information theory to study random tensor networks with bounded and unbounded variation in link spectra, and in cases where a subsystem has one or multiple minimal cuts. If the link states have bounded spectral variation, the limiting entanglement spectrum of a subsystem with two minimal cuts can be expressed as a free product of the entanglement spectra of each cut, along with a Marchenko-Pastur distribution. For a class of states with unbounded spectral variation, analogous to semiclassical states in quantum gravity, we relate the limiting entanglement spectrum of a subsystem with two minimal cuts to the distribution of the minimal entanglement across the two cuts. In doing so, we draw connections to previous work on split transfer protocols, entanglement negativity in random tensor networks, and Euclidean path integrals in quantum gravity. △ Less

Submitted 11 August, 2024; v1 submitted 21 June, 2022; originally announced June 2022.

Comments: 85 pages, 7 figures

Journal ref: Annales Henri Poincare, Vol. 25, No. 4 (2024) pp. 2107-2212

arXiv:2203.15208 [pdf, other]

doi 10.1103/PhysRevLett.129.088002

Band theory and boundary modes of high-dimensional representations of infinite hyperbolic lattices

Authors: Nan Cheng, Francesco Serafin, James McInerney, Zeb Rocklin, Kai Sun, Xiaoming Mao

Abstract: Periodic lattices in hyperbolic space are characterized by symmetries beyond Euclidean crystallographic groups, offering a new platform for classical and quantum waves, demonstrating great potentials for a new class of topological metamaterials. One important feature of hyperbolic lattices is that their translation group is nonabelian, permitting high-dimensional irreducible representations (irrep… ▽ More Periodic lattices in hyperbolic space are characterized by symmetries beyond Euclidean crystallographic groups, offering a new platform for classical and quantum waves, demonstrating great potentials for a new class of topological metamaterials. One important feature of hyperbolic lattices is that their translation group is nonabelian, permitting high-dimensional irreducible representations (irreps), in contrast to abelian translation groups in Euclidean lattices. Here we introduce a general framework to construct wave eigenstates of high-dimensional irreps of infinite hyperbolic lattices, thereby generalizing Bloch's theorem, and discuss its implications on unusual mode-counting and degeneracy, as well as bulk-edge correspondence in hyperbolic lattices. We apply this method to a mechanical hyperbolic lattice, and characterize its band structure and zero modes of high-dimensional irreps. △ Less

Submitted 28 March, 2022; originally announced March 2022.

Comments: 10 pages, 4 figures

arXiv:2009.00186 [pdf, other]

doi 10.1093/ptep/ptab095

Vertex operator superalgebra/sigma model correspondences: The four-torus case

Authors: Vassilis Anagiannis, Miranda C. N. Cheng, John Duncan, Roberto Volpato

Abstract: We propose a correspondence between vertex operator superalgebras and families of sigma models in which the two structures are related by symmetry properties and a certain reflection procedure. The existence of such a correspondence is motivated by previous work on N=(4,4) supersymmetric non-linear sigma models on K3 surfaces and on a vertex operator superalgebra with Conway group symmetry. Here w… ▽ More We propose a correspondence between vertex operator superalgebras and families of sigma models in which the two structures are related by symmetry properties and a certain reflection procedure. The existence of such a correspondence is motivated by previous work on N=(4,4) supersymmetric non-linear sigma models on K3 surfaces and on a vertex operator superalgebra with Conway group symmetry. Here we present an example of the correspondence for N=(4,4) supersymmetric non-linear sigma models on four-tori, and compare it to the K3 case. △ Less

Submitted 31 August, 2020; originally announced September 2020.

Comments: 31 pages including three appendices

Journal ref: Prog Theor Exp Phys (2021)

arXiv:1912.07997 [pdf, other]

doi 10.1098/rsta.2018.0439

Three-Manifold Quantum Invariants and Mock Theta Functions

Authors: Miranda C. N. Cheng, Francesca Ferrari, Gabriele Sgroi

Abstract: Mock modular forms have found applications in numerous branches of mathematical sciences since they were first introduced by Ramanujan nearly a century ago. In this proceeding we highlight a new area where mock modular forms start to play an important role, namely the study of three-manifold invariants. For a certain class of Seifert three-manifolds, we describe a conjecture on the mock modular pr… ▽ More Mock modular forms have found applications in numerous branches of mathematical sciences since they were first introduced by Ramanujan nearly a century ago. In this proceeding we highlight a new area where mock modular forms start to play an important role, namely the study of three-manifold invariants. For a certain class of Seifert three-manifolds, we describe a conjecture on the mock modular properties of a recently proposed quantum invariant. As an illustration, we include concrete computations for a specific three-manifold, the Brieskorn sphere $Σ(2,3,7)$. This note is partially based on the talk by the first author in the conference "Srinivasa Ramanujan: in celebration of the centenary of his election as FRS" held at the Royal Society in 2018. △ Less

Submitted 16 March, 2020; v1 submitted 17 December, 2019; originally announced December 2019.

Comments: 19 pages

Journal ref: Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences. Volume 378, Issue 2163 (2019)

arXiv:0806.2337 [pdf, ps, other]

doi 10.3842/SIGMA.2008.068

Wall Crossing, Discrete Attractor Flow and Borcherds Algebra

Authors: Miranda C. N. Cheng, Erik P. Verlinde

Abstract: The appearance of a generalized (or Borcherds-) Kac-Moody algebra in the spectrum of BPS dyons in N=4, d=4 string theory is elucidated. From the low-energy supergravity analysis, we identify its root lattice as the lattice of the T-duality invariants of the dyonic charges, the symmetry group of the root system as the extended S-duality group PGL(2,Z) of the theory, and the walls of Weyl chambers… ▽ More The appearance of a generalized (or Borcherds-) Kac-Moody algebra in the spectrum of BPS dyons in N=4, d=4 string theory is elucidated. From the low-energy supergravity analysis, we identify its root lattice as the lattice of the T-duality invariants of the dyonic charges, the symmetry group of the root system as the extended S-duality group PGL(2,Z) of the theory, and the walls of Weyl chambers as the walls of marginal stability for the relevant two-centered solutions. This leads to an interpretation for the Weyl group as the group of wall-crossing, or the group of discrete attractor flows. Furthermore we propose an equivalence between a "second-quantized multiplicity" of a charge- and moduli-dependent highest weight vector and the dyon degeneracy, and show that the wall-crossing formula following from our proposal agrees with the wall-crossing formula obtained from the supergravity analysis. This can be thought of as providing a microscopic derivation of the wall-crossing formula of this theory. △ Less

Submitted 7 October, 2008; v1 submitted 16 June, 2008; originally announced June 2008.

Comments: This is a contribution to the Special Issue on Kac-Moody Algebras and Applications, published in SIGMA (Symmetry, Integrability and Geometry: Methods and Applications) at http://www.emis.de/journals/SIGMA/

Report number: ITFA-2008-20

Journal ref: SIGMA 4:068,2008

Showing 1–8 of 8 results for author: Cheng, N