-
Symmetry Breaking in Neural Network Optimization: Insights from Input Dimension Expansion
Authors:
Jun-Jie Zhang,
Nan Cheng,
Fu-Peng Li,
Xiu-Cheng Wang,
Jian-Nan Chen,
Long-Gang Pang,
Deyu Meng
Abstract:
Understanding the mechanisms behind neural network optimization is crucial for improving network design and performance. While various optimization techniques have been developed, a comprehensive understanding of the underlying principles that govern these techniques remains elusive. Specifically, the role of symmetry breaking, a fundamental concept in physics, has not been fully explored in neura…
▽ More
Understanding the mechanisms behind neural network optimization is crucial for improving network design and performance. While various optimization techniques have been developed, a comprehensive understanding of the underlying principles that govern these techniques remains elusive. Specifically, the role of symmetry breaking, a fundamental concept in physics, has not been fully explored in neural network optimization. This gap in knowledge limits our ability to design networks that are both efficient and effective. Here, we propose the symmetry breaking hypothesis to elucidate the significance of symmetry breaking in enhancing neural network optimization. We demonstrate that a simple input expansion can significantly improve network performance across various tasks, and we show that this improvement can be attributed to the underlying symmetry breaking mechanism. We further develop a metric to quantify the degree of symmetry breaking in neural networks, providing a practical approach to evaluate and guide network design. Our findings confirm that symmetry breaking is a fundamental principle that underpins various optimization techniques, including dropout, batch normalization, and equivariance. By quantifying the degree of symmetry breaking, our work offers a practical technique for performance enhancement and a metric to guide network design without the need for complete datasets and extensive training processes.
△ Less
Submitted 12 September, 2024; v1 submitted 10 September, 2024;
originally announced September 2024.
-
3d Modularity Revisited
Authors:
Miranda C. N. Cheng,
Ioana Coman,
Piotr Kucharski,
Davide Passaro,
Gabriele Sgroi
Abstract:
The three-manifold topological invariants $\hat Z$ capture the half-index of the three-dimensional theory with ${\cal N}=2$ supersymmetry obtained by compactifying the M5 brane theory on the closed three-manifold. In 2019, surprising general relations between the $\hat Z$-invariants, quantum modular forms, and vertex algebras, have been proposed. In the meanwhile, an extensive array of examples ha…
▽ More
The three-manifold topological invariants $\hat Z$ capture the half-index of the three-dimensional theory with ${\cal N}=2$ supersymmetry obtained by compactifying the M5 brane theory on the closed three-manifold. In 2019, surprising general relations between the $\hat Z$-invariants, quantum modular forms, and vertex algebras, have been proposed. In the meanwhile, an extensive array of examples have been studied, but several general important structural questions remain. First, for many three-manifolds it was observed that the different $\hat Z$-invariants for the same three-manifolds are quantum modular forms that span a subspace of a Weil representation for the modular group $SL_2(Z)$, corresponding to the structure of vector-valued quantum modular forms. We elucidate the meaning of this vector-valued quantum modular form structure by first proposing the analogue $\hat Z$-invariants with supersymmetric defects, and subsequently showing that the full vector-valued quantum modular form is precisely the object capturing all the $\hat Z$-invariants, with and without defects, of a given three-manifold. Second, it was expected that matching radial limits is a key feature of $\hat Z$-invariants when changing the orientation of the plumbed three-manifold, suggesting the relevance of mock modularity. We substantiate the conjecture by providing explicit proposals for such $\hat Z$-invariants for an infinite family of three-manifolds and verify their mock modularity and limits. Third, we initiate the study of the vertex algebra structure of the mock type invariants by showcasing a systematic way to construct cone vertex operator algebras associated to these invariants, which can be viewed as the partner of logarithmic vertex operator algebras in this context.
△ Less
Submitted 25 March, 2024; v1 submitted 21 March, 2024;
originally announced March 2024.
-
Quantum Modular $\widehat Z{}^G$-Invariants
Authors:
Miranda C. N. Cheng,
Ioana Coman,
Davide Passaro,
Gabriele Sgroi
Abstract:
We study the quantum modular properties of $\widehat Z{}^G$-invariants of closed three-manifolds. Higher depth quantum modular forms are expected to play a central role for general three-manifolds and gauge groups $G$. In particular, we conjecture that for plumbed three-manifolds whose plumbing graphs have $n$ junction nodes with definite signature and for rank $r$ gauge group $G$, that…
▽ More
We study the quantum modular properties of $\widehat Z{}^G$-invariants of closed three-manifolds. Higher depth quantum modular forms are expected to play a central role for general three-manifolds and gauge groups $G$. In particular, we conjecture that for plumbed three-manifolds whose plumbing graphs have $n$ junction nodes with definite signature and for rank $r$ gauge group $G$, that $\widehat Z{}^G$ is related to a quantum modular form of depth $nr$. We prove this for $G={\rm SU}(3)$ and for an infinite class of three-manifolds (weakly negative Seifert with three exceptional fibers). We also investigate the relation between the quantum modularity of $\widehat Z{}^G$-invariants of the same three-manifold with different gauge group $G$. We conjecture a recursive relation among the iterated Eichler integrals relevant for $\widehat Z{}^G$ with $G={\rm SU}(2)$ and ${\rm SU}(3)$, for negative Seifert manifolds with three exceptional fibers. This is reminiscent of the recursive structure among mock modular forms playing the role of Vafa-Witten invariants for ${\rm SU}(N)$. We prove the conjecture when the three-manifold is moreover an integral homological sphere.
△ Less
Submitted 9 March, 2024; v1 submitted 8 April, 2023;
originally announced April 2023.
-
Random tensor networks with nontrivial links
Authors:
Newton Cheng,
Cécilia Lancien,
Geoff Penington,
Michael Walter,
Freek Witteveen
Abstract:
Random tensor networks are a powerful toy model for understanding the entanglement structure of holographic quantum gravity. However, unlike holographic quantum gravity, their entanglement spectra are flat. It has therefore been argued that a better model consists of random tensor networks with link states that are not maximally entangled, i.e., have nontrivial spectra. In this work, we initiate a…
▽ More
Random tensor networks are a powerful toy model for understanding the entanglement structure of holographic quantum gravity. However, unlike holographic quantum gravity, their entanglement spectra are flat. It has therefore been argued that a better model consists of random tensor networks with link states that are not maximally entangled, i.e., have nontrivial spectra. In this work, we initiate a systematic study of the entanglement properties of these networks. We employ tools from free probability, random matrix theory, and one-shot quantum information theory to study random tensor networks with bounded and unbounded variation in link spectra, and in cases where a subsystem has one or multiple minimal cuts. If the link states have bounded spectral variation, the limiting entanglement spectrum of a subsystem with two minimal cuts can be expressed as a free product of the entanglement spectra of each cut, along with a Marchenko-Pastur distribution. For a class of states with unbounded spectral variation, analogous to semiclassical states in quantum gravity, we relate the limiting entanglement spectrum of a subsystem with two minimal cuts to the distribution of the minimal entanglement across the two cuts. In doing so, we draw connections to previous work on split transfer protocols, entanglement negativity in random tensor networks, and Euclidean path integrals in quantum gravity.
△ Less
Submitted 11 August, 2024; v1 submitted 21 June, 2022;
originally announced June 2022.
-
Band theory and boundary modes of high-dimensional representations of infinite hyperbolic lattices
Authors:
Nan Cheng,
Francesco Serafin,
James McInerney,
Zeb Rocklin,
Kai Sun,
Xiaoming Mao
Abstract:
Periodic lattices in hyperbolic space are characterized by symmetries beyond Euclidean crystallographic groups, offering a new platform for classical and quantum waves, demonstrating great potentials for a new class of topological metamaterials. One important feature of hyperbolic lattices is that their translation group is nonabelian, permitting high-dimensional irreducible representations (irrep…
▽ More
Periodic lattices in hyperbolic space are characterized by symmetries beyond Euclidean crystallographic groups, offering a new platform for classical and quantum waves, demonstrating great potentials for a new class of topological metamaterials. One important feature of hyperbolic lattices is that their translation group is nonabelian, permitting high-dimensional irreducible representations (irreps), in contrast to abelian translation groups in Euclidean lattices. Here we introduce a general framework to construct wave eigenstates of high-dimensional irreps of infinite hyperbolic lattices, thereby generalizing Bloch's theorem, and discuss its implications on unusual mode-counting and degeneracy, as well as bulk-edge correspondence in hyperbolic lattices. We apply this method to a mechanical hyperbolic lattice, and characterize its band structure and zero modes of high-dimensional irreps.
△ Less
Submitted 28 March, 2022;
originally announced March 2022.
-
Vertex operator superalgebra/sigma model correspondences: The four-torus case
Authors:
Vassilis Anagiannis,
Miranda C. N. Cheng,
John Duncan,
Roberto Volpato
Abstract:
We propose a correspondence between vertex operator superalgebras and families of sigma models in which the two structures are related by symmetry properties and a certain reflection procedure. The existence of such a correspondence is motivated by previous work on N=(4,4) supersymmetric non-linear sigma models on K3 surfaces and on a vertex operator superalgebra with Conway group symmetry. Here w…
▽ More
We propose a correspondence between vertex operator superalgebras and families of sigma models in which the two structures are related by symmetry properties and a certain reflection procedure. The existence of such a correspondence is motivated by previous work on N=(4,4) supersymmetric non-linear sigma models on K3 surfaces and on a vertex operator superalgebra with Conway group symmetry. Here we present an example of the correspondence for N=(4,4) supersymmetric non-linear sigma models on four-tori, and compare it to the K3 case.
△ Less
Submitted 31 August, 2020;
originally announced September 2020.
-
Three-Manifold Quantum Invariants and Mock Theta Functions
Authors:
Miranda C. N. Cheng,
Francesca Ferrari,
Gabriele Sgroi
Abstract:
Mock modular forms have found applications in numerous branches of mathematical sciences since they were first introduced by Ramanujan nearly a century ago. In this proceeding we highlight a new area where mock modular forms start to play an important role, namely the study of three-manifold invariants. For a certain class of Seifert three-manifolds, we describe a conjecture on the mock modular pr…
▽ More
Mock modular forms have found applications in numerous branches of mathematical sciences since they were first introduced by Ramanujan nearly a century ago. In this proceeding we highlight a new area where mock modular forms start to play an important role, namely the study of three-manifold invariants. For a certain class of Seifert three-manifolds, we describe a conjecture on the mock modular properties of a recently proposed quantum invariant. As an illustration, we include concrete computations for a specific three-manifold, the Brieskorn sphere $Σ(2,3,7)$. This note is partially based on the talk by the first author in the conference "Srinivasa Ramanujan: in celebration of the centenary of his election as FRS" held at the Royal Society in 2018.
△ Less
Submitted 16 March, 2020; v1 submitted 17 December, 2019;
originally announced December 2019.
-
Wall Crossing, Discrete Attractor Flow and Borcherds Algebra
Authors:
Miranda C. N. Cheng,
Erik P. Verlinde
Abstract:
The appearance of a generalized (or Borcherds-) Kac-Moody algebra in the spectrum of BPS dyons in N=4, d=4 string theory is elucidated. From the low-energy supergravity analysis, we identify its root lattice as the lattice of the T-duality invariants of the dyonic charges, the symmetry group of the root system as the extended S-duality group PGL(2,Z) of the theory, and the walls of Weyl chambers…
▽ More
The appearance of a generalized (or Borcherds-) Kac-Moody algebra in the spectrum of BPS dyons in N=4, d=4 string theory is elucidated. From the low-energy supergravity analysis, we identify its root lattice as the lattice of the T-duality invariants of the dyonic charges, the symmetry group of the root system as the extended S-duality group PGL(2,Z) of the theory, and the walls of Weyl chambers as the walls of marginal stability for the relevant two-centered solutions. This leads to an interpretation for the Weyl group as the group of wall-crossing, or the group of discrete attractor flows. Furthermore we propose an equivalence between a "second-quantized multiplicity" of a charge- and moduli-dependent highest weight vector and the dyon degeneracy, and show that the wall-crossing formula following from our proposal agrees with the wall-crossing formula obtained from the supergravity analysis. This can be thought of as providing a microscopic derivation of the wall-crossing formula of this theory.
△ Less
Submitted 7 October, 2008; v1 submitted 16 June, 2008;
originally announced June 2008.