-
Asymptotics of Wide Convolutional Neural Networks
Authors:
Anders Andreassen,
Ethan Dyer
Abstract:
Wide neural networks have proven to be a rich class of architectures for both theory and practice. Motivated by the observation that finite width convolutional networks appear to outperform infinite width networks, we study scaling laws for wide CNNs and networks with skip connections. Following the approach of (Dyer & Gur-Ari, 2019), we present a simple diagrammatic recipe to derive the asymptoti…
▽ More
Wide neural networks have proven to be a rich class of architectures for both theory and practice. Motivated by the observation that finite width convolutional networks appear to outperform infinite width networks, we study scaling laws for wide CNNs and networks with skip connections. Following the approach of (Dyer & Gur-Ari, 2019), we present a simple diagrammatic recipe to derive the asymptotic width dependence for many quantities of interest. These scaling relationships provide a solvable description for the training dynamics of wide convolutional networks. We test these relations across a broad range of architectures. In particular, we find that the difference in performance between finite and infinite width models vanishes at a definite rate with respect to model width. Nonetheless, this relation is consistent with finite width models generalizing either better or worse than their infinite width counterparts, and we provide examples where the relative performance depends on the optimization details.
△ Less
Submitted 19 August, 2020;
originally announced August 2020.
-
Scale Invariant Instantons and the Complete Lifetime of the Standard Model
Authors:
Anders Andreassen,
William Frost,
Matthew D. Schwartz
Abstract:
In a classically scale-invariant quantum field theory, tunneling rates are infrared divergent due to the existence of instantons of any size. While one expects such divergences to be resolved by quantum effects, it has been unclear how higher-loop corrections can resolve a problem appearing already at one loop. With a careful power counting, we uncover a series of loop contributions that dominate…
▽ More
In a classically scale-invariant quantum field theory, tunneling rates are infrared divergent due to the existence of instantons of any size. While one expects such divergences to be resolved by quantum effects, it has been unclear how higher-loop corrections can resolve a problem appearing already at one loop. With a careful power counting, we uncover a series of loop contributions that dominate over the one-loop result and sum all the necessary terms. We also clarify previously incomplete treatments of related issues pertaining to global symmetries, gauge fixing and finite mass effects. In addition, we produce exact closed-form solutions for the functional determinants over scalars, fermions and vector bosons around the scale-invariant bounce, demonstrating manifest gauge invariance in the vector case.
With these problems solved, we produce the first complete calculation of the lifetime of our universe: 10^139 years. With 95% confidence, we expect our universe to last more than 10^58 years. The uncertainty is part experimental uncertainty on the top quark mass and on $αs$ and part theory uncertainty from electroweak threshold corrections. Using our complete result, we provide phase diagrams in the $mt/mh$ and the $mt/αs$ planes, with uncertainty bands. To rule out absolute stability to $3σ$ confidence, the uncertainty on the top quark pole mass would have to be pushed below 250 MeV or the uncertainty on $αs(mZ)$ pushed below 0.00025.
△ Less
Submitted 2 May, 2018; v1 submitted 25 July, 2017;
originally announced July 2017.
-
Precision decay rate calculations in quantum field theory
Authors:
Anders Andreassen,
David Farhi,
William Frost,
Matthew D. Schwartz
Abstract:
Tunneling in quantum field theory is worth understanding properly, not least because it controls the long term fate of our universe. There are however, a number of features of tunneling rate calculations which lack a desirable transparency, such as the necessity of analytic continuation, the appropriateness of using an effective instead of classical potential, and the sensitivity to short-distance…
▽ More
Tunneling in quantum field theory is worth understanding properly, not least because it controls the long term fate of our universe. There are however, a number of features of tunneling rate calculations which lack a desirable transparency, such as the necessity of analytic continuation, the appropriateness of using an effective instead of classical potential, and the sensitivity to short-distance physics. This paper attempts to review in pedagogical detail the physical origin of tunneling and its connection to the path integral. Both the traditional potential-deformation method and a recent more direct propagator-based method are discussed. Some new insights from using approximate semi-classical solutions are presented. In addition, we explore the sensitivity of the lifetime of our universe to short distance physics, such as quantum gravity, emphasizing a number of important subtleties.
△ Less
Submitted 31 August, 2017; v1 submitted 20 April, 2016;
originally announced April 2016.
-
A direct approach to quantum tunneling
Authors:
Anders Andreassen,
David Farhi,
William Frost,
Matthew D. Schwartz
Abstract:
The decay rates of quasistable states in quantum field theories are usually calculated using instanton methods. Standard derivations of these methods rely in a crucial way upon deformations and analytic continuations of the physical potential, and on the saddle point approximation. While the resulting procedure can be checked against other semi-classical approaches in some one-dimensional cases, i…
▽ More
The decay rates of quasistable states in quantum field theories are usually calculated using instanton methods. Standard derivations of these methods rely in a crucial way upon deformations and analytic continuations of the physical potential, and on the saddle point approximation. While the resulting procedure can be checked against other semi-classical approaches in some one-dimensional cases, it is challenging to trace the role of the relevant physical scales, and any intuitive handle on the precision of the approximations involved are at best obscure. In this paper, we use a physical definition of the tunneling probability to derive a formula for the decay rate in both quantum mechanics and quantum field theory directly from the Minkowski path integral, without reference to unphysical deformations of the potential. There are numerous benefits to this approach, from non-perturbative applications to precision calculations and aesthetic simplicity.
△ Less
Submitted 2 February, 2016;
originally announced February 2016.