Skip to main content

Showing 1–4 of 4 results for author: Cont, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2212.08199  [pdf, other

    cs.LG

    Asymptotic Analysis of Deep Residual Networks

    Authors: Rama Cont, Alain Rossier, Renyuan Xu

    Abstract: We investigate the asymptotic properties of deep Residual networks (ResNets) as the number of layers increases. We first show the existence of scaling regimes for trained weights markedly different from those implicitly assumed in the neural ODE literature. We study the convergence of the hidden state dynamics in these scaling regimes, showing that one may obtain an ODE, a stochastic differential… ▽ More

    Submitted 25 January, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: 49 pages, 12 figures. arXiv admin note: substantial text overlap with arXiv:2105.12245

    MSC Class: 60F17; 60F25; 68T05

  2. arXiv:2204.07261  [pdf, other

    cs.LG math.OC

    Convergence and Implicit Regularization Properties of Gradient Descent for Deep Residual Networks

    Authors: Rama Cont, Alain Rossier, RenYuan Xu

    Abstract: We prove linear convergence of gradient descent to a global optimum for the training of deep residual networks with constant layer width and smooth activation function. We show that if the trained weights, as a function of the layer index, admit a scaling limit as the depth increases, then the limit has finite $p-$variation with $p=2$. Proofs are based on non-asymptotic estimates for the loss func… ▽ More

    Submitted 25 January, 2023; v1 submitted 14 April, 2022; originally announced April 2022.

    MSC Class: 65Kxx; 62M45; 68Q32; 68Txx

  3. arXiv:2105.12245  [pdf, other

    cs.LG cs.NE math.NA stat.ML

    Scaling Properties of Deep Residual Networks

    Authors: Alain-Sam Cohen, Rama Cont, Alain Rossier, Renyuan Xu

    Abstract: Residual networks (ResNets) have displayed impressive results in pattern recognition and, recently, have garnered considerable theoretical interest due to a perceived link with neural ordinary differential equations (neural ODEs). This link relies on the convergence of network weights to a smooth function as the number of layers increases. We investigate the properties of weights trained by stocha… ▽ More

    Submitted 10 June, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: Published at ICML 2021

  4. arXiv:1501.00014  [pdf, ps, other

    cs.DS math.OC

    Optimal rounding under integer constraints

    Authors: Rama Cont, Massoud Heidari

    Abstract: Given real numbers whose sum is an integer, we study the problem of finding integers which match these real numbers as closely as possible, in the sense of L^p norm, while preserving the sum. We describe the structure of solutions for this integer optimization problem and propose an algorithm with complexity O(N log N) for solving it. In contrast to fractional rounding and randomized rounding, whi… ▽ More

    Submitted 30 December, 2014; originally announced January 2015.

    MSC Class: 90C10; 90C27 ACM Class: G.1.6