Skip to main content

Showing 1–3 of 3 results for author: Rásonyi, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.02092  [pdf, ps, other

    math.PR cs.LG math.OC

    Functional Central Limit Theorem and Strong Law of Large Numbers for Stochastic Gradient Langevin Dynamics

    Authors: Attila Lovas, Miklós Rásonyi

    Abstract: We study the mixing properties of an important optimization algorithm of machine learning: the stochastic gradient Langevin dynamics (SGLD) with a fixed step size. The data stream is not assumed to be independent hence the SGLD is not a Markov chain, merely a \emph{Markov chain in a random environment}, which complicates the mathematical treatment considerably. We derive a strong law of large numb… ▽ More

    Submitted 29 July, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    Comments: 16 pages

    MSC Class: 60J05; 60J20

  2. arXiv:2006.14514  [pdf, other

    cs.LG math.OC math.PR stat.ML

    Taming neural networks with TUSLA: Non-convex learning via adaptive stochastic gradient Langevin algorithms

    Authors: Attila Lovas, Iosif Lytras, Miklós Rásonyi, Sotirios Sabanis

    Abstract: Artificial neural networks (ANNs) are typically highly nonlinear systems which are finely tuned via the optimization of their associated, non-convex loss functions. In many cases, the gradient of any such loss function has superlinear growth, making the use of the widely-accepted (stochastic) gradient descent methods, which are based on Euler numerical schemes, problematic. We offer a new learning… ▽ More

    Submitted 15 January, 2023; v1 submitted 25 June, 2020; originally announced June 2020.

  3. arXiv:1903.10328  [pdf, ps, other

    stat.ML cs.LG

    Stochastic Gradient Hamiltonian Monte Carlo for Non-Convex Learning

    Authors: Huy N. Chau, Miklos Rasonyi

    Abstract: Stochastic Gradient Hamiltonian Monte Carlo (SGHMC) is a momentum version of stochastic gradient descent with properly injected Gaussian noise to find a global minimum. In this paper, non-asymptotic convergence analysis of SGHMC is given in the context of non-convex optimization, where subsampling techniques are used over an i.i.d dataset for gradient updates. Our results complement those of [RRT1… ▽ More

    Submitted 25 February, 2020; v1 submitted 25 March, 2019; originally announced March 2019.