Skip to main content

Showing 1–2 of 2 results for author: Thierry-Mieg, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:1812.07538  [pdf, ps, other

    cs.LG stat.ML

    XOR_p A maximally intertwined p-classes problem used as a benchmark with built-in truth for neural networks gradient descent optimization

    Authors: Danielle Thierry-Mieg, Jean Thierry-Mieg

    Abstract: A natural p-classes generalization of the eXclusive OR problem, the subtraction modulo p, where p is prime, is presented and solved using a single fully connected hidden layer with p-neurons. Although the problem is very simple, the landscape is intricate and challenging and represents an interesting benchmark for gradient descent optimization algorithms. Testing 9 optimizers and 9 activation func… ▽ More

    Submitted 18 December, 2018; originally announced December 2018.

    Comments: 16 pages, 4 figures, 3 tables. The source code is public

  2. arXiv:1811.00576  [pdf, ps, other

    cs.LG hep-th stat.ML

    Connections between physics, mathematics and deep learning

    Authors: Jean Thierry-Mieg

    Abstract: Starting from the Fermat's principle of least action, which governs classical and quantum mechanics and from the theory of exterior differential forms, which governs the geometry of curved manifolds, we show how to derive the equations governing neural networks in an intrinsic, coordinate invariant way, where the loss function plays the role of the Hamiltonian. To be covariant, these equations imp… ▽ More

    Submitted 25 August, 2019; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: Version 1 and 2 title was: How the fundamental concepts of mathematics and physics explain deep learning. Version 3 with the new title is accepted in LHEP. It is enriched by a new chapter on the Bayesian Information criterion seen as an application of renormalisation theory. 19 pages, 22 references, no figure