-
arXiv:1812.07538 [pdf, ps, other]
XOR_p A maximally intertwined p-classes problem used as a benchmark with built-in truth for neural networks gradient descent optimization
Abstract: A natural p-classes generalization of the eXclusive OR problem, the subtraction modulo p, where p is prime, is presented and solved using a single fully connected hidden layer with p-neurons. Although the problem is very simple, the landscape is intricate and challenging and represents an interesting benchmark for gradient descent optimization algorithms. Testing 9 optimizers and 9 activation func… ▽ More
Submitted 18 December, 2018; originally announced December 2018.
Comments: 16 pages, 4 figures, 3 tables. The source code is public
-
arXiv:1811.00576 [pdf, ps, other]
Connections between physics, mathematics and deep learning
Abstract: Starting from the Fermat's principle of least action, which governs classical and quantum mechanics and from the theory of exterior differential forms, which governs the geometry of curved manifolds, we show how to derive the equations governing neural networks in an intrinsic, coordinate invariant way, where the loss function plays the role of the Hamiltonian. To be covariant, these equations imp… ▽ More
Submitted 25 August, 2019; v1 submitted 1 November, 2018; originally announced November 2018.
Comments: Version 1 and 2 title was: How the fundamental concepts of mathematics and physics explain deep learning. Version 3 with the new title is accepted in LHEP. It is enriched by a new chapter on the Bayesian Information criterion seen as an application of renormalisation theory. 19 pages, 22 references, no figure