Skip to main content

Showing 1–2 of 2 results for author: Koroko, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.18083  [pdf, other

    cs.LG math.OC

    Analysis and Comparison of Two-Level KFAC Methods for Training Deep Neural Networks

    Authors: Abdoulaye Koroko, Ani Anciaux-Sedrakian, Ibtihel Ben Gharbia, Valérie Garès, Mounir Haddou, Quang Huy Tran

    Abstract: As a second-order method, the Natural Gradient Descent (NGD) has the ability to accelerate training of neural networks. However, due to the prohibitive computational and memory costs of computing and inverting the Fisher Information Matrix (FIM), efficient approximations are necessary to make NGD scalable to Deep Neural Networks (DNNs). Many such approximations have been attempted. The most sophis… ▽ More

    Submitted 3 April, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

    Comments: Under Review

  2. arXiv:2201.10285  [pdf, other

    cs.NE math.OC stat.ML

    Efficient Approximations of the Fisher Matrix in Neural Networks using Kronecker Product Singular Value Decomposition

    Authors: Abdoulaye Koroko, Ani Anciaux-Sedrakian, Ibtihel Ben Gharbia, Valérie Garès, Mounir Haddou, Quang Huy Tran

    Abstract: Several studies have shown the ability of natural gradient descent to minimize the objective function more efficiently than ordinary gradient descent based methods. However, the bottleneck of this approach for training deep neural networks lies in the prohibitive cost of solving a large dense linear system corresponding to the Fisher Information Matrix (FIM) at each iteration. This has motivated v… ▽ More

    Submitted 14 October, 2022; v1 submitted 25 January, 2022; originally announced January 2022.