Skip to main content

Showing 1–12 of 12 results for author: Amari, S

Searching in archive cond-mat. Search in all archives.
.
  1. arXiv:1910.05992  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Pathological spectra of the Fisher information metric and its variants in deep neural networks

    Authors: Ryo Karakida, Shotaro Akaho, Shun-ichi Amari

    Abstract: The Fisher information matrix (FIM) plays an essential role in statistics and machine learning as a Riemannian metric tensor or a component of the Hessian matrix of loss functions. Focusing on the FIM and its variants in deep neural networks (DNNs), we reveal their characteristic scale dependence on the network width, depth and sample size when the network has random weights and is sufficiently wi… ▽ More

    Submitted 27 September, 2020; v1 submitted 14 October, 2019; originally announced October 2019.

    Comments: 23 pages, 7 figures; v2: minor improvements, Section 3.4 added

  2. arXiv:1906.02926  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    The Normalization Method for Alleviating Pathological Sharpness in Wide Neural Networks

    Authors: Ryo Karakida, Shotaro Akaho, Shun-ichi Amari

    Abstract: Normalization methods play an important role in enhancing the performance of deep learning while their theoretical understandings have been limited. To theoretically elucidate the effectiveness of normalization, we quantify the geometry of the parameter space determined by the Fisher information matrix (FIM), which also corresponds to the local shape of the loss landscape under certain conditions.… ▽ More

    Submitted 28 October, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

    Comments: To appear in NeurIPS 2019

  3. Unified framework for the entropy production and the stochastic interaction based on information geometry

    Authors: Sosuke Ito, Masafumi Oizumi, Shun-ichi Amari

    Abstract: We show a relationship between the entropy production in stochastic thermodynamics and the stochastic interaction in the information integrated theory. To clarify this relationship, we newly introduce an information geometric interpretation of the entropy production for a total system and the partial entropy productions for subsystems. We show that the violation of the additivity of the entropy pr… ▽ More

    Submitted 6 April, 2020; v1 submitted 22 October, 2018; originally announced October 2018.

    Comments: 13pages, 4 figures

    Report number: Phys. Rev. Research 2, 033048 (2020)

    Journal ref: Phys. Rev. Research 2, 033048 (2020)

  4. arXiv:1808.07172  [pdf, ps, other

    cs.LG cond-mat.dis-nn stat.ML

    Fisher Information and Natural Gradient Learning of Random Deep Networks

    Authors: Shun-ichi Amari, Ryo Karakida, Masafumi Oizumi

    Abstract: A deep neural network is a hierarchical nonlinear model transforming input signals to output signals. Its input-output relation is considered to be stochastic, being described for a given input by a parameterized conditional probability distribution of outputs. The space of parameters consisting of weights and biases is a Riemannian manifold, where the metric is defined by the Fisher information m… ▽ More

    Submitted 21 August, 2018; originally announced August 2018.

    Comments: 22 pages, 2 figures

  5. arXiv:1808.07169  [pdf, ps, other

    cond-mat.dis-nn cs.LG stat.ML

    Statistical Neurodynamics of Deep Networks: Geometry of Signal Spaces

    Authors: Shun-ichi Amari, Ryo Karakida, Masafumi Oizumi

    Abstract: Statistical neurodynamics studies macroscopic behaviors of randomly connected neural networks. We consider a deep layered feedforward network where input signals are processed layer by layer. The manifold of input signals is embedded in a higher dimensional manifold of the next layer as a curved submanifold, provided the number of neurons is larger than that of inputs. We show geometrical features… ▽ More

    Submitted 21 August, 2018; originally announced August 2018.

    Comments: 23 pages, 8 figures

  6. arXiv:1806.01316  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG

    Universal Statistics of Fisher Information in Deep Neural Networks: Mean Field Approach

    Authors: Ryo Karakida, Shotaro Akaho, Shun-ichi Amari

    Abstract: The Fisher information matrix (FIM) is a fundamental quantity to represent the characteristics of a stochastic model, including deep neural networks (DNNs). The present study reveals novel statistics of FIM that are universal among a wide class of DNNs. To this end, we use random weights and large width limits, which enables us to utilize mean field theories. We investigate the asymptotic statisti… ▽ More

    Submitted 8 October, 2019; v1 submitted 4 June, 2018; originally announced June 2018.

    Comments: Accepted at AISTATS2019. Main text: 10 pages, 2 figures. Supplementary material: 9 pages, 2 figures, typos corrected

  7. arXiv:1502.00127  [pdf, ps, other

    cond-mat.dis-nn

    Spontaneous Motion on Two-dimensional Continuous Attractors

    Authors: C. C. Alan Fung, S. -I. Amari

    Abstract: Attractor models are simplified models used to describe the dynamics of firing rate profiles of a pool of neurons. The firing rate profile, or the neuronal activity, is thought to carry information. Continuous attractor neural networks (CANNs) describe the neural processing of continuous information such as object position, object orientation and direction of object motion. Recently, it was found… ▽ More

    Submitted 31 January, 2015; originally announced February 2015.

    Comments: 58 pages, 11 figures

    Journal ref: Neural Computation 27 (3) 2015

  8. arXiv:1202.6526  [pdf, ps, other

    cond-mat.dis-nn nlin.AO

    State Concentration Exponent as a Measure of Quickness in Kauffman-type Networks

    Authors: Shun-ichi Amari, Hiroyasu Ando, Taro Toyoizumi, Naoki Masuda

    Abstract: We study the dynamics of randomly connected networks composed of binary Boolean elements and those composed of binary majority vote elements. We elucidate their differences in both sparsely and densely connected cases. The quickness of large network dynamics is usually quantified by the length of transient paths, an analytically intractable measure. For discrete-time dynamics of networks of binary… ▽ More

    Submitted 4 March, 2013; v1 submitted 29 February, 2012; originally announced February 2012.

    Comments: 6 figures

    Journal ref: Physical Review E, 87, 022814 (2013)

  9. arXiv:1010.4965  [pdf, ps, other

    cond-mat.stat-mech cs.IT math.DG

    Dually flat structure with escort probability and its application to alpha-Voronoi diagrams

    Authors: Atsumi Ohara, Hiroshi Matsuzoe, Shun-ichi Amari

    Abstract: This paper studies geometrical structure of the manifold of escort probability distributions and shows its new applicability to information science. In order to realize escort probabilities we use a conformal transformation that flattens so-called alpha-geometry of the space of discrete probability distributions, which well characterizes nonadditive statistics on the space. As a result escort prob… ▽ More

    Submitted 24 October, 2010; originally announced October 2010.

    Comments: Several results in this paper can be found in the conference paper [36] without complete proofs

  10. Efficiency of Energy Transduction in a Molecular Chemical Engine

    Authors: Kazuo Sasaki, Ryo Kanada, Satoshi Amari

    Abstract: A simple model of the two-state ratchet type is proposed for molecular chemical engines that convert chemical free energy into mechanical work and vice versa. The engine works by catalyzing a chemical reaction and turning a rotor. Analytical expressions are obtained for the dependences of rotation and reaction rates on the concentrations of reactant and product molecules, from which the performa… ▽ More

    Submitted 28 December, 2006; v1 submitted 19 July, 2006; originally announced July 2006.

    Comments: 4 pages, 4 fugures; title modified, figures 2 and 3 modified, content changed (pages 1 and 4, mainly), references added

    Journal ref: Journal of the Physical Society of Japan, 76, 023003 (2007)

  11. arXiv:cond-mat/0502017  [pdf, ps, other

    cond-mat.stat-mech

    Diffusion Coefficient and Mobility of a Brownian Particle in a Tilted Periodic Potential

    Authors: Kazuo Sasaki, Satoshi Amari

    Abstract: The Brownian motion of a particle in a one-dimensional periodic potential subjected to a uniform external force F is studied. Using the formula for the diffusion coefficient D obtained by other authors and an alternative one derived from the Fokker-Planck equation in the present work, D is compared with the differential mobility μ= dv/dF where v is the average velocity of the particle. Analytica… ▽ More

    Submitted 1 February, 2005; originally announced February 2005.

    Comments: 7 pages, 4 figures, submitted to J. Phys. Soc. Jpn

  12. arXiv:cond-mat/9806078  [pdf, ps, other

    cond-mat.stat-mech cond-mat.dis-nn q-bio

    Mutual Information of Three-State Low Activity Diluted Neural Networks with Self-Control

    Authors: D. Bolle', D. R. C. Dominguez, S. Amari

    Abstract: The influence of a macroscopic time-dependent threshold on the retrieval process of three-state extremely diluted neural networks is examined. If the threshold is chosen appropriately in function of the noise and the pattern activity of the network, adapting itself in the course of the time evolution, it guarantees an autonomous functioning of the network. It is found that this self-control mech… ▽ More

    Submitted 21 August, 2000; v1 submitted 5 June, 1998; originally announced June 1998.

    Comments: Change of title and small corrections (16 pages and 6 figures)

    Report number: KUL-TF-98/26

    Journal ref: Neural Networks 13, 455-462 (2000)