Skip to main content

Showing 1–3 of 3 results for author: Chen, K K

Searching in archive stat. Search in all archives.
.
  1. arXiv:1811.11152  [pdf, other

    stat.ML cs.LG

    Knots in random neural networks

    Authors: Kevin K. Chen, Anthony C. Gamst, Alden K. Walker

    Abstract: The weights of a neural network are typically initialized at random, and one can think of the functions produced by such a network as having been generated by a prior over some function space. Studying random networks, then, is useful for a Bayesian understanding of the network evolution in early stages of training. In particular, one can investigate why neural networks with huge numbers of parame… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

    Comments: Presented at the Workshop on Bayesian Deep Learning, NIPS 2016, Barcelona, Spain

  2. arXiv:1611.09448  [pdf, other

    stat.ML cs.LG

    The Upper Bound on Knots in Neural Networks

    Authors: Kevin K. Chen

    Abstract: Neural networks with rectified linear unit activations are essentially multivariate linear splines. As such, one of many ways to measure the "complexity" or "expressivity" of a neural network is to count the number of knots in the spline model. We study the number of knots in fully-connected feedforward neural networks with rectified linear unit activation functions. We intentionally keep the neur… ▽ More

    Submitted 29 November, 2016; v1 submitted 28 November, 2016; originally announced November 2016.

    Comments: 19 pages, 8 figures

  3. arXiv:1611.09444  [pdf, other

    stat.ML cs.LG

    The empirical size of trained neural networks

    Authors: Kevin K. Chen, Anthony Gamst, Alden Walker

    Abstract: ReLU neural networks define piecewise linear functions of their inputs. However, initializing and training a neural network is very different from fitting a linear spline. In this paper, we expand empirically upon previous theoretical work to demonstrate features of trained neural networks. Standard network initialization and training produce networks vastly simpler than a naive parameter count wo… ▽ More

    Submitted 28 November, 2016; originally announced November 2016.

    Comments: 6 pages, 5 figures