Skip to main content

Showing 1–3 of 3 results for author: Oliva, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2112.11776  [pdf, ps, other

    cs.CL cs.LG

    The Importance of the Current Input in Sequence Modeling

    Authors: Christian Oliva, Luis F. Lago-Fernández

    Abstract: The last advances in sequence modeling are mainly based on deep learning approaches. The current state of the art involves the use of variations of the standard LSTM architecture, combined with several tricks that improve the final prediction rates of the trained neural networks. However, in some cases, these adaptations might be too much tuned to the particular problems being addressed. In this a… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

    Comments: 11 pages, 2 appendix pages

  2. Stability of Internal States in Recurrent Neural Networks Trained on Regular Languages

    Authors: Christian Oliva, Luis F. Lago-Fernández

    Abstract: We provide an empirical study of the stability of recurrent neural networks trained to recognize regular languages. When a small amount of noise is introduced into the activation function, the neurons in the recurrent layer tend to saturate in order to compensate the variability. In this saturated regime, analysis of the network activation shows a set of clusters that resemble discrete states in a… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

    Comments: 11 pages, 9 figures

  3. arXiv:2005.13971  [pdf, other

    cs.NE cs.FL cs.LG stat.ML

    Separation of Memory and Processing in Dual Recurrent Neural Networks

    Authors: Christian Oliva, Luis F. Lago-Fernández

    Abstract: We explore a neural network architecture that stacks a recurrent layer and a feedforward layer that is also connected to the input, and compare it to standard Elman and LSTM architectures in terms of accuracy and interpretability. When noise is introduced into the activation function of the recurrent units, these neurons are forced into a binary activation regime that makes the networks behave muc… ▽ More

    Submitted 17 May, 2020; originally announced May 2020.

    Comments: 10 pages