Skip to main content

Showing 1–9 of 9 results for author: Monfared, Z

Searching in archive math. Search in all archives.
.
  1. arXiv:2410.23467  [pdf, other

    cs.LG math.NA

    Gradient-free training of recurrent neural networks

    Authors: Erik Lien Bolager, Ana Cukarska, Iryna Burak, Zahra Monfared, Felix Dietrich

    Abstract: Recurrent neural networks are a successful neural architecture for many time-dependent problems, including time series analysis, forecasting, and modeling of dynamical systems. Training such networks with backpropagation through time is a notoriously difficult problem because their loss gradients tend to explode or vanish. In this contribution, we introduce a computational approach to construct al… ▽ More

    Submitted 29 January, 2025; v1 submitted 30 October, 2024; originally announced October 2024.

  2. arXiv:2410.14240  [pdf, other

    cs.LG cs.AI math.DS nlin.CD physics.data-an

    Almost-Linear RNNs Yield Highly Interpretable Symbolic Codes in Dynamical Systems Reconstruction

    Authors: Manuel Brenner, Christoph Jürgen Hemmer, Zahra Monfared, Daniel Durstewitz

    Abstract: Dynamical systems (DS) theory is fundamental for many areas of science and engineering. It can provide deep insights into the behavior of systems evolving in time, as typically described by differential or recursive equations. A common approach to facilitate mathematical tractability and interpretability of DS models involves decomposing nonlinear DS into multiple linear DS separated by switching… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

  3. arXiv:2402.18377  [pdf, other

    cs.LG cs.AI math.DS nlin.CD

    Out-of-Domain Generalization in Dynamical Systems Reconstruction

    Authors: Niclas Göring, Florian Hess, Manuel Brenner, Zahra Monfared, Daniel Durstewitz

    Abstract: In science we are interested in finding the governing equations, the dynamical rules, underlying empirical phenomena. While traditionally scientific models are derived through cycles of human insight and experimentation, recently deep learning (DL) techniques have been advanced to reconstruct dynamical systems (DS) directly from time series data. State-of-the-art dynamical systems reconstruction (… ▽ More

    Submitted 7 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  4. arXiv:2310.17561  [pdf, other

    cs.LG cs.AI math.DS

    Bifurcations and loss jumps in RNN training

    Authors: Lukas Eisenmann, Zahra Monfared, Niclas Alexander Göring, Daniel Durstewitz

    Abstract: Recurrent neural networks (RNNs) are popular machine learning tools for modeling and forecasting sequential data and for inferring dynamical systems (DS) from observed time series. Concepts from DS theory (DST) have variously been used to further our understanding of both, how trained RNNs solve complex tasks, and the training process itself. Bifurcations are particularly important phenomena in DS… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  5. arXiv:2306.04406  [pdf, other

    cs.LG cs.AI math.DS nlin.CD

    Generalized Teacher Forcing for Learning Chaotic Dynamics

    Authors: Florian Hess, Zahra Monfared, Manuel Brenner, Daniel Durstewitz

    Abstract: Chaotic dynamical systems (DS) are ubiquitous in nature and society. Often we are interested in reconstructing such systems from observed time series for prediction or mechanistic insight, where by reconstruction we mean learning geometrical and invariant temporal properties of the system in question (like attractors). However, training reconstruction algorithms like recurrent neural networks (RNN… ▽ More

    Submitted 27 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Published in the Proceedings of the 40th International Conference on Machine Learning (ICML 2023)

    Journal ref: PMLR 202:13017-13049, 2023

  6. arXiv:2207.02542  [pdf, other

    cs.LG math.DS nlin.CD physics.comp-ph

    Tractable Dendritic RNNs for Reconstructing Nonlinear Dynamical Systems

    Authors: Manuel Brenner, Florian Hess, Jonas M. Mikhaeil, Leonard Bereska, Zahra Monfared, Po-Chen Kuo, Daniel Durstewitz

    Abstract: In many scientific disciplines, we are interested in inferring the nonlinear dynamical system underlying a set of observed time series, a challenging task in the face of chaotic behavior and noise. Previous deep learning approaches toward this goal often suffered from a lack of interpretability and tractability. In particular, the high-dimensional latent spaces often required for a faithful embedd… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

    Comments: To be published in the Proceedings of the 39th International Conference on Machine Learning (ICML 2022)

  7. arXiv:2110.07238  [pdf, other

    cs.LG math.DS stat.ML

    On the difficulty of learning chaotic dynamics with RNNs

    Authors: Jonas M. Mikhaeil, Zahra Monfared, Daniel Durstewitz

    Abstract: Recurrent neural networks (RNNs) are wide-spread machine learning tools for modeling sequential and time series data. They are notoriously hard to train because their loss gradients backpropagated in time tend to saturate or diverge during training. This is known as the exploding and vanishing gradient problem. Previous solutions to this issue either built on rather complicated, purpose-engineered… ▽ More

    Submitted 6 October, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

  8. arXiv:2007.00321  [pdf, other

    math.DS

    Transformation of ReLU-based recurrent neural networks from discrete-time to continuous-time

    Authors: Zahra Monfared, Daniel Durstewitz

    Abstract: Recurrent neural networks (RNN) as used in machine learning are commonly formulated in discrete time, i.e. as recursive maps. This brings a lot of advantages for training models on data, e.g. for the purpose of time series prediction or dynamical systems identification, as powerful and efficient inference algorithms exist for discrete time systems and numerical integration of differential equation… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

  9. Existence of n-cycles and border-collision bifurcations in piecewise-linear continuous maps with applications to recurrent neural networks

    Authors: Zahra Monfared, Daniel Durstewitz

    Abstract: Piecewise linear recurrent neural networks (PLRNNs) form the basis of many successful machine learning applications for time series prediction and dynamical systems identification, but rigorous mathematical analysis of their dynamics and properties is lagging behind. Here we contribute to this topic by investigating the existence of n-cycles $(n\geq 3)$ and border-collision bifurcations in a class… ▽ More

    Submitted 1 July, 2020; v1 submitted 11 November, 2019; originally announced November 2019.