-
A Data-Driven Framework for Koopman Semigroup Estimation in Stochastic Dynamical Systems
Authors:
Yuanchao Xu,
Kaidi Shao,
Isao Ishikawa,
Yuka Hashimoto,
Nikos Logothetis,
Zhongwei Shen
Abstract:
We present Stochastic Dynamic Mode Decomposition (SDMD), a novel data-driven framework for approximating the Koopman semigroup in stochastic dynamical systems. Unlike existing methods, SDMD explicitly incorporates sampling time into its approximation, ensuring numerical stability and precision. By directly approximating the Koopman semigroup instead of the generator, SDMD avoids computationally ex…
▽ More
We present Stochastic Dynamic Mode Decomposition (SDMD), a novel data-driven framework for approximating the Koopman semigroup in stochastic dynamical systems. Unlike existing methods, SDMD explicitly incorporates sampling time into its approximation, ensuring numerical stability and precision. By directly approximating the Koopman semigroup instead of the generator, SDMD avoids computationally expensive matrix exponential computations, which offers a more efficient and practical pathway for analyzing stochastic dynamics. The framework further integrates neural networks to automate basis selection, which reduces the reliance on manual intervention while maintaining computational efficiency. Rigorous theoretical guarantees, including convergence in the large data limit, zero-limit of sampling time, and large dictionary size, establish the method's reliability. Numerical experiments on canonical stochastic systems validate SDMD's effectiveness in approximating eigenvalues and eigenfunctions of the stochastic Koopman operator.
△ Less
Submitted 24 May, 2025; v1 submitted 22 January, 2025;
originally announced January 2025.
-
ResKoopNet: Learning Koopman Representations for Complex Dynamics with Spectral Residuals
Authors:
Yuanchao Xu,
Kaidi Shao,
Nikos Logothetis,
Zhongwei Shen
Abstract:
Analyzing the long-term behavior of high-dimensional nonlinear dynamical systems remains a significant challenge. While the Koopman operator framework provides a powerful global linearization tool, current methods for approximating its spectral components often face theoretical limitations and depend on predefined dictionaries. Residual Dynamic Mode Decomposition (ResDMD) advanced the field by int…
▽ More
Analyzing the long-term behavior of high-dimensional nonlinear dynamical systems remains a significant challenge. While the Koopman operator framework provides a powerful global linearization tool, current methods for approximating its spectral components often face theoretical limitations and depend on predefined dictionaries. Residual Dynamic Mode Decomposition (ResDMD) advanced the field by introducing the \emph{spectral residual} to assess Koopman operator approximation accuracy; however, its approach of only filtering precomputed spectra prevents the discovery of the operator's complete spectral information, a limitation known as the `spectral inclusion' problem. We introduce ResKoopNet (Residual-based Koopman-learning Network), a novel method that directly addresses this by explicitly minimizing the \emph{spectral residual} to compute Koopman eigenpairs. This enables the identification of a more precise and complete Koopman operator spectrum. Using neural networks, our approach provides theoretical guarantees while maintaining computational adaptability. Experiments on a variety of physical and biological systems show that ResKoopNet achieves more accurate spectral approximations than existing methods, particularly for high-dimensional systems and those with continuous spectra, which demonstrates its effectiveness as a tool for analyzing complex dynamical systems.
△ Less
Submitted 27 May, 2025; v1 submitted 31 December, 2024;
originally announced January 2025.
-
Information Theoretic Measures of Causal Influences during Transient Neural Events
Authors:
Kaidi Shao,
Nikos K. Logothetis,
Michel Besserve
Abstract:
Transient phenomena play a key role in coordinating brain activity at multiple scales, however,their underlying mechanisms remain largely unknown. A key challenge for neural data science is thus to characterize the network interactions at play during these events. Using the formalism of Structural Causal Models and their graphical representation, we investigate the theoretical and empirical proper…
▽ More
Transient phenomena play a key role in coordinating brain activity at multiple scales, however,their underlying mechanisms remain largely unknown. A key challenge for neural data science is thus to characterize the network interactions at play during these events. Using the formalism of Structural Causal Models and their graphical representation, we investigate the theoretical and empirical properties of Information Theory based causal strength measures in the context of recurring spontaneous transient events. After showing the limitations of Transfer Entropy and Dynamic Causal Strength in such a setting, we introduce a novel measure, relative Dynamic Causal Strength, and provide theoretical and empirical support for its benefits. These methods are applied to simulated and experimentally recorded neural time series, and provide results in agreement with our current understanding of the underlying brain circuits.
△ Less
Submitted 15 September, 2022;
originally announced September 2022.
-
Bayesian Information Criterion for Event-based Multi-trial Ensemble data
Authors:
Kaidi Shao,
Nikos K. Logothetis,
Michel Besserve
Abstract:
Transient recurring phenomena are ubiquitous in many scientific fields like neuroscience and meteorology. Time inhomogenous Vector Autoregressive Models (VAR) may be used to characterize peri-event system dynamics associated with such phenomena, and can be learned by exploiting multi-dimensional data gathering samples of the evolution of the system in multiple time windows comprising, each associa…
▽ More
Transient recurring phenomena are ubiquitous in many scientific fields like neuroscience and meteorology. Time inhomogenous Vector Autoregressive Models (VAR) may be used to characterize peri-event system dynamics associated with such phenomena, and can be learned by exploiting multi-dimensional data gathering samples of the evolution of the system in multiple time windows comprising, each associated with one occurrence of the transient phenomenon, that we will call "trial". However, optimal VAR model order selection methods, commonly relying on the Akaike or Bayesian Information Criteria (AIC/BIC), are typically not designed for multi-trial data. Here we derive the BIC methods for multi-trial ensemble data which are gathered after the detection of the events. We show using simulated bivariate AR models that the multi-trial BIC is able to recover the real model order. We also demonstrate with simulated transient events and real data that the multi-trial BIC is able to estimate a sufficiently small model order for dynamic system modeling.
△ Less
Submitted 29 April, 2022;
originally announced April 2022.
-
From univariate to multivariate coupling between continuous signals and point processes: a mathematical framework
Authors:
Shervin Safavi,
Nikos K. Logothetis,
Michel Besserve
Abstract:
Time series datasets often contain heterogeneous signals, composed of both continuously changing quantities and discretely occurring events. The coupling between these measurements may provide insights into key underlying mechanisms of the systems under study. To better extract this information, we investigate the asymptotic statistical properties of coupling measures between continuous signals an…
▽ More
Time series datasets often contain heterogeneous signals, composed of both continuously changing quantities and discretely occurring events. The coupling between these measurements may provide insights into key underlying mechanisms of the systems under study. To better extract this information, we investigate the asymptotic statistical properties of coupling measures between continuous signals and point processes. We first introduce martingale stochastic integration theory as a mathematical model for a family of statistical quantities that include the Phase Locking Value, a classical coupling measure to characterize complex dynamics. Based on the martingale Central Limit Theorem, we can then derive the asymptotic Gaussian distribution of estimates of such coupling measure, that can be exploited for statistical testing. Second, based on multivariate extensions of this result and Random Matrix Theory, we establish a principled way to analyze the low rank coupling between a large number of point processes and continuous signals. For a null hypothesis of no coupling, we establish sufficient conditions for the empirical distribution of squared singular values of the matrix to converge, as the number of measured signals increases, to the well-known Marchenko-Pastur (MP) law, and the largest squared singular value converges to the upper end of the MPs support. This justifies a simple thresholding approach to assess the significance of multivariate coupling. Finally, we illustrate with simulations the relevance of our univariate and multivariate results in the context of neural time series, addressing how to reliably quantify the interplay between multi channel Local Field Potential signals and the spiking activity of a large population of neurons.
△ Less
Submitted 8 May, 2020;
originally announced May 2020.
-
Signal detection in extracellular neural ensemble recordings using higher criticism
Authors:
Farzad Fathizadeh,
Ekaterina Mitricheva,
Rui Kimura,
Nikos Logothetis,
Hamid Reza Noori
Abstract:
Information processing in the brain is conducted by a concerted action of multiple neural populations. Gaining insights in the organization and dynamics of such populations can best be studied with broadband intracranial recordings of so-called extracellular field potential, reflecting neuronal spiking as well as mesoscopic activities, such as waves, oscillations, intrinsic large deflections, and…
▽ More
Information processing in the brain is conducted by a concerted action of multiple neural populations. Gaining insights in the organization and dynamics of such populations can best be studied with broadband intracranial recordings of so-called extracellular field potential, reflecting neuronal spiking as well as mesoscopic activities, such as waves, oscillations, intrinsic large deflections, and multiunit spiking activity. Such signals are critical for our understanding of how neuronal ensembles encode sensory information and how such information is integrated in the large networks underlying cognition. The aforementioned principles are now well accepted, yet the efficacy of extracting information out of the complex neural data, and their employment for improving our understanding of neural networks, critically depends on the mathematical processing steps ranging from simple detection of action potentials in noisy traces - to fitting advanced mathematical models to distinct patterns of the neural signal potentially underlying intra-processing of information, e.g. interneuronal interactions. Here, we present a robust strategy for detecting signals in broadband and noisy time series such as spikes, sharp waves and multi-unit activity data that is solely based on the intrinsic statistical distribution of the recorded data. By using so-called higher criticism - a second-level significance testing procedure comparing the fraction of observed significances to an expected fraction under the global null - we are able to detect small signals in correlated noisy time-series without prior filtering, denoising or data regression. Results demonstrate the efficiency and reliability of the method and versatility over a wide range of experimental conditions and suggest the appropriateness of higher criticism to characterize neuronal dynamics without prior manipulation of the data.
△ Less
Submitted 15 May, 2019;
originally announced May 2019.
-
The information content of Local Field Potentials: experiments and models
Authors:
Alberto Mazzoni,
Nikos K. Logothetis,
Stefano Panzeri
Abstract:
The LFPs is a broadband signal that captures variations of neural population activity over a wide range of time scales. The range of time scales available in LFPs is particularly interesting from the neural coding point of view because it opens up the possibility to investigate whether there are privileged time scales for information processing, a question that has been hotly debated over the last…
▽ More
The LFPs is a broadband signal that captures variations of neural population activity over a wide range of time scales. The range of time scales available in LFPs is particularly interesting from the neural coding point of view because it opens up the possibility to investigate whether there are privileged time scales for information processing, a question that has been hotly debated over the last one or two decades.It is possible that information is represented by only a small number of specific frequency ranges, each carrying a separate contribution to the information representation. To shed light on this issue, it is important to quantify the information content of each frequency range of neural activity, and understand which ranges carry complementary or similar information.
△ Less
Submitted 4 June, 2012;
originally announced June 2012.