Skip to main content

Showing 1–8 of 8 results for author: Petreczky, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.20278  [pdf, other

    cs.LG cs.AI stat.ML

    Length independent generalization bounds for deep SSM architectures via Rademacher contraction and stability constraints

    Authors: Dániel Rácz, Mihály Petreczky, Bálint Daróczy

    Abstract: Many state-of-the-art models trained on long-range sequences, for example S4, S5 or LRU, are made of sequential blocks combining State-Space Models (SSMs) with neural networks. In this paper we provide a PAC bound that holds for these kind of architectures with \emph{stable} SSM blocks and does not depend on the length of the input sequence. Imposing stability of the SSM blocks is a standard pract… ▽ More

    Submitted 24 May, 2025; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: preliminary version accepted at ICML 2024 Next Generation of Sequence Modeling Architectures Workshop

    MSC Class: 68 ACM Class: I.2.6

  2. PAC-Bayes Generalisation Bounds for Dynamical Systems Including Stable RNNs

    Authors: Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihaly Petreczky

    Abstract: In this paper, we derive a PAC-Bayes bound on the generalisation gap, in a supervised time-series setting for a special class of discrete-time non-linear dynamical systems. This class includes stable recurrent neural networks (RNN), and the motivation for this work was its application to RNNs. In order to achieve the results, we impose some stability constraints, on the allowed models. Here, stabi… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted to AAAI2024 conference

    Journal ref: AAAI, vol. 38, no. 11, pp. 11901-11909, Mar. 2024

  3. arXiv:2310.09961  [pdf, other

    cs.LG stat.ME

    Theoretical Evaluation of Asymmetric Shapley Values for Root-Cause Analysis

    Authors: Domokos M. Kelen, Mihály Petreczky, Péter Kersch, András A. Benczúr

    Abstract: In this work, we examine Asymmetric Shapley Values (ASV), a variant of the popular SHAP additive local explanation method. ASV proposes a way to improve model explanations incorporating known causal relations between variables, and is also considered as a way to test for unfair discrimination in model predictions. Unexplored in previous literature, relaxing symmetry in Shapley values can have coun… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: 10 pages, 6 figures, to be published in IEEE ICDM 2023

  4. arXiv:2303.16816  [pdf, ps, other

    stat.ML cs.LG

    PAC-Bayesian bounds for learning LTI-ss systems with input from empirical loss

    Authors: Deividas Eringis, John Leth, Zheng-Hua Tan, Rafael Wisniewski, Mihaly Petreczky

    Abstract: In this paper we derive a Probably Approxilmately Correct(PAC)-Bayesian error bound for linear time-invariant (LTI) stochastic dynamical systems with inputs. Such bounds are widespread in machine learning, and they are useful for characterizing the predictive power of models learned from finitely many data points. In particular, with the bound derived in this paper relates future average predictio… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

    Comments: arXiv admin note: text overlap with arXiv:2212.14838

  5. arXiv:2212.14838  [pdf, ps, other

    stat.ML cs.LG math.DS math.ST

    PAC-Bayesian-Like Error Bound for a Class of Linear Time-Invariant Stochastic State-Space Models

    Authors: Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihaly Petreczky

    Abstract: In this paper we derive a PAC-Bayesian-Like error bound for a class of stochastic dynamical systems with inputs, namely, for linear time-invariant stochastic state-space models (stochastic LTI systems for short). This class of systems is widely used in control engineering and econometrics, in particular, they represent a special case of recurrent neural networks. In this paper we 1) formalize the… ▽ More

    Submitted 30 December, 2022; originally announced December 2022.

  6. arXiv:2109.02384  [pdf, other

    math.OC cs.LG math.DS stat.ML

    Explicit construction of the minimum error variance estimator for stochastic LTI state-space systems

    Authors: Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Mihaly Petreczky

    Abstract: In this short article, we showcase the derivation of the optimal (minimum error variance) estimator, when one part of the stochastic LTI system output is not measured but is able to be predicted from the measured system outputs. Similar derivations have been done before but not using state-space representation.

    Submitted 1 January, 2023; v1 submitted 6 September, 2021; originally announced September 2021.

  7. arXiv:2103.12866  [pdf, other

    stat.ML cs.LG

    PAC-Bayesian theory for stochastic LTI systems

    Authors: Deividas Eringis, John Leth, Zheng-Hua Tan, Rafal Wisniewski, Alireza Fakhrizadeh Esfahani, Mihaly Petreczky

    Abstract: In this paper we derive a PAC-Bayesian error bound for autonomous stochastic LTI state-space models. The motivation for deriving such error bounds is that they will allow deriving similar error bounds for more general dynamical systems, including recurrent neural networks. In turn, PACBayesian error bounds are known to be useful for analyzing machine learning algorithms and for deriving new ones.

    Submitted 25 March, 2021; v1 submitted 23 March, 2021; originally announced March 2021.

  8. arXiv:1912.03036  [pdf, ps, other

    cs.LG stat.ML

    Improved PAC-Bayesian Bounds for Linear Regression

    Authors: Vera Shalaeva, Alireza Fakhrizadeh Esfahani, Pascal Germain, Mihaly Petreczky

    Abstract: In this paper, we improve the PAC-Bayesian error bound for linear regression derived in Germain et al. [10]. The improvements are twofold. First, the proposed error bound is tighter, and converges to the generalization loss with a well-chosen temperature parameter. Second, the error bound also holds for training data that are not independently sampled. In particular, the error bound applies to cer… ▽ More

    Submitted 6 December, 2019; originally announced December 2019.

    Journal ref: Thirty-Fourth AAAI Conference on Artificial Intelligence, Feb 2020, New York, United States