Skip to main content

Showing 1–20 of 20 results for author: Teichmann, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2510.00087  [pdf, ps, other

    stat.AP cs.LG math.PR q-bio.QM

    Revealing the temporal dynamics of antibiotic anomalies in the infant gut microbiome with neural jump ODEs

    Authors: Anja Adamov, Markus Chardonnet, Florian Krach, Jakob Heiss, Josef Teichmann, Nicholas A. Bokulich

    Abstract: Detecting anomalies in irregularly sampled multi-variate time-series is challenging, especially in data-scarce settings. Here we introduce an anomaly detection framework for irregularly sampled time-series that leverages neural jump ordinary differential equations (NJODEs). The method infers conditional mean and variance trajectories in a fully path dependent way and computes anomaly scores. On sy… ▽ More

    Submitted 30 September, 2025; originally announced October 2025.

  2. arXiv:2503.16696  [pdf, ps, other

    math.PR cs.LG math.FA q-fin.MF stat.ML

    Universal approximation property of neural stochastic differential equations

    Authors: Anna P. Kwossek, David J. Prömel, Josef Teichmann

    Abstract: We identify various classes of neural networks that are able to approximate continuous functions locally uniformly subject to fixed global linear growth constraints. For such neural networks the associated neural stochastic differential equations can approximate general stochastic differential equations, both of Itô diffusion type, arbitrarily well. Moreover, quantitative error estimates are deriv… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: 20 pages

    MSC Class: 41A29; 60H10; 68T07; 91G80

  3. arXiv:2502.03163  [pdf, other

    math.CA cs.LG math.PR stat.ML

    Signature Reconstruction from Randomized Signatures

    Authors: Mie Glückstad, Nicola Muca Cirone, Josef Teichmann

    Abstract: Controlled ordinary differential equations driven by continuous bounded variation curves can be considered a continuous time analogue of recurrent neural networks for the construction of expressive features of the input curves. We ask up to which extent well known signature features of such curves can be reconstructed from controlled ordinary differential equations with (untrained) random vector f… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

    Comments: 37 pages, 7 figures

    MSC Class: 60L10 (Primary) 60L70; 60L90; 68T07 (Secondary)

  4. arXiv:2407.18808  [pdf, other

    stat.ML cs.AI cs.LG math.DS math.PR

    Learning Chaotic Systems and Long-Term Predictions with Neural Jump ODEs

    Authors: Florian Krach, Josef Teichmann

    Abstract: The Path-dependent Neural Jump ODE (PD-NJ-ODE) is a model for online prediction of generic (possibly non-Markovian) stochastic processes with irregular (in time) and potentially incomplete (with respect to coordinates) observations. It is a model for which convergence to the $L^2$-optimal predictor, which is given by the conditional expectation, is established theoretically. Thereby, the training… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  5. arXiv:2307.13147  [pdf, other

    stat.ML cs.LG math.NA math.PR

    Extending Path-Dependent NJ-ODEs to Noisy Observations and a Dependent Observation Framework

    Authors: William Andersson, Jakob Heiss, Florian Krach, Josef Teichmann

    Abstract: The Path-Dependent Neural Jump Ordinary Differential Equation (PD-NJ-ODE) is a model for predicting continuous-time stochastic processes with irregular and incomplete observations. In particular, the method learns optimal forecasts given irregularly sampled time series of incomplete past observations. So far the process itself and the coordinate-wise observation times were assumed to be independen… ▽ More

    Submitted 5 February, 2024; v1 submitted 24 July, 2023; originally announced July 2023.

    Journal ref: Transactions on Machine Learning Research (TMLR) 2024

  6. arXiv:2306.03303  [pdf, other

    stat.ML cs.LG math.FA math.PR q-fin.MF

    Global universal approximation of functional input maps on weighted spaces

    Authors: Christa Cuchiero, Philipp Schmocker, Josef Teichmann

    Abstract: We introduce so-called functional input neural networks defined on a possibly infinite dimensional weighted space with values also in a possibly infinite dimensional output space. To this end, we use an additive family to map the input weighted space to the hidden layer, on which a non-linear scalar activation function is applied to each neuron, and finally return the output via some linear readou… ▽ More

    Submitted 2 February, 2025; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: 67 pages, 4 figures

    MSC Class: 26A16; 26E20; 41A65; 41A81; 46E40; 60L10; 68T07

  7. arXiv:2303.11454  [pdf, other

    cs.LG stat.ML

    How (Implicit) Regularization of ReLU Neural Networks Characterizes the Learned Function -- Part II: the Multi-D Case of Two Layers with Random First Layer

    Authors: Jakob Heiss, Josef Teichmann, Hanna Wutte

    Abstract: Randomized neural networks (randomized NNs), where only the terminal layer's weights are optimized constitute a powerful model class to reduce computational time in training the neural network model. At the same time, these models generalize surprisingly well in various regression and classification tasks. In this paper, we give an exact macroscopic characterization (i.e., a characterization in fu… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

    Comments: 16 pages + appendix

  8. arXiv:2206.14284  [pdf, other

    stat.ML cs.LG math.NA math.PR

    Optimal Estimation of Generic Dynamics by Path-Dependent Neural Jump ODEs

    Authors: Florian Krach, Marc Nübel, Josef Teichmann

    Abstract: This paper studies the problem of forecasting general stochastic processes using a path-dependent extension of the Neural Jump ODE (NJ-ODE) framework \citep{herrera2021neural}. While NJ-ODE was the first framework to establish convergence guarantees for the prediction of irregularly observed time series, these results were limited to data stemming from Itô-diffusions with complete observations, in… ▽ More

    Submitted 4 July, 2024; v1 submitted 28 June, 2022; originally announced June 2022.

  9. arXiv:2201.02441  [pdf, other

    q-fin.CP cs.LG q-fin.MF stat.ML

    Applications of Signature Methods to Market Anomaly Detection

    Authors: Erdinc Akyildirim, Matteo Gambara, Josef Teichmann, Syang Zhou

    Abstract: Anomaly detection is the process of identifying abnormal instances or events in data sets which deviate from the norm significantly. In this study, we propose a signatures based machine learning algorithm to detect rare or unexpected items in a given data set of time series type. We present applications of signature or randomized signature as feature extractors for anomaly detection algorithms; ad… ▽ More

    Submitted 8 February, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

  10. How Infinitely Wide Neural Networks Can Benefit from Multi-task Learning -- an Exact Macroscopic Characterization

    Authors: Jakob Heiss, Josef Teichmann, Hanna Wutte

    Abstract: In practice, multi-task learning (through learning features shared among tasks) is an essential property of deep neural networks (NNs). While infinite-width limits of NNs can provide good intuition for their generalization behavior, the well-known infinite-width limits of NNs in the literature (e.g., neural tangent kernels) assume specific settings in which wide ReLU-NNs behave like shallow Gaussi… ▽ More

    Submitted 20 October, 2022; v1 submitted 31 December, 2021; originally announced December 2021.

    Comments: 13 pages + appendix

    MSC Class: 68T07; 68Q32 ACM Class: I.2

  11. arXiv:2104.13669  [pdf, other

    stat.ML cs.LG math.NA math.PR q-fin.CP

    Optimal Stopping via Randomized Neural Networks

    Authors: Calypso Herrera, Florian Krach, Pierre Ruyssen, Josef Teichmann

    Abstract: This paper presents the benefits of using randomized neural networks instead of standard basis functions or deep neural networks to approximate the solutions of optimal stopping problems. The key idea is to use neural networks, where the parameters of the hidden layers are generated randomly and only the last layer is trained, in order to approximate the continuation value. Our approaches are appl… ▽ More

    Submitted 1 December, 2023; v1 submitted 28 April, 2021; originally announced April 2021.

    MSC Class: 60G40 (Primary); 68T07 (Secondary)

  12. arXiv:2102.13640  [pdf, other

    cs.LG cs.AI stat.ML

    NOMU: Neural Optimization-based Model Uncertainty

    Authors: Jakob Heiss, Jakob Weissteiner, Hanna Wutte, Sven Seuken, Josef Teichmann

    Abstract: We study methods for estimating model uncertainty for neural networks (NNs) in regression. To isolate the effect of model uncertainty, we focus on a noiseless setting with scarce training data. We introduce five important desiderata regarding model uncertainty that any method should satisfy. However, we find that established benchmarks often fail to reliably capture some of these desiderata, even… ▽ More

    Submitted 11 March, 2023; v1 submitted 26 February, 2021; originally announced February 2021.

    Comments: 9 pages + appendix

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:8708-8758, 2022

  13. arXiv:2010.14615  [pdf, ps, other

    cs.NE cs.LG math.PR stat.ML

    Discrete-time signatures and randomness in reservoir computing

    Authors: Christa Cuchiero, Lukas Gonon, Lyudmila Grigoryeva, Juan-Pablo Ortega, Josef Teichmann

    Abstract: A new explanation of geometric nature of the reservoir computing phenomenon is presented. Reservoir computing is understood in the literature as the possibility of approximating input/output systems with randomly chosen recurrent neural systems and a trained linear readout layer. Light is shed on this phenomenon by constructing what is called strongly universal reservoir systems as random projecti… ▽ More

    Submitted 17 September, 2020; originally announced October 2020.

    Comments: 14 pages

  14. arXiv:2006.09455  [pdf, other

    q-fin.CP stat.ML

    Consistent Recalibration Models and Deep Calibration

    Authors: Matteo Gambara, Josef Teichmann

    Abstract: Consistent Recalibration models (CRC) have been introduced to capture in necessary generality the dynamic features of term structures of derivatives' prices. Several approaches have been suggested to tackle this problem, but all of them, including CRC models, suffered from numerical intractabilities mainly due to the presence of complicated drift terms or consistency conditions. We overcome this p… ▽ More

    Submitted 1 July, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

  15. arXiv:2006.04727  [pdf, other

    stat.ML cs.LG math.PR q-fin.CP q-fin.ST

    Neural Jump Ordinary Differential Equations: Consistent Continuous-Time Prediction and Filtering

    Authors: Calypso Herrera, Florian Krach, Josef Teichmann

    Abstract: Combinations of neural ODEs with recurrent neural networks (RNN), like GRU-ODE-Bayes or ODE-RNN are well suited to model irregularly observed time series. While those models outperform existing discrete-time approaches, no theoretical guarantees for their predictive capabilities are available. Assuming that the irregularly-sampled time series data originates from a continuous stochastic process, t… ▽ More

    Submitted 16 April, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

    Journal ref: International Conference on Learning Representations (2021)

  16. arXiv:2005.02505  [pdf, other

    q-fin.CP math.OC stat.ML

    A generative adversarial network approach to calibration of local stochastic volatility models

    Authors: Christa Cuchiero, Wahid Khosrawi, Josef Teichmann

    Abstract: We propose a fully data-driven approach to calibrate local stochastic volatility (LSV) models, circumventing in particular the ad hoc interpolation of the volatility surface. To achieve this, we parametrize the leverage function by a family of feed-forward neural networks and learn their parameters directly from the available market option prices. This should be seen in the context of neural SDEs… ▽ More

    Submitted 29 September, 2020; v1 submitted 5 May, 2020; originally announced May 2020.

    Comments: Replacement for previous version: Major update of previous version to match the content of the published version

    Journal ref: Risks 2020, 8, 101

  17. arXiv:2004.13612  [pdf, other

    stat.ML cs.LG math.OC q-fin.CP

    Denise: Deep Robust Principal Component Analysis for Positive Semidefinite Matrices

    Authors: Calypso Herrera, Florian Krach, Anastasis Kratsios, Pierre Ruyssen, Josef Teichmann

    Abstract: The robust PCA of covariance matrices plays an essential role when isolating key explanatory features. The currently available methods for performing such a low-rank plus sparse decomposition are matrix specific, meaning, those algorithms must re-run for every new matrix. Since these algorithms are computationally expensive, it is preferable to learn and store a function that nearly instantaneousl… ▽ More

    Submitted 6 June, 2023; v1 submitted 28 April, 2020; originally announced April 2020.

    Journal ref: Transactions on Machine Learning Research (2023)

  18. arXiv:2004.13135  [pdf, other

    stat.ML cs.LG q-fin.MF

    Local Lipschitz Bounds of Deep Neural Networks

    Authors: Calypso Herrera, Florian Krach, Josef Teichmann

    Abstract: The Lipschitz constant is an important quantity that arises in analysing the convergence of gradient-based optimization methods. It is generally unclear how to estimate the Lipschitz constant of a complex model. Thus, this paper studies an important problem that may be useful to the broader area of non-convex optimization. The main result provides a local upper bound on the Lipschitz constants of… ▽ More

    Submitted 9 February, 2023; v1 submitted 27 April, 2020; originally announced April 2020.

  19. arXiv:1911.02903  [pdf, other

    cs.LG math.NA stat.ML

    How Implicit Regularization of ReLU Neural Networks Characterizes the Learned Function -- Part I: the 1-D Case of Two Layers with Random First Layer

    Authors: Jakob Heiss, Josef Teichmann, Hanna Wutte

    Abstract: In this paper, we consider one dimensional (shallow) ReLU neural networks in which weights are chosen randomly and only the terminal layer is trained. First, we mathematically show that for such networks L2-regularized regression corresponds in function space to regularizing the estimate's second derivative for fairly general loss functionals. For least squares regression, we show that the trained… ▽ More

    Submitted 4 October, 2023; v1 submitted 7 November, 2019; originally announced November 2019.

    Comments: adding Appendix C for more intuition, fixing typos, improving formulations, (moving end of Section 3.1 into Appendix B)

    MSC Class: 41Axx; 93Exx; 68T05; 68Q32 ACM Class: I.2.6; G.3

  20. arXiv:1209.2566  [pdf, other

    stat.ME math.PR

    Generalizations of Matérn's hard-core point processes

    Authors: Jakob Teichmann, Felix Ballani, Karl Gerald van den Boogaart

    Abstract: Matérn's hard-core processes are valuable point process models in spatial statistics. In order to extend their field of application, Matérn's original models are generalized here, both as point processes and particle processes. The thinning rule uses a distance-dependent probability function, which controls deletion of points close together. For this general setting, explicit formulas for first- a… ▽ More

    Submitted 12 September, 2012; originally announced September 2012.

    Comments: 21 pages, 17 figures

    MSC Class: 60D05; 60G55