Skip to main content

Showing 1–9 of 9 results for author: Rusch, T K

Searching in archive math. Search in all archives.
.
  1. arXiv:2508.04926  [pdf, ps, other

    math.NA math.OC

    On the optimization of discrepancy measures

    Authors: François Clément, Nathan Kirk, Art B. Owen, T. Konstantin Rusch

    Abstract: Points in the unit cube with low discrepancy can be constructed using algebra or, more recently, by direct computational optimization of a criterion. The usual $L_\infty$ star discrepancy is a poor criterion for this because it is computationally expensive and lacks differentiability. Its usual replacement, the $L_2$ star discrepancy, is smooth but exhibits other pathologies shown by J. Matoušek.… ▽ More

    Submitted 6 August, 2025; originally announced August 2025.

    Comments: 22 pages, 3 Figures, 4 Tables

  2. arXiv:2503.21103  [pdf, other

    cs.LG math.NA

    Low Stein Discrepancy via Message-Passing Monte Carlo

    Authors: Nathan Kirk, T. Konstantin Rusch, Jakob Zech, Daniela Rus

    Abstract: Message-Passing Monte Carlo (MPMC) was recently introduced as a novel low-discrepancy sampling approach leveraging tools from geometric deep learning. While originally designed for generating uniform point sets, we extend this framework to sample from general multivariate probability distributions with known probability density function. Our proposed method, Stein-Message-Passing Monte Carlo (Stei… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

    Comments: 8 pages, 2 figures, Accepted at the ICLR 2025 Workshop on Frontiers in Probabilistic Inference

  3. arXiv:2405.15059  [pdf, other

    cs.LG math.NA stat.ML

    Message-Passing Monte Carlo: Generating low-discrepancy point sets via Graph Neural Networks

    Authors: T. Konstantin Rusch, Nathan Kirk, Michael M. Bronstein, Christiane Lemieux, Daniela Rus

    Abstract: Discrepancy is a well-known measure for the irregularity of the distribution of a point set. Point sets with small discrepancy are called low-discrepancy and are known to efficiently fill the space in a uniform manner. Low-discrepancy points play a central role in many problems in science and engineering, including numerical integration, computer vision, machine perception, computer graphics, mach… ▽ More

    Submitted 26 September, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Published in Proceedings of the National Academy of Sciences (PNAS): https://www.pnas.org/doi/10.1073/pnas.2409913121

  4. arXiv:2302.03580  [pdf, other

    cs.LG math.NA stat.ML

    Multi-Scale Message Passing Neural PDE Solvers

    Authors: Léonard Equer, T. Konstantin Rusch, Siddhartha Mishra

    Abstract: We propose a novel multi-scale message passing neural network algorithm for learning the solutions of time-dependent PDEs. Our algorithm possesses both temporal and spatial multi-scale resolution features by incorporating multi-scale sequence models and graph gating modules in the encoder and processor, respectively. Benchmark numerical experiments are presented to demonstrate that the proposed al… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

  5. arXiv:2202.02296  [pdf, other

    cs.LG math.DS stat.ML

    Graph-Coupled Oscillator Networks

    Authors: T. Konstantin Rusch, Benjamin P. Chamberlain, James Rowbottom, Siddhartha Mishra, Michael M. Bronstein

    Abstract: We propose Graph-Coupled Oscillator Networks (GraphCON), a novel framework for deep learning on graphs. It is based on discretizations of a second-order system of ordinary differential equations (ODEs), which model a network of nonlinear controlled and damped oscillators, coupled via the adjacency structure of the underlying graph. The flexibility of our framework permits any basic GNN layer (e.g.… ▽ More

    Submitted 23 June, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: ICML 2022

  6. arXiv:2110.04744  [pdf, other

    cs.LG math.DS stat.ML

    Long Expressive Memory for Sequence Modeling

    Authors: T. Konstantin Rusch, Siddhartha Mishra, N. Benjamin Erichson, Michael W. Mahoney

    Abstract: We propose a novel method called Long Expressive Memory (LEM) for learning long-term sequential dependencies. LEM is gradient-based, it can efficiently process sequential tasks with very long-term dependencies, and it is sufficiently expressive to be able to learn complicated input-output maps. To derive LEM, we consider a system of multiscale ordinary differential equations, as well as a suitable… ▽ More

    Submitted 25 February, 2022; v1 submitted 10 October, 2021; originally announced October 2021.

    Comments: ICLR 2022

  7. arXiv:2103.05487  [pdf, other

    cs.LG math.DS stat.ML

    UnICORNN: A recurrent model for learning very long time dependencies

    Authors: T. Konstantin Rusch, Siddhartha Mishra

    Abstract: The design of recurrent neural networks (RNNs) to accurately process sequential inputs with long-time dependencies is very challenging on account of the exploding and vanishing gradient problem. To overcome this, we propose a novel RNN architecture which is based on a structure preserving discretization of a Hamiltonian system of second-order ordinary differential equations that models networks of… ▽ More

    Submitted 10 June, 2021; v1 submitted 9 March, 2021; originally announced March 2021.

    Report number: PMLR 139:9168-9178, 2021

  8. arXiv:2009.02713  [pdf, other

    math.NA cs.LG

    Higher-order Quasi-Monte Carlo Training of Deep Neural Networks

    Authors: M. Longo, S. Mishra, T. K. Rusch, Ch. Schwab

    Abstract: We present a novel algorithmic approach and an error analysis leveraging Quasi-Monte Carlo points for training deep neural network (DNN) surrogates of Data-to-Observable (DtO) maps in engineering design. Our analysis reveals higher-order consistent, deterministic choices of training points in the input data space for deep and shallow Neural Networks with holomorphic activation functions such as ta… ▽ More

    Submitted 6 September, 2020; originally announced September 2020.

  9. arXiv:2005.12564  [pdf, other

    cs.LG math.NA physics.flu-dyn stat.ML

    Enhancing accuracy of deep learning algorithms by training with low-discrepancy sequences

    Authors: Siddhartha Mishra, T. Konstantin Rusch

    Abstract: We propose a deep supervised learning algorithm based on low-discrepancy sequences as the training set. By a combination of theoretical arguments and extensive numerical experiments we demonstrate that the proposed algorithm significantly outperforms standard deep learning algorithms that are based on randomly chosen training data, for problems in moderately high dimensions. The proposed algorithm… ▽ More

    Submitted 26 May, 2020; originally announced May 2020.