Out-of-distributional risk bounds for neural operators with applications to the Helmholtz equation

Benitez, J. Antonio Lara; Furuya, Takashi; Faucher, Florian; Kratsios, Anastasis; Tricoche, Xavier; de Hoop, Maarten V.

doi:10.1016/j.jcp.2024.113168

Computer Science > Machine Learning

arXiv:2301.11509 (cs)

[Submitted on 27 Jan 2023 (v1), last revised 4 Jul 2023 (this version, v3)]

Title:Out-of-distributional risk bounds for neural operators with applications to the Helmholtz equation

Authors:J. Antonio Lara Benitez, Takashi Furuya, Florian Faucher, Anastasis Kratsios, Xavier Tricoche, Maarten V. de Hoop

View PDF

Abstract:Despite their remarkable success in approximating a wide range of operators defined by PDEs, existing neural operators (NOs) do not necessarily perform well for all physics problems. We focus here on high-frequency waves to highlight possible shortcomings. To resolve these, we propose a subfamily of NOs enabling an enhanced empirical approximation of the nonlinear operator mapping wave speed to solution, or boundary values for the Helmholtz equation on a bounded domain. The latter operator is commonly referred to as the ''forward'' operator in the study of inverse problems. Our methodology draws inspiration from transformers and techniques such as stochastic depth. Our experiments reveal certain surprises in the generalization and the relevance of introducing stochastic depth. Our NOs show superior performance as compared with standard NOs, not only for testing within the training distribution but also for out-of-distribution scenarios. To delve into this observation, we offer an in-depth analysis of the Rademacher complexity associated with our modified models and prove an upper bound tied to their stochastic depth that existing NOs do not satisfy. Furthermore, we obtain a novel out-of-distribution risk bound tailored to Gaussian measures on Banach spaces, again relating stochastic depth with the bound. We conclude by proposing a hypernetwork version of the subfamily of NOs as a surrogate model for the mentioned forward operator.

Subjects:	Machine Learning (cs.LG); Numerical Analysis (math.NA); Machine Learning (stat.ML)
Cite as:	arXiv:2301.11509 [cs.LG]
	(or arXiv:2301.11509v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2301.11509
Related DOI:	https://doi.org/10.1016/j.jcp.2024.113168

Submission history

From: Jose Antonio Lara Benitez [view email]
[v1] Fri, 27 Jan 2023 03:02:12 UTC (18,892 KB)
[v2] Wed, 19 Apr 2023 03:06:03 UTC (18,257 KB)
[v3] Tue, 4 Jul 2023 22:42:47 UTC (44,122 KB)

Computer Science > Machine Learning

Title:Out-of-distributional risk bounds for neural operators with applications to the Helmholtz equation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Out-of-distributional risk bounds for neural operators with applications to the Helmholtz equation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators