Search | arXiv e-print repository

Digital Twins: McKean-Pontryagin Control for Partially Observed Physical Twins

Abstract: Optimal control for fully observed diffusion processes is well established and has led to numerous numerical implementations based on, for example, Bellman's principle, model free reinforcement learning, Pontryagin's maximum principle, and model predictive control. On the contrary, much fewer algorithms are available for optimal control of partially observed processes. However, this scenario is ce… ▽ More Optimal control for fully observed diffusion processes is well established and has led to numerous numerical implementations based on, for example, Bellman's principle, model free reinforcement learning, Pontryagin's maximum principle, and model predictive control. On the contrary, much fewer algorithms are available for optimal control of partially observed processes. However, this scenario is central to the digital twin paradigm where a physical twin is partially observed and control laws are derived based on a digital twin. In this paper, we contribute to this challenge by combining data assimilation in the form of the ensemble Kalman filter with the recently proposed McKean-Pontryagin approach to stochastic optimal control. We derive forward evolving mean-field evolution equations for states and co-states which simultaneously allow for an online assimilation of data as well as an online computation of control laws. The proposed methodology is therefore perfectly suited for real time applications of digital twins. We present numerical results for a controlled Lorenz-63 system and an inverted pendulum. △ Less

Submitted 20 October, 2025; v1 submitted 1 October, 2025; originally announced October 2025.

arXiv:2510.00785 [pdf, ps, other]

Enhancement of the WS$_2$ A$_{1\text{g}}$ Raman Mode in MoS$_2$/WS$_2$ Heterostructures

Authors: Annika Bergmann-Iwe, Tomasz Woźniak, Mustafa Hemaid, Oisín Garrity, Patryk Kusch, Rico Schwartz, Ziyang Gan, Antony George, Ludger Wirtz, Stephanie Reich, Andrey Turchanin, Tobias Korn

Abstract: When combined into van der Waals heterostructures, transition metal dichalcogenide monolayers enable the exploration of novel physics beyond their unique individual properties. However, for interesting phenomena such as interlayer charge transfer and interlayer excitons to occur, precise control of the interface and ensuring high-quality interlayer contact is crucial. Here, we investigate bilayer… ▽ More When combined into van der Waals heterostructures, transition metal dichalcogenide monolayers enable the exploration of novel physics beyond their unique individual properties. However, for interesting phenomena such as interlayer charge transfer and interlayer excitons to occur, precise control of the interface and ensuring high-quality interlayer contact is crucial. Here, we investigate bilayer heterostructures fabricated by combining chemical-vapor-deposition-grown MoS$_2$ and exfoliated WS$_2$ monolayers, allowing us to form several heterostructures with various twist angles within one preparation step. In case of sufficiently good interfacial contact, evaluated by photoluminescence quenching, we observe a twist-angle-dependent enhancement of the WS$_2$ A$_{1g}$ Raman mode. In contrast, other WS$_2$ and MoS$_2$ Raman modes (in particular, the MoS$_2$ A$_{1g}$ mode) do not show a clear enhancement under the same experimental conditions. We present a systematic study of this mode-selective effect using nonresonant Raman measurements that are complemented with ab-initio calculations of Raman spectra. We find that the selective enhancement of the WS$_2$ A$_{1g}$ mode exhibits a strong dependence on interlayer distance. We show that this selectivity is related to the A$_{1g}$ eigenvectors in the heterolayer: the eigenvectors are predominantly localized on one of the two layers; yet, the intensity of the MoS$_2$ mode is attenuated because the WS$_2$ layer is vibrating (albeit with much lower amplitude) out of phase, while the WS$_2$ mode is amplified because the atoms on the MoS$_2$ layer are vibrating in phase. To separate this eigenmode effect from resonant Raman enhancement, our study is extended with near-resonant Raman measurements. △ Less

Submitted 1 October, 2025; originally announced October 2025.

arXiv:2508.21006 [pdf, ps, other]

Practical indistinguishability in a gene regulatory network inference problem, a case study

Authors: Cody E. FitzGerald, Shelley Reich, Victor Agaba, Arjun Mathur, Michael S. Werner, Niall M. Mangan

Abstract: Computationally inferring mechanistic insights from typical biological data is a challenging pursuit. Even the highest-quality experimental data come with challenges. There are always sources of noise, a limit to how often we can measure the system, and we can rarely measure all the relevant states that participate in the underlying complexity. There are usually sources of uncertainty in model dev… ▽ More Computationally inferring mechanistic insights from typical biological data is a challenging pursuit. Even the highest-quality experimental data come with challenges. There are always sources of noise, a limit to how often we can measure the system, and we can rarely measure all the relevant states that participate in the underlying complexity. There are usually sources of uncertainty in model development, which give rise to multiple competing model structures. To underscore the need for further analysis of structural uncertainty in modeling, we use a meta-analysis across six journals covering mathematical biology and show that a huge number of models for biological systems are developed each year, but model selection and comparison across model structures appear to be less common. We walk through a case study involving inference of regulatory network structure involved in a developmental decision in the nematode, \textit{Pristonchus pacificus}. We use real biological data and compare across 13,824 models--each corresponding to a different regulatory network structure, to determine which regulatory features are supported by the data across three experimental conditions. We find that the best-fitting models for each experimental condition share a combination of features and identify a regulatory network that is common across the model sets for each condition. This model can describe the data across the experimental conditions we considered and exhibits a high degree of positive regulation and interconnectivity between the key regulators, \textit{eud-1}, $textit{sult-1}, and \textit{nhr-40}. While the biological results are specific to the molecular biology of development in \textit{Pristonchus pacificus}, the general modeling framework and underlying challenges we faced doing this analysis are widespread across biology, chemistry, physics, and many other scientific disciplines. △ Less

Submitted 28 August, 2025; originally announced August 2025.

arXiv:2508.15069 [pdf, ps, other]

Sampling by averaging: A multiscale approach to score estimation

Authors: Paula Cordero-Encinar, Andrew B. Duncan, Sebastian Reich, O. Deniz Akyildiz

Abstract: We introduce a novel framework for efficient sampling from complex, unnormalised target distributions by exploiting multiscale dynamics. Traditional score-based sampling methods either rely on learned approximations of the score function or involve computationally expensive nested Markov chain Monte Carlo (MCMC) loops. In contrast, the proposed approach leverages stochastic averaging within a slow… ▽ More We introduce a novel framework for efficient sampling from complex, unnormalised target distributions by exploiting multiscale dynamics. Traditional score-based sampling methods either rely on learned approximations of the score function or involve computationally expensive nested Markov chain Monte Carlo (MCMC) loops. In contrast, the proposed approach leverages stochastic averaging within a slow-fast system of stochastic differential equations (SDEs) to estimate intermediate scores along a diffusion path without training or inner-loop MCMC. Two algorithms are developed under this framework: MultALMC, which uses multiscale annealed Langevin dynamics, and MultCDiff, based on multiscale controlled diffusions for the reverse-time Ornstein-Uhlenbeck process. Both overdamped and underdamped variants are considered, with theoretical guarantees of convergence to the desired diffusion path. The framework is extended to handle heavy-tailed target distributions using Student's t-based noise models and tailored fast-process dynamics. Empirical results across synthetic and real-world benchmarks, including multimodal and high-dimensional distributions, demonstrate that the proposed methods are competitive with existing samplers in terms of accuracy and efficiency, without the need for learned models. △ Less

Submitted 20 August, 2025; originally announced August 2025.

arXiv:2506.22391 [pdf, ps, other]

Regularized Extragradient Methods for Solving Equilibrium Problems on Hadamard Manifolds

Authors: Shikher Sharmaa, Pankaj Gautam, Simeon Reich

Abstract: Employing two distinct types of regularization terms, we propose two regularized extragradient methods for solving equilibrium problems on Hadamard manifolds. The sequences generated by these extragradient algorithms converge to a solution of the equilibrium problem without requiring the Lipschitz continuity of the bifunction or imposing additional conditions on the parameters. We establish conver… ▽ More Employing two distinct types of regularization terms, we propose two regularized extragradient methods for solving equilibrium problems on Hadamard manifolds. The sequences generated by these extragradient algorithms converge to a solution of the equilibrium problem without requiring the Lipschitz continuity of the bifunction or imposing additional conditions on the parameters. We establish convergence results for both algorithms and derive global error bounds along with $R$-linear convergence rates in cases where the bifunction is strongly pseudomonotone. Finally, we present numerical experiments to demonstrate the effectiveness of our methods. △ Less

Submitted 27 June, 2025; originally announced June 2025.

arXiv:2506.10506 [pdf, ps, other]

On a mean-field Pontryagin minimum principle for stochastic optimal control

Authors: Manfred Opper, Sebastian Reich

Abstract: This papers outlines a novel extension of the classical Pontryagin minimum (maximum) principle to stochastic optimal control problems. Contrary to the well-known stochastic Pontryagin minimum principle involving forward-backward stochastic differential equations, the proposed formulation is deterministic and of mean-field type. We denote it the McKean-Pontryagin minimum principle. The Hamiltonian… ▽ More This papers outlines a novel extension of the classical Pontryagin minimum (maximum) principle to stochastic optimal control problems. Contrary to the well-known stochastic Pontryagin minimum principle involving forward-backward stochastic differential equations, the proposed formulation is deterministic and of mean-field type. We denote it the McKean-Pontryagin minimum principle. The Hamiltonian structure of the proposed McKean-Pontryagin minimum principle is achieved via the introduction of an appropriate gauge variable. The gauge freedom can be used to decouple the forward and reverse time equations; hence simplifying the solution of the underlying boundary value problem. We also consider infinite horizon discounted cost optimal control problems. In this case, the mean-field formulation allows converting the computation of the desired optimal control law into solving a pair of forward mean-field ordinary differential equations. The McKean-Pontryagin minimum principle is tested numerically for a controlled inverted pendulum and a controlled Lorenz-63 system. △ Less

Submitted 27 July, 2025; v1 submitted 12 June, 2025; originally announced June 2025.

MSC Class: 35F21; 49M99; 93E20; 70H30; 70H45

arXiv:2505.24610 [pdf, ps, other]

Potential Effects of Loading Terminal Locations on Surface Trajectories of Oil Spill Transport

Authors: Shoshana Reich, Edward Buskey, Clint Dawson, Eirik Valseth

Abstract: We present an investigation comparing the potential impacts of offshore and onshore crude oil loading sites on surface trajectories of spilled oil particles in the regions near the Port of Corpus Christi, Texas. Oil transport is established in a two step procedure. First, the circulation and flow characteristics of seawater throughout the coastal ocean are established for various flow conditions,… ▽ More We present an investigation comparing the potential impacts of offshore and onshore crude oil loading sites on surface trajectories of spilled oil particles in the regions near the Port of Corpus Christi, Texas. Oil transport is established in a two step procedure. First, the circulation and flow characteristics of seawater throughout the coastal ocean are established for various flow conditions, including current and proposed channel depth, seasonality changes, and extreme weather events. Then, spilled oil is modeled as distinct particles released at either the proposed onshore or offshore loading locations. The particle trajectories are tracked and used to assess the spread into diverse coastal ecosystems with extensive plant, sea, and land life. The models indicate that the extent of spread of these simulated oil spills to ecologically significant regions is greater when initiated at the onshore loading site than at the offshore site. △ Less

Submitted 30 May, 2025; originally announced May 2025.

arXiv:2505.10004 [pdf, other]

Topology-driven identification of repetitions in multi-variate time series

Authors: Simon Schindler, Elias Steffen Reich, Saverio Messineo, Simon Hoher, Stefan Huber

Abstract: Many multi-variate time series obtained in the natural sciences and engineering possess a repetitive behavior, as for instance state-space trajectories of industrial machines in discrete automation. Recovering the times of recurrence from such a multi-variate time series is of a fundamental importance for many monitoring and control tasks. For a periodic time series this is equivalent to determini… ▽ More Many multi-variate time series obtained in the natural sciences and engineering possess a repetitive behavior, as for instance state-space trajectories of industrial machines in discrete automation. Recovering the times of recurrence from such a multi-variate time series is of a fundamental importance for many monitoring and control tasks. For a periodic time series this is equivalent to determining its period length. In this work we present a persistent homology framework to estimate recurrence times in multi-variate time series with different generalizations of cyclic behavior (periodic, repetitive, and recurring). To this end, we provide three specialized methods within our framework that are provably stable and validate them using real-world data, including a new benchmark dataset from an injection molding machine. △ Less

Submitted 19 May, 2025; v1 submitted 15 May, 2025; originally announced May 2025.

Comments: Appears at 6th Interdisciplinary Data Science Conference (iDSC'25)

arXiv:2505.06373 [pdf, other]

Ultrastrong Light-Matter Coupling in Materials

Authors: Niclas S. Mueller, Eduardo B. Barros, Stephanie Reich

Abstract: Ultrastrong light-matter coupling has traditionally been studied in optical cavities, where it occurs when the light-matter coupling strength reaches a significant fraction of the transition frequency. This regime fundamentally alters the ground and excited states of the particle-cavity system, unlocking new ways to control its physics and chemistry. However, achieving ultrastrong coupling in engi… ▽ More Ultrastrong light-matter coupling has traditionally been studied in optical cavities, where it occurs when the light-matter coupling strength reaches a significant fraction of the transition frequency. This regime fundamentally alters the ground and excited states of the particle-cavity system, unlocking new ways to control its physics and chemistry. However, achieving ultrastrong coupling in engineered cavities remains a major challenge. Here, we show that ultra- and deep-strong coupling naturally occur in bulk materials without the need for external cavities. By analyzing experimental data from over 70 materials, we demonstrate that phonon-, exciton-, and plasmon-polaritons in many solids exhibit ultrastrong coupling, systematically surpassing the coupling strengths achieved in cavity-based systems. To explain this phenomenon, we introduce a dipole lattice model based on a generalized Hopfield Hamiltonian, which unifies photon-matter, matter-matter, and photon-photon interactions. The complete overlap between the photonic and collective dipole modes in the lattice enables ultrastrong coupling, leading to excited-state mixing, radiative decay suppression, and potential phase transitions into collective ground states. Applying our model to real materials, we show that it reproduces light-matter coupling across broad material classes and may underlie structural phase transitions that give rise to emergent phenomena such as ferroelectricity, insulator-to-metal transitions, and exciton condensation. Recognizing ultrastrong coupling as an intrinsic property of solids reshapes our understanding of light-matter interactions and opens new avenues for exploring quantum materials and exotic phases of matter. △ Less

Submitted 9 May, 2025; originally announced May 2025.

arXiv:2505.05865 [pdf, ps, other]

Directed light emission from monolayers on 2D materials via optical interferences

Authors: Pavel Trofimov, Sabrina Juergensen, Adrián Dewambrechies Fernández, Kirill Bolotin, Stephanie Reich, Hélène Seiler

Abstract: Two-dimensional materials provide a rich platform to explore phenomena such as emerging electronic and excitonic states, strong light-matter coupling and new optoelectronic device concepts. The optical response of monolayers is entangled with the substrate on which they are grown or deposited on, often a two-dimensional material itself. Understanding how the properties of the two-dimensional monol… ▽ More Two-dimensional materials provide a rich platform to explore phenomena such as emerging electronic and excitonic states, strong light-matter coupling and new optoelectronic device concepts. The optical response of monolayers is entangled with the substrate on which they are grown or deposited on, often a two-dimensional material itself. Understanding how the properties of the two-dimensional monolayers can be tuned via the substrate is therefore essential. Here we employ angle-resolved reflectivity and photoluminescence spectroscopy on highly ordered molecular monolayers on hexagonal boron nitride (hBN) to systematically investigate the angle-dependent optical response as a function of the thickness of the hBN flake. We observe that light reflection and emission occur in a strongly directed fashion and that the direction of light reflection and emission is dictated by the hBN flake thickness. Transfer matrix simulations reproduce the experimental data and show that optical interference effects in hBN are at the origin of the angle-dependent optical properties. While our study focuses on molecular monolayers on hBN, our findings are general and relevant for any 2D material placed on top of a substrate. Our findings demonstrate the need to carefully choose substrate parameters for a given experimental geometry but also highlight opportunities in applications such as lighting technology where the direction of light emission can be controlled via substrate thickness. △ Less

Submitted 9 May, 2025; originally announced May 2025.

arXiv:2505.04417 [pdf, ps, other]

Localized Diffusion Models

Authors: Georg A. Gottwald, Shuigen Liu, Youssef Marzouk, Sebastian Reich, Xin T. Tong

Abstract: Diffusion models are state-of-the-art tools for various generative tasks. Yet training these models involves estimating high-dimensional score functions, which in principle suffers from the curse of dimensionality. It is therefore important to understand how low-dimensional structure in the target distribution can be exploited in these models. Here we consider locality structure, which describes c… ▽ More Diffusion models are state-of-the-art tools for various generative tasks. Yet training these models involves estimating high-dimensional score functions, which in principle suffers from the curse of dimensionality. It is therefore important to understand how low-dimensional structure in the target distribution can be exploited in these models. Here we consider locality structure, which describes certain sparse conditional dependencies among the target random variables. Given some locality structure, the score function is effectively low-dimensional, so that it can be estimated by a localized neural network with significantly reduced sample complexity. This observation motivates the localized diffusion model, where a localized score matching loss is used to train the score function within a localized hypothesis space. We prove that such localization enables diffusion models to circumvent the curse of dimensionality, at the price of additional localization error. Under realistic sample size scaling, we then show both theoretically and numerically that a moderate localization radius can balance the statistical and localization errors, yielding better overall performance. Localized structure also facilitates parallel training, making localized diffusion models potentially more efficient for large-scale applications. △ Less

Submitted 27 September, 2025; v1 submitted 7 May, 2025; originally announced May 2025.

arXiv:2505.02590 [pdf, other]

Ensemble Kalman filter for uncertainty in human language comprehension

Authors: Diksha Bhandari, Alessandro Lopopolo, Milena Rabovsky, Sebastian Reich

Abstract: Artificial neural networks (ANNs) are widely used in modeling sentence processing but often exhibit deterministic behavior, contrasting with human sentence comprehension, which manages uncertainty during ambiguous or unexpected inputs. This is exemplified by reversal anomalies-sentences with unexpected role reversals that challenge syntax and semantics-highlighting the limitations of traditional A… ▽ More Artificial neural networks (ANNs) are widely used in modeling sentence processing but often exhibit deterministic behavior, contrasting with human sentence comprehension, which manages uncertainty during ambiguous or unexpected inputs. This is exemplified by reversal anomalies-sentences with unexpected role reversals that challenge syntax and semantics-highlighting the limitations of traditional ANN models, such as the Sentence Gestalt (SG) Model. To address these limitations, we propose a Bayesian framework for sentence comprehension, applying an extension of the ensemble Kalman filter (EnKF) for Bayesian inference to quantify uncertainty. By framing language comprehension as a Bayesian inverse problem, this approach enhances the SG model's ability to reflect human sentence processing with respect to the representation of uncertainty. Numerical experiments and comparisons with maximum likelihood estimation (MLE) demonstrate that Bayesian methods improve uncertainty representation, enabling the model to better approximate human cognitive processing when dealing with linguistic ambiguities. △ Less

Submitted 5 May, 2025; originally announced May 2025.

arXiv:2505.01364 [pdf]

Monitoring morphometric drift in lifelong learning segmentation of the spinal cord

Authors: Enamundram Naga Karthik, Sandrine Bédard, Jan Valošek, Christoph S. Aigner, Elise Bannier, Josef Bednařík, Virginie Callot, Anna Combes, Armin Curt, Gergely David, Falk Eippert, Lynn Farner, Michael G Fehlings, Patrick Freund, Tobias Granberg, Cristina Granziera, RHSCIR Network Imaging Group, Ulrike Horn, Tomáš Horák, Suzanne Humphreys, Markus Hupp, Anne Kerbrat, Nawal Kinany, Shannon Kolind, Petr Kudlička , et al. (31 additional authors not shown)

Abstract: Morphometric measures derived from spinal cord segmentations can serve as diagnostic and prognostic biomarkers in neurological diseases and injuries affecting the spinal cord. While robust, automatic segmentation methods to a wide variety of contrasts and pathologies have been developed over the past few years, whether their predictions are stable as the model is updated using new datasets has not… ▽ More Morphometric measures derived from spinal cord segmentations can serve as diagnostic and prognostic biomarkers in neurological diseases and injuries affecting the spinal cord. While robust, automatic segmentation methods to a wide variety of contrasts and pathologies have been developed over the past few years, whether their predictions are stable as the model is updated using new datasets has not been assessed. This is particularly important for deriving normative values from healthy participants. In this study, we present a spinal cord segmentation model trained on a multisite $(n=75)$ dataset, including 9 different MRI contrasts and several spinal cord pathologies. We also introduce a lifelong learning framework to automatically monitor the morphometric drift as the model is updated using additional datasets. The framework is triggered by an automatic GitHub Actions workflow every time a new model is created, recording the morphometric values derived from the model's predictions over time. As a real-world application of the proposed framework, we employed the spinal cord segmentation model to update a recently-introduced normative database of healthy participants containing commonly used measures of spinal cord morphometry. Results showed that: (i) our model outperforms previous versions and pathology-specific models on challenging lumbar spinal cord cases, achieving an average Dice score of $0.95 \pm 0.03$; (ii) the automatic workflow for monitoring morphometric drift provides a quick feedback loop for developing future segmentation models; and (iii) the scaling factor required to update the database of morphometric measures is nearly constant among slices across the given vertebral levels, showing minimum drift between the current and previous versions of the model monitored by the framework. The code and model are open-source and accessible via Spinal Cord Toolbox v7.0. △ Less

Submitted 20 October, 2025; v1 submitted 2 May, 2025; originally announced May 2025.

Comments: Under review (after 1st round of revision) at Imaging Neuroscience journal

arXiv:2503.12474 [pdf, ps, other]

Ensemble Kalman-Bucy filtering for nonlinear model predictive control

Authors: Sebastian Reich

Abstract: We consider the problem of optimal control for partially observed dynamical systems. Despite its prevalence in practical applications, there are still very few algorithms available, which take uncertainties in the current state estimates and future observations into account. In other words, most current approaches separate state estimation from the optimal control problem. In this paper, we extend… ▽ More We consider the problem of optimal control for partially observed dynamical systems. Despite its prevalence in practical applications, there are still very few algorithms available, which take uncertainties in the current state estimates and future observations into account. In other words, most current approaches separate state estimation from the optimal control problem. In this paper, we extend the popular ensemble Kalman filter to receding horizon optimal control problems in the spirit of nonlinear model predictive control. We provide an interacting particle approximation to the forward-backward stochastic differential equations arising from Pontryagin's maximum principle with the forward stochastic differential equation provided by the time-continuous ensemble Kalman-Bucy filter equations. The receding horizon control laws are approximated as linear and are continuously updated as in nonlinear model predictive control. We illustrate the performance of the proposed methodology for an inverted pendulum example. △ Less

Submitted 16 March, 2025; originally announced March 2025.

MSC Class: 49M05; 93C10; 93C15; 93E11; 62M20; 60G35

arXiv:2503.02529 [pdf]

THz-Driven Coherent Phonon Fingerprints of Hidden Symmetry Breaking in 2D Layered Hybrid Perovskites

Authors: Joanna M. Urban, Michael S. Spencer, Maximilian Frenzel, Gaëlle Trippé- Allard, Marie Cherasse, Charlotte Berrezueta Palacios, Olga Minakova, Eduardo Bedê Barros, Luca Perfetti, Stephanie Reich, Martin Wolf, Emmanuelle Deleporte, Sebastian F. Maehrlein

Abstract: Metal-halide perovskites (MHPs) emerged as a family of novel semiconductors with outstanding optoelectronic properties for applications in photovoltaics and light emission. Recently, they also attract interest as promising candidates for spintronics. In materials lacking inversion symmetry, spin-orbit coupling (SOC) leads to the Rashba-Dresselhaus effect, offering a pathway for spin current contro… ▽ More Metal-halide perovskites (MHPs) emerged as a family of novel semiconductors with outstanding optoelectronic properties for applications in photovoltaics and light emission. Recently, they also attract interest as promising candidates for spintronics. In materials lacking inversion symmetry, spin-orbit coupling (SOC) leads to the Rashba-Dresselhaus effect, offering a pathway for spin current control. Therefore, inversion symmetry breaking in MHPs, which are characterized by strong SOC, has crucial implications. Yet, in complex low-dimensional hybrid organic-inorganic perovskites (HOIPs), the presence of and structural contributions to inversion symmetry breaking remain elusive. Here, employing intense THz fields, we coherently drive lattice dynamics carrying spectroscopic fingerprints of inversion symmetry breaking in Ruddlesden-Popper (PEA)$_2$(MA)$_{n-1}$PbnI${3n+1}$ perovskites, which are globally assigned to a centrosymmetric space group. We demonstrate coherent control by THz pulses over specific phonons, which we assign to either purely inorganic or highly anharmonic hybrid cage-ligand vibrations. By developing a general polarization analysis for THz-driven phonons, we pinpoint linear and nonlinear driving mechanisms. From this, we identify simultaneous IR- and Raman-activity of inorganic cage modes below 1.5 THz, indicating mode-selective inversion symmetry breaking. By exploring the driving pathways of these coherent phonons, we lay the groundwork for simultaneous ultrafast control of optoelectronic and spintronic properties in 2D HOIPs. △ Less

Submitted 4 March, 2025; originally announced March 2025.

Comments: 54 pages, 24 figures

arXiv:2502.06642 [pdf, other]

Regularity of the Product of Two Relaxed Cutters with Relaxation Parameters Beyond Two

Authors: Andrzej Cegielski, Simeon Reich, Rafał Zalas

Abstract: We study the product of two relaxed cutters having a common fixed point. We assume that one of the relaxation parameters is greater than two so that the corresponding relaxed cutter is no longer quasi-nonexpansive, but rather demicontractive. We show that if both of the operators are (weakly/linearly) regular, then under certain conditions, the resulting product inherits the same type of regularit… ▽ More We study the product of two relaxed cutters having a common fixed point. We assume that one of the relaxation parameters is greater than two so that the corresponding relaxed cutter is no longer quasi-nonexpansive, but rather demicontractive. We show that if both of the operators are (weakly/linearly) regular, then under certain conditions, the resulting product inherits the same type of regularity. We then apply these results to proving convergence in the weak, norm and linear sense of algorithms that employ such products. △ Less

Submitted 10 February, 2025; originally announced February 2025.

MSC Class: 47J25; 47J26

arXiv:2409.12537 [pdf, other]

doi 10.1021/acsphotonics.4c01548

Nanocavities for Molecular Optomechanics: their fundamental description and applications

Authors: Philippe Roelli, Huatian Hu, Ewold Verhagen, Stephanie Reich, Christophe Galland

Abstract: Vibrational Raman scattering -- a process where light exchanges energy with a molecular vibration through inelastic scattering -- is most fundamentally described in a quantum framework where both light and vibration are quantized. When the Raman scatterer is embedded inside a plasmonic nanocavity, as in some sufficiently controlled implementations of surface-enhanced Raman scattering (SERS), the c… ▽ More Vibrational Raman scattering -- a process where light exchanges energy with a molecular vibration through inelastic scattering -- is most fundamentally described in a quantum framework where both light and vibration are quantized. When the Raman scatterer is embedded inside a plasmonic nanocavity, as in some sufficiently controlled implementations of surface-enhanced Raman scattering (SERS), the coupled system realizes an optomechanical cavity, where coherent and parametrically amplified light-vibration interaction becomes a resource for vibrational state engineering and nanoscale nonlinear optics. The purpose of this Perspective is to clarify the connection between the languages and parameters used in the fields of molecular cavity optomechanics (McOM) vs. its conventional, `macroscopic' counterpart, and to summarize the main results achieved so far in McOM and the most pressing experimental and theoretical challenges. We aim to make the theoretical framework of molecular cavity optomechanics practically usable for the SERS and nanoplasmonics community at large. While quality factors ($Q$'s) and mode volumes ($V$'s) essentially describe the performance of a nanocavity in enhancing light-matter interaction, we point to the light-cavity coupling efficiencies ($η$'s) and optomechanical cooperativities ($\mathcal{C}$'s) as the key parameters for molecular optomechanics. As an illustration of the significance of these quantities, we investigate the feasibility of observing optomechanically induced transparency with a molecular vibration -- a measurement that would allow for a direct estimate of the optomechanical cooperativity. △ Less

Submitted 28 September, 2024; v1 submitted 19 September, 2024; originally announced September 2024.

Comments: Includes an Appendix

Journal ref: ACS Photonics 2024, 11, 11, 4486-4501

arXiv:2409.07968 [pdf, ps, other]

Localized Schrödinger Bridge Sampler

Authors: Georg A. Gottwald, Sebastian Reich

Abstract: We consider the problem of sampling from an unknown distribution for which only a sufficiently large number of training samples are available. In this paper, we build on previous work combining Schrödinger bridges and plug & play Langevin samplers. A key bottleneck of these approaches is the exponential dependence of the required training samples on the dimension, $d$, of the ambient state space.… ▽ More We consider the problem of sampling from an unknown distribution for which only a sufficiently large number of training samples are available. In this paper, we build on previous work combining Schrödinger bridges and plug & play Langevin samplers. A key bottleneck of these approaches is the exponential dependence of the required training samples on the dimension, $d$, of the ambient state space. We propose a localization strategy which exploits conditional independence of conditional expectation values. Localization thus replaces a single high-dimensional Schrödinger bridge problem by $d$ low-dimensional Schrödinger bridge problems over the available training samples. In this context, a connection to multi-head self attention transformer architectures is established. As for the original Schrödinger bridge sampling approach, the localized sampler is stable and geometric ergodic. The sampler also naturally extends to conditional sampling and to Bayesian inference. We demonstrate the performance of our proposed scheme through experiments on a high-dimensional Gaussian problem, on a temporal stochastic process, and on a stochastic subgrid-scale parametrization conditional sampling problem. We also extend the idea of localization to plug & play Langevin samplers using kernel-based denoising in combination with Tweedie's formula. △ Less

Submitted 17 November, 2024; v1 submitted 12 September, 2024; originally announced September 2024.

MSC Class: 60H10; 62F15; 62F30; 65C05; 65C40

arXiv:2408.17173 [pdf, ps, other]

Existence and approximate controllability results for time-fractional stochastic Navier-Stokes equations

Authors: Renu Chaudhary, Simeon Reich, Juan J. Nieto

Abstract: This paper deals with time-fractional stochastic Navier-Stokes equations, which are characterized by the coexistence of stochastic noise and a fractional power of the Laplacian. We establish sufficient conditions for the existence and approximate controllability of a unique mild solution to time-fractional stochastic Navier-Stokes equations. Using a fixed point technique, we first demonstrate the… ▽ More This paper deals with time-fractional stochastic Navier-Stokes equations, which are characterized by the coexistence of stochastic noise and a fractional power of the Laplacian. We establish sufficient conditions for the existence and approximate controllability of a unique mild solution to time-fractional stochastic Navier-Stokes equations. Using a fixed point technique, we first demonstrate the existence and uniqueness of a mild solution to the equation under consideration. We then establish approximate controllability results by using the concepts of fractional calculus, semigroup theory, functional analysis and stochastic analysis. △ Less

Submitted 10 October, 2025; v1 submitted 30 August, 2024; originally announced August 2024.

arXiv:2408.15885 [pdf, other]

Collective states of α-sexithiophene chains inside boron nitride nanotubes

Authors: Sabrina Juergensen, Jean-Baptiste Marceau, Chantal Mueller, Eduardo B. Barros, Patryk Kusch, Antonio Setaro, Etienne Gaufrès, Stephanie Reich

Abstract: Nanotubes align molecules into one dimensional chains creating collective states through the coupling of the molecular transition dipole moments. These collective excitations have strong fluorescence, narrow bandwidth, and shifted emission/absorption energies. We study the optical properties of α-sexithiophene chains in boron nitride nanotubes by combining fluorescence with far- and near-field abs… ▽ More Nanotubes align molecules into one dimensional chains creating collective states through the coupling of the molecular transition dipole moments. These collective excitations have strong fluorescence, narrow bandwidth, and shifted emission/absorption energies. We study the optical properties of α-sexithiophene chains in boron nitride nanotubes by combining fluorescence with far- and near-field absorption spectroscopy. The inner nanotube diameter determines the number of encapsulated molecular chains. A single chain of α-sexithiophene molecules has an optical absorption and emission spectrum that is red-shifted by almost 300 meV compared to the monomer emission, which is much larger than expected from dipole-dipole coupling. The collective state splits into excitation and emission channels with a Stokes shift of 200 meV for chains with two or more files. Our study emphasises the formation of a delocalized collective state through Coulomb coupling of the transition moments that shows a remarkable tuneability in transition energy. △ Less

Submitted 28 August, 2024; originally announced August 2024.

arXiv:2408.11534 [pdf]

Double tips for in-plane polarized near-field microscopy and spectroscopy

Authors: Patryk Kusch, Jose Pareja Arcos, Aleksei Tsarapkin, Victor Deinhart, Karsten Harbauer, Katja Höflich, Stephanie Reich

Abstract: Near-field optical microscopy and spectroscopy provide high-resolution imaging below the diffraction limit, crucial in physics, chemistry, and biology for studying molecules, nanoparticles, and viruses. These techniques use a sharp metallic tip of an atomic force microscope (AFM) to enhance incoming and scattered light by excited near-fields at the tip apex leading to high sensitivity and a spatia… ▽ More Near-field optical microscopy and spectroscopy provide high-resolution imaging below the diffraction limit, crucial in physics, chemistry, and biology for studying molecules, nanoparticles, and viruses. These techniques use a sharp metallic tip of an atomic force microscope (AFM) to enhance incoming and scattered light by excited near-fields at the tip apex leading to high sensitivity and a spatial resolution of a few nanometers. However, this restricts the near-field orientation to out-of-plane polarization, limiting optical polarization choices. We introduce double tips that offer in-plane polarization for enhanced imaging and spectroscopy. These double tips provide superior enhancement over single tips, although with a slightly lower spatial resolution (~30nm). They enable advanced studies of nanotubes, graphene defects, and transition metal dichalcogenides, benefiting from polarization control. The double tips allow varied polarization in tip-enhanced Raman scattering and selective excitation of transverse-electric and -magnetic polaritons, expanding the range of nanoscale samples that can be studied. △ Less

Submitted 21 August, 2024; originally announced August 2024.

arXiv:2406.17263 [pdf, other]

Efficient, Multimodal, and Derivative-Free Bayesian Inference With Fisher-Rao Gradient Flows

Authors: Yifan Chen, Daniel Zhengyu Huang, Jiaoyang Huang, Sebastian Reich, Andrew M. Stuart

Abstract: In this paper, we study efficient approximate sampling for probability distributions known up to normalization constants. We specifically focus on a problem class arising in Bayesian inference for large-scale inverse problems in science and engineering applications. The computational challenges we address with the proposed methodology are: (i) the need for repeated evaluations of expensive forward… ▽ More In this paper, we study efficient approximate sampling for probability distributions known up to normalization constants. We specifically focus on a problem class arising in Bayesian inference for large-scale inverse problems in science and engineering applications. The computational challenges we address with the proposed methodology are: (i) the need for repeated evaluations of expensive forward models; (ii) the potential existence of multiple modes; and (iii) the fact that gradient of, or adjoint solver for, the forward model might not be feasible. While existing Bayesian inference methods meet some of these challenges individually, we propose a framework that tackles all three systematically. Our approach builds upon the Fisher-Rao gradient flow in probability space, yielding a dynamical system for probability densities that converges towards the target distribution at a uniform exponential rate. This rapid convergence is advantageous for the computational burden outlined in (i). We apply Gaussian mixture approximations with operator splitting techniques to simulate the flow numerically; the resulting approximation can capture multiple modes thus addressing (ii). Furthermore, we employ the Kalman methodology to facilitate a derivative-free update of these Gaussian components and their respective weights, addressing the issue in (iii). The proposed methodology results in an efficient derivative-free sampler flexible enough to handle multi-modal distributions: Gaussian Mixture Kalman Inversion (GMKI). The effectiveness of GMKI is demonstrated both theoretically and numerically in several experiments with multimodal target distributions, including proof-of-concept and two-dimensional examples, as well as a large-scale application: recovering the Navier-Stokes initial condition from solution data at positive times. △ Less

Submitted 11 October, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

Comments: 42 pages, 10 figures

arXiv:2406.14738 [pdf, ps, other]

Parameter estimation for partially observed second-order diffusion processes

Authors: Jan Albrecht, Sebastian Reich

Abstract: Estimating parameters of a diffusion process given continuous-time observations of the process via maximum likelihood approaches or, online, via stochastic gradient descent or Kalman filter formulations constitutes a well-established research area. It has also been established previously that these techniques are, in general, not robust to perturbations in the data in the form of temporal correlat… ▽ More Estimating parameters of a diffusion process given continuous-time observations of the process via maximum likelihood approaches or, online, via stochastic gradient descent or Kalman filter formulations constitutes a well-established research area. It has also been established previously that these techniques are, in general, not robust to perturbations in the data in the form of temporal correlations of the driving noise. While the subject is relatively well understood and appropriate modifications have been suggested in the context of multi-scale diffusion processes and their reduced model equations, we consider here an alternative but related setting where a diffusion process in positions and velocities is only observed via its positions. In this note, we propose a simple modification to standard stochastic gradient descent and Kalman filter formulations, which eliminates the arising systematic estimation biases. The modification can be extended to standard maximum likelihood approaches and avoids computation of previously proposed correction terms. △ Less

Submitted 14 March, 2025; v1 submitted 20 June, 2024; originally announced June 2024.

MSC Class: 65C30; 65L09; 60M20; 62F10; 62F15; 62L20

arXiv:2404.05450 [pdf, other]

Raman scattering by carbon nanotubes coupled to quantum dots via dipolar excitonic interaction

Authors: Anna Wroblewska, Niclas S. Mueller, Mariusz Zdrojek, Stephanie Reich, Georgy Gordeev

Abstract: The dipole-dipole interactions between excitons are of paramount importance in the nanoscale structures. When two excitons are placed together they can exchange the energy can manifest in the resonant Raman cross sections. We provide theoretical framework for such effects by combining the coupled oscillator model and perturbation theory. We apply this theory to a hybrid film comprising semiconduct… ▽ More The dipole-dipole interactions between excitons are of paramount importance in the nanoscale structures. When two excitons are placed together they can exchange the energy can manifest in the resonant Raman cross sections. We provide theoretical framework for such effects by combining the coupled oscillator model and perturbation theory. We apply this theory to a hybrid film comprising semiconducting quantum dots and metallic carbon nanotubes. The quantum dots exciton has a fixed energy, while the nanotube resonances span across a larger range from 1.7 to \SI{1.93}{eV}. We acquire the resonant Raman profiles of the pristine nanotubes and hybrids and find a relative shift between them. The shift direction depends on the relative energies between the CNT and QD exciton energies, as predicted by our theory. △ Less

Submitted 8 April, 2024; originally announced April 2024.

Comments: 19 Pages, 7 Figures

arXiv:2404.04112 [pdf, other]

Resonant Raman signatures of exciton polarons in a transition metal oxide: BiVO$_4$

Authors: Georgy Gordeev, Christina Hill, Angelina Gudima, Stephanie Reich, Mael Guennou

Abstract: In this work we investigate the delocalized excitons and excitons trapped by a polaron formation in \BVO{} by means of resonant Raman spectroscopy. We record Raman spectra with 16 laser lines between 1.9 and \SI{2.6}{\eV} and analyze intensity variations of the Raman peaks for different vibrational modes. The resonant Raman cross sections of the \Ag{} modes contain two types of resonances. The fir… ▽ More In this work we investigate the delocalized excitons and excitons trapped by a polaron formation in \BVO{} by means of resonant Raman spectroscopy. We record Raman spectra with 16 laser lines between 1.9 and \SI{2.6}{\eV} and analyze intensity variations of the Raman peaks for different vibrational modes. The resonant Raman cross sections of the \Ag{} modes contain two types of resonances. The first high-energy resonance near \SI{2.45}{\eV} belongs to a transition between delocalized states; it is close to absorption edge measured at \SI{2.3}{\eV} and exhibits a characteristic \SI{50}{\meV} anisotropy between polarization parallel and perpendicular to the $c$ axis. The high energy Raman resonance occurs inside the gap at \SI{1.94}{\eV} for all crystallographic directions. The in-gap resonance can involve a localized transition. We attribute it to an exciton-polaron, formed by a small localized electron polaron of Holstein type and delocalized holes. It manifests in the vibrations of vanadium and oxygen atoms where polaron localization occurs and the resonance energy matches theoretical predictions. The vibrational modes couple to the polaron with different efficiency determined from resonant Raman profiles. △ Less

Submitted 5 April, 2024; originally announced April 2024.

Comments: main: 8 pages, 4 figures, supporting: 4 pages, 3 figures

arXiv:2404.01146 [pdf, other]

Dielectric Screening Inside Carbon Nanotubes

Authors: Georgy Gordeev, Sören Wasserroth, Han Li, Ado Jorio, Benjamin S. Flavel, Stephanie Reich

Abstract: Dielectric screening plays a vital role for the physical properties in the nanoscale and also alters our ability to detect and characterize nanomaterials by optical techniques. We study the dielectric screening inside of carbon nanotubes and how it changes electromagnetic fields and many-body effects for encapsulated nanostructures. First, we show that the local electric field inside a nanotube is… ▽ More Dielectric screening plays a vital role for the physical properties in the nanoscale and also alters our ability to detect and characterize nanomaterials by optical techniques. We study the dielectric screening inside of carbon nanotubes and how it changes electromagnetic fields and many-body effects for encapsulated nanostructures. First, we show that the local electric field inside a nanotube is altered by one-dimensional screening with dramatic effects on the effective Raman scattering efficiency of the encapsulated species for metallic walls. The scattering intensity of the inner tube is two orders of magnitude weaker than for the tube in air, which is nicely reproduced by local field calculations. Secondly, we find that the optical transition energies of the inner nanotubes shift to lower energies compared to a single-walled carbon nanotubes of the same chirality. The shift is higher if the outer tube is metallic than when it is semiconducting. The magnitude of the shift suggests that the excitons of small diameter inner metallic tubes are thermally dissociated at room temperate if the outer tube is also metallic and in essence we observe band-to-band transitions. △ Less

Submitted 1 April, 2024; originally announced April 2024.

Comments: main: 19 pages, 6 figures supporting: 8 pages, 3 figures

arXiv:2403.18353 [pdf, ps, other]

Early Stopping for Ensemble Kalman-Bucy Inversion

Authors: Maia Tienstra, Sebastian Reich

Abstract: Bayesian linear inverse problems aim to recover an unknown signal from noisy observations, incorporating prior knowledge. This paper analyses a data-dependent method to choose the scale parameter of a Gaussian prior. The method we study arises from early stopping methods, which have been successfully applied to a range of problems, such as statistical inverse problems, in the frequentist setting.… ▽ More Bayesian linear inverse problems aim to recover an unknown signal from noisy observations, incorporating prior knowledge. This paper analyses a data-dependent method to choose the scale parameter of a Gaussian prior. The method we study arises from early stopping methods, which have been successfully applied to a range of problems, such as statistical inverse problems, in the frequentist setting. These results are extended to the Bayesian setting. We study the use of a discrepancy-based stopping rule in the setting of random noise, which allows for adaptation. Our proposed stopping rule results in optimal rates for the reparameterized problem under certain conditions on the prior covariance operator. We furthermore derive for which class of signals this method is adaptive. It is also shown that the associated posterior contracts at the same rate as the MAP estimator and provides a conservative measure of uncertainty. We implement the proposed stopping rule using the continuous-time ensemble Kalman--Bucy filter (EnKBF). The fictitious time parameter replaces the scale parameter, and the ensemble size is appropriately adjusted in order not to lose the statistical optimality of the computed estimator. With this Monte Carlo algorithm, we extend our results numerically to a nonlinear problem. △ Less

Submitted 21 October, 2025; v1 submitted 27 March, 2024; originally announced March 2024.

arXiv:2401.04372 [pdf, ps, other]

Stable generative modeling using Schrödinger bridges

Authors: Georg A. Gottwald, Fengyi Li, Youssef Marzouk, Sebastian Reich

Abstract: We consider the problem of sampling from an unknown distribution for which only a sufficiently large number of training samples are available. Such settings have recently drawn considerable interest in the context of generative modelling and Bayesian inference. In this paper, we propose a generative model combining Schrödinger bridges and Langevin dynamics. Schrödinger bridges over an appropriate… ▽ More We consider the problem of sampling from an unknown distribution for which only a sufficiently large number of training samples are available. Such settings have recently drawn considerable interest in the context of generative modelling and Bayesian inference. In this paper, we propose a generative model combining Schrödinger bridges and Langevin dynamics. Schrödinger bridges over an appropriate reversible reference process are used to approximate the conditional transition probability from the available training samples, which is then implemented in a discrete-time reversible Langevin sampler to generate new samples. By setting the kernel bandwidth in the reference process to match the time step size used in the unadjusted Langevin algorithm, our method effectively circumvents any stability issues typically associated with the time-stepping of stiff stochastic differential equations. Moreover, we introduce a novel split-step scheme, ensuring that the generated samples remain within the convex hull of the training samples. Our framework can be naturally extended to generate conditional samples and to Bayesian inference problems. We demonstrate the performance of our proposed scheme through experiments on synthetic datasets with increasing dimensions and on a stochastic subgrid-scale parametrization conditional sampling problem as well as generating sample trajectories of a dynamical system using conditional sampling. △ Less

Submitted 23 October, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

MSC Class: 60H10; 62F15; 62F30; 65C05; 65C40

arXiv:2312.15975 [pdf, other]

Filtered data based estimators for stochastic processes driven by colored noise

Authors: Grigorios A. Pavliotis, Sebastian Reich, Andrea Zanoni

Abstract: We consider the problem of estimating unknown parameters in stochastic differential equations driven by colored noise, which we model as a sequence of Gaussian stationary processes with decreasing correlation time. We aim to infer parameters in the limit equation, driven by white noise, given observations of the colored noise dynamics. We consider both the maximum likelihood and the stochastic gra… ▽ More We consider the problem of estimating unknown parameters in stochastic differential equations driven by colored noise, which we model as a sequence of Gaussian stationary processes with decreasing correlation time. We aim to infer parameters in the limit equation, driven by white noise, given observations of the colored noise dynamics. We consider both the maximum likelihood and the stochastic gradient descent in continuous time estimators, and we propose to modify them by including filtered data. We provide a convergence analysis for our estimators showing their asymptotic unbiasedness in a general setting and asymptotic normality under a simplified scenario. △ Less

Submitted 27 December, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

arXiv:2311.18060 [pdf, ps, other]

Levitin-Polyak well-posedness of split multivalued variational inequalities

Authors: Soumitra Dey, Simeon Reich

Abstract: We introduce and study the split multivalued variational inequality problem (SMVIP) and the parametric SMVIP. We examine, in particular, Levitin-Polyak well-posedness of SMVIPs and parametric SMVIPs in Hilbert spaces. We provide several examples to illustrate our theoretical results. We also discuss several important special cases. We introduce and study the split multivalued variational inequality problem (SMVIP) and the parametric SMVIP. We examine, in particular, Levitin-Polyak well-posedness of SMVIPs and parametric SMVIPs in Hilbert spaces. We provide several examples to illustrate our theoretical results. We also discuss several important special cases. △ Less

Submitted 29 November, 2023; originally announced November 2023.

Comments: arXiv admin note: text overlap with arXiv:2208.07126

MSC Class: 49K40; 49J40; 90C31; 47H10; 47J20

arXiv:2311.06906 [pdf, ps, other]

Particle-based algorithm for stochastic optimal control

Authors: Sebastian Reich

Abstract: The solution to a stochastic optimal control problem can be determined by computing the value function from a discretization of the associated Hamilton-Jacobi-Bellman equation. Alternatively, the problem can be reformulated in terms of a pair of forward-backward SDEs, which makes Monte-Carlo techniques applicable. More recently, the problem has also been viewed from the perspective of forward and… ▽ More The solution to a stochastic optimal control problem can be determined by computing the value function from a discretization of the associated Hamilton-Jacobi-Bellman equation. Alternatively, the problem can be reformulated in terms of a pair of forward-backward SDEs, which makes Monte-Carlo techniques applicable. More recently, the problem has also been viewed from the perspective of forward and reverse time SDEs and their associated Fokker-Planck equations. This approach is closely related to techniques used in diffusion-based generative models. Forward and reverse time formulations express the value function as the ratio of two probability density functions; one stemming from a forward McKean-Vlasov SDE and another one from a reverse McKean-Vlasov SDE. In this paper, we extend this approach to a more general class of stochastic optimal control problems and combine it with ensemble Kalman filter type and diffusion map approximation techniques in order to obtain efficient and robust particle-based algorithms. △ Less

Submitted 27 February, 2024; v1 submitted 12 November, 2023; originally announced November 2023.

MSC Class: 93E20; 49L12; 65C35; 65M75

arXiv:2311.03107 [pdf, other]

Longitudinal Polaritons in Crystals

Authors: Eduardo B. Barros, Stephanie Reich

Abstract: The collective excitations of solids are classified as longitudinal and transverse depending on their relative polarization and propagation direction. This seemingly formal classification results in surprisingly distinct types of excitations if calculated within the Coulomb gauge. Transverse modes couple to free-space photons and hybridize into polaritons for strong light-matter coupling. Longitud… ▽ More The collective excitations of solids are classified as longitudinal and transverse depending on their relative polarization and propagation direction. This seemingly formal classification results in surprisingly distinct types of excitations if calculated within the Coulomb gauge. Transverse modes couple to free-space photons and hybridize into polaritons for strong light-matter coupling. Longitudinal modes, in contrast, are seen as pure matter excitations that produce a dynamic polarization inside the material without photon coupling. Here we show that both longitudinal and transverse modes become polaritons in the explicitly covariant Lorenz gauge. Longitudinal excitations couple to longitudinal and scalar photons, which have been considered elusive so far. We show that the dipolar excitations become three-fold degenerate in the long-wavelength limit when including all photonic degrees of freedom, as expected from symmetry. Our findings demonstrate how choosing a gauge determines our thinking about materials excitations and how gauge fixing reveals new pathways for tailoring polaritons in crystals, metamaterials, and surfaces. Longitudinal polaritons will interact with longitudinal near fields located at surfaces, which provides additional excitation channels to engineer scanning near-field microscopy and surface-enhanced spectroscopy. △ Less

Submitted 15 April, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

arXiv:2310.10205 [pdf, other]

New iterative algorithms for solving split variational inclusions

Authors: Soumitra Dey, Chinedu Izuchukwu, Adeolu Taiwo, Simeon Reich

Abstract: In this paper we study a class of split variational inclusion (SVI) and regularized split variational inclusion (RSVI) problems in real Hilbert spaces. We discuss various analytical properties of the net generated by the RSVI and establish the existence and uniqueness of the solution to the RSVI. Using analytical properties of this net and under certain assumptions on the parameters and mappings a… ▽ More In this paper we study a class of split variational inclusion (SVI) and regularized split variational inclusion (RSVI) problems in real Hilbert spaces. We discuss various analytical properties of the net generated by the RSVI and establish the existence and uniqueness of the solution to the RSVI. Using analytical properties of this net and under certain assumptions on the parameters and mappings associated with the SVI, we establish the strong convergence of the sequence generated by our proposed iterative algorithm. We also deduce another iterative algorithm by taking the regularization parameters to be zero in our proposed algorithm. We establish the weak convergence of the sequence generated by our new algorithm under certain assumptions. Moreover, we discuss two special cases of the SVI, namely the split convex minimization and the split variational inequality problems, and give several numerical examples. △ Less

Submitted 16 October, 2023; originally announced October 2023.

MSC Class: 65Y05; 65K15; 47H05; 49J53; 47H10

arXiv:2310.06721 [pdf, other]

Tweedie Moment Projected Diffusions For Inverse Problems

Authors: Benjamin Boys, Mark Girolami, Jakiw Pidstrigach, Sebastian Reich, Alan Mosca, O. Deniz Akyildiz

Abstract: Diffusion generative models unlock new possibilities for inverse problems as they allow for the incorporation of strong empirical priors in scientific inference. Recently, diffusion models are repurposed for solving inverse problems using Gaussian approximations to conditional densities of the reverse process via Tweedie's formula to parameterise the mean, complemented with various heuristics. To… ▽ More Diffusion generative models unlock new possibilities for inverse problems as they allow for the incorporation of strong empirical priors in scientific inference. Recently, diffusion models are repurposed for solving inverse problems using Gaussian approximations to conditional densities of the reverse process via Tweedie's formula to parameterise the mean, complemented with various heuristics. To address various challenges arising from these approximations, we leverage higher order information using Tweedie's formula and obtain a statistically principled approximation. We further provide a theoretical guarantee specifically for posterior sampling which can lead to a better theoretical understanding of diffusion-based conditional sampling. Finally, we illustrate the empirical effectiveness of our approach for general linear inverse problems on toy synthetic examples as well as image restoration. We show that our method (i) removes any time-dependent step-size hyperparameters required by earlier methods, (ii) brings stability and better sample quality across multiple noise levels, (iii) is the only method that works in a stable way with variance exploding (VE) forward processes as opposed to earlier works. △ Less

Submitted 25 September, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

Comments: 12 pages, 2 figures, 2 tables when excluding abstract and bibliography; 45 pages, 17 figures, 13 tables when including abstract and bibliography

arXiv:2310.03597 [pdf, other]

Sampling via Gradient Flows in the Space of Probability Measures

Authors: Yifan Chen, Daniel Zhengyu Huang, Jiaoyang Huang, Sebastian Reich, Andrew M Stuart

Abstract: Sampling a target probability distribution with an unknown normalization constant is a fundamental challenge in computational science and engineering. Recent work shows that algorithms derived by considering gradient flows in the space of probability measures open up new avenues for algorithm development. This paper makes three contributions to this sampling approach by scrutinizing the design com… ▽ More Sampling a target probability distribution with an unknown normalization constant is a fundamental challenge in computational science and engineering. Recent work shows that algorithms derived by considering gradient flows in the space of probability measures open up new avenues for algorithm development. This paper makes three contributions to this sampling approach by scrutinizing the design components of such gradient flows. Any instantiation of a gradient flow for sampling needs an energy functional and a metric to determine the flow, as well as numerical approximations of the flow to derive algorithms. Our first contribution is to show that the Kullback-Leibler divergence, as an energy functional, has the unique property (among all f-divergences) that gradient flows resulting from it do not depend on the normalization constant of the target distribution. Our second contribution is to study the choice of metric from the perspective of invariance. The Fisher-Rao metric is known as the unique choice (up to scaling) that is diffeomorphism invariant. As a computationally tractable alternative, we introduce a relaxed, affine invariance property for the metrics and gradient flows. In particular, we construct various affine invariant Wasserstein and Stein gradient flows. Affine invariant gradient flows are shown to behave more favorably than their non-affine-invariant counterparts when sampling highly anisotropic distributions, in theory and by using particle methods. Our third contribution is to study, and develop efficient algorithms based on Gaussian approximations of the gradient flows; this leads to an alternative to particle methods. We establish connections between various Gaussian approximate gradient flows, discuss their relation to gradient methods arising from parametric variational inference, and study their convergence properties both theoretically and numerically. △ Less

Submitted 9 March, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

Comments: Related and text overlap with arXiv:2302.11024

arXiv:2309.09673 [pdf, other]

Strong Coupling of Two-Dimensional Excitons and Plasmonic Photonic Crystals: Microscopic Theory Reveals Triplet Spectra

Authors: Lara Greten, Robert Salzwedel, Tobias Göde, David Greten, Stephanie Reich, Stephen Hughes, Malte Selig, Andreas Knorr

Abstract: Monolayers of transition metal dichalcogenides (TMDC) are direct-gap semiconductors with strong light-matter interactions featuring tightly bound excitons, while plasmonic crystals (PCs), consisting of metal nanoparticles that act as meta-atoms, exhibit collective plasmon modes and allow one to tailor electric fields on the nanoscale. Recent experiments show that TMDC-PC hybrids can reach the stro… ▽ More Monolayers of transition metal dichalcogenides (TMDC) are direct-gap semiconductors with strong light-matter interactions featuring tightly bound excitons, while plasmonic crystals (PCs), consisting of metal nanoparticles that act as meta-atoms, exhibit collective plasmon modes and allow one to tailor electric fields on the nanoscale. Recent experiments show that TMDC-PC hybrids can reach the strong-coupling limit between excitons and plasmons forming new quasiparticles, so-called plexcitons. To describe this coupling theoretically, we develop a self-consistent Maxwell-Bloch theory for TMDC-PC hybrid structures, which allows us to compute the scattered light in the near- and far-field explicitly and provide guidance for experimental studies. Our calculations reveal a spectral splitting signature of strong coupling of more than $100\,$meV in gold-MoSe$_2$ structures with $30\,$nm nanoparticles, manifesting in a hybridization of exciton and plasmon into two effective plexcitonic bands. In addition to the hybridized states, we find a remaining excitonic mode with significantly smaller coupling to the plasmonic near-field, emitting directly into the far-field. Thus, hybrid spectra in the strong coupling regime can contain three emission peaks. △ Less

Submitted 18 September, 2023; originally announced September 2023.

Comments: 16 pages, 6 figures

arXiv:2309.04742 [pdf, other]

Affine Invariant Ensemble Transform Methods to Improve Predictive Uncertainty in Neural Networks

Authors: Diksha Bhandari, Jakiw Pidstrigach, Sebastian Reich

Abstract: We consider the problem of performing Bayesian inference for logistic regression using appropriate extensions of the ensemble Kalman filter. Two interacting particle systems are proposed that sample from an approximate posterior and prove quantitative convergence rates of these interacting particle systems to their mean-field limit as the number of particles tends to infinity. Furthermore, we appl… ▽ More We consider the problem of performing Bayesian inference for logistic regression using appropriate extensions of the ensemble Kalman filter. Two interacting particle systems are proposed that sample from an approximate posterior and prove quantitative convergence rates of these interacting particle systems to their mean-field limit as the number of particles tends to infinity. Furthermore, we apply these techniques and examine their effectiveness as methods of Bayesian approximation for quantifying predictive uncertainty in neural networks. △ Less

Submitted 1 July, 2024; v1 submitted 9 September, 2023; originally announced September 2023.

arXiv:2308.16784 [pdf, other]

doi 10.1137/23M159860X

Dropout Ensemble Kalman inversion for high dimensional inverse problems

Authors: Shuigen Liu, Sebastian Reich, Xin T. Tong

Abstract: Ensemble Kalman inversion (EKI) is an ensemble-based method to solve inverse problems. Its gradient-free formulation makes it an attractive tool for problems with involved formulation. However, EKI suffers from the ''subspace property'', i.e., the EKI solutions are confined in the subspace spanned by the initial ensemble. It implies that the ensemble size should be larger than the problem dimensio… ▽ More Ensemble Kalman inversion (EKI) is an ensemble-based method to solve inverse problems. Its gradient-free formulation makes it an attractive tool for problems with involved formulation. However, EKI suffers from the ''subspace property'', i.e., the EKI solutions are confined in the subspace spanned by the initial ensemble. It implies that the ensemble size should be larger than the problem dimension to ensure EKI's convergence to the correct solution. Such scaling of ensemble size is impractical and prevents the use of EKI in high dimensional problems. To address this issue, we propose a novel approach using dropout regularization to mitigate the subspace problem. We prove that dropout-EKI converges in the small ensemble settings, and the computational cost of the algorithm scales linearly with dimension. We also show that dropout-EKI reaches the optimal query complexity, up to a constant factor. Numerical examples demonstrate the effectiveness of our approach. △ Less

Submitted 30 September, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

MSC Class: 65K10; 90C56; 65M32

arXiv:2306.12219 [pdf, ps, other]

Comparing the Methods of Alternating and Simultaneous Projections for Two Subspaces

Authors: Simeon Reich, Rafał Zalas

Abstract: We study the well-known methods of alternating and simultaneous projections when applied to two nonorthogonal linear subspaces of a real Euclidean space. Assuming that both of the methods have a common starting point chosen from either one of the subspaces, we show that the method of alternating projections converges significantly faster than the method of simultaneous projections. On the other ha… ▽ More We study the well-known methods of alternating and simultaneous projections when applied to two nonorthogonal linear subspaces of a real Euclidean space. Assuming that both of the methods have a common starting point chosen from either one of the subspaces, we show that the method of alternating projections converges significantly faster than the method of simultaneous projections. On the other hand, we provide examples of subspaces and starting points, where the method of simultaneous projections outperforms the method of alternating projections. △ Less

Submitted 14 November, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

arXiv:2306.10630 [pdf, other]

Collective States in Molecular Monolayers on 2D Materials

Authors: Sabrina Juergensen, Moritz Kessens, Charlotte Berrezueta-Palacios, Nikolai Severin, Sumaya Ifland, Jürgen P. Rabe, Niclas S. Mueller, Stephanie Reich

Abstract: Collective excited states form in organic two-dimensional layers through the Coulomb coupling of the molecular transition dipole moments. They manifest as characteristic strong and narrow peaks in the excitation and emission spectra that are shifted to lower energies compared to the monomer transition. We study experimentally and theoretically how robust the collective states are against homogeneo… ▽ More Collective excited states form in organic two-dimensional layers through the Coulomb coupling of the molecular transition dipole moments. They manifest as characteristic strong and narrow peaks in the excitation and emission spectra that are shifted to lower energies compared to the monomer transition. We study experimentally and theoretically how robust the collective states are against homogeneous and inhomogeneous broadening as well as spatial disorder that occur in real molecular monolayers. Using a microscopic model for a two-dimensional dipole lattice in real space we calculate the properties of collective states and their extinction spectra. We find that the collective states persist even for 1-10% random variation in the molecular position and in the transition frequency, with similar peak position and integrated intensity as for the perfectly ordered system. We measure the optical response of a monolayer of the perylene-derivative MePTCDI on two-dimensional materials. On the wide band-gap insulator hexagonal boron nitride it shows strong emission from the collective state with a line width that is dominated by the inhomogeneous broadening of the molecular state. When using the semimetal graphene as a substrate, however, the luminescence is completely quenched. By combining optical absorption, luminescence, and multi-wavelength Raman scattering we verify that the MePTCDI molecules form very similar collective monolayer states on hexagonal boron nitride and graphene substrates, but on graphene the line width is dominated by non-radiative excitation transfer from the molecules to the substrate. Our study highlights the transition from the localized molecular state of the monomer to a delocalized collective state in the two-dimensional molecular lattice that is entirely based on Coulomb coupling between optically active excitations of the electrons or molecular vibrations. △ Less

Submitted 14 August, 2023; v1 submitted 18 June, 2023; originally announced June 2023.

arXiv:2305.18932 [pdf, other]

doi 10.1145/3539618.3591888

The Information Retrieval Experiment Platform

Authors: Maik Fröbe, Jan Heinrich Reimer, Sean MacAvaney, Niklas Deckers, Simon Reich, Janek Bevendorff, Benno Stein, Matthias Hagen, Martin Potthast

Abstract: We integrate ir_datasets, ir_measures, and PyTerrier with TIRA in the Information Retrieval Experiment Platform (TIREx) to promote more standardized, reproducible, scalable, and even blinded retrieval experiments. Standardization is achieved when a retrieval approach implements PyTerrier's interfaces and the input and output of an experiment are compatible with ir_datasets and ir_measures. However… ▽ More We integrate ir_datasets, ir_measures, and PyTerrier with TIRA in the Information Retrieval Experiment Platform (TIREx) to promote more standardized, reproducible, scalable, and even blinded retrieval experiments. Standardization is achieved when a retrieval approach implements PyTerrier's interfaces and the input and output of an experiment are compatible with ir_datasets and ir_measures. However, none of this is a must for reproducibility and scalability, as TIRA can run any dockerized software locally or remotely in a cloud-native execution environment. Version control and caching ensure efficient (re)execution. TIRA allows for blind evaluation when an experiment runs on a remote server or cloud not under the control of the experimenter. The test data and ground truth are then hidden from public access, and the retrieval software has to process them in a sandbox that prevents data leaks. We currently host an instance of TIREx with 15 corpora (1.9 billion documents) on which 32 shared retrieval tasks are based. Using Docker images of 50 standard retrieval approaches, we automatically evaluated all approaches on all tasks (50 $\cdot$ 32 = 1,600~runs) in less than a week on a midsize cluster (1,620 CPU cores and 24 GPUs). This instance of TIREx is open for submissions and will be integrated with the IR Anthology, as well as released open source. △ Less

Submitted 30 May, 2023; originally announced May 2023.

Comments: 11 pages. To be published in the proceedings of SIGIR 2023

arXiv:2304.12727 [pdf, ps, other]

On forward-backward SDE approaches to continuous-time minimum variance estimation

Authors: Jin Won Kim, Sebastian Reich

Abstract: The work of Kalman and Bucy has established a duality between filtering and optimal estimation in the context of time-continuous linear systems. This duality has recently been extended to time-continuous nonlinear systems in terms of an optimization problem constrained by a backward stochastic partial differential equation. Here we revisit this problem from the perspective of appropriate forward-b… ▽ More The work of Kalman and Bucy has established a duality between filtering and optimal estimation in the context of time-continuous linear systems. This duality has recently been extended to time-continuous nonlinear systems in terms of an optimization problem constrained by a backward stochastic partial differential equation. Here we revisit this problem from the perspective of appropriate forward-backward stochastic differential equations. This approach sheds new light on the estimation problem and provides a unifying perspective. It is also demonstrated that certain formulations of the estimation problem lead to deterministic formulations similar to the linear Gaussian case as originally investigated by Kalman and Bucy. Finally, optimal control of partially observed diffusion processes is discussed as an application of the proposed estimators. △ Less

Submitted 14 August, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

MSC Class: 90E10; 90E11; 60G35; 62M20; 93E11; 93E20

arXiv:2303.16494 [pdf, other]

doi 10.1137/23M1561142

EnKSGD: A Class Of Preconditioned Black Box Optimization And Inversion Algorithms

Authors: Brian Irwin, Sebastian Reich

Abstract: In this paper, we introduce the Ensemble Kalman-Stein Gradient Descent (EnKSGD) class of algorithms. The EnKSGD class of algorithms builds on the ensemble Kalman filter (EnKF) line of work, applying techniques from sequential data assimilation to unconstrained optimization and parameter estimation problems. The essential idea is to exploit the EnKF as a black box (i.e. derivative-free, zeroth orde… ▽ More In this paper, we introduce the Ensemble Kalman-Stein Gradient Descent (EnKSGD) class of algorithms. The EnKSGD class of algorithms builds on the ensemble Kalman filter (EnKF) line of work, applying techniques from sequential data assimilation to unconstrained optimization and parameter estimation problems. The essential idea is to exploit the EnKF as a black box (i.e. derivative-free, zeroth order) optimization tool if iterated to convergence. In this paper, we return to the foundations of the EnKF as a sequential data assimilation technique, including its continuous-time and mean-field limits, with the goal of developing faster optimization algorithms suited to noisy black box optimization and inverse problems. The resulting EnKSGD class of algorithms can be designed to both maintain the desirable property of affine-invariance, and employ the well-known backtracking line search. Furthermore, EnKSGD algorithms are designed to not necessitate the subspace restriction property and variance collapse property of previous iterated EnKF approaches to optimization, as both these properties can be undesirable in an optimization context. EnKSGD also generalizes beyond the $L^{2}$ loss, and is thus applicable to a wider class of problems than the standard EnKF. Numerical experiments with both linear and nonlinear least squares problems, as well as maximum likelihood estimation, demonstrate the faster convergence of EnKSGD relative to alternative EnKF approaches to optimization. △ Less

Submitted 29 March, 2023; originally announced March 2023.

Comments: 20 pages, 3 figures

MSC Class: 65K10; 90C56; 65C35; 65C05; 62F10

arXiv:2303.11941 [pdf, other]

Bayesian Dynamical Modeling of Fixational Eye Movements

Authors: Lisa Schwetlick, Sebastian Reich, Ralf Engbert

Abstract: Humans constantly move their eyes, even during visual fixations, where miniature (or fixational) eye movements occur involuntarily. Fixational eye movements comprise slow components (physiological drift and tremor) and fast components (microsaccades). The complex dynamics of physiological drift can be modeled qualitatively as a statistically self-avoiding random walk (SAW model, Engbert, Mergentha… ▽ More Humans constantly move their eyes, even during visual fixations, where miniature (or fixational) eye movements occur involuntarily. Fixational eye movements comprise slow components (physiological drift and tremor) and fast components (microsaccades). The complex dynamics of physiological drift can be modeled qualitatively as a statistically self-avoiding random walk (SAW model, Engbert, Mergenthaler, Sinn, & Pikovsky, 2011). In this study, we implement a data assimilation approach for the SAW model to explain statistics of fixational eye movements and microsaccades in experimental data obtained from high-resolution eye-tracking. We discuss and analyze the likelihood function for the SAW model, which allows us to apply Bayesian parameter estimation at the level of individual human observers. Based on model fitting, we find a relationship between the activation predicted by the SAW model and the occurrence of microsaccades. The model's latent activation relative to microsaccade onsets and offsets using experimental data lends support to the existence of a triggering mechanism for microsaccades. Our findings suggest that the SAW model can capture individual differences and serve as a tool for exploring the relationship between physiological drift and microsaccades as the two most essential components of fixational eye movements. Our results contribute to understanding individual variability in microsaccade behaviors and the role of fixational eye movements in visual information processing. △ Less

Submitted 25 May, 2025; v1 submitted 21 March, 2023; originally announced March 2023.

arXiv:2302.11024 [pdf, other]

Gradient Flows for Sampling: Mean-Field Models, Gaussian Approximations and Affine Invariance

Authors: Yifan Chen, Daniel Zhengyu Huang, Jiaoyang Huang, Sebastian Reich, Andrew M. Stuart

Abstract: Sampling a probability distribution with an unknown normalization constant is a fundamental problem in computational science and engineering. This task may be cast as an optimization problem over all probability measures, and an initial distribution can be evolved to the desired minimizer dynamically via gradient flows. Mean-field models, whose law is governed by the gradient flow in the space of… ▽ More Sampling a probability distribution with an unknown normalization constant is a fundamental problem in computational science and engineering. This task may be cast as an optimization problem over all probability measures, and an initial distribution can be evolved to the desired minimizer dynamically via gradient flows. Mean-field models, whose law is governed by the gradient flow in the space of probability measures, may also be identified; particle approximations of these mean-field models form the basis of algorithms. The gradient flow approach is also the basis of algorithms for variational inference, in which the optimization is performed over a parameterized family of probability distributions such as Gaussians, and the underlying gradient flow is restricted to the parameterized family. By choosing different energy functionals and metrics for the gradient flow, different algorithms with different convergence properties arise. In this paper, we concentrate on the Kullback-Leibler divergence after showing that, up to scaling, it has the unique property that the gradient flows resulting from this choice of energy do not depend on the normalization constant. For the metrics, we focus on variants of the Fisher-Rao, Wasserstein, and Stein metrics; we introduce the affine invariance property for gradient flows, and their corresponding mean-field models, determine whether a given metric leads to affine invariance, and modify it to make it affine invariant if it does not. We study the resulting gradient flows in both probability density space and Gaussian space. The flow in the Gaussian space may be understood as a Gaussian approximation of the flow. We demonstrate that the Gaussian approximation based on the metric and through moment closure coincide, establish connections between them, and study their long-time convergence properties showing the advantages of affine invariance. △ Less

Submitted 10 September, 2024; v1 submitted 21 February, 2023; originally announced February 2023.

Comments: 82 pages, 8 figures (Welcome any feedback!)

arXiv:2302.10130 [pdf, ps, other]

Infinite-Dimensional Diffusion Models

Authors: Jakiw Pidstrigach, Youssef Marzouk, Sebastian Reich, Sven Wang

Abstract: Diffusion models have had a profound impact on many application areas, including those where data are intrinsically infinite-dimensional, such as images or time series. The standard approach is first to discretize and then to apply diffusion models to the discretized data. While such approaches are practically appealing, the performance of the resulting algorithms typically deteriorates as discret… ▽ More Diffusion models have had a profound impact on many application areas, including those where data are intrinsically infinite-dimensional, such as images or time series. The standard approach is first to discretize and then to apply diffusion models to the discretized data. While such approaches are practically appealing, the performance of the resulting algorithms typically deteriorates as discretization parameters are refined. In this paper, we instead directly formulate diffusion-based generative models in infinite dimensions and apply them to the generative modelling of functions. We prove that our formulations are well posed in the infinite-dimensional setting and provide dimension-independent distance bounds from the sample to the target measure. Using our theory, we also develop guidelines for the design of infinite-dimensional diffusion models. For image distributions, these guidelines are in line with current canonical choices. For other distributions, however, we can improve upon these canonical choices. We demonstrate these results both theoretically and empirically, by applying the algorithms to data distributions on manifolds and to distributions arising in Bayesian inverse problems or simulation-based inference. △ Less

Submitted 6 June, 2025; v1 submitted 20 February, 2023; originally announced February 2023.

MSC Class: 68T99; 60Hxx

arXiv:2301.12211 [pdf, other]

Nanomechanical absorption spectroscopy of 2D materials with femtowatt sensitivity

Authors: Jan N. Kirchhof, Yuefeng Yu, Denis Yagodkin, Nele Stetzuhn, Daniel B. de Araújo, Kostas Kanellopulos, Samuel Manas-Valero, Eugenio Coronado, Herre van der Zant, Stephanie Reich, Silvan Schmid, Kirill I. Bolotin

Abstract: Nanomechanical spectroscopy (NMS) is a recently developed approach to determine optical absorption spectra of nanoscale materials via mechanical measurements. It is based on measuring changes in the resonance frequency of a membrane resonator vs. the photon energy of incoming light. This method is a direct measurement of absorption, which has practical advantages compared to common optical spectro… ▽ More Nanomechanical spectroscopy (NMS) is a recently developed approach to determine optical absorption spectra of nanoscale materials via mechanical measurements. It is based on measuring changes in the resonance frequency of a membrane resonator vs. the photon energy of incoming light. This method is a direct measurement of absorption, which has practical advantages compared to common optical spectroscopy approaches. In the case of two-dimensional (2D) materials, NMS overcomes limitations inherent to conventional optical methods, such as the complications associated with measurements at high magnetic fields and low temperatures. In this work, we develop a protocol for NMS of 2D materials that yields two orders of magnitude improved sensitivity compared to previous approaches, while being simpler to use. To this end, we use electrical sample actuation, which simplifies the experiment and provides a reliable calibration for greater accuracy. Additionally, the use of low-stress silicon nitride membranes as our substrate reduces the noise-equivalent power to $NEP = 890 fW/\sqrt{Hz}$, comparable to commercial semiconductor photodetectors. We use our approach to spectroscopically characterize a two-dimensional transition metal dichalcogenide (WS$_2$), a layered magnetic semiconductor (CrPS$_4$), and a plasmonic supercrystal consisting of gold nanoparticles. △ Less

Submitted 28 January, 2023; originally announced January 2023.

arXiv:2212.06727 [pdf, other]

What do Vision Transformers Learn? A Visual Exploration

Authors: Amin Ghiasi, Hamid Kazemi, Eitan Borgnia, Steven Reich, Manli Shu, Micah Goldblum, Andrew Gordon Wilson, Tom Goldstein

Abstract: Vision transformers (ViTs) are quickly becoming the de-facto architecture for computer vision, yet we understand very little about why they work and what they learn. While existing studies visually analyze the mechanisms of convolutional neural networks, an analogous exploration of ViTs remains challenging. In this paper, we first address the obstacles to performing visualizations on ViTs. Assiste… ▽ More Vision transformers (ViTs) are quickly becoming the de-facto architecture for computer vision, yet we understand very little about why they work and what they learn. While existing studies visually analyze the mechanisms of convolutional neural networks, an analogous exploration of ViTs remains challenging. In this paper, we first address the obstacles to performing visualizations on ViTs. Assisted by these solutions, we observe that neurons in ViTs trained with language model supervision (e.g., CLIP) are activated by semantic concepts rather than visual features. We also explore the underlying differences between ViTs and CNNs, and we find that transformers detect image background features, just like their convolutional counterparts, but their predictions depend far less on high-frequency information. On the other hand, both architecture types behave similarly in the way features progress from abstract patterns in early layers to concrete objects in late layers. In addition, we show that ViTs maintain spatial information in all layers except the final layer. In contrast to previous works, we show that the last layer most likely discards the spatial information and behaves as a learned global pooling operation. Finally, we conduct large-scale visualizations on a wide range of ViT variants, including DeiT, CoaT, ConViT, PiT, Swin, and Twin, to validate the effectiveness of our method. △ Less

Submitted 13 December, 2022; originally announced December 2022.

arXiv:2211.04615 [pdf, other]

Observation of multi-directional energy transfer in a hybrid plasmonic-excitonic nanostructure

Authors: Tommaso Pincelli, Thomas Vasileiadis, Shuo Dong, Samuel Beaulieu, Maciej Dendzik, Daniela Zahn, Sang-Eun Lee, Hélène Seiler, Yinpeng Qi, R. Patrick Xian, Julian Maklar, Emerson Coy, Niclas S. Müller, Yu Okamura, Stephanie Reich, Martin Wolf, Laurenz Rettig, Ralph Ernstorfer

Abstract: Hybrid plasmonic devices involve a nanostructured metal supporting localized surface plasmons to amplify light-matter interaction, and a non-plasmonic material to functionalize charge excitations. Application-relevant epitaxial heterostructures, however, give rise to ballistic ultrafast dynamics that challenge the conventional semiclassical understanding of unidirectional nanometal-to-substrate en… ▽ More Hybrid plasmonic devices involve a nanostructured metal supporting localized surface plasmons to amplify light-matter interaction, and a non-plasmonic material to functionalize charge excitations. Application-relevant epitaxial heterostructures, however, give rise to ballistic ultrafast dynamics that challenge the conventional semiclassical understanding of unidirectional nanometal-to-substrate energy transfer. We study epitaxial Au nanoislands on WSe$_2$ with time- and angle-resolved photoemission spectroscopy and femtosecond electron diffraction: this combination of techniques resolves material, energy and momentum of charge-carriers and phonons excited in the heterostructure. We observe a strong non-linear plasmon-exciton interaction that transfers the energy of sub-bandgap photons very efficiently to the semiconductor, leaving the metal cold until non-radiative exciton recombination heats the nanoparticles on hundreds of femtoseconds timescales. Our results resolve a multi-directional energy exchange on timescales shorter than the electronic thermalization of the nanometal. Electron-phonon coupling and diffusive charge-transfer determine the subsequent energy flow. This complex dynamics opens perspectives for optoelectronic and photocatalytic applications, while providing a constraining experimental testbed for state-of-the-art modelling. △ Less

Submitted 29 November, 2022; v1 submitted 8 November, 2022; originally announced November 2022.

arXiv:2210.15091 [pdf, other]

Segmentation of Multiple Sclerosis Lesions across Hospitals: Learn Continually or Train from Scratch?

Authors: Enamundram Naga Karthik, Anne Kerbrat, Pierre Labauge, Tobias Granberg, Jason Talbott, Daniel S. Reich, Massimo Filippi, Rohit Bakshi, Virginie Callot, Sarath Chandar, Julien Cohen-Adad

Abstract: Segmentation of Multiple Sclerosis (MS) lesions is a challenging problem. Several deep-learning-based methods have been proposed in recent years. However, most methods tend to be static, that is, a single model trained on a large, specialized dataset, which does not generalize well. Instead, the model should learn across datasets arriving sequentially from different hospitals by building upon the… ▽ More Segmentation of Multiple Sclerosis (MS) lesions is a challenging problem. Several deep-learning-based methods have been proposed in recent years. However, most methods tend to be static, that is, a single model trained on a large, specialized dataset, which does not generalize well. Instead, the model should learn across datasets arriving sequentially from different hospitals by building upon the characteristics of lesions in a continual manner. In this regard, we explore experience replay, a well-known continual learning method, in the context of MS lesion segmentation across multi-contrast data from 8 different hospitals. Our experiments show that replay is able to achieve positive backward transfer and reduce catastrophic forgetting compared to sequential fine-tuning. Furthermore, replay outperforms the multi-domain training, thereby emerging as a promising solution for the segmentation of MS lesions. The code is available at this link: https://github.com/naga-karthik/continual-learning-ms △ Less

Submitted 26 October, 2022; originally announced October 2022.

Comments: Accepted at the Medical Imaging Meets NeurIPS (MedNeurIPS) Workshop 2022

Showing 1–50 of 204 results for author: Reich, S