Search | arXiv e-print repository

Foundation Models in Medical Imaging -- A Review and Outlook

Authors: Vivien van Veldhuizen, Vanessa Botha, Chunyao Lu, Melis Erdal Cesur, Kevin Groot Lipman, Edwin D. de Jong, Hugo Horlings, Clárisa I. Sanchez, Cees G. M. Snoek, Lodewyk Wessels, Ritse Mann, Eric Marcus, Jonas Teuwen

Abstract: Foundation models (FMs) are changing the way medical images are analyzed by learning from large collections of unlabeled data. Instead of relying on manually annotated examples, FMs are pre-trained to learn general-purpose visual features that can later be adapted to specific clinical tasks with little additional supervision. In this review, we examine how FMs are being developed and applied in pa… ▽ More Foundation models (FMs) are changing the way medical images are analyzed by learning from large collections of unlabeled data. Instead of relying on manually annotated examples, FMs are pre-trained to learn general-purpose visual features that can later be adapted to specific clinical tasks with little additional supervision. In this review, we examine how FMs are being developed and applied in pathology, radiology, and ophthalmology, drawing on evidence from over 150 studies. We explain the core components of FM pipelines, including model architectures, self-supervised learning methods, and strategies for downstream adaptation. We also review how FMs are being used in each imaging domain and compare design choices across applications. Finally, we discuss key challenges and open questions to guide future research. △ Less

Submitted 16 June, 2025; v1 submitted 10 June, 2025; originally announced June 2025.

arXiv:2408.07688 [pdf, other]

Finite Dimensional Projections of HJB Equations in the Wasserstein Space

Authors: Andrzej Święch, Lukas Wessels

Abstract: This paper continues the study of controlled interacting particle systems with common noise started in [W. Gangbo, S. Mayorga and A. {Ś}wi{ę}ch, \textit{SIAM J. Math. Anal.} 53 (2021), no. 2, 1320--1356] and [S. Mayorga and A. {Ś}wi{ę}ch, \textit{SIAM J. Control Optim.} 61 (2023), no. 2, 820--851]. First, we extend the following results of the previously mentioned works to the case of multiplicati… ▽ More This paper continues the study of controlled interacting particle systems with common noise started in [W. Gangbo, S. Mayorga and A. {Ś}wi{ę}ch, \textit{SIAM J. Math. Anal.} 53 (2021), no. 2, 1320--1356] and [S. Mayorga and A. {Ś}wi{ę}ch, \textit{SIAM J. Control Optim.} 61 (2023), no. 2, 820--851]. First, we extend the following results of the previously mentioned works to the case of multiplicative noise: (i) We generalize the convergence of the value functions $u_n$ corresponding to control problems of $n$ particles to the value function $V$ corresponding to an appropriately defined infinite dimensional control problem; (ii) we prove, under certain additional assumptions, $C^{1,1}$ regularity of $V$ in the spatial variable. The second main contribution of the present work is the proof that if $DV$ is continuous (which, in particular, includes the previously proven case of $C^{1,1}$ regularity in the spatial variable), the value function $V$ projects precisely onto the value functions $u_n$. Using this projection property, we show that optimal controls of the finite dimensional problem correspond to optimal controls of the infinite dimensional problem and vice versa. In the case of a linear state equation, we are able to prove that $V$ projects precisely onto the value functions $u_n$ under relaxed assumptions on the coefficients of the cost functional by using approximation techniques in the Wasserstein space, thus covering cases where $V$ may not be differentiable. △ Less

Submitted 14 August, 2024; originally announced August 2024.

Comments: 35 pages

MSC Class: 28A33; 35D40; 35R15; 49L12; 49L25; 49N80; 93E20

arXiv:2310.03181 [pdf, other]

doi 10.1214/25-EJP1294

Stochastic optimal control in Hilbert spaces: $C^{1,1}$ regularity of the value function and optimal synthesis via viscosity solutions

Authors: Filippo de Feo, Andrzej Święch, Lukas Wessels

Abstract: We study optimal control problems governed by abstract infinite dimensional stochastic differential equations using the dynamic programming approach. In the first part, we prove Lipschitz continuity, semiconcavity and semiconvexity of the value function under several sets of assumptions, and thus derive its $C^{1,1}$ regularity in the space variable. Based on this regularity result, we construct o… ▽ More We study optimal control problems governed by abstract infinite dimensional stochastic differential equations using the dynamic programming approach. In the first part, we prove Lipschitz continuity, semiconcavity and semiconvexity of the value function under several sets of assumptions, and thus derive its $C^{1,1}$ regularity in the space variable. Based on this regularity result, we construct optimal feedback controls using the notion of the $B$-continuous viscosity solutions for the associated Hamilton--Jacobi--Bellman equation. This is done in the case when the noise coefficient is independent of the control variable. We also discuss applications of our results to optimal control problems governed by stochastic reaction-diffusion equations and, under economic motivations, stochastic delay differential equations. △ Less

Submitted 14 February, 2025; v1 submitted 4 October, 2023; originally announced October 2023.

Comments: Accepted for publication in Electron. J. Probab

MSC Class: 93E20; 49L25; 49L12; 49K45; 60H15; 49L20; 49N35; 35R15; 35K57; 34K50

Journal ref: Electron. J. Probab. 30: 1-39, 2025

arXiv:2303.10038 [pdf, other]

doi 10.1080/07362994.2024.2434735

Semilinear Feynman-Kac Formulae for $B$-Continuous Viscosity Solutions

Authors: Lukas Wessels

Abstract: We prove the existence of a $B$-continuous viscosity solution for a class of infinite dimensional semilinear partial differential equations (PDEs) using probabilistic methods. Our approach also yields a stochastic representation formula for the solution in terms of a scalar-valued backward stochastic differential equation. The uniqueness is proved under additional assumptions using a comparison th… ▽ More We prove the existence of a $B$-continuous viscosity solution for a class of infinite dimensional semilinear partial differential equations (PDEs) using probabilistic methods. Our approach also yields a stochastic representation formula for the solution in terms of a scalar-valued backward stochastic differential equation. The uniqueness is proved under additional assumptions using a comparison theorem for viscosity solutions. Our results constitute the first nonlinear Feynman-Kac formula using the notion of $B$-continuous viscosity solutions and thus introduces a framework allowing for generalizations to the case of fully nonlinear PDEs. △ Less

Submitted 21 November, 2024; v1 submitted 17 March, 2023; originally announced March 2023.

Comments: Accepted for publication in Stoch. Anal. Appl

Journal ref: Stoch. Anal. Appl. 43 (2025), 112-129

arXiv:2301.11926 [pdf, other]

doi 10.1063/5.0143939

Neural Network Approximation of Optimal Controls for Stochastic Reaction-Diffusion Equations

Authors: Wilhelm Stannat, Alexander Vogler, Lukas Wessels

Abstract: We present a numerical algorithm that allows the approximation of optimal controls for stochastic reaction-diffusion equations with additive noise by first reducing the problem to controls of feedback form and then approximating the feedback function using finitely based approximations. Using structural assumptions on the finitely based approximations, rates for the approximation error of the cost… ▽ More We present a numerical algorithm that allows the approximation of optimal controls for stochastic reaction-diffusion equations with additive noise by first reducing the problem to controls of feedback form and then approximating the feedback function using finitely based approximations. Using structural assumptions on the finitely based approximations, rates for the approximation error of the cost can be obtained. Our algorithm significantly reduces the computational complexity of finding controls with asymptotically optimal cost. Numerical experiments using artificial neural networks as well as radial basis function networks illustrate the performance of our algorithm. Our approach can also be applied to stochastic control problems for high dimensional stochastic differential equations and more general stochastic partial differential equations. △ Less

Submitted 13 September, 2023; v1 submitted 25 January, 2023; originally announced January 2023.

Comments: 12 figures

Journal ref: Chaos: An Interdisciplinary Journal of Nonlinear Science, Vol. 33, No. 9, 093118, 2023

arXiv:2212.12073 [pdf, ps, other]

doi 10.1109/TASC.2023.3256343

A tabletop x-ray tomography instrument for nanometer-scale imaging: demonstration of the 1,000-element transition-edge sensor subarray

Authors: Paul Szypryt, Nathan Nakamura, Daniel T. Becker, Douglas A. Bennett, Amber L. Dagel, W. Bertrand Doriese, Joseph W. Fowler, Johnathon D. Gard, J. Zachariah Harris, Gene C. Hilton, Jozsef Imrek, Edward S. Jimenez, Kurt W. Larson, Zachary H. Levine, John A. B. Mates, D. McArthur, Luis Miaja-Avila, Kelsey M. Morgan, Galen C. O'Neil, Nathan J. Ortiz, Christine G. Pappas, Daniel R. Schmidt, Kyle R. Thompson, Joel N. Ullom, Leila Vale , et al. (6 additional authors not shown)

Abstract: We report on the 1,000-element transition-edge sensor (TES) x-ray spectrometer implementation of the TOMographic Circuit Analysis Tool (TOMCAT). TOMCAT combines a high spatial resolution scanning electron microscope (SEM) with a highly efficient and pixelated TES spectrometer to reconstruct three-dimensional maps of nanoscale integrated circuits (ICs). A 240-pixel prototype spectrometer was recent… ▽ More We report on the 1,000-element transition-edge sensor (TES) x-ray spectrometer implementation of the TOMographic Circuit Analysis Tool (TOMCAT). TOMCAT combines a high spatial resolution scanning electron microscope (SEM) with a highly efficient and pixelated TES spectrometer to reconstruct three-dimensional maps of nanoscale integrated circuits (ICs). A 240-pixel prototype spectrometer was recently used to reconstruct ICs at the 130 nm technology node, but to increase imaging speed to more practical levels, the detector efficiency needs to be improved. For this reason, we are building a spectrometer that will eventually contain 3,000 TES microcalorimeters read out with microwave superconducting quantum interference device (SQUID) multiplexing, and we currently have commissioned a 1,000 TES subarray. This still represents a significant improvement from the 240-pixel system and allows us to begin characterizing the full spectrometer performance. Of the 992 maximimum available readout channels, we have yielded 818 devices, representing the largest number of TES x-ray microcalorimeters simultaneously read out to date. These microcalorimeters have been optimized for pulse speed rather than purely energy resolution, and we measure a FWHM energy resolution of 14 eV at the 8.0 keV Cu K$α$ line. △ Less

Submitted 22 December, 2022; originally announced December 2022.

Comments: 5 pages, 4 figures, submitted to IEEE Transactions on Applied Superconductivity

Journal ref: IEEE Transactions on Applied Superconductivity, vol. 33, no. 5, pp. 1-5, Aug. 2023, Art no. 2100705

arXiv:2212.07460 [pdf]

doi 10.1109/TASC.2021.3052723

Design of a 3000-pixel transition-edge sensor x-ray spectrometer for microcircuit tomography

Authors: Paul Szypryt, Douglas A. Bennett, William J. Boone, Amber L. Dagel, Gabriella Dalton, W. Bertrand Doriese, Joseph W. Fowler, Edward J. Garboczi, Johnathon D. Gard, Gene C. Hilton, Jozsef Imrek, Edward S. Jimenez, Vincent Y. Kotsubo, Kurt Larson, Zachary H. Levine, John A. B. Mates, Daniel McArthur, Kelsey M. Morgan, Nathan Nakamura, Galen C. O'Neil, Nathan J. Ortiz, Christine G. Pappas, Carl D. Reintsema, Daniel R. Schmidt, Daniel S. Swetz , et al. (6 additional authors not shown)

Abstract: Feature sizes in integrated circuits have decreased substantially over time, and it has become increasingly difficult to three-dimensionally image these complex circuits after fabrication. This can be important for process development, defect analysis, and detection of unexpected structures in externally sourced chips, among other applications. Here, we report on a non-destructive, tabletop approa… ▽ More Feature sizes in integrated circuits have decreased substantially over time, and it has become increasingly difficult to three-dimensionally image these complex circuits after fabrication. This can be important for process development, defect analysis, and detection of unexpected structures in externally sourced chips, among other applications. Here, we report on a non-destructive, tabletop approach that addresses this imaging problem through x-ray tomography, which we uniquely realize with an instrument that combines a scanning electron microscope (SEM) with a transition-edge sensor (TES) x-ray spectrometer. Our approach uses the highly focused SEM electron beam to generate a small x-ray generation region in a carefully designed target layer that is placed over the sample being tested. With the high collection efficiency and resolving power of a TES spectrometer, we can isolate x-rays generated in the target from background and trace their paths through regions of interest in the sample layers, providing information about the various materials along the x-ray paths through their attenuation functions. We have recently demonstrated our approach using a 240 Mo/Cu bilayer TES prototype instrument on a simplified test sample containing features with sizes of $\sim$1 $μ$m. Currently, we are designing and building a 3000 Mo/Au bilayer TES spectrometer upgrade, which is expected to improve the imaging speed by factor of up to 60 through a combination of increased detector number and detector speed. △ Less

Submitted 14 December, 2022; originally announced December 2022.

Comments: 5 pages, 3 figures, published in IEEE Transactions on Applied Superconductivity

Journal ref: in IEEE Transactions on Applied Superconductivity, vol. 31, no. 5, pp. 1-5, Aug. 2021, Art no. 2100405

arXiv:2207.10845 [pdf]

doi 10.1007/s10909-022-02860-3

Absolute Energy Measurements with Superconducting Transition-Edge Sensors for Muonic X-ray Spectroscopy at 44 keV

Authors: Daikang Yan, Joel C. Weber, Tejas Guruswamy, Kelsey M. Morgan, Galen C. O'Neil, Abigail L. Wessels, Douglas A. Bennett, Christine G. Pappas, John A. Mates, Johnathon D. Gard, Daniel T. Becker, Joseph W. Fowler, Daniel S. Swetz, Daniel R. Schmidt, Joel N. Ullom, Takuma Okumura, Tadaaki Isobe, Toshiyuki Azuma, Shinji Okada, Shinya Yamada, Tadashi Hashimoto, Orlando Quaranta, Antonino Miceli, Lisa M. Gades, Umeshkumar M. Patel , et al. (3 additional authors not shown)

Abstract: Superconducting transition-edge sensor (TES) microcalorimeters have great utility in x-ray applications owing to their high energy resolution, good collecting efficiency and the feasibility of being multiplexed into large arrays. In this work, we develop hard x-ray TESs to measure the absolute energies of muonic-argon ($μ$-Ar) transition lines around 44 keV and 20 keV. TESs with sidecar absorbers… ▽ More Superconducting transition-edge sensor (TES) microcalorimeters have great utility in x-ray applications owing to their high energy resolution, good collecting efficiency and the feasibility of being multiplexed into large arrays. In this work, we develop hard x-ray TESs to measure the absolute energies of muonic-argon ($μ$-Ar) transition lines around 44 keV and 20 keV. TESs with sidecar absorbers of different heat capacities were fabricated and characterized for their energy resolution and calibration uncertainty. We achieved ~ 1 eV absolute energy measurement accuracy at 44 keV, and < 12 eV energy resolution at 17.5 keV. △ Less

Submitted 21 July, 2022; originally announced July 2022.

arXiv:2205.07640 [pdf, other]

ecpc: An R-package for generic co-data models for high-dimensional prediction

Authors: Mirrelijn M. van Nee, Lodewyk F. A. Wessels, Mark A. van de Wiel

Abstract: High-dimensional prediction considers data with more variables than samples. Generic research goals are to find the best predictor or to select variables. Results may be improved by exploiting prior information in the form of co-data, providing complementary data not on the samples, but on the variables. We consider adaptive ridge penalised generalised linear and Cox models, in which the variable… ▽ More High-dimensional prediction considers data with more variables than samples. Generic research goals are to find the best predictor or to select variables. Results may be improved by exploiting prior information in the form of co-data, providing complementary data not on the samples, but on the variables. We consider adaptive ridge penalised generalised linear and Cox models, in which the variable specific ridge penalties are adapted to the co-data to give a priori more weight to more important variables. The R-package ecpc originally accommodated various and possibly multiple co-data sources, including categorical co-data, i.e. groups of variables, and continuous co-data. Continuous co-data, however, was handled by adaptive discretisation, potentially inefficiently modelling and losing information. Here, we present an extension to the method and software for generic co-data models, particularly for continuous co-data. At the basis lies a classical linear regression model, regressing prior variance weights on the co-data. Co-data variables are then estimated with empirical Bayes moment estimation. After placing the estimation procedure in the classical regression framework, extension to generalised additive and shape constrained co-data models is straightforward. Besides, we show how ridge penalties may be transformed to elastic net penalties with the R-package squeezy. In simulation studies we first compare various co-data models for continuous co-data from the extension to the original method. Secondly, we compare variable selection performance to other variable selection methods. Moreover, we demonstrate use of the package in several examples throughout the paper. △ Less

Submitted 16 May, 2022; originally announced May 2022.

arXiv:2202.02933 [pdf]

Quantification of 242Pu with a Microcalorimeter Gamma Spectrometer

Authors: David J. Mercer, Ryan Winkler, Katrina E. Koehler, Daniel T. Becker, Douglas A. Bennett, Matthew H. Carpenter, Mark P. Croce, Krystal I. de Castro, Eric A. Feissle, Joseph W. Fowler, Johnathon D. Gard, John A. B. Mates, Daniel G. McNeel, Nathan J. Ortiz, Daniel Schmidt, Katherine A. Schreiber, Daniel S. Swetz, Joel N. Ullom, Leila R. Vale, Sophie L. Weidenbenner, Abigail L. Wessels

Abstract: We report measurements of the 103-keV and 159-keV gamma ray signatures of 242Pu using microcalorimetry. This is the first observation of these gamma rays in a non-destructive measurement of an unprepared sample, and so represents an important advance in nuclear material accountancy. The measurement campaign also serves as the first demonstration of a field campaign with a portable microcalorimeter… ▽ More We report measurements of the 103-keV and 159-keV gamma ray signatures of 242Pu using microcalorimetry. This is the first observation of these gamma rays in a non-destructive measurement of an unprepared sample, and so represents an important advance in nuclear material accountancy. The measurement campaign also serves as the first demonstration of a field campaign with a portable microcalorimeter gamma-ray spectrometer. For the 103-keV gamma ray we report an improved centroid energy and emission probability. △ Less

Submitted 8 July, 2022; v1 submitted 6 February, 2022; originally announced February 2022.

Comments: 6 pages, 7 figures

Report number: LA-UR-22-20077

arXiv:2112.09639 [pdf, other]

doi 10.1214/23-AAP2038

Necessary and Sufficient Conditions for Optimal Control of Semilinear Stochastic Partial Differential Equations

Authors: Wilhelm Stannat, Lukas Wessels

Abstract: Using a recently introduced representation of the second order adjoint state as the solution of a function-valued backward stochastic partial differential equation (SPDE), we calculate the viscosity super- and subdifferential of the value function evaluated along an optimal trajectory for controlled semilinear SPDEs. This establishes the well-known connection between Pontryagin's maximum principle… ▽ More Using a recently introduced representation of the second order adjoint state as the solution of a function-valued backward stochastic partial differential equation (SPDE), we calculate the viscosity super- and subdifferential of the value function evaluated along an optimal trajectory for controlled semilinear SPDEs. This establishes the well-known connection between Pontryagin's maximum principle and dynamic programming within the framework of viscosity solutions. As a corollary, we derive that the correction term in the stochastic Hamiltonian arising in non-smooth stochastic control problems is non-positive. These results directly lead us to a stochastic verification theorem for fully nonlinear Hamilton--Jacobi--Bellman equations in the framework of viscosity solutions. △ Less

Submitted 8 December, 2023; v1 submitted 17 December, 2021; originally announced December 2021.

Comments: To appear at Ann. Appl. Probab

Journal ref: Ann. Appl. Probab. 34 (3) 3251 - 3287, June 2024

arXiv:2105.05194 [pdf, other]

doi 10.1137/20M1368057

Peng's Maximum Principle for Stochastic Partial Differential Equations

Authors: Wilhelm Stannat, Lukas Wessels

Abstract: We extend Peng's maximum principle for semilinear stochastic partial differential equations (SPDEs) in one space-dimension with non-convex control domains and control-dependent diffusion coefficients to the case of general cost functionals with Nemytskii-type coefficients. Our analysis is based on a new approach to the characterization of the second order adjoint state as the solution of a functio… ▽ More We extend Peng's maximum principle for semilinear stochastic partial differential equations (SPDEs) in one space-dimension with non-convex control domains and control-dependent diffusion coefficients to the case of general cost functionals with Nemytskii-type coefficients. Our analysis is based on a new approach to the characterization of the second order adjoint state as the solution of a function-valued backward SPDE. △ Less

Submitted 10 August, 2021; v1 submitted 11 May, 2021; originally announced May 2021.

Comments: 19 pages; accepted for publication in SIAM Journal on Control and Optimization

MSC Class: 93E20; 49K45; 60H15

Journal ref: SIAM J. Control Optim., 59 (2021), 3552-3573

arXiv:2103.15893 [pdf, other]

New Experimentally Observable Gamma-ray Emissions from 241Am Nuclear Decay

Authors: Katrina E. Koehler, Michael D. Yoho, Matthew H. Carpenter, Mark P. Croce, David J. Mercer, Chandler M. Smith, Aidan D. Tollefson, Duc T. Vo, Michael A. Famiano, Caroline D. Nesaraja, Daniel T. Becker, Johnathon D. Gard, Abigail L. Wessels, Douglas A. Bennett, J. A. B. Mates, Nathan J. Ortiz, Daniel R. Schmidt, Joel N. Ullom, Leila R. Vale

Abstract: With the high resolution of microcalorimeter detectors, previously unresolvable gamma-ray lines are now clearly resolvable. A careful measurement of Am-241 decay with a large array of gamma-ray microcalorimeters has revealed never before seen or predicted gamma lines at 207.72 +/- 0.02 keV and 208.21 +/- 0.01 keV. These results were made possible by new microwave-multiplexing readout to increase t… ▽ More With the high resolution of microcalorimeter detectors, previously unresolvable gamma-ray lines are now clearly resolvable. A careful measurement of Am-241 decay with a large array of gamma-ray microcalorimeters has revealed never before seen or predicted gamma lines at 207.72 +/- 0.02 keV and 208.21 +/- 0.01 keV. These results were made possible by new microwave-multiplexing readout to increase the array size and improved analysis algorithms to eliminate spectral artifacts. We suggest nuclear levels from which these gamma-rays might originate and calculate branching ratios for these transitions from measurements of both mixed Pu-Am standards and a pure Am-241 source. These results have implications for nuclear material safeguards and accounting, particularly for microcalorimeter gamma spectrometers, which are now being adopted in nuclear safeguards analytical laboratories. △ Less

Submitted 19 August, 2024; v1 submitted 29 March, 2021; originally announced March 2021.

Comments: 7 pages, 4 figures, 2 tables

Report number: LA-UR-24-28099

arXiv:2005.10304 [pdf, other]

doi 10.1016/j.nima.2020.164307

Improved Plutonium and Americium Photon Branching Ratios from Microcalorimeter Gamma Spectroscopy

Authors: Michael D. Yoho, Katrina E. Koehler, Daniel T. Becker, Douglas A. Bennett, Matthew H. Carpenter, Mark P. Croce, Johnathon D. Gard, J. A. Ben Mates, David J. Mercer, Nathan J. Ortiz, Daniel R. Schmidt, Chandler M. Smith, Daniel S. Swetz, Aidan D. Tollefson, Joel N. Ullom, Leila R. Vale, Abigail L. Wessels, Duc T. Vo

Abstract: Photon branching ratios are critical input data for activities such as nuclear materials protection and accounting because they allow material compositions to be extracted from measurements of gamma-ray intensities. Uncertainties in these branching ratios are often a limiting source of uncertainty in composition determination. Here, we use high statistics, high resolution (~60-70eV full-width-at-h… ▽ More Photon branching ratios are critical input data for activities such as nuclear materials protection and accounting because they allow material compositions to be extracted from measurements of gamma-ray intensities. Uncertainties in these branching ratios are often a limiting source of uncertainty in composition determination. Here, we use high statistics, high resolution (~60-70eV full-width-at-half-maximum at 100 keV) gamma-ray spectra acquired using microcalorimeter sensors to substantially reduce the uncertainties for 11 plutonium (238Pu,239Pu,241Pu) and 241Am branching ratios important for material control and accountability and nuclear forensics in the energy range of 125 keV to 208 keV. We show a reduction in uncertainty of over a factor of three for one branching ratio and a factor of 2{3 for four branching ratios. △ Less

Submitted 22 June, 2020; v1 submitted 20 May, 2020; originally announced May 2020.

Comments: 25 pages, 6 figures, 7 tables

Report number: LA-UR-19-31885

arXiv:2005.04010 [pdf, other]

Flexible co-data learning for high-dimensional prediction

Authors: Mirrelijn M. van Nee, Lodewyk F. A. Wessels, Mark A. van de Wiel

Abstract: Clinical research often focuses on complex traits in which many variables play a role in mechanisms driving, or curing, diseases. Clinical prediction is hard when data is high-dimensional, but additional information, like domain knowledge and previously published studies, may be helpful to improve predictions. Such complementary data, or co-data, provide information on the covariates, such as geno… ▽ More Clinical research often focuses on complex traits in which many variables play a role in mechanisms driving, or curing, diseases. Clinical prediction is hard when data is high-dimensional, but additional information, like domain knowledge and previously published studies, may be helpful to improve predictions. Such complementary data, or co-data, provide information on the covariates, such as genomic location or p-values from external studies. Our method enables exploiting multiple and various co-data sources to improve predictions. We use discrete or continuous co-data to define possibly overlapping or hierarchically structured groups of covariates. These are then used to estimate adaptive multi-group ridge penalties for generalised linear and Cox models. We combine empirical Bayes estimation of group penalty hyperparameters with an extra level of shrinkage. This renders a uniquely flexible framework as any type of shrinkage can be used on the group level. The hyperparameter shrinkage learns how relevant a specific co-data source is, counters overfitting of hyperparameters for many groups, and accounts for structured co-data. We describe various types of co-data and propose suitable forms of hypershrinkage. The method is very versatile, as it allows for integration and weighting of multiple co-data sets, inclusion of unpenalised covariates and posterior variable selection. We demonstrate it on two cancer genomics applications and show that it may improve the performance of other dense and parsimonious prognostic models substantially, and stabilises variable selection. △ Less

Submitted 8 May, 2020; originally announced May 2020.

Comments: Document consists of main content (20 pages, 10 figures) and supplementary material (14 pages, 13 figures)

arXiv:1905.09074 [pdf, other]

doi 10.3934/eect.2020087

Deterministic Control of Stochastic Reaction-Diffusion Equations

Authors: Wilhelm Stannat, Lukas Wessels

Abstract: We consider the control of semilinear stochastic partial differential equations (SPDEs) via deterministic controls. In the case of multiplicative noise, existence of optimal controls and necessary conditions for optimality are derived. In the case of additive noise, we obtain a representation for the gradient of the cost functional via adjoint calculus. The restriction to deterministic controls an… ▽ More We consider the control of semilinear stochastic partial differential equations (SPDEs) via deterministic controls. In the case of multiplicative noise, existence of optimal controls and necessary conditions for optimality are derived. In the case of additive noise, we obtain a representation for the gradient of the cost functional via adjoint calculus. The restriction to deterministic controls and additive noise avoids the necessity of introducing a backward SPDE. Based on this novel representation, we present a probabilistic nonlinear conjugate gradient descent method to approximate the optimal control, and apply our results to the stochastic Schlögl model. We also present some analysis in the case where the optimal control for the stochastic system differs from the optimal control for the deterministic system. △ Less

Submitted 13 July, 2020; v1 submitted 22 May, 2019; originally announced May 2019.

Comments: accepted for publication in Evolution Equations & Control Theory; 21 pages, 10 figures

Journal ref: Evol. Equ. Control Theory, 10 (2021), 701-722

arXiv:1904.10279 [pdf, other]

Heterofusion: Fusing genomics data of different measurement scales

Authors: Age K. Smilde, Yipeng Song, Johan A. Westerhuis, Henk A. L. Kiers, Nanne Aben, Lodewyk F. A. Wessels

Abstract: In systems biology, it is becoming increasingly common to measure biochemical entities at different levels of the same biological system. Hence, data fusion problems are abundant in the life sciences. With the availability of a multitude of measuring techniques, one of the central problems is the heterogeneity of the data. In this paper, we discuss a specific form of heterogeneity, namely that of… ▽ More In systems biology, it is becoming increasingly common to measure biochemical entities at different levels of the same biological system. Hence, data fusion problems are abundant in the life sciences. With the availability of a multitude of measuring techniques, one of the central problems is the heterogeneity of the data. In this paper, we discuss a specific form of heterogeneity, namely that of measurements obtained at different measurement scales, such as binary, ordinal, interval and ratio-scaled variables. Three generic fusion approaches are presented of which two are new to the systems biology community. The methods are presented, put in context and illustrated with a real-life genomics example. △ Less

Submitted 23 April, 2019; originally announced April 2019.

arXiv:1807.04982 [pdf, other]

Generalized simultaneous component analysis of binary and quantitative data

Authors: Yipeng Song, Johan A. Westerhuis, Nanne Aben, Lodewyk F. A. Wessels, Patrick J. F. Groenen, Age K. Smilde

Abstract: In the current era of systems biological research there is a need for the integrative analysis of binary and quantitative genomics data sets measured on the same objects. One standard tool of exploring the underlying dependence structure present in multiple quantitative data sets is simultaneous component analysis (SCA) model. However, it does not have any provisions when a part of the data are bi… ▽ More In the current era of systems biological research there is a need for the integrative analysis of binary and quantitative genomics data sets measured on the same objects. One standard tool of exploring the underlying dependence structure present in multiple quantitative data sets is simultaneous component analysis (SCA) model. However, it does not have any provisions when a part of the data are binary. To this end, we propose the generalized SCA (GSCA) model, which takes into account the distinct mathematical properties of binary and quantitative measurements in the maximum likelihood framework. Like in the SCA model, a common low dimensional subspace is assumed to represent the shared information between these two distinct types of measurements. However, the GSCA model can easily be overfitted when a rank larger than one is used, leading to some of the estimated parameters to become very large. To achieve a low rank solution and combat overfitting, we propose to use a concave variant of the nuclear norm penalty. An efficient majorization algorithm is developed to fit this model with different concave penalties. Realistic simulations (low signal-to-noise ratio and highly imbalanced binary data) are used to evaluate the performance of the proposed model in recovering the underlying structure. Also, a missing value based cross validation procedure is implemented for model selection. We illustrate the usefulness of the GSCA model for exploratory data analysis of quantitative gene expression and binary copy number aberration (CNA) measurements obtained from the GDSC1000 data sets. △ Less

Submitted 3 June, 2019; v1 submitted 13 July, 2018; originally announced July 2018.

Comments: 19 pages, 10 figures

arXiv:1712.04200 [pdf, other]

Approximating multivariate posterior distribution functions from Monte Carlo samples for sequential Bayesian inference

Authors: Bram Thijssen, Lodewyk F. A. Wessels

Abstract: An important feature of Bayesian statistics is the opportunity to do sequential inference: the posterior distribution obtained after seeing a dataset can be used as prior for a second inference. However, when Monte Carlo sampling methods are used for inference, we only have a set of samples from the posterior distribution. To do sequential inference, we then either have to evaluate the second post… ▽ More An important feature of Bayesian statistics is the opportunity to do sequential inference: the posterior distribution obtained after seeing a dataset can be used as prior for a second inference. However, when Monte Carlo sampling methods are used for inference, we only have a set of samples from the posterior distribution. To do sequential inference, we then either have to evaluate the second posterior at only these locations and reweight the samples accordingly, or we can estimate a functional description of the posterior probability distribution from the samples and use that as prior for the second inference. Here, we investigated to what extent we can obtain an accurate joint posterior from two datasets if the inference is done sequentially rather than jointly, under the condition that each inference step is done using Monte Carlo sampling. To test this, we evaluated the accuracy of kernel density estimates, Gaussian mixtures, vine copulas and Gaussian processes in approximating posterior distributions, and then tested whether these approximations can be used in sequential inference. In low dimensionality, Gaussian processes are more accurate, whereas in higher dimensionality Gaussian mixtures or vine copulas perform better. In our test cases, posterior approximations are preferable over direct sample reweighting, although joint inference is still preferable over sequential inference. Since the performance is case-specific, we provide an R package mvdens with a unified interface for the density approximation methods. △ Less

Submitted 21 June, 2019; v1 submitted 12 December, 2017; originally announced December 2017.

arXiv:1110.3717 [pdf, other]

doi 10.1371/journal.pone.0034796

A critical evaluation of network and pathway based classifiers for outcome prediction in breast cancer

Authors: C. Staiger, S. Cadot, R. Kooter, M. Dittrich, T. Mueller, G. W. Klau, L. F. A. Wessels

Abstract: Recently, several classifiers that combine primary tumor data, like gene expression data, and secondary data sources, such as protein-protein interaction networks, have been proposed for predicting outcome in breast cancer. In these approaches, new composite features are typically constructed by aggregating the expression levels of several genes. The secondary data sources are employed to guide th… ▽ More Recently, several classifiers that combine primary tumor data, like gene expression data, and secondary data sources, such as protein-protein interaction networks, have been proposed for predicting outcome in breast cancer. In these approaches, new composite features are typically constructed by aggregating the expression levels of several genes. The secondary data sources are employed to guide this aggregation. Although many studies claim that these approaches improve classification performance over single gene classifiers, the gain in performance is difficult to assess. This stems mainly from the fact that different breast cancer data sets and validation procedures are employed to assess the performance. Here we address these issues by employing a large cohort of six breast cancer data sets as benchmark set and by performing an unbiased evaluation of the classification accuracies of the different approaches. Contrary to previous claims, we find that composite feature classifiers do not outperform simple single gene classifiers. We investigate the effect of (1) the number of selected features; (2) the specific gene set from which features are selected; (3) the size of the training set and (4) the heterogeneity of the data set on the performance of composite feature and single gene classifiers. Strikingly, we find that randomization of secondary data sources, which destroys all biological information in these sources, does not result in a deterioration in performance of composite feature classifiers. Finally, we show that when a proper correction for gene set size is performed, the stability of single gene sets is similar to the stability of composite feature sets. Based on these results there is currently no reason to prefer prognostic classifiers based on composite features over single gene classifiers for predicting outcome in breast cancer. △ Less

Submitted 18 October, 2011; v1 submitted 17 October, 2011; originally announced October 2011.

Showing 1–20 of 20 results for author: Wessels, L