Skip to main content

Showing 1–50 of 132 results for author: Schönlieb, C

Searching in archive math. Search in all archives.
.
  1. arXiv:2506.11732  [pdf, ps, other

    math.NA cs.LG math.OC

    Data-driven approaches to inverse problems

    Authors: Carola-Bibiane Schönlieb, Zakhar Shumaylov

    Abstract: Inverse problems are concerned with the reconstruction of unknown physical quantities using indirect measurements and are fundamental across diverse fields such as medical imaging, remote sensing, and material sciences. These problems serve as critical tools for visualizing internal structures beyond what is visible to the naked eye, enabling quantification, diagnosis, prediction, and discovery. H… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Comments: Notes from Machine Learning: From Data to Mathematical Understanding (CIME 2023)

  2. arXiv:2505.19873  [pdf, ps, other

    cs.CV math.NA

    Deep Spectral Prior

    Authors: Yanqi Cheng, Tieyong Zeng, Pietro Lio, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: We introduce Deep Spectral Prior (DSP), a new formulation of Deep Image Prior (DIP) that redefines image reconstruction as a frequency-domain alignment problem. Unlike traditional DIP, which relies on pixel-wise loss and early stopping to mitigate overfitting, DSP directly matches Fourier coefficients between the network output and observed measurements. This shift introduces an explicit inductive… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  3. arXiv:2505.12940  [pdf, other

    cs.LG math.NA

    Multi-Level Monte Carlo Training of Neural Operators

    Authors: James Rowbottom, Stefania Fresca, Pietro Lio, Carola-Bibiane Schönlieb, Nicolas Boullé

    Abstract: Operator learning is a rapidly growing field that aims to approximate nonlinear operators related to partial differential equations (PDEs) using neural operators. These rely on discretization of input and output functions and are, usually, expensive to train for large-scale problems at high-resolution. Motivated by this, we present a Multi-Level Monte Carlo (MLMC) approach to train neural operator… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 18 pages, 8 figures

  4. arXiv:2505.12003  [pdf, ps, other

    cs.LG math.NA

    Approximation theory for 1-Lipschitz ResNets

    Authors: Davide Murari, Takashi Furuya, Carola-Bibiane Schönlieb

    Abstract: 1-Lipschitz neural networks are fundamental for generative modelling, inverse problems, and robust classifiers. In this paper, we focus on 1-Lipschitz residual networks (ResNets) based on explicit Euler steps of negative gradient flows and study their approximation capabilities. Leveraging the Restricted Stone-Weierstrass Theorem, we first show that these 1-Lipschitz ResNets are dense in the set o… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

    MSC Class: 68T07

  5. arXiv:2505.01388  [pdf, ps, other

    eess.IV math.ST

    Potential Contrast: Properties, Equivalences, and Generalization to Multiple Classes

    Authors: Wallace Peaslee, Anna Breger, Carola-Bibiane Schönlieb

    Abstract: Potential contrast is typically used as an image quality measure and quantifies the maximal possible contrast between samples from two classes of pixels in an image after an arbitrary grayscale transformation. It has been valuable in cultural heritage applications, identifying and visualizing relevant information in multispectral images while requiring a small number of pixels to be manually sampl… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

  6. arXiv:2504.04997  [pdf, other

    stat.ML cs.AI cs.LG math.ST stat.AP

    SurvSurf: a partially monotonic neural network for first-hitting time prediction of intermittently observed discrete and continuous sequential events

    Authors: Yichen Kelly Chen, Sören Dittmer, Kinga Bernatowicz, Josep Arús-Pous, Kamen Bliznashki, John Aston, James H. F. Rudd, Carola-Bibiane Schönlieb, James Jones, Michael Roberts

    Abstract: We propose a neural-network based survival model (SurvSurf) specifically designed for direct and simultaneous probabilistic prediction of the first hitting time of sequential events from baseline. Unlike existing models, SurvSurf is theoretically guaranteed to never violate the monotonic relationship between the cumulative incidence functions of sequential events, while allowing nonlinear influenc… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: 41 pages, 18 figures (including supplemental information). Submitted to RSS: Data Science and Artificial Intelligence

    MSC Class: 62N01

  7. arXiv:2412.10249  [pdf, other

    math.NA math.OC

    Stochastic Multiresolution Image Sketching for Inverse Imaging Problems

    Authors: Alessandro Perelli, Carola-Bibiane Schonlieb, Matthias J. Ehrhardt

    Abstract: A challenge in high-dimensional inverse problems is developing iterative solvers to find the accurate solution of regularized optimization problems with low computational cost. An important example is computed tomography (CT) where both image and data sizes are large and therefore the forward model is costly to evaluate. Since several years algorithms from stochastic optimization are used for tomo… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

    Comments: 26 pages, 11 figures, submitted to SIAM Journal on Imaging Sciences

    MSC Class: 49M29; 65K05; 65F22 ACM Class: G.1.6; G.1.3

  8. arXiv:2412.10129  [pdf, other

    physics.med-ph cs.MS math.OC

    TIGRE v3: Efficient and easy to use iterative computed tomographic reconstruction toolbox for real datasets

    Authors: Ander Biguri, Tomoyuki Sadakane, Reuben Lindroos, Yi Liu, Malena Sabaté Landman, Yi Du, Manasavee Lohvithee, Stefanie Kaser, Sepideh Hatamikia, Robert Bryll, Emilien Valat, Sarinrat Wonglee, Thomas Blumensath, Carola-Bibiane Schönlieb

    Abstract: Computed Tomography (CT) has been widely adopted in medicine and it is increasingly being used in scientific and industrial applications. Parallelly, research in different mathematical areas concerning discrete inverse problems has led to the development of new sophisticated numerical solvers that can be applied in the context of CT. The Tomographic Iterative GPU-based Reconstruction (TIGRE) toolb… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

  9. arXiv:2410.18262  [pdf, other

    cs.LG math.NA physics.comp-ph

    Hamiltonian Matching for Symplectic Neural Integrators

    Authors: Priscilla Canizares, Davide Murari, Carola-Bibiane Schönlieb, Ferdia Sherry, Zakhar Shumaylov

    Abstract: Hamilton's equations of motion form a fundamental framework in various branches of physics, including astronomy, quantum mechanics, particle physics, and climate science. Classical numerical solvers are typically employed to compute the time evolution of these systems. However, when the system spans multiple spatial and temporal scales numerical errors can accumulate, leading to reduced accuracy.… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

    Comments: NeurReps 2024

  10. arXiv:2410.04543  [pdf, ps, other

    cs.LG cs.AI math.DG q-bio.BM

    Pullback Flow Matching on Data Manifolds

    Authors: Friso de Kruiff, Erik Bekkers, Ozan Öktem, Carola-Bibiane Schönlieb, Willem Diepeveen

    Abstract: We propose Pullback Flow Matching (PFM), a novel framework for generative modeling on data manifolds. Unlike existing methods that assume or learn restrictive closed-form manifold mappings for training Riemannian Flow Matching (RFM) models, PFM leverages pullback geometry and isometric learning to preserve the underlying manifold's geometry while enabling efficient generation and precise interpola… ▽ More

    Submitted 9 July, 2025; v1 submitted 6 October, 2024; originally announced October 2024.

  11. arXiv:2410.02698  [pdf, other

    cs.LG cs.CV math.NA

    Lie Algebra Canonicalization: Equivariant Neural Operators under arbitrary Lie Groups

    Authors: Zakhar Shumaylov, Peter Zaika, James Rowbottom, Ferdia Sherry, Melanie Weber, Carola-Bibiane Schönlieb

    Abstract: The quest for robust and generalizable machine learning models has driven recent interest in exploiting symmetries through equivariant neural networks. In the context of PDE solvers, recent works have shown that Lie point symmetries can be a useful inductive bias for Physics-Informed Neural Networks (PINNs) through data and loss augmentation. Despite this, directly enforcing equivariance within th… ▽ More

    Submitted 4 March, 2025; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: 44 pages; accepted at ICLR 2025

  12. arXiv:2410.02113  [pdf, other

    cs.LG math.NA

    Mamba Neural Operator: Who Wins? Transformers vs. State-Space Models for PDEs

    Authors: Chun-Wun Cheng, Jiahao Huang, Yi Zhang, Guang Yang, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: Partial differential equations (PDEs) are widely used to model complex physical systems, but solving them efficiently remains a significant challenge. Recently, Transformers have emerged as the preferred architecture for PDEs due to their ability to capture intricate dependencies. However, they struggle with representing continuous dynamics and long-range interactions. To overcome these limitation… ▽ More

    Submitted 9 April, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

  13. arXiv:2410.01950  [pdf, other

    cs.LG math.DG stat.ML

    Score-based Pullback Riemannian Geometry: Extracting the Data Manifold Geometry using Anisotropic Flows

    Authors: Willem Diepeveen, Georgios Batzolis, Zakhar Shumaylov, Carola-Bibiane Schönlieb

    Abstract: Data-driven Riemannian geometry has emerged as a powerful tool for interpretable representation learning, offering improved efficiency in downstream tasks. Moving forward, it is crucial to balance cheap manifold mappings with efficient training algorithms. In this work, we integrate concepts from pullback Riemannian geometry and generative models to propose a framework for data-driven Riemannian g… ▽ More

    Submitted 22 May, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

  14. arXiv:2409.01097  [pdf, other

    math.NA

    Nested Bregman Iterations for Decomposition Problems

    Authors: Tobias Wolf, Derek Driggs, Kostas Papafitsoros, Elena Resmerita, Carola-Bibiane Schönlieb

    Abstract: We consider the task of image reconstruction while simultaneously decomposing the reconstructed image into components with different features. A commonly used tool for this is a variational approach with an infimal convolution of appropriate functions as a regularizer. Especially for noise corrupted observations, incorporating these functionals into the classical method of Bregman iterations provi… ▽ More

    Submitted 15 April, 2025; v1 submitted 2 September, 2024; originally announced September 2024.

  15. arXiv:2408.06996  [pdf, other

    cs.LG math.ST

    Blessing of Dimensionality for Approximating Sobolev Classes on Manifolds

    Authors: Hong Ye Tan, Subhadip Mukherjee, Junqi Tang, Carola-Bibiane Schönlieb

    Abstract: The manifold hypothesis says that natural high-dimensional data lie on or around a low-dimensional manifold. The recent success of statistical and learning-based methods in very high dimensions empirically supports this hypothesis, suggesting that typical worst-case analysis does not provide practical guarantees. A natural step for analysis is thus to assume the manifold hypothesis and derive boun… ▽ More

    Submitted 3 May, 2025; v1 submitted 13 August, 2024; originally announced August 2024.

    MSC Class: 41A25; 41A46; 53Z50;

  16. arXiv:2407.04516  [pdf, ps, other

    cs.LG math.NA

    G-Adaptivity: optimised graph-based mesh relocation for finite element methods

    Authors: James Rowbottom, Georg Maierhofer, Teo Deveney, Eike Mueller, Alberto Paganini, Katharina Schratz, Pietro Liò, Carola-Bibiane Schönlieb, Chris Budd

    Abstract: We present a novel, and effective, approach to achieve optimal mesh relocation in finite element methods (FEMs). The cost and accuracy of FEMs is critically dependent on the choice of mesh points. Mesh relocation (r-adaptivity) seeks to optimise the mesh geometry to obtain the best solution accuracy at given computational budget. Classical r-adaptivity relies on the solution of a separate nonlinea… ▽ More

    Submitted 21 June, 2025; v1 submitted 5 July, 2024; originally announced July 2024.

    Journal ref: Proceedings of the 42nd International Conference on Machine Learning, 2025

  17. arXiv:2406.02458  [pdf, other

    math.NA

    Deep Block Proximal Linearised Minimisation Algorithm for Non-convex Inverse Problems

    Authors: Chaoyan Huang, Zhongming Wu, Yanqi Cheng, Tieyong Zeng, Carola-Bibiane Schönlieb, Angelica I. Aviles-Rivero

    Abstract: Image restoration is typically addressed through non-convex inverse problems, which are often solved using first-order block-wise splitting methods. In this paper, we consider a general type of non-convex optimisation model that captures many inverse image problems and present an inertial block proximal linearised minimisation (iBPLM) algorithm. Our new method unifies the Jacobi-type parallel and… ▽ More

    Submitted 13 November, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 6 figures, 3 tables

  18. arXiv:2403.17100  [pdf, ps, other

    math.OC

    Practical Acceleration of the Condat-Vũ Algorithm

    Authors: Derek Driggs, Matthias J. Ehrhardt, Carola-Bibiane Schönlieb, Junqi Tang

    Abstract: The Condat-Vũ algorithm is a widely used primal-dual method for optimizing composite objectives of three functions. Several algorithms for optimizing composite objectives of two functions are special cases of Condat-Vũ, including proximal gradient descent (PGD). It is well-known that PGD exhibits suboptimal performance, and a simple adjustment to PGD can accelerate its convergence rate from… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  19. arXiv:2402.03541  [pdf, other

    cs.LG math.NA

    HAMLET: Graph Transformer Neural Operator for Partial Differential Equations

    Authors: Andrey Bryutkin, Jiahao Huang, Zhongying Deng, Guang Yang, Carola-Bibiane Schönlieb, Angelica Aviles-Rivero

    Abstract: We present a novel graph transformer framework, HAMLET, designed to address the challenges in solving partial differential equations (PDEs) using neural networks. The framework uses graph transformers with modular input encoders to directly incorporate differential equation information into the solution process. This modularity enhances parameter correspondence control, making HAMLET adaptable to… ▽ More

    Submitted 2 October, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 18 pages, 7 figures, 6 tables

    Journal ref: Proceedings of Machine Learning Research, Vol. 235, pp. 4624-4641, 2024

  20. arXiv:2402.01052  [pdf, other

    math.OC cs.CV cs.LG stat.ML

    Weakly Convex Regularisers for Inverse Problems: Convergence of Critical Points and Primal-Dual Optimisation

    Authors: Zakhar Shumaylov, Jeremy Budd, Subhadip Mukherjee, Carola-Bibiane Schönlieb

    Abstract: Variational regularisation is the primary method for solving inverse problems, and recently there has been considerable work leveraging deeply learned regularisation for enhanced performance. However, few results exist addressing the convergence of such regularisation, particularly within the context of critical points as opposed to global minimisers. In this paper, we present a generalised formul… ▽ More

    Submitted 15 June, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: 26 pages, 4 figures; https://openreview.net/forum?id=E8FpcUyPuS

  21. arXiv:2311.15996  [pdf, other

    cs.LG math.NA stat.ML

    Closing the ODE-SDE gap in score-based diffusion models through the Fokker-Planck equation

    Authors: Teo Deveney, Jan Stanczuk, Lisa Maria Kreusser, Chris Budd, Carola-Bibiane Schönlieb

    Abstract: Score-based diffusion models have emerged as one of the most promising frameworks for deep generative modelling, due to their state-of-the art performance in many generation tasks while relying on mathematical foundations such as stochastic differential equations (SDEs) and ordinary differential equations (ODEs). Empirically, it has been reported that ODE based samples are inferior to SDE based sa… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  22. arXiv:2308.07818  [pdf, other

    q-bio.BM math.DG math.NA

    Riemannian geometry for efficient analysis of protein dynamics data

    Authors: Willem Diepeveen, Carlos Esteve-Yagüe, Jan Lellmann, Ozan Öktem, Carola-Bibiane Schönlieb

    Abstract: An increasingly common viewpoint is that protein dynamics data sets reside in a non-linear subspace of low conformational energy. Ideal data analysis tools for such data sets should therefore account for such non-linear geometry. The Riemannian geometry setting can be suitable for a variety of reasons. First, it comes with a rich structure to account for a wide range of geometries that can be mode… ▽ More

    Submitted 26 October, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

    MSC Class: 49Q10; 53C22; 53Z10; 53Z50; 65D18; 92-08; 92-10

  23. arXiv:2308.05045  [pdf, other

    math.OC

    Boosting Data-Driven Mirror Descent with Randomization, Equivariance, and Acceleration

    Authors: Hong Ye Tan, Subhadip Mukherjee, Junqi Tang, Carola-Bibiane Schönlieb

    Abstract: Learning-to-optimize (L2O) is an emerging research area in large-scale optimization with applications in data science. Recently, researchers have proposed a novel L2O framework called learned mirror descent (LMD), based on the classical mirror descent (MD) algorithm with learnable mirror maps parameterized by input-convex neural networks. The LMD approach has been shown to significantly accelerate… ▽ More

    Submitted 10 May, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

    MSC Class: 46N10; 65K10; 65G50

  24. arXiv:2307.13579  [pdf, other

    cs.LG cs.AI math.ST

    Reinterpreting survival analysis in the universal approximator age

    Authors: Sören Dittmer, Michael Roberts, Jacobus Preller, AIX COVNET, James H. F. Rudd, John A. D. Aston, Carola-Bibiane Schönlieb

    Abstract: Survival analysis is an integral part of the statistical toolbox. However, while most domains of classical statistics have embraced deep learning, survival analysis only recently gained some minor attention from the deep learning community. This recent development is likely in part motivated by the COVID-19 pandemic. We aim to provide the tools needed to fully harness the potential of survival ana… ▽ More

    Submitted 25 July, 2023; originally announced July 2023.

  25. arXiv:2307.09441  [pdf, other

    math.NA cs.LG

    Convergent regularization in inverse problems and linear plug-and-play denoisers

    Authors: Andreas Hauptmann, Subhadip Mukherjee, Carola-Bibiane Schönlieb, Ferdia Sherry

    Abstract: Plug-and-play (PnP) denoising is a popular iterative framework for solving imaging inverse problems using off-the-shelf image denoisers. Their empirical success has motivated a line of research that seeks to understand the convergence of PnP iterates under various assumptions on the denoiser. While a significant amount of research has gone into establishing the convergence of the PnP iteration for… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  26. arXiv:2307.07344  [pdf, other

    cs.LG math.NA

    Inverse Evolution Layers: Physics-informed Regularizers for Deep Neural Networks

    Authors: Chaoyu Liu, Zhonghua Qiao, Chao Li, Carola-Bibiane Schönlieb

    Abstract: Traditional image processing methods employing partial differential equations (PDEs) offer a multitude of meaningful regularizers, along with valuable theoretical foundations for a wide range of image-related tasks. This makes their integration into neural networks a promising avenue. In this paper, we introduce a novel regularization approach inspired by the reverse process of PDE-based evolution… ▽ More

    Submitted 1 July, 2024; v1 submitted 14 July, 2023; originally announced July 2023.

  27. arXiv:2306.17737  [pdf, other

    stat.CO math.NA

    Proximal Langevin Sampling With Inexact Proximal Mapping

    Authors: Matthias J. Ehrhardt, Lorenz Kuger, Carola-Bibiane Schönlieb

    Abstract: In order to solve tasks like uncertainty quantification or hypothesis tests in Bayesian imaging inverse problems, we often have to draw samples from the arising posterior distribution. For the usually log-concave but high-dimensional posteriors, Markov chain Monte Carlo methods based on time discretizations of Langevin diffusion are a popular tool. If the potential defining the distribution is non… ▽ More

    Submitted 13 May, 2024; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: 29 pages, 11 figures

    Journal ref: SIAM Journal on Imaging Sciences 2024 17:3, 1729-1760

  28. arXiv:2304.08628  [pdf, other

    math.OC

    A sparse optimization approach to infinite infimal convolution regularization

    Authors: Kristian Bredies, Marcello Carioni, Martin Holler, Yury Korolev, Carola-Bibiane Schönlieb

    Abstract: In this paper we introduce the class of infinite infimal convolution functionals and apply these functionals to the regularization of ill-posed inverse problems. The proposed regularization involves an infimal convolution of a continuously parametrized family of convex, positively one-homogeneous functionals defined on a common Banach space $X$. We show that, under mild assumptions, this functiona… ▽ More

    Submitted 15 December, 2024; v1 submitted 17 April, 2023; originally announced April 2023.

  29. arXiv:2304.08342  [pdf, other

    math.NA cs.CV stat.ML

    NF-ULA: Langevin Monte Carlo with Normalizing Flow Prior for Imaging Inverse Problems

    Authors: Ziruo Cai, Junqi Tang, Subhadip Mukherjee, Jinglai Li, Carola Bibiane Schönlieb, Xiaoqun Zhang

    Abstract: Bayesian methods for solving inverse problems are a powerful alternative to classical methods since the Bayesian approach offers the ability to quantify the uncertainty in the solution. In recent years, data-driven techniques for solving inverse problems have also been remarkably successful, due to their superior representation ability. In this work, we incorporate data-based models into a class o… ▽ More

    Submitted 14 October, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

  30. arXiv:2303.07271  [pdf, other

    math.OC cs.CV cs.LG stat.ML

    Provably Convergent Plug-and-Play Quasi-Newton Methods

    Authors: Hong Ye Tan, Subhadip Mukherjee, Junqi Tang, Carola-Bibiane Schönlieb

    Abstract: Plug-and-Play (PnP) methods are a class of efficient iterative methods that aim to combine data fidelity terms and deep denoisers using classical optimization algorithms, such as ISTA or ADMM, with applications in inverse problems and imaging. Provable PnP methods are a subclass of PnP methods with convergence guarantees, such as fixed point convergence or convergence to critical points of some en… ▽ More

    Submitted 13 November, 2023; v1 submitted 9 March, 2023; originally announced March 2023.

    MSC Class: 49M15; 49J52; 65K15

  31. arXiv:2303.03794  [pdf, other

    cs.CV eess.IV math.NA

    Hidden Knowledge: Mathematical Methods for the Extraction of the Fingerprint of Medieval Paper from Digital Images

    Authors: Tamara G. Grossmann, Carola-Bibiane Schönlieb, Orietta Da Rold

    Abstract: Medieval paper, a handmade product, is made with a mould which leaves an indelible imprint on the sheet of paper. This imprint includes chain lines, laid lines and watermarks which are often visible on the sheet. Extracting these features allows the identification of paper stock and gives information about chronology, localisation and movement of books and people. Most computational work for featu… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  32. arXiv:2302.04107  [pdf, other

    math.NA cs.LG

    Can Physics-Informed Neural Networks beat the Finite Element Method?

    Authors: Tamara G. Grossmann, Urszula Julia Komorowska, Jonas Latz, Carola-Bibiane Schönlieb

    Abstract: Partial differential equations play a fundamental role in the mathematical modelling of many processes and systems in physical, biological and other sciences. To simulate such processes and systems, the solutions of PDEs often need to be approximated numerically. The finite element method, for instance, is a usual standard methodology to do so. The recent success of deep neural networks at various… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  33. arXiv:2301.02511  [pdf, ps, other

    math.OC

    Stochastic Primal Dual Hybrid Gradient Algorithm with Adaptive Step-Sizes

    Authors: Antonin Chambolle, Claire Delplancke, Matthias J. Ehrhardt, Carola-Bibiane Schönlieb, Junqi Tang

    Abstract: In this work we propose a new primal-dual algorithm with adaptive step-sizes. The stochastic primal-dual hybrid gradient (SPDHG) algorithm with constant step-sizes has become widely applied in large-scale convex optimization across many scientific fields due to its scalability. While the product of the primal and dual step-sizes is subject to an upper-bound in order to ensure convergence, the sele… ▽ More

    Submitted 4 December, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

    Comments: 31 pages, 9 figures

    MSC Class: 47N10; 49J40; 65D18; 65K10; 90C06; 90C15; 90C25; 92C55; 94A08

  34. On Krylov Methods for Large Scale CBCT Reconstruction

    Authors: Malena Sabate Landman, Ander Biguri, Sepideh Hatamikia, Richard Boardman, John Aston, Carola-Bibiane Schonlieb

    Abstract: Krylov subspace methods are a powerful family of iterative solvers for linear systems of equations, which are commonly used for inverse problems due to their intrinsic regularization properties. Moreover, these methods are naturally suited to solve large-scale problems, as they only require matrix-vector products with the system matrix (and its adjoint) to compute approximate solutions, and they d… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: submitted

  35. arXiv:2211.05205  [pdf, ps, other

    math.ST cs.IT math.OC math.PR

    Maximum Entropy on the Mean and the Cramér Rate Function in Statistical Estimation and Inverse Problems: Properties, Models, and Algorithms

    Authors: Yakov Vaisbourd, Rustum Choksi, Ariel Goodwin, Tim Hoheisel, Carola-Bibiane Schönlieb

    Abstract: We explore a method of statistical estimation called Maximum Entropy on the Mean (MEM) which is based on an information-driven criterion that quantifies the compliance of a given point with a reference prior probability measure. At the core of this approach lies the MEM function which is a partial minimization of the Kullback-Leibler divergence over a linear constraint. In many cases, it is known… ▽ More

    Submitted 16 December, 2022; v1 submitted 9 November, 2022; originally announced November 2022.

  36. Robust Data-Driven Accelerated Mirror Descent

    Authors: Hong Ye Tan, Subhadip Mukherjee, Junqi Tang, Andreas Hauptmann, Carola-Bibiane Schönlieb

    Abstract: Learning-to-optimize is an emerging framework that leverages training data to speed up the solution of certain optimization problems. One such approach is based on the classical mirror descent algorithm, where the mirror map is modelled using input-convex neural networks. In this work, we extend this functional parameterization approach by introducing momentum into the iterations, based on the cla… ▽ More

    Submitted 2 June, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: Note inconsistency with ICASSP paper for step-size choice in (4c) and associated Alg. 1, this version is correct with step-size kt/r

    Journal ref: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

  37. arXiv:2210.02373  [pdf, other

    cs.LG math.DS math.NA

    Dynamical systems' based neural networks

    Authors: Elena Celledoni, Davide Murari, Brynjulf Owren, Carola-Bibiane Schönlieb, Ferdia Sherry

    Abstract: Neural networks have gained much interest because of their effectiveness in many applications. However, their mathematical properties are generally not well understood. If there is some underlying geometric structure inherent to the data or to the function to approximate, it is often desirable to take this into account in the design of the neural network. In this work, we start with a non-autonomo… ▽ More

    Submitted 31 August, 2023; v1 submitted 5 October, 2022; originally announced October 2022.

    MSC Class: 65L05; 65L06; 37M15

  38. arXiv:2209.05546  [pdf, other

    eess.IV math.OC q-bio.QM stat.AP

    Spectral decomposition of atomic structures in heterogeneous cryo-EM

    Authors: Carlos Esteve-Yagüe, Willem Diepeveen, Ozan Öktem, Carola-Bibiane Schönlieb

    Abstract: We consider the problem of recovering the three-dimensional atomic structure of a flexible macromolecule from a heterogeneous cryo-EM dataset. The dataset contains noisy tomographic projections of the electrostatic potential of the macromolecule, taken from different viewing directions, and in the heterogeneous case, each image corresponds to a different conformation of the macromolecule. Under th… ▽ More

    Submitted 27 December, 2022; v1 submitted 12 September, 2022; originally announced September 2022.

    Comments: 35 pages,20 figures

  39. arXiv:2209.03045  [pdf, other

    math.OC math.DG

    Regularising orientation estimation in Cryo-EM 3D map refinement through measure-based lifting over Riemannian manifolds

    Authors: Willem Diepeveen, Jan Lellmann, Ozan Öktem, Carola-Bibiane Schönlieb

    Abstract: Motivated by the trade-off between noise-robustness and data-consistency for joint 3D map reconstruction and rotation estimation in single particle cryogenic-electron microscopy (Cryo-EM), we propose ellipsoidal support lifting (ESL), a measure-based lifting scheme for regularising and approximating the global minimiser of a smooth function over a Riemannian manifold. Under a uniqueness assumption… ▽ More

    Submitted 31 January, 2023; v1 submitted 7 September, 2022; originally announced September 2022.

    MSC Class: 90C26; 92E10; 68U10

  40. arXiv:2208.14784  [pdf, other

    eess.IV cs.CV cs.LG math.OC

    Practical Operator Sketching Framework for Accelerating Iterative Data-Driven Solutions in Inverse Problems

    Authors: Junqi Tang, Guixian Xu, Subhadip Mukherjee, Carola-Bibiane Schönlieb

    Abstract: We propose a new operator-sketching paradigm for designing efficient iterative data-driven reconstruction (IDR) schemes, e.g. Plug-and-Play algorithms and deep unrolling networks. These IDR schemes are currently the state-of-the-art solutions for imaging inverse problems. However, for high-dimensional imaging tasks, especially X-ray CT and MRI imaging, these IDR schemes typically become inefficien… ▽ More

    Submitted 5 December, 2024; v1 submitted 31 August, 2022; originally announced August 2022.

  41. arXiv:2208.01631  [pdf, ps, other

    math.OC cs.LG eess.IV

    Stochastic Primal-Dual Three Operator Splitting Algorithm with Extension to Equivariant Regularization-by-Denoising

    Authors: Junqi Tang, Matthias Ehrhardt, Carola-Bibiane Schönlieb

    Abstract: In this work we propose a stochastic primal-dual three-operator splitting algorithm (TOS-SPDHG) for solving a class of convex three-composite optimization problems. Our proposed scheme is a direct three-operator splitting extension of the SPDHG algorithm [Chambolle et al. 2018]. We provide theoretical convergence analysis showing ergodic $O(1/K)$ convergence rate, and demonstrate the effectiveness… ▽ More

    Submitted 15 March, 2025; v1 submitted 2 August, 2022; originally announced August 2022.

    Comments: SSVM-2025

  42. arXiv:2206.06733  [pdf, other

    math.OC

    Data-Driven Mirror Descent with Input-Convex Neural Networks

    Authors: Hong Ye Tan, Subhadip Mukherjee, Junqi Tang, Carola-Bibiane Schönlieb

    Abstract: Learning-to-optimize is an emerging framework that seeks to speed up the solution of certain optimization problems by leveraging training data. Learned optimization solvers have been shown to outperform classical optimization algorithms in terms of convergence speed, especially for convex problems. Many existing data-driven optimization methods are based on parameterizing the update step and learn… ▽ More

    Submitted 24 February, 2023; v1 submitted 14 June, 2022; originally announced June 2022.

    MSC Class: 46N10 (Primary) 65K10; 65G50 (Secondary)

  43. arXiv:2203.11156  [pdf, other

    cs.CV cs.LG eess.IV math.OC

    Operator Sketching for Deep Unrolling Networks

    Authors: Junqi Tang, Subhadip Mukherjee, Carola-Bibiane Schönlieb

    Abstract: In this work we propose a new paradigm for designing efficient deep unrolling networks using operator sketching. The deep unrolling networks are currently the state-of-the-art solutions for imaging inverse problems. However, for high-dimensional imaging tasks, especially the 3D cone-beam X-ray CT and 4D MRI imaging, the deep unrolling schemes typically become inefficient both in terms of memory an… ▽ More

    Submitted 6 June, 2022; v1 submitted 21 March, 2022; originally announced March 2022.

  44. arXiv:2203.04650  [pdf, ps, other

    math.PR math.FA

    Gaussian random fields on non-separable Banach spaces

    Authors: Yury Korolev, Jonas Latz, Carola-Bibiane Schönlieb

    Abstract: We study Gaussian random fields on certain Banach spaces and investigate conditions for their existence. Our results apply inter alia to spaces of Radon measures and Hölder functions. In the former case, we are able to define Gaussian white noise on the space of measures directly, avoiding, e.g., an embedding into a negative-order Sobolev space. In the latter case, we demonstrate how Hölder regula… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    MSC Class: 60G15; 46N30; 46B26

  45. arXiv:2202.04965  [pdf, other

    math.OC math.AP math.NA

    $Γ$-Convergence of an Ambrosio-Tortorelli approximation scheme for image segmentation

    Authors: Irene Fonseca, Lisa Maria Kreusser, Carola-Bibiane Schönlieb, Matthew Thorpe

    Abstract: Given an image $u_0$, the aim of minimising the Mumford-Shah functional is to find a decomposition of the image domain into sub-domains and a piecewise smooth approximation $u$ of $u_0$ such that $u$ varies smoothly within each sub-domain. Since the Mumford-Shah functional is highly non-smooth, regularizations such as the Ambrosio-Tortorelli approximation can be considered which is one of the most… ▽ More

    Submitted 4 September, 2023; v1 submitted 10 February, 2022; originally announced February 2022.

  46. arXiv:2112.03754  [pdf, other

    cs.LG math.PR math.ST

    A Continuous-time Stochastic Gradient Descent Method for Continuous Data

    Authors: Kexin Jin, Jonas Latz, Chenguang Liu, Carola-Bibiane Schönlieb

    Abstract: Optimization problems with continuous data appear in, e.g., robust machine learning, functional data analysis, and variational inference. Here, the target function is given as an integral over a family of (continuously) indexed target functions - integrated with respect to a probability measure. Such problems can often be solved by stochastic optimization methods: performing optimization steps wit… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Journal ref: Journal of Machine Learning Research 24(274), pp. 1-48, 2023

  47. arXiv:2111.00534  [pdf, other

    eess.IV cs.AI cs.CV math.OC

    Focal Attention Networks: optimising attention for biomedical image segmentation

    Authors: Michael Yeung, Leonardo Rundo, Evis Sala, Carola-Bibiane Schönlieb, Guang Yang

    Abstract: In recent years, there has been increasing interest to incorporate attention into deep learning architectures for biomedical image segmentation. The modular design of attention mechanisms enables flexible integration into convolutional neural network architectures, such as the U-Net. Whether attention is appropriate to use, what type of attention to use, and where in the network to incorporate att… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

  48. arXiv:2111.00533  [pdf, other

    eess.IV cs.AI cs.CV math.OC

    Incorporating Boundary Uncertainty into loss functions for biomedical image segmentation

    Authors: Michael Yeung, Guang Yang, Evis Sala, Carola-Bibiane Schönlieb, Leonardo Rundo

    Abstract: Manual segmentation is used as the gold-standard for evaluating neural networks on automated image segmentation tasks. Due to considerable heterogeneity in shapes, colours and textures, demarcating object boundaries is particularly difficult in biomedical images, resulting in significant inter and intra-rater variability. Approaches, such as soft labelling and distance penalty term, apply a global… ▽ More

    Submitted 31 October, 2021; originally announced November 2021.

  49. arXiv:2111.00528  [pdf, other

    eess.IV cs.AI cs.CV math.OC

    Calibrating the Dice loss to handle neural network overconfidence for biomedical image segmentation

    Authors: Michael Yeung, Leonardo Rundo, Yang Nan, Evis Sala, Carola-Bibiane Schönlieb, Guang Yang

    Abstract: The Dice similarity coefficient (DSC) is both a widely used metric and loss function for biomedical image segmentation due to its robustness to class imbalance. However, it is well known that the DSC loss is poorly calibrated, resulting in overconfident predictions that cannot be usefully interpreted in biomedical and clinical practice. Performance is often the only metric used to evaluate segment… ▽ More

    Submitted 1 November, 2022; v1 submitted 31 October, 2021; originally announced November 2021.

  50. arXiv:2110.10093  [pdf, other

    eess.IV cs.CV math.OC

    Stochastic Primal-Dual Deep Unrolling

    Authors: Junqi Tang, Subhadip Mukherjee, Carola-Bibiane Schönlieb

    Abstract: We propose a new type of efficient deep-unrolling networks for solving imaging inverse problems. Conventional deep-unrolling methods require full forward operator and its adjoint across each layer, and hence can be significantly more expensive computationally as compared with other end-to-end methods that are based on post-processing of model-based reconstructions, especially for 3D image reconstr… ▽ More

    Submitted 15 February, 2022; v1 submitted 19 October, 2021; originally announced October 2021.