Skip to main content

Showing 1–18 of 18 results for author: Benning, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.09630  [pdf, other

    cs.GR

    CASteer: Steering Diffusion Models for Controllable Generation

    Authors: Tatiana Gaintseva, Chengcheng Ma, Ziquan Liu, Martin Benning, Gregory Slabaugh, Jiankang Deng, Ismail Elezi

    Abstract: Diffusion models have transformed image generation, yet controlling their outputs for diverse applications, including content moderation and creative customization, remains challenging. Existing approaches usually require task-specific training and struggle to generalize across both concrete (e.g., objects) and abstract (e.g., styles) concepts. We propose CASteer (Cross-Attention Steering) a train… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  2. arXiv:2501.19386  [pdf, ps, other

    stat.ME cs.CV eess.SP

    Multi-Frame Blind Manifold Deconvolution for Rotating Synthetic Aperture Imaging

    Authors: Dao Lin, Jian Zhang, Martin Benning

    Abstract: Rotating synthetic aperture (RSA) imaging system captures images of the target scene at different rotation angles by rotating a rectangular aperture. Deblurring acquired RSA images plays a critical role in reconstructing a latent sharp image underlying the scene. In the past decade, the emergence of blind convolution technology has revolutionised this field by its ability to model complex features… ▽ More

    Submitted 31 January, 2025; originally announced January 2025.

    Comments: 39 pages, 9 figures

    MSC Class: 62P30

  3. arXiv:2410.23130  [pdf, other

    eess.IV cs.CV

    Compositional Segmentation of Cardiac Images Leveraging Metadata

    Authors: Abbas Khan, Muhammad Asad, Martin Benning, Caroline Roney, Gregory Slabaugh

    Abstract: Cardiac image segmentation is essential for automated cardiac function assessment and monitoring of changes in cardiac structures over time. Inspired by coarse-to-fine approaches in image analysis, we propose a novel multitask compositional segmentation approach that can simultaneously localize the heart in a cardiac image and perform part-based segmentation of different regions of interest. We de… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

    Comments: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025

  4. arXiv:2408.08742  [pdf, other

    math.OC cs.CV

    A lifted Bregman strategy for training unfolded proximal neural network Gaussian denoisers

    Authors: Xiaoyu Wang, Martin Benning, Audrey Repetti

    Abstract: Unfolded proximal neural networks (PNNs) form a family of methods that combines deep learning and proximal optimization approaches. They consist in designing a neural network for a specific task by unrolling a proximal algorithm for a fixed number of iterations, where linearities can be learned from prior training procedure. PNNs have shown to be more robust than traditional deep learning approach… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: 2024 IEEE International Workshop on Machine Learning for Signal Processing, Sept. 22--25, 2024, London, UK

    MSC Class: 65K10; 68T01

  5. arXiv:2406.15035  [pdf, other

    cs.CV

    Improving Interpretability and Robustness for the Detection of AI-Generated Images

    Authors: Tatiana Gaintseva, Laida Kushnareva, German Magai, Irina Piontkovskaya, Sergey Nikolenko, Martin Benning, Serguei Barannikov, Gregory Slabaugh

    Abstract: With growing abilities of generative models, artificial content detection becomes an increasingly important and difficult task. However, all popular approaches to this problem suffer from poor generalization across domains and generative models. In this work, we focus on the robustness of AI-generated image (AIGI) detectors. We analyze existing state-of-the-art AIGI detection methods based on froz… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  6. arXiv:2406.05786  [pdf, other

    cs.CV

    CAMS: Convolution and Attention-Free Mamba-based Cardiac Image Segmentation

    Authors: Abbas Khan, Muhammad Asad, Martin Benning, Caroline Roney, Gregory Slabaugh

    Abstract: Convolutional Neural Networks (CNNs) and Transformer-based self-attention models have become the standard for medical image segmentation. This paper demonstrates that convolution and self-attention, while widely used, are not the only effective methods for segmentation. Breaking with convention, we present a Convolution and self-Attention-free Mamba-based semantic Segmentation Network named CAMS-N… ▽ More

    Submitted 29 October, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: This paper has been accepted for the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025

  7. arXiv:2404.16708  [pdf, other

    eess.IV cs.CV

    Multi-view Cardiac Image Segmentation via Trans-Dimensional Priors

    Authors: Abbas Khan, Muhammad Asad, Martin Benning, Caroline Roney, Gregory Slabaugh

    Abstract: We propose a novel multi-stage trans-dimensional architecture for multi-view cardiac image segmentation. Our method exploits the relationship between long-axis (2D) and short-axis (3D) magnetic resonance (MR) images to perform a sequential 3D-to-2D-to-3D segmentation, segmenting the long-axis and short-axis images. In the first stage, 3D segmentation is performed using the short-axis image, and th… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  8. arXiv:2404.01889  [pdf, other

    cs.CV cs.AI

    RAVE: Residual Vector Embedding for CLIP-Guided Backlit Image Enhancement

    Authors: Tatiana Gaintseva, Martin Benning, Gregory Slabaugh

    Abstract: In this paper we propose a novel modification of Contrastive Language-Image Pre-Training (CLIP) guidance for the task of unsupervised backlit image enhancement. Our work builds on the state-of-the-art CLIP-LIT approach, which learns a prompt pair by constraining the text-image similarity between a prompt (negative/positive sample) and a corresponding image (backlit image/well-lit image) in the CLI… ▽ More

    Submitted 20 July, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  9. arXiv:2402.09156  [pdf, other

    eess.IV cs.CV

    Crop and Couple: cardiac image segmentation using interlinked specialist networks

    Authors: Abbas Khan, Muhammad Asad, Martin Benning, Caroline Roney, Gregory Slabaugh

    Abstract: Diagnosis of cardiovascular disease using automated methods often relies on the critical task of cardiac image segmentation. We propose a novel strategy that performs segmentation using specialist networks that focus on a single anatomy (left ventricle, right ventricle, or myocardium). Given an input long-axis cardiac MR image, our method performs a ternary segmentation in the first stage to ident… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  10. arXiv:2303.01965  [pdf, other

    math.NA cs.LG

    A Lifted Bregman Formulation for the Inversion of Deep Neural Networks

    Authors: Xiaoyu Wang, Martin Benning

    Abstract: We propose a novel framework for the regularised inversion of deep neural networks. The framework is based on the authors' recent work on training feed-forward neural networks without the differentiation of activation functions. The framework lifts the parameter space into a higher dimensional space by introducing auxiliary variables, and penalises these variables with tailored Bregman distances.… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: 21 pages, 9 figures

    MSC Class: 47A52; 47J30; 65J20; 65J22; 65K10; 68T07; 94A08

  11. arXiv:2212.07786  [pdf, other

    math.NA cs.CV cs.LG eess.IV

    Convergent Data-driven Regularizations for CT Reconstruction

    Authors: Samira Kabri, Alexander Auras, Danilo Riccio, Hartmut Bauermeister, Martin Benning, Michael Moeller, Martin Burger

    Abstract: The reconstruction of images from their corresponding noisy Radon transform is a typical example of an ill-posed linear inverse problem as arising in the application of computerized tomography (CT). As the (naive) solution does not depend on the measured data continuously, regularization is needed to re-establish a continuous dependence. In this work, we investigate simple, but yet still provably… ▽ More

    Submitted 15 December, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

  12. arXiv:2208.08772  [pdf, other

    math.OC cs.LG

    Lifted Bregman Training of Neural Networks

    Authors: Xiaoyu Wang, Martin Benning

    Abstract: We introduce a novel mathematical formulation for the training of feed-forward neural networks with (potentially non-smooth) proximal maps as activation functions. This formulation is based on Bregman distances and a key advantage is that its partial derivatives with respect to the network's parameters do not require the computation of derivatives of the network's activation functions. Instead of… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: 48 pages, 16 figures

    MSC Class: 47A52; 65K10; 68T01; 68W15

  13. arXiv:2109.02096  [pdf, other

    cs.SD cs.AI cs.CV cs.LG eess.AS

    Timbre Transfer with Variational Auto Encoding and Cycle-Consistent Adversarial Networks

    Authors: Russell Sammut Bonnici, Charalampos Saitis, Martin Benning

    Abstract: This research project investigates the application of deep learning to timbre transfer, where the timbre of a source audio can be converted to the timbre of a target audio with minimal loss in quality. The adopted approach combines Variational Autoencoders with Generative Adversarial Networks to construct meaningful representations of the source audio and produce realistic generations of the targe… ▽ More

    Submitted 10 October, 2021; v1 submitted 5 September, 2021; originally announced September 2021.

    Comments: 12 pages, 3 main figures, 4 tables

  14. arXiv:2012.03642  [pdf, other

    cs.LG math.OC

    Generalised Perceptron Learning

    Authors: Xiaoyu Wang, Martin Benning

    Abstract: We present a generalisation of Rosenblatt's traditional perceptron learning algorithm to the class of proximal activation functions and demonstrate how this generalisation can be interpreted as an incremental gradient method applied to a novel energy function. This novel energy function is based on a generalised Bregman distance, for which the gradient with respect to the weights and biases does n… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

    Comments: 8 pages, 2 figures, accepted at the 12th Annual Workshop on Optimization for Machine Learning

    MSC Class: 68T07; 65K05; 49M37; 90C30

  15. arXiv:1906.08754  [pdf, other

    eess.IV cs.CV math.NA math.OC

    Learning the Sampling Pattern for MRI

    Authors: Ferdia Sherry, Martin Benning, Juan Carlos De los Reyes, Martin J. Graves, Georg Maierhofer, Guy Williams, Carola-Bibiane Schönlieb, Matthias J. Ehrhardt

    Abstract: The discovery of the theory of compressed sensing brought the realisation that many inverse problems can be solved even when measurements are "incomplete". This is particularly interesting in magnetic resonance imaging (MRI), where long acquisition times can limit its use. In this work, we consider the problem of learning a sparse sampling pattern that can be used to optimally balance acquisition… ▽ More

    Submitted 21 June, 2020; v1 submitted 20 June, 2019; originally announced June 2019.

    Comments: The main document is 12 pages, the supporting document is 2 pages and attached at the end of the main document

  16. arXiv:1904.05657  [pdf, other

    math.OC cs.LG math.NA

    Deep learning as optimal control problems: models and numerical methods

    Authors: Martin Benning, Elena Celledoni, Matthias J. Ehrhardt, Brynjulf Owren, Carola-Bibiane Schönlieb

    Abstract: We consider recent work of Haber and Ruthotto 2017 and Chang et al. 2018, where deep learning neural networks have been interpreted as discretisations of an optimal control problem subject to an ordinary differential equation constraint. We review the first order conditions for optimality, and the conditions ensuring optimality after discretisation. This leads to a class of algorithms for solving… ▽ More

    Submitted 30 September, 2019; v1 submitted 11 April, 2019; originally announced April 2019.

  17. arXiv:1703.08001  [pdf, other

    cs.CV math.NA

    Nonlinear Spectral Image Fusion

    Authors: Martin Benning, Michael Möller, Raz Z. Nossek, Martin Burger, Daniel Cremers, Guy Gilboa, Carola-Bibiane Schönlieb

    Abstract: In this paper we demonstrate that the framework of nonlinear spectral decompositions based on total variation (TV) regularization is very well suited for image fusion as well as more general image manipulation tasks. The well-localized and edge-preserving spectral TV decomposition allows to select frequencies of a certain image to transfer particular features, such as wrinkles in a face, from one… ▽ More

    Submitted 23 March, 2017; originally announced March 2017.

    Comments: 13 pages, 9 figures, submitted to SSVM conference proceedings 2017

    MSC Class: 35P30; 62H35; 65M70; 94A08 ACM Class: G.1.3; G.1.6; G.1.8; I.4.0; I.4.5

  18. Variational Depth from Focus Reconstruction

    Authors: Michael Moeller, Martin Benning, Carola Schönlieb, Daniel Cremers

    Abstract: This paper deals with the problem of reconstructing a depth map from a sequence of differently focused images, also known as depth from focus or shape from focus. We propose to state the depth from focus problem as a variational problem including a smooth but nonconvex data fidelity term, and a convex nonsmooth regularization, which makes the method robust to noise and leads to more realistic dept… ▽ More

    Submitted 5 November, 2014; v1 submitted 1 August, 2014; originally announced August 2014.