Skip to main content

Showing 1–50 of 56 results for author: Aviles-Rivero, A I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.00627  [pdf, other

    cs.CV

    Brain Foundation Models with Hypergraph Dynamic Adapter for Brain Disease Analysis

    Authors: Zhongying Deng, Haoyu Wang, Ziyan Huang, Lipei Zhang, Angelica I. Aviles-Rivero, Chaoyu Liu, Junjun He, Zoe Kourtzi, Carola-Bibiane Schönlieb

    Abstract: Brain diseases, such as Alzheimer's disease and brain tumors, present profound challenges due to their complexity and societal impact. Recent advancements in brain foundation models have shown significant promise in addressing a range of brain-related tasks. However, current brain foundation models are limited by task and data homogeneity, restricted generalization beyond segmentation or classific… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

    Comments: 35 pages, 4 figures

  2. arXiv:2504.18520  [pdf, other

    eess.IV cs.CV

    RSFR: A Coarse-to-Fine Reconstruction Framework for Diffusion Tensor Cardiac MRI with Semantic-Aware Refinement

    Authors: Jiahao Huang, Fanwen Wang, Pedro F. Ferreira, Haosen Zhang, Yinzhe Wu, Zhifan Gao, Lei Zhu, Angelica I. Aviles-Rivero, Carola-Bibiane Schonlieb, Andrew D. Scott, Zohya Khalique, Maria Dwornik, Ramyah Rajakulasingam, Ranil De Silva, Dudley J. Pennell, Guang Yang, Sonia Nielles-Vallespin

    Abstract: Cardiac diffusion tensor imaging (DTI) offers unique insights into cardiomyocyte arrangements, bridging the gap between microscopic and macroscopic cardiac function. However, its clinical utility is limited by technical challenges, including a low signal-to-noise ratio, aliasing artefacts, and the need for accurate quantitative fidelity. To address these limitations, we introduce RSFR (Reconstruct… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

  3. arXiv:2503.03141  [pdf, other

    eess.IV cs.CV cs.LG

    Implicit U-KAN2.0: Dynamic, Efficient and Interpretable Medical Image Segmentation

    Authors: Chun-Wun Cheng, Yining Zhao, Yanqi Cheng, Javier Montoya, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: Image segmentation is a fundamental task in both image analysis and medical applications. State-of-the-art methods predominantly rely on encoder-decoder architectures with a U-shaped design, commonly referred to as U-Net. Recent advancements integrating transformers and MLPs improve performance but still face key limitations, such as poor interpretability, difficulty handling intrinsic noise, and… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

  4. arXiv:2502.19159  [pdf, other

    cs.CV

    A Sliding Layer Merging Method for Efficient Depth-Wise Pruning in LLMs

    Authors: Xuan Ding, Rui Sun, Yunjian Zhang, Xiu Yan, Yueqi Zhou, Kaihao Huang, Suzhong Fu, Angelica I Aviles-Rivero, Chuanlong Xie, Yao Zhu

    Abstract: Compared to width-wise pruning, depth-wise pruning can significantly accelerate inference in resource-constrained scenarios. However, treating the entire Transformer layer as the minimum pruning unit may degrade model performance by indiscriminately discarding the entire information of the layer. This paper reveals the ``Patch-like'' feature relationship between layers in large language models by… ▽ More

    Submitted 15 May, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

  5. arXiv:2502.16890  [pdf, other

    cs.LG cs.AI

    ReFocus: Reinforcing Mid-Frequency and Key-Frequency Modeling for Multivariate Time Series Forecasting

    Authors: Guoqi Yu, Yaoming Li, Juncheng Wang, Xiaoyu Guo, Angelica I. Aviles-Rivero, Tong Yang, Shujun Wang

    Abstract: Recent advancements have progressively incorporated frequency-based techniques into deep learning models, leading to notable improvements in accuracy and efficiency for time series analysis tasks. However, the Mid-Frequency Spectrum Gap in the real-world time series, where the energy is concentrated at the low-frequency region while the middle-frequency band is negligible, hinders the ability of e… ▽ More

    Submitted 3 March, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

    Comments: Under Review

  6. arXiv:2412.15813  [pdf, other

    cs.CV

    Cross-Modal Few-Shot Learning with Second-Order Neural Ordinary Differential Equations

    Authors: Yi Zhang, Chun-Wun Cheng, Junyi He, Zhihai He, Carola-Bibiane Schönlieb, Yuyan Chen, Angelica I Aviles-Rivero

    Abstract: We introduce SONO, a novel method leveraging Second-Order Neural Ordinary Differential Equations (Second-Order NODEs) to enhance cross-modal few-shot learning. By employing a simple yet effective architecture consisting of a Second-Order NODEs model paired with a cross-modal classifier, SONO addresses the significant challenge of overfitting, which is common in few-shot scenarios due to limited tr… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

  7. arXiv:2412.06204  [pdf, other

    cs.CV

    You KAN Do It in a Single Shot: Plug-and-Play Methods with Single-Instance Priors

    Authors: Yanqi Cheng, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: The use of Plug-and-Play (PnP) methods has become a central approach for solving inverse problems, with denoisers serving as regularising priors that guide optimisation towards a clean solution. In this work, we introduce KAN-PnP, an optimisation framework that incorporates Kolmogorov-Arnold Networks (KANs) as denoisers within the Plug-and-Play (PnP) paradigm. KAN-PnP is specifically designed to s… ▽ More

    Submitted 2 May, 2025; v1 submitted 8 December, 2024; originally announced December 2024.

  8. arXiv:2411.03688  [pdf, other

    cs.CV

    Where Do We Stand with Implicit Neural Representations? A Technical and Performance Survey

    Authors: Amer Essakine, Yanqi Cheng, Chun-Wun Cheng, Lipei Zhang, Zhongying Deng, Lei Zhu, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: Implicit Neural Representations (INRs) have emerged as a paradigm in knowledge representation, offering exceptional flexibility and performance across a diverse range of applications. INRs leverage multilayer perceptrons (MLPs) to model data as continuous implicit functions, providing critical advantages such as resolution independence, memory efficiency, and generalisation beyond discretised data… ▽ More

    Submitted 18 February, 2025; v1 submitted 6 November, 2024; originally announced November 2024.

    Journal ref: Published in Transactions on Machine Learning Research, 2025

  9. arXiv:2410.07901  [pdf, other

    cs.CV

    Semi-Supervised Video Desnowing Network via Temporal Decoupling Experts and Distribution-Driven Contrastive Regularization

    Authors: Hongtao Wu, Yijun Yang, Angelica I Aviles-Rivero, Jingjing Ren, Sixiang Chen, Haoyu Chen, Lei Zhu

    Abstract: Snow degradations present formidable challenges to the advancement of computer vision tasks by the undesirable corruption in outdoor scenarios. While current deep learning-based desnowing approaches achieve success on synthetic benchmark datasets, they struggle to restore out-of-distribution real-world snowy videos due to the deficiency of paired real-world training data. To address this bottlenec… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  10. arXiv:2410.02113  [pdf, other

    cs.LG math.NA

    Mamba Neural Operator: Who Wins? Transformers vs. State-Space Models for PDEs

    Authors: Chun-Wun Cheng, Jiahao Huang, Yi Zhang, Guang Yang, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: Partial differential equations (PDEs) are widely used to model complex physical systems, but solving them efficiently remains a significant challenge. Recently, Transformers have emerged as the preferred architecture for PDEs due to their ability to capture intricate dependencies. However, they struggle with representing continuous dynamics and long-range interactions. To overcome these limitation… ▽ More

    Submitted 9 April, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

  11. arXiv:2409.01544  [pdf, other

    eess.IV cs.CV

    Learning Task-Specific Sampling Strategy for Sparse-View CT Reconstruction

    Authors: Liutao Yang, Jiahao Huang, Yingying Fang, Angelica I Aviles-Rivero, Carola-Bibiane Schonlieb, Daoqiang Zhang, Guang Yang

    Abstract: Sparse-View Computed Tomography (SVCT) offers low-dose and fast imaging but suffers from severe artifacts. Optimizing the sampling strategy is an essential approach to improving the imaging quality of SVCT. However, current methods typically optimize a universal sampling strategy for all types of scans, overlooking the fact that the optimal strategy may vary depending on the specific scanning task… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

  12. arXiv:2407.08672  [pdf, other

    cs.CV

    NODE-Adapter: Neural Ordinary Differential Equations for Better Vision-Language Reasoning

    Authors: Yi Zhang, Chun-Wun Cheng, Ke Yu, Zhihai He, Carola-Bibiane Schönlieb, Angelica I. Aviles-Rivero

    Abstract: In this paper, we consider the problem of prototype-based vision-language reasoning problem. We observe that existing methods encounter three major challenges: 1) escalating resource demands and prolonging training times, 2) contending with excessive learnable parameters, and 3) fine-tuning based only on a single modality. These challenges will hinder their capability to adapt Vision-Language Mode… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  13. arXiv:2407.05703  [pdf, other

    cs.CV

    LGRNet: Local-Global Reciprocal Network for Uterine Fibroid Segmentation in Ultrasound Videos

    Authors: Huihui Xu, Yijun Yang, Angelica I Aviles-Rivero, Guang Yang, Jing Qin, Lei Zhu

    Abstract: Regular screening and early discovery of uterine fibroid are crucial for preventing potential malignant transformations and ensuring timely, life-saving interventions. To this end, we collect and annotate the first ultrasound video dataset with 100 videos for uterine fibroid segmentation (UFUV). We also present Local-Global Reciprocal Network (LGRNet) to efficiently and effectively propagate the l… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: MICCAI2024 Early Accept

  14. arXiv:2406.02287  [pdf, other

    cs.CV

    Optimised ProPainter for Video Diminished Reality Inpainting

    Authors: Pengze Li, Lihao Liu, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: In this paper, part of the DREAMING Challenge - Diminished Reality for Emerging Applications in Medicine through Inpainting, we introduce a refined video inpainting technique optimised from the ProPainter method to meet the specialised demands of medical imaging, specifically in the context of oral and maxillofacial surgery. Our enhanced algorithm employs the zero-shot ProPainter, featuring optimi… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to ISBI 2024

  15. arXiv:2405.17659  [pdf, other

    eess.IV cs.CV

    Enhancing Global Sensitivity and Uncertainty Quantification in Medical Image Reconstruction with Monte Carlo Arbitrary-Masked Mamba

    Authors: Jiahao Huang, Liutao Yang, Fanwen Wang, Yang Nan, Weiwen Wu, Chengyan Wang, Kuangyu Shi, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Daoqiang Zhang, Guang Yang

    Abstract: Deep learning has been extensively applied in medical image reconstruction, where Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) represent the predominant paradigms, each possessing distinct advantages and inherent limitations: CNNs exhibit linear complexity with local sensitivity, whereas ViTs demonstrate quadratic complexity with global sensitivity. The emerging Mamba has sh… ▽ More

    Submitted 25 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  16. arXiv:2405.14338  [pdf, other

    cs.CV

    MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models

    Authors: Jiuming Liu, Jinru Han, Lihao Liu, Angelica I. Aviles-Rivero, Chaokang Jiang, Zhe Liu, Hesheng Wang

    Abstract: Point cloud videos can faithfully capture real-world spatial geometries and temporal dynamics, which are essential for enabling intelligent agents to understand the dynamically changing world. However, designing an effective 4D backbone remains challenging, mainly due to the irregular and unordered distribution of points and temporal inconsistencies across frames. Also, recent transformer-based 4D… ▽ More

    Submitted 26 February, 2025; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted by CVPR 2025. The first two authors contribute equally

  17. arXiv:2403.12719  [pdf, other

    cs.LG

    Bilevel Hypergraph Networks for Multi-Modal Alzheimer's Diagnosis

    Authors: Angelica I. Aviles-Rivero, Chun-Wun Cheng, Zhongying Deng, Zoe Kourtzi, Carola-Bibiane Schönlieb

    Abstract: Early detection of Alzheimer's disease's precursor stages is imperative for significantly enhancing patient outcomes and quality of life. This challenge is tackled through a semi-supervised multi-modal diagnosis framework. In particular, we introduce a new hypergraph framework that enables higher-order relations between multi-modal data, while utilising minimal labels. We first introduce a bilevel… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  18. arXiv:2403.09136  [pdf, other

    eess.IV cs.CV

    Biophysics Informed Pathological Regularisation for Brain Tumour Segmentation

    Authors: Lipei Zhang, Yanqi Cheng, Lihao Liu, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: Recent advances in deep learning have significantly improved brain tumour segmentation techniques; however, the results still lack confidence and robustness as they solely consider image data without biophysical priors or pathological information. Integrating biophysics-informed regularisation is one effective way to change this situation, as it provides an prior regularisation for automated end-t… ▽ More

    Submitted 8 October, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: 11 pages, 4 figures and 1 table. Accepted by MICCAI2024

  19. arXiv:2403.07684  [pdf, other

    cs.CV

    Genuine Knowledge from Practice: Diffusion Test-Time Adaptation for Video Adverse Weather Removal

    Authors: Yijun Yang, Hongtao Wu, Angelica I. Aviles-Rivero, Yulun Zhang, Jing Qin, Lei Zhu

    Abstract: Real-world vision tasks frequently suffer from the appearance of unexpected adverse weather conditions, including rain, haze, snow, and raindrops. In the last decade, convolutional neural networks and vision transformers have yielded outstanding results in single-weather video removal. However, due to the absence of appropriate adaptation, most of them fail to generalize to other weather condition… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  20. arXiv:2402.18451  [pdf, other

    eess.IV cs.CV

    MambaMIR: An Arbitrary-Masked Mamba for Joint Medical Image Reconstruction and Uncertainty Estimation

    Authors: Jiahao Huang, Liutao Yang, Fanwen Wang, Yang Nan, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Daoqiang Zhang, Guang Yang

    Abstract: The recent Mamba model has shown remarkable adaptability for visual representation learning, including in medical imaging tasks. This study introduces MambaMIR, a Mamba-based model for medical image reconstruction, as well as its Generative Adversarial Network-based variant, MambaMIR-GAN. Our proposed MambaMIR inherits several advantages, such as linear complexity, global receptive fields, and dyn… ▽ More

    Submitted 25 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  21. arXiv:2402.12694  [pdf, other

    cs.LG

    Revitalizing Multivariate Time Series Forecasting: Learnable Decomposition with Inter-Series Dependencies and Intra-Series Variations Modeling

    Authors: Guoqi Yu, Jing Zou, Xiaowei Hu, Angelica I. Aviles-Rivero, Jing Qin, Shujun Wang

    Abstract: Predicting multivariate time series is crucial, demanding precise modeling of intricate patterns, including inter-series dependencies and intra-series variations. Distinctive trend characteristics in each time series pose challenges, and existing methods, relying on basic moving average kernels, may struggle with the non-linear structure and complex trends in real-world data. Given that, we introd… ▽ More

    Submitted 5 July, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  22. arXiv:2311.13682  [pdf, other

    cs.CV eess.IV

    Single-Shot Plug-and-Play Methods for Inverse Problems

    Authors: Yanqi Cheng, Lipei Zhang, Zhenda Shen, Shujun Wang, Lequan Yu, Raymond H. Chan, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: The utilisation of Plug-and-Play (PnP) priors in inverse problems has become increasingly prominent in recent years. This preference is based on the mathematical equivalence between the general proximal operator and the regularised denoiser, facilitating the adaptation of various off-the-shelf denoiser priors to a wide range of inverse problems. However, existing PnP models predominantly rely on p… ▽ More

    Submitted 11 November, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Journal ref: Published in Transactions on Machine Learning Research, 2024

  23. arXiv:2311.13610  [pdf, other

    cs.CV eess.IV

    TRIDENT: The Nonlinear Trilogy for Implicit Neural Representations

    Authors: Zhenda Shen, Yanqi Cheng, Raymond H. Chan, Pietro Liò, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: Implicit neural representations (INRs) have garnered significant interest recently for their ability to model complex, high-dimensional data without explicit parameterisation. In this work, we introduce TRIDENT, a novel function for implicit neural representations characterised by a trilogy of nonlinearities. Firstly, it is designed to represent high-order features through order compactness. Secon… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  24. arXiv:2311.10092  [pdf, other

    cs.CV

    Traffic Video Object Detection using Motion Prior

    Authors: Lihao Liu, Yanqi Cheng, Dongdong Chen, Jing He, Pietro Liò, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: Traffic videos inherently differ from generic videos in their stationary camera setup, thus providing a strong motion prior where objects often move in a specific direction over a short time interval. Existing works predominantly employ generic video object detection framework for traffic video object detection, which yield certain advantages such as broad applicability and robustness to diverse s… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: 11 pages, 4 figures

  25. arXiv:2310.20092  [pdf, other

    cs.LG cs.CV

    The Missing U for Efficient Diffusion Models

    Authors: Sergio Calvo-Ordonez, Chun-Wun Cheng, Jiahao Huang, Lipei Zhang, Guang Yang, Carola-Bibiane Schonlieb, Angelica I Aviles-Rivero

    Abstract: Diffusion Probabilistic Models stand as a critical tool in generative modelling, enabling the generation of complex data distributions. This family of generative models yields record-breaking performance in tasks such as image synthesis, video generation, and molecule design. Despite their capabilities, their efficiency, especially in the reverse process, remains a challenge due to slow convergenc… ▽ More

    Submitted 5 April, 2024; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: 23 pages, 14 figures, Accepted at Transactions of Machine Learning Research (04/2024)

  26. arXiv:2309.13700  [pdf, other

    cs.CV

    Video Adverse-Weather-Component Suppression Network via Weather Messenger and Adversarial Backpropagation

    Authors: Yijun Yang, Angelica I. Aviles-Rivero, Huazhu Fu, Ye Liu, Weiming Wang, Lei Zhu

    Abstract: Although convolutional neural networks (CNNs) have been proposed to remove adverse weather conditions in single images using a single set of pre-trained weights, they fail to restore weather videos due to the absence of temporal information. Furthermore, existing methods for removing adverse weather conditions (e.g., rain, fog, and snow) from videos can only handle one type of adverse weather. In… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

  27. arXiv:2308.01057  [pdf, other

    cs.CV

    MammoDG: Generalisable Deep Learning Breaks the Limits of Cross-Domain Multi-Center Breast Cancer Screening

    Authors: Yijun Yang, Shujun Wang, Lihao Liu, Sarah Hickman, Fiona J Gilbert, Carola-Bibiane Schönlieb, Angelica I. Aviles-Rivero

    Abstract: Breast cancer is a major cause of cancer death among women, emphasising the importance of early detection for improved treatment outcomes and quality of life. Mammography, the primary diagnostic imaging test, poses challenges due to the high variability and patterns in mammograms. Double reading of mammograms is recommended in many screening programs to improve diagnostic accuracy but increases ra… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

  28. arXiv:2304.00996  [pdf, other

    physics.med-ph cs.CV eess.IV

    Deep Learning-based Diffusion Tensor Cardiac Magnetic Resonance Reconstruction: A Comparison Study

    Authors: Jiahao Huang, Pedro F. Ferreira, Lichao Wang, Yinzhe Wu, Angelica I. Aviles-Rivero, Carola-Bibiane Schonlieb, Andrew D. Scott, Zohya Khalique, Maria Dwornik, Ramyah Rajakulasingam, Ranil De Silva, Dudley J. Pennell, Sonia Nielles-Vallespin, Guang Yang

    Abstract: In vivo cardiac diffusion tensor imaging (cDTI) is a promising Magnetic Resonance Imaging (MRI) technique for evaluating the micro-structure of myocardial tissue in the living heart, providing insights into cardiac function and enabling the development of innovative therapeutic strategies. However, the integration of cDTI into routine clinical practice is challenging due to the technical obstacles… ▽ More

    Submitted 4 April, 2023; v1 submitted 31 March, 2023; originally announced April 2023.

    Comments: 15 pages, 8 figures

  29. arXiv:2303.10610  [pdf, other

    cs.CV

    DiffMIC: Dual-Guidance Diffusion Network for Medical Image Classification

    Authors: Yijun Yang, Huazhu Fu, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb, Lei Zhu

    Abstract: Diffusion Probabilistic Models have recently shown remarkable performance in generative image modeling, attracting significant attention in the computer vision community. However, while a substantial amount of diffusion-based research has focused on generative tasks, few studies have applied diffusion models to general medical image classification. In this paper, we propose the first diffusion-bas… ▽ More

    Submitted 11 July, 2023; v1 submitted 19 March, 2023; originally announced March 2023.

  30. arXiv:2303.10390  [pdf, other

    cs.CV

    HGIB: Prognosis for Alzheimer's Disease via Hypergraph Information Bottleneck

    Authors: Shujun Wang, Angelica I Aviles-Rivero, Zoe Kourtzi, Carola-Bibiane Schönlieb

    Abstract: Alzheimer's disease prognosis is critical for early Mild Cognitive Impairment patients for timely treatment to improve the patient's quality of life. Whilst existing prognosis techniques demonstrate potential results, they are highly limited in terms of using a single modality. Most importantly, they fail in considering a key element for prognosis: not all features extracted at the current moment… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

  31. arXiv:2303.08113  [pdf, other

    eess.IV cs.CV

    Learning Homeomorphic Image Registration via Conformal-Invariant Hyperelastic Regularisation

    Authors: Jing Zou, Noémie Debroux, Lihao Liu, Jing Qin, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: Deformable image registration is a fundamental task in medical image analysis and plays a crucial role in a wide range of clinical applications. Recently, deep learning-based approaches have been widely studied for deformable medical image registration and achieved promising results. However, existing deep learning image registration techniques do not theoretically guarantee topology-preserving tr… ▽ More

    Submitted 30 June, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: 13 pages, 3 figures

  32. arXiv:2303.06274  [pdf

    cs.CV cs.LG

    CoNIC Challenge: Pushing the Frontiers of Nuclear Detection, Segmentation, Classification and Counting

    Authors: Simon Graham, Quoc Dang Vu, Mostafa Jahanifar, Martin Weigert, Uwe Schmidt, Wenhua Zhang, Jun Zhang, Sen Yang, Jinxi Xiang, Xiyue Wang, Josef Lorenz Rumberger, Elias Baumann, Peter Hirsch, Lihao Liu, Chenyang Hong, Angelica I. Aviles-Rivero, Ayushi Jain, Heeyoung Ahn, Yiyu Hong, Hussam Azzuni, Min Xu, Mohammad Yaqub, Marie-Claire Blache, Benoît Piégu, Bertrand Vernay , et al. (64 additional authors not shown)

    Abstract: Nuclear detection, segmentation and morphometric profiling are essential in helping us further understand the relationship between histology and patient outcome. To drive innovation in this area, we setup a community-wide challenge using the largest available dataset of its kind to assess nuclear segmentation and cellular composition. Our challenge, named CoNIC, stimulated the development of repro… ▽ More

    Submitted 14 March, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  33. arXiv:2302.00626  [pdf, other

    cs.CV eess.IV

    Continuous U-Net: Faster, Greater and Noiseless

    Authors: Chun-Wun Cheng, Christina Runkel, Lihao Liu, Raymond H Chan, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: Image segmentation is a fundamental task in image analysis and clinical practice. The current state-of-the-art techniques are based on U-shape type encoder-decoder networks with skip connections, called U-Net. Despite the powerful performance reported by existing U-Net type networks, they suffer from several major limitations. Issues include the hard coding of the receptive field size, compromisin… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  34. arXiv:2211.09620  [pdf, other

    cs.CV

    TrafficCAM: A Versatile Dataset for Traffic Flow Segmentation

    Authors: Zhongying Deng, Yanqi Chen, Lihao Liu, Shujun Wang, Rihuan Ke, Carola-Bibiane Schonlieb, Angelica I Aviles-Rivero

    Abstract: Traffic flow analysis is revolutionising traffic management. Qualifying traffic flow data, traffic control bureaus could provide drivers with real-time alerts, advising the fastest routes and therefore optimising transportation logistics and reducing congestion. The existing traffic flow datasets have two major limitations. They feature a limited number of classes, usually limited to one type of v… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  35. arXiv:2211.09593  [pdf, other

    cs.CV

    NorMatch: Matching Normalizing Flows with Discriminative Classifiers for Semi-Supervised Learning

    Authors: Zhongying Deng, Rihuan Ke, Carola-Bibiane Schonlieb, Angelica I Aviles-Rivero

    Abstract: Semi-Supervised Learning (SSL) aims to learn a model using a tiny labeled set and massive amounts of unlabeled data. To better exploit the unlabeled data the latest SSL methods use pseudo-labels predicted from a single discriminative classifier. However, the generated pseudo-labels are inevitably linked to inherent confirmation bias and noise which greatly affects the model performance. In this wo… ▽ More

    Submitted 16 February, 2024; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: Accepted to Transactions on Machine Learning Research

  36. arXiv:2211.06885  [pdf, other

    cs.CV

    SCOTCH and SODA: A Transformer Video Shadow Detection Framework

    Authors: Lihao Liu, Jean Prost, Lei Zhu, Nicolas Papadakis, Pietro Liò, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: Shadows in videos are difficult to detect because of the large shadow deformation between frames. In this work, we argue that accounting for shadow deformation is essential when designing a video shadow detection method. To this end, we introduce the shadow deformation attention trajectory (SODA), a new type of video self-attention module, specially designed to handle the large shadow deformations… ▽ More

    Submitted 26 March, 2023; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: Accepted to CVPR 2023

  37. arXiv:2209.08647  [pdf, other

    cs.CV

    Why Deep Surgical Models Fail?: Revisiting Surgical Action Triplet Recognition through the Lens of Robustness

    Authors: Yanqi Cheng, Lihao Liu, Shujun Wang, Yueming Jin, Carola-Bibiane Schönlieb, Angelica I. Aviles-Rivero

    Abstract: Surgical action triplet recognition provides a better understanding of the surgical scene. This task is of high relevance as it provides the surgeon with context-aware support and safety. The current go-to strategy for improving performance is the development of new network mechanisms. However, the performance of current state-of-the-art techniques is substantially lower than other surgical tasks.… ▽ More

    Submitted 20 February, 2023; v1 submitted 18 September, 2022; originally announced September 2022.

  38. arXiv:2204.02399  [pdf, other

    cs.LG cs.CV eess.IV

    Multi-Modal Hypergraph Diffusion Network with Dual Prior for Alzheimer Classification

    Authors: Angelica I. Aviles-Rivero, Christina Runkel, Nicolas Papadakis, Zoe Kourtzi, Carola-Bibiane Schönlieb

    Abstract: The automatic early diagnosis of prodromal stages of Alzheimer's disease is of great relevance for patient treatment to improve quality of life. We address this problem as a multi-modal classification task. Multi-modal data provides richer and complementary information. However, existing techniques only consider either lower order relations between the data and single/multi-modal imaging data. In… ▽ More

    Submitted 6 September, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    Journal ref: MICCAI 2022

  39. arXiv:2203.05684  [pdf, other

    cs.CV

    PC-SwinMorph: Patch Representation for Unsupervised Medical Image Registration and Segmentation

    Authors: Lihao Liu, Zhening Huang, Pietro Liò, Carola-Bibiane Schönlieb, Angelica I. Aviles-Rivero

    Abstract: Medical image registration and segmentation are critical tasks for several clinical procedures. Manual realisation of those tasks is time-consuming and the quality is highly dependent on the level of expertise of the physician. To mitigate that laborious task, automatic tools have been developed where the majority of solutions are supervised techniques. However, in medical domain, the strong assum… ▽ More

    Submitted 20 July, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

    Comments: 10 pages, 7 figures, 2 tables

  40. arXiv:2203.00157  [pdf, other

    cs.CV

    Simultaneous Semantic and Instance Segmentation for Colon Nuclei Identification and Counting

    Authors: Lihao Liu, Chenyang Hong, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb

    Abstract: We address the problem of automated nuclear segmentation, classification, and quantification from Haematoxylin and Eosin stained histology images, which is of great relevance for several downstream computational pathology applications. In this work, we present a solution framed as a simultaneous semantic and instance segmentation framework. Our solution is part of the Colon Nuclei Identification a… ▽ More

    Submitted 15 April, 2022; v1 submitted 28 February, 2022; originally announced March 2022.

    Comments: 9 pages; 4 figures

  41. arXiv:2107.10014  [pdf, other

    stat.ML cs.LG math.PR

    Delving Into Deep Walkers: A Convergence Analysis of Random-Walk-Based Vertex Embeddings

    Authors: Dominik Kloepfer, Angelica I. Aviles-Rivero, Daniel Heydecker

    Abstract: Graph vertex embeddings based on random walks have become increasingly influential in recent years, showing good performance in several tasks as they efficiently transform a graph into a more computationally digestible format while preserving relevant information. However, the theoretical properties of such algorithms, in particular the influence of hyperparameters and of the graph structure on th… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

  42. LaplaceNet: A Hybrid Graph-Energy Neural Network for Deep Semi-Supervised Classification

    Authors: Philip Sellars, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb

    Abstract: Semi-supervised learning has received a lot of recent attention as it alleviates the need for large amounts of labelled data which can often be expensive, requires expert knowledge and be time consuming to collect. Recent developments in deep semi-supervised classification have reached unprecedented performance and the gap between supervised and semi-supervised learning is ever-decreasing. This im… ▽ More

    Submitted 28 September, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: https://ieeexplore.ieee.org/document/9900300

    Journal ref: IEEE Transactions on Neural Networks and Learning Systems 2022

  43. arXiv:2106.03755  [pdf, other

    cs.CV stat.AP stat.ML

    HERS Superpixels: Deep Affinity Learning for Hierarchical Entropy Rate Segmentation

    Authors: Hankui Peng, Angelica I. Aviles-Rivero, Carola-Bibiane Schönlieb

    Abstract: Superpixels serve as a powerful preprocessing tool in numerous computer vision tasks. By using superpixel representation, the number of image primitives can be largely reduced by orders of magnitudes. With the rise of deep learning in recent years, a few works have attempted to feed deeply learned features / graphs into existing classical superpixel techniques. However, none of them are able to pr… ▽ More

    Submitted 18 November, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

    ACM Class: I.4; I.5

  44. arXiv:2101.07945  [pdf, other

    cs.CV

    Beyond Fine-tuning: Classifying High Resolution Mammograms using Function-Preserving Transformations

    Authors: Tao Wei, Angelica I Aviles-Rivero, Shuo Wang, Yuan Huang, Fiona J Gilbert, Carola-Bibiane Schönlieb, Chang Wen Chen

    Abstract: The task of classifying mammograms is very challenging because the lesion is usually small in the high resolution image. The current state-of-the-art approaches for medical image classification rely on using the de-facto method for ConvNets - fine-tuning. However, there are fundamental differences between natural images and medical images, which based on existing evidence from the literature, limi… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

    Comments: 10 pages, 5 figures

  45. arXiv:2011.08894  [pdf, other

    cs.CV

    Contrastive Registration for Unsupervised Medical Image Segmentation

    Authors: Lihao Liu, Angelica I Aviles-Rivero, Carola-Bibiane Schönlieb

    Abstract: Medical image segmentation is a relevant task as it serves as the first step for several diagnosis processes, thus it is indispensable in clinical usage. Whilst major success has been reported using supervised techniques, they assume a large and well-representative labelled set. This is a strong assumption in the medical domain where annotations are expensive, time-consuming, and inherent to human… ▽ More

    Submitted 20 July, 2022; v1 submitted 17 November, 2020; originally announced November 2020.

    Comments: 12 pages, 8 figures, 4 tables

  46. arXiv:2010.00378  [pdf, other

    cs.LG cs.CV stat.ML

    GraphXCOVID: Explainable Deep Graph Diffusion Pseudo-Labelling for Identifying COVID-19 on Chest X-rays

    Authors: Angelica I Aviles-Rivero, Philip Sellars, Carola-Bibiane Schönlieb, Nicolas Papadakis

    Abstract: Can one learn to diagnose COVID-19 under extreme minimal supervision? Since the outbreak of the novel COVID-19 there has been a rush for developing Artificial Intelligence techniques for expert-level disease identification on Chest X-ray data. In particular, the use of deep supervised learning has become the go-to paradigm. However, the performance of such models is heavily dependent on the availa… ▽ More

    Submitted 4 July, 2021; v1 submitted 30 September, 2020; originally announced October 2020.

  47. arXiv:2008.06388  [pdf

    cs.LG cs.CV eess.IV stat.ML

    Common pitfalls and recommendations for using machine learning to detect and prognosticate for COVID-19 using chest radiographs and CT scans

    Authors: Michael Roberts, Derek Driggs, Matthew Thorpe, Julian Gilbey, Michael Yeung, Stephan Ursprung, Angelica I. Aviles-Rivero, Christian Etmann, Cathal McCague, Lucian Beer, Jonathan R. Weir-McCall, Zhongzhao Teng, Effrossyni Gkrania-Klotsas, James H. F. Rudd, Evis Sala, Carola-Bibiane Schönlieb

    Abstract: Machine learning methods offer great promise for fast and accurate detection and prognostication of COVID-19 from standard-of-care chest radiographs (CXR) and computed tomography (CT) images. Many articles have been published in 2020 describing new machine learning-based models for both of these tasks, but it is unclear which are of potential clinical utility. In this systematic review, we search… ▽ More

    Submitted 5 January, 2021; v1 submitted 14 August, 2020; originally announced August 2020.

    Comments: 35 pages, 3 figures, 2 tables, updated to the period 1 January 2020 - 3 October 2020

    Journal ref: Nature Machine Intelligence 3, 199-217 (2021)

  48. arXiv:2003.06451  [pdf, other

    cs.CV

    The GraphNet Zoo: An All-in-One Graph Based Deep Semi-Supervised Framework for Medical Image Classification

    Authors: Marianne de Vriendt, Philip Sellars, Angelica I Aviles-Rivero

    Abstract: We consider the problem of classifying a medical image dataset when we have a limited amounts of labels. This is very common yet challenging setting as labelled data is expensive, time consuming to collect and may require expert knowledge. The current classification go-to of deep supervised learning is unable to cope with such a problem setup. However, using semi-supervised learning, one can produ… ▽ More

    Submitted 26 June, 2020; v1 submitted 13 March, 2020; originally announced March 2020.

  49. arXiv:1912.07764  [pdf, other

    cs.CV eess.IV

    Dim the Lights! -- Low-Rank Prior Temporal Data for Specular-Free Video Recovery

    Authors: Samar M. Alsaleh, Angelica I. Aviles-Rivero, Noemie Debroux, James K. Hahn

    Abstract: The appearance of an object is significantly affected by the illumination conditions in the environment. This is more evident with strong reflective objects as they suffer from more dominant specular reflections, causing information loss and discontinuity in the image domain. In this paper, we present a novel framework for specular-free video recovery with special emphasis on dealing with complex… ▽ More

    Submitted 16 December, 2019; originally announced December 2019.

    Comments: 22 pages, 6 figures

  50. arXiv:1912.07648  [pdf, other

    eess.IV cs.CV cs.LG

    Rethinking Medical Image Reconstruction via Shape Prior, Going Deeper and Faster: Deep Joint Indirect Registration and Reconstruction

    Authors: Jiulong Liu, Angelica I. Aviles-Rivero, Hui Ji, Carola-Bibiane Schönlieb

    Abstract: Indirect image registration is a promising technique to improve image reconstruction quality by providing a shape prior for the reconstruction task. In this paper, we propose a novel hybrid method that seeks to reconstruct high quality images from few measurements whilst requiring low computational cost. With this purpose, our framework intertwines indirect registration and reconstruction tasks is… ▽ More

    Submitted 16 December, 2019; originally announced December 2019.