Skip to main content

Showing 1–6 of 6 results for author: Zaiane, O R

Searching in archive eess. Search in all archives.
.
  1. arXiv:2506.22628  [pdf, ps, other

    cs.SD eess.AS

    Evaluating Sound Similarity Metrics for Differentiable, Iterative Sound-Matching

    Authors: Amir Salimi, Abram Hindle, Osmar R. Zaiane

    Abstract: Manual sound design with a synthesizer is inherently iterative: an artist compares the synthesized output to a mental target, adjusts parameters, and repeats until satisfied. Iterative sound-matching automates this workflow by continually programming a synthesizer under the guidance of a loss function (or similarity measure) toward a target sound. Prior comparisons of loss functions have typically… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

  2. arXiv:2506.10675  [pdf, ps, other

    eess.IV cs.CV

    ConStyX: Content Style Augmentation for Generalizable Medical Image Segmentation

    Authors: Xi Chen, Zhiqiang Shen, Peng Cao, Jinzhu Yang, Osmar R. Zaiane

    Abstract: Medical images are usually collected from multiple domains, leading to domain shifts that impair the performance of medical image segmentation models. Domain Generalization (DG) aims to address this issue by training a robust model with strong generalizability. Recently, numerous domain randomization-based DG methods have been proposed. However, these methods suffer from the following limitations:… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  3. arXiv:2503.17831  [pdf, other

    eess.IV cs.AI cs.CV

    FundusGAN: A Hierarchical Feature-Aware Generative Framework for High-Fidelity Fundus Image Generation

    Authors: Qingshan Hou, Meng Wang, Peng Cao, Zou Ke, Xiaoli Liu, Huazhu Fu, Osmar R. Zaiane

    Abstract: Recent advancements in ophthalmology foundation models such as RetFound have demonstrated remarkable diagnostic capabilities but require massive datasets for effective pre-training, creating significant barriers for development and deployment. To address this critical challenge, we propose FundusGAN, a novel hierarchical feature-aware generative framework specifically designed for high-fidelity fu… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

  4. arXiv:2502.20619  [pdf, other

    eess.IV cs.CV

    Style Content Decomposition-based Data Augmentation for Domain Generalizable Medical Image Segmentation

    Authors: Zhiqiang Shen, Peng Cao, Jinzhu Yang, Osmar R. Zaiane, Zhaolin Chen

    Abstract: Due to the domain shifts between training and testing medical images, learned segmentation models often experience significant performance degradation during deployment. In this paper, we first decompose an image into its style code and content map and reveal that domain shifts in medical images involve: \textbf{style shifts} (\emph{i.e.}, differences in image appearance) and \textbf{content shift… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  5. arXiv:2301.06943  [pdf, other

    eess.IV cs.AI cs.CV

    Self-supervised Domain Adaptation for Breaking the Limits of Low-quality Fundus Image Quality Enhancement

    Authors: Qingshan Hou, Peng Cao, Jiaqi Wang, Xiaoli Liu, Jinzhu Yang, Osmar R. Zaiane

    Abstract: Retinal fundus images have been applied for the diagnosis and screening of eye diseases, such as Diabetic Retinopathy (DR) or Diabetic Macular Edema (DME). However, both low-quality fundus images and style inconsistency potentially increase uncertainty in the diagnosis of fundus disease and even lead to misdiagnosis by ophthalmologists. Most of the existing image enhancement methods mainly focus o… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

  6. arXiv:2109.04335  [pdf, other

    cs.CV cs.LG eess.IV

    UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer

    Authors: Haonan Wang, Peng Cao, Jiaqi Wang, Osmar R. Zaiane

    Abstract: Most recent semantic segmentation methods adopt a U-Net framework with an encoder-decoder architecture. It is still challenging for U-Net with a simple skip connection scheme to model the global multi-scale context: 1) Not each skip connection setting is effective due to the issue of incompatible feature sets of encoder and decoder stage, even some skip connection negatively influence the segmenta… ▽ More

    Submitted 24 January, 2022; v1 submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted by AAAI 2022. Code is available at https://github.com/McGregorWwww/UCTransNet