Skip to main content

Showing 1–6 of 6 results for author: Shao, F

Searching in archive eess. Search in all archives.
.
  1. arXiv:2509.14302  [pdf, ps, other

    eess.IV

    D4PM: A Dual-branch Driven Denoising Diffusion Probabilistic Model with Joint Posterior Diffusion Sampling for EEG Artifacts Removal

    Authors: Feixue Shao, Xueyu Liu, Yongfei Wu, Jianbo Lu, Guiying Yan, Weihua Yang

    Abstract: Artifact removal is critical for accurate analysis and interpretation of Electroencephalogram (EEG) signals. Traditional methods perform poorly with strong artifact-EEG correlations or single-channel data. Recent advances in diffusion-based generative models have demonstrated strong potential for EEG denoising, notably improving fine-grained noise suppression and reducing over-smoothing. However,… ▽ More

    Submitted 17 September, 2025; originally announced September 2025.

  2. arXiv:2412.04746  [pdf, other

    cs.SD cs.IR cs.MM eess.AS

    Diff4Steer: Steerable Diffusion Prior for Generative Music Retrieval with Semantic Guidance

    Authors: Xuchan Bao, Judith Yue Li, Zhong Yi Wan, Kun Su, Timo Denk, Joonseok Lee, Dima Kuzmin, Fei Sha

    Abstract: Modern music retrieval systems often rely on fixed representations of user preferences, limiting their ability to capture users' diverse and uncertain retrieval needs. To address this limitation, we introduce Diff4Steer, a novel generative retrieval framework that employs lightweight diffusion models to synthesize diverse seed embeddings from user queries that represent potential directions for mu… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

    Comments: NeurIPS 2024 Creative AI Track

    Journal ref: Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025

  3. arXiv:2411.07728  [pdf, other

    cs.CV cs.AI eess.IV

    No-Reference Point Cloud Quality Assessment via Graph Convolutional Network

    Authors: Wu Chen, Qiuping Jiang, Wei Zhou, Feng Shao, Guangtao Zhai, Weisi Lin

    Abstract: Three-dimensional (3D) point cloud, as an emerging visual media format, is increasingly favored by consumers as it can provide more realistic visual information than two-dimensional (2D) data. Similar to 2D plane images and videos, point clouds inevitably suffer from quality degradation and information loss through multimedia communication systems. Therefore, automatic point cloud quality assessme… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

    Comments: Accepted by IEEE Transactions on Multimedia

  4. arXiv:2305.06594  [pdf, other

    cs.SD cs.CV cs.LG cs.MM eess.AS

    V2Meow: Meowing to the Visual Beat via Video-to-Music Generation

    Authors: Kun Su, Judith Yue Li, Qingqing Huang, Dima Kuzmin, Joonseok Lee, Chris Donahue, Fei Sha, Aren Jansen, Yu Wang, Mauro Verzetti, Timo I. Denk

    Abstract: Video-to-music generation demands both a temporally localized high-quality listening experience and globally aligned video-acoustic signatures. While recent music generation models excel at the former through advanced audio codecs, the exploration of video-acoustic signatures has been confined to specific visual scenarios. In contrast, our research confronts the challenge of learning globally alig… ▽ More

    Submitted 22 February, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: accepted at AAAI 2024, music samples available at https://tinyurl.com/v2meow

  5. arXiv:2208.00623  [pdf, other

    cs.CV cs.MM eess.IV

    Quality Evaluation of Arbitrary Style Transfer: Subjective Study and Objective Metric

    Authors: Hangwei Chen, Feng Shao, Xiongli Chai, Yuese Gu, Qiuping Jiang, Xiangchao Meng, Yo-Sung Ho

    Abstract: Arbitrary neural style transfer is a vital topic with great research value and wide industrial application, which strives to render the structure of one image using the style of another. Recent researches have devoted great efforts on the task of arbitrary style transfer (AST) for improving the stylization quality. However, there are very few explorations about the quality evaluation of AST images… ▽ More

    Submitted 29 January, 2023; v1 submitted 1 August, 2022; originally announced August 2022.

    Comments: Accepted by IEEE Transactions on Circuits and Systems for Video Technology 2022, Code and Dataset: https://github.com/Hangwei-Chen/AST-IQAD-SRQE

  6. Towards Top-Down Just Noticeable Difference Estimation of Natural Images

    Authors: Qiuping Jiang, Zhentao Liu, Shiqi Wang, Feng Shao, Weisi Lin

    Abstract: Just noticeable difference (JND) of natural images refers to the maximum pixel intensity change magnitude that typical human visual system (HVS) cannot perceive. Existing efforts on JND estimation mainly dedicate to modeling the diverse masking effects in either/both spatial or/and frequency domains, and then fusing them into an overall JND estimate. In this work, we turn to a dramatically differe… ▽ More

    Submitted 24 May, 2022; v1 submitted 11 August, 2021; originally announced August 2021.

    Comments: 16 pages, 16 figures

    Journal ref: IEEE Transactions on Image Processing, 2022