Skip to main content

Showing 1–21 of 21 results for author: Men, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.21818  [pdf

    eess.IV cs.CV

    Deep Learning-Based Quantitative Assessment of Renal Chronicity Indices in Lupus Nephritis

    Authors: Tianqi Tu, Hui Wang, Jiangbo Pei, Xiaojuan Yu, Aidong Men, Suxia Wang, Qingchao Chen, Ying Tan, Feng Yu, Minghui Zhao

    Abstract: Background: Renal chronicity indices (CI) have been identified as strong predictors of long-term outcomes in lupus nephritis (LN) patients. However, assessment by pathologists is hindered by challenges such as substantial time requirements, high interobserver variation, and susceptibility to fatigue. This study aims to develop an effective deep learning (DL) pipeline that automates the assessment… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

  2. arXiv:2412.10115  [pdf, other

    cs.CV

    Filter or Compensate: Towards Invariant Representation from Distribution Shift for Anomaly Detection

    Authors: Zining Chen, Xingshuang Luo, Weiqiu Wang, Zhicheng Zhao, Fei Su, Aidong Men

    Abstract: Recent Anomaly Detection (AD) methods have achieved great success with In-Distribution (ID) data. However, real-world data often exhibits distribution shift, causing huge performance decay on traditional AD methods. From this perspective, few previous work has explored AD with distribution shift, and the distribution-invariant normality learning has been proposed based on the Reverse Distillation… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

    Comments: Accepted by AAAI2025

  3. arXiv:2411.19459  [pdf, other

    cs.CV cs.LG

    Fleximo: Towards Flexible Text-to-Human Motion Video Generation

    Authors: Yuhang Zhang, Yuan Zhou, Zeyu Liu, Yuxuan Cai, Qiuyue Wang, Aidong Men, Huan Yang

    Abstract: Current methods for generating human motion videos rely on extracting pose sequences from reference videos, which restricts flexibility and control. Additionally, due to the limitations of pose detection techniques, the extracted pose sequences can sometimes be inaccurate, leading to low-quality video outputs. We introduce a novel task aimed at generating human motion videos solely from reference… ▽ More

    Submitted 28 November, 2024; originally announced November 2024.

  4. arXiv:2407.05796  [pdf, other

    eess.IV cs.CV

    Poisson Ordinal Network for Gleason Group Estimation Using Bi-Parametric MRI

    Authors: Yinsong Xu, Yipei Wang, Ziyi Shen, Iani J. M. B. Gayo, Natasha Thorley, Shonit Punwani, Aidong Men, Dean Barratt, Qingchao Chen, Yipeng Hu

    Abstract: The Gleason groups serve as the primary histological grading system for prostate cancer, providing crucial insights into the cancer's potential for growth and metastasis. In clinical practice, pathologists determine the Gleason groups based on specimens obtained from ultrasound-guided biopsies. In this study, we investigate the feasibility of directly estimating the Gleason groups from MRI scans t… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: MICCAI 2024

  5. arXiv:2406.14797  [pdf, other

    cs.CV cs.AI

    Camera-Invariant Meta-Learning Network for Single-Camera-Training Person Re-identification

    Authors: Jiangbo Pei, Zhuqing Jiang, Aidong Men, Haiying Wang, Haiyong Luo, Shiping Wen

    Abstract: Single-camera-training person re-identification (SCT re-ID) aims to train a re-ID model using SCT datasets where each person appears in only one camera. The main challenge of SCT re-ID is to learn camera-invariant feature representations without cross-camera same-person (CCSP) data as supervision. Previous methods address it by assuming that the most similar person should be found in another camer… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  6. arXiv:2404.09011  [pdf, other

    cs.CV cs.LG

    PracticalDG: Perturbation Distillation on Vision-Language Models for Hybrid Domain Generalization

    Authors: Zining Chen, Weiqiu Wang, Zhicheng Zhao, Fei Su, Aidong Men, Hongying Meng

    Abstract: Domain Generalization (DG) aims to resolve distribution shifts between source and target domains, and current DG methods are default to the setting that data from source and target domains share identical categories. Nevertheless, there exists unseen classes from target domains in practical scenarios. To address this issue, Open Set Domain Generalization (OSDG) has emerged and several methods have… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: Accepted to CVPR2024

  7. arXiv:2403.01582  [pdf, other

    cs.LG

    Selection, Ensemble, and Adaptation: Advancing Multi-Source-Free Domain Adaptation via Architecture Zoo

    Authors: Jiangbo Pei, Ruizhe Li, Aidong Men, Yang Liu, Xiahai Zhuang, Qingchao Chen

    Abstract: Conventional Multi-Source Free Domain Adaptation (MSFDA) assumes that each source domain provides a single source model, and all source models adopt a uniform architecture. This paper introduces Zoo-MSFDA, a more general setting that allows each source domain to offer a zoo of multiple source models with different architectures. While it enriches the source knowledge, Zoo-MSFDA risks being dominat… ▽ More

    Submitted 23 May, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

  8. arXiv:2311.06400  [pdf, other

    cs.CV

    EviPrompt: A Training-Free Evidential Prompt Generation Method for Segment Anything Model in Medical Images

    Authors: Yinsong Xu, Jiaqi Tang, Aidong Men, Qingchao Chen

    Abstract: Medical image segmentation has immense clinical applicability but remains a challenge despite advancements in deep learning. The Segment Anything Model (SAM) exhibits potential in this field, yet the requirement for expertise intervention and the domain gap between natural and medical images poses significant obstacles. This paper introduces a novel training-free evidential prompt generation metho… ▽ More

    Submitted 10 November, 2023; originally announced November 2023.

  9. arXiv:2308.03097  [pdf, ps, other

    cs.CV

    Incorporating Pre-training Data Matters in Unsupervised Domain Adaptation

    Authors: Yinsong Xu, Aidong Men, Yang Liu, Xiahai Zhuang, Qingchao Chen

    Abstract: In deep learning, initializing models with pre-trained weights has become the de facto practice for various downstream tasks. Many unsupervised domain adaptation (UDA) methods typically adopt a backbone pre-trained on ImageNet, and focus on reducing the source-target domain discrepancy. However, the impact of pre-training on adaptation received little attention. In this study, we delve into UDA fr… ▽ More

    Submitted 18 June, 2025; v1 submitted 6 August, 2023; originally announced August 2023.

  10. arXiv:2301.09120   

    cs.CV

    Causality-based Dual-Contrastive Learning Framework for Domain Generalization

    Authors: Zining Chen, Weiqiu Wang, Zhicheng Zhao, Aidong Men

    Abstract: Domain Generalization (DG) is essentially a sub-branch of out-of-distribution generalization, which trains models from multiple source domains and generalizes to unseen target domains. Recently, some domain generalization algorithms have emerged, but most of them were designed with non-transferable complex architecture. Additionally, contrastive learning has become a promising solution for simplic… ▽ More

    Submitted 22 March, 2023; v1 submitted 22 January, 2023; originally announced January 2023.

    Comments: Inadequate proof of the effectiveness of the method

  11. Uncertainty-Induced Transferability Representation for Source-Free Unsupervised Domain Adaptation

    Authors: Jiangbo Pei, Zhuqing Jiang, Aidong Men, Liang Chen, Yang Liu, Qingchao Chen

    Abstract: Source-free unsupervised domain adaptation (SFUDA) aims to learn a target domain model using unlabeled target data and the knowledge of a well-trained source domain model. Most previous SFUDA works focus on inferring semantics of target data based on the source knowledge. Without measuring the transferability of the source knowledge, these methods insufficiently exploit the source knowledge, and f… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

  12. Delving into the Continuous Domain Adaptation

    Authors: Yinsong Xu, Zhuqing Jiang, Aidong Men, Yang Liu, Qingchao Chen

    Abstract: Existing domain adaptation methods assume that domain discrepancies are caused by a few discrete attributes and variations, e.g., art, real, painting, quickdraw, etc. We argue that this is not realistic as it is implausible to define the real-world datasets using a few discrete attributes. Therefore, we propose to investigate a new problem namely the Continuous Domain Adaptation (CDA) through the… ▽ More

    Submitted 27 August, 2022; originally announced August 2022.

    Comments: Accepted by ACM MM22

  13. arXiv:2208.10722  [pdf, other

    cs.CV cs.LG

    Bag of Tricks for Out-of-Distribution Generalization

    Authors: Zining Chen, Weiqiu Wang, Zhicheng Zhao, Aidong Men, Hong Chen

    Abstract: Recently, out-of-distribution (OOD) generalization has attracted attention to the robustness and generalization ability of deep learning based models, and accordingly, many strategies have been made to address different aspects related to this issue. However, most existing algorithms for OOD generalization are complicated and specifically designed for certain dataset. To alleviate this problem, ni… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Journal ref: ECCV 2022 Workshop

  14. arXiv:2208.05814  [pdf, other

    cs.CV cs.AI cs.MM

    Seeing your sleep stage: cross-modal distillation from EEG to infrared video

    Authors: Jianan Han, Shaoxing Zhang, Aidong Men, Yang Liu, Ziming Yao, Yan Yan, Qingchao Chen

    Abstract: It is inevitably crucial to classify sleep stage for the diagnosis of various diseases. However, existing automated diagnosis methods mostly adopt the "gold-standard" lectroencephalogram (EEG) or other uni-modal sensing signal of the PolySomnoGraphy (PSG) machine in hospital, that are expensive, importable and therefore unsuitable for point-of-care monitoring at home. To enable the sleep stage mon… ▽ More

    Submitted 11 August, 2022; originally announced August 2022.

    Comments: We have submitted this paper to an academic journal

  15. arXiv:2105.14430  [pdf, other

    cs.CV

    Rethinking the constraints of multimodal fusion: case study in Weakly-Supervised Audio-Visual Video Parsing

    Authors: Jianning Wu, Zhuqing Jiang, Shiping Wen, Aidong Men, Haiying Wang

    Abstract: For multimodal tasks, a good feature extraction network should extract information as much as possible and ensure that the extracted feature embedding and other modal feature embedding have an excellent mutual understanding. The latter is often more critical in feature fusion than the former. Therefore, selecting the optimal feature extraction network collocation is a very important subproblem in… ▽ More

    Submitted 30 May, 2021; originally announced May 2021.

  16. arXiv:2105.11062  [pdf, other

    cs.CV cs.LG

    Taylor saves for later: disentanglement for video prediction using Taylor representation

    Authors: Ting Pan, Zhuqing Jiang, Jianan Han, Shiping Wen, Aidong Men, Haiying Wang

    Abstract: Video prediction is a challenging task with wide application prospects in meteorology and robot systems. Existing works fail to trade off short-term and long-term prediction performances and extract robust latent dynamics laws in video frames. We propose a two-branch seq-to-seq deep model to disentangle the Taylor feature and the residual feature in video frames by a novel recurrent prediction mod… ▽ More

    Submitted 23 May, 2021; originally announced May 2021.

  17. arXiv:2101.08039  [pdf, other

    eess.IV cs.CV cs.LG

    Bridge the Vision Gap from Field to Command: A Deep Learning Network Enhancing Illumination and Details

    Authors: Zhuqing Jiang, Chang Liu, Ya'nan Wang, Kai Li, Aidong Men, Haiying Wang, Haiyong Luo

    Abstract: With the goal of tuning up the brightness, low-light image enhancement enjoys numerous applications, such as surveillance, remote sensing and computational photography. Images captured under low-light conditions often suffer from poor visibility and blur. Solely brightening the dark regions will inevitably amplify the blur, thus may lead to detail loss. In this paper, we propose a simple yet effec… ▽ More

    Submitted 20 January, 2021; originally announced January 2021.

  18. arXiv:2101.00813  [pdf, other

    cs.CV cs.LG eess.IV

    Shed Various Lights on a Low-Light Image: Multi-Level Enhancement Guided by Arbitrary References

    Authors: Ya'nan Wang, Zhuqing Jiang, Chang Liu, Kai Li, Aidong Men, Haiying Wang

    Abstract: It is suggested that low-light image enhancement realizes one-to-many mapping since we have different definitions of NORMAL-light given application scenarios or users' aesthetic. However, most existing methods ignore subjectivity of the task, and simply produce one result with fixed brightness. This paper proposes a neural network for multi-level low-light image enhancement, which is user-friendly… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

  19. A Switched View of Retinex: Deep Self-Regularized Low-Light Image Enhancement

    Authors: Zhuqing Jiang, Haotian Li, Liangjie Liu, Aidong Men, Haiying Wang

    Abstract: Self-regularized low-light image enhancement does not require any normal-light image in training, thereby freeing from the chains on paired or unpaired low-/normal-images. However, existing methods suffer color deviation and fail to generalize to various lighting conditions. This paper presents a novel self-regularized method based on Retinex, which, inspired by HSV, preserves all colors (Hue, Sat… ▽ More

    Submitted 3 January, 2021; originally announced January 2021.

    Journal ref: Neurocomputing 454 (2021): 361-372

  20. arXiv:2006.12085  [pdf, ps, other

    cs.CV

    Split to Be Slim: An Overlooked Redundancy in Vanilla Convolution

    Authors: Qiulin Zhang, Zhuqing Jiang, Qishuo Lu, Jia'nan Han, Zhengxin Zeng, Shang-hua Gao, Aidong Men

    Abstract: Many effective solutions have been proposed to reduce the redundancy of models for inference acceleration. Nevertheless, common approaches mostly focus on eliminating less important filters or constructing efficient operations, while ignoring the pattern redundancy in feature maps. We reveal that many feature maps within a layer share similar but not identical patterns. However, it is difficult to… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: Preprint version. The final version has been accepted to appear at IJCAI20

  21. arXiv:1908.00273  [pdf, other

    cs.CV cs.LG eess.IV

    Pyramid Real Image Denoising Network

    Authors: Yiyun Zhao, Zhuqing Jiang, Aidong Men, Guodong Ju

    Abstract: While deep Convolutional Neural Networks (CNNs) have shown extraordinary capability of modelling specific noise and denoising, they still perform poorly on real-world noisy images. The main reason is that the real-world noise is more sophisticated and diverse. To tackle the issue of blind denoising, in this paper, we propose a novel pyramid real image denoising network (PRIDNet), which contains th… ▽ More

    Submitted 21 October, 2019; v1 submitted 1 August, 2019; originally announced August 2019.