Skip to main content

Showing 1–22 of 22 results for author: Xuan, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.12324  [pdf, other

    cs.CL cs.AI

    Cross-Document Cross-Lingual Natural Language Inference via RST-enhanced Graph Fusion and Interpretability Prediction

    Authors: Mengying Yuan, Wangzi Xuan, Fei Li

    Abstract: Natural Language Inference (NLI) is a fundamental task in both natural language processing and information retrieval. While NLI has developed many sub-directions such as sentence-level NLI, document-level NLI and cross-lingual NLI, Cross-Document Cross-Lingual NLI (CDCL-NLI) remains largely unexplored. In this paper, we propose a novel paradigm for CDCL-NLI that extends traditional NLI capabilitie… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

  2. arXiv:2504.06564  [pdf, other

    cs.CL

    Do Reasoning Models Show Better Verbalized Calibration?

    Authors: Qingcheng Zeng, Weihao Xuan, Leyang Cui, Rob Voigt

    Abstract: Large reasoning models (LRMs) have recently shown impressive capabilities in complex reasoning by leveraging increased test-time computation and exhibiting behaviors akin to human-like deliberation. Despite these advances, it remains an open question whether LRMs are better calibrated - particularly in their verbalized confidence - compared to instruction-tuned counterparts. In this paper, we inve… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

    Comments: Work in Progress

  3. arXiv:2503.10497  [pdf, ps, other

    cs.CL

    MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation

    Authors: Weihao Xuan, Rui Yang, Heli Qi, Qingcheng Zeng, Yunze Xiao, Yun Xing, Junjue Wang, Huitao Li, Xin Li, Kunyu Yu, Nan Liu, Qingyu Chen, Douglas Teodoro, Edison Marrese-Taylor, Shijian Lu, Yusuke Iwasawa, Yutaka Matsuo, Irene Li

    Abstract: Traditional benchmarks struggle to evaluate increasingly sophisticated language models in multilingual and culturally diverse contexts. To address this gap, we introduce MMLU-ProX, a comprehensive multilingual benchmark covering 13 typologically diverse languages with approximately 11,829 questions per language. Building on the challenging reasoning-focused design of MMLU-Pro, our framework employ… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  4. Predicting and Understanding College Student Mental Health with Interpretable Machine Learning

    Authors: Meghna Roy Chowdhury, Wei Xuan, Shreyas Sen, Yixue Zhao, Yi Ding

    Abstract: Mental health issues among college students have reached critical levels, significantly impacting academic performance and overall wellbeing. Predicting and understanding mental health status among college students is challenging due to three main factors: the necessity for large-scale longitudinal datasets, the prevalence of black-box machine learning models lacking transparency, and the tendency… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: 12 pages, 10 figures, ACM/IEEE International Conference on Connected Health: Applications, Systems and Engineering Technologies (CHASE '25), June 24--26, 2025, New York, NY, USA

  5. arXiv:2503.07637  [pdf, other

    cs.LG

    Is Pre-training Applicable to the Decoder for Dense Prediction?

    Authors: Chao Ning, Wanshui Gan, Weihao Xuan, Naoto Yokoya

    Abstract: Pre-trained encoders are widely employed in dense prediction tasks for their capability to effectively extract visual features from images. The decoder subsequently processes these features to generate pixel-level predictions. However, due to structural differences and variations in input data, only encoders benefit from pre-learned representations from vision benchmarks such as image classificati… ▽ More

    Submitted 15 March, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

  6. arXiv:2502.08766  [pdf, other

    cs.CY cs.HC cs.LG cs.SE

    Unlocking Mental Health: Exploring College Students' Well-being through Smartphone Behaviors

    Authors: Wei Xuan, Meghna Roy Chowdhury, Yi Ding, Yixue Zhao

    Abstract: The global mental health crisis is a pressing concern, with college students particularly vulnerable to rising mental health disorders. The widespread use of smartphones among young adults, while offering numerous benefits, has also been linked to negative outcomes such as addiction and regret, significantly impacting well-being. Leveraging the longest longitudinal dataset collected over four coll… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

    Comments: Published at International Conference on Mobile Software Engineering and Systems (MOBILESoft 2025)

  7. arXiv:2501.06019  [pdf, other

    cs.CV cs.AI eess.IV eess.SP

    BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster response

    Authors: Hongruixuan Chen, Jian Song, Olivier Dietrich, Clifford Broni-Bediako, Weihao Xuan, Junjue Wang, Xinlei Shao, Yimin Wei, Junshi Xia, Cuiling Lan, Konrad Schindler, Naoto Yokoya

    Abstract: Disaster events occur around the world and cause significant damage to human life and property. Earth observation (EO) data enables rapid and comprehensive building damage assessment (BDA), an essential capability in the aftermath of a disaster to reduce human casualties and to inform disaster relief efforts. Recent research focuses on the development of AI models to achieve accurate mapping of un… ▽ More

    Submitted 18 April, 2025; v1 submitted 10 January, 2025; originally announced January 2025.

  8. arXiv:2412.16918  [pdf, other

    cs.CV

    Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection

    Authors: Yuhang Gan, Wenjie Xuan, Zhiming Luo, Lei Fang, Zengmao Wang, Juhua Liu, Bo Du

    Abstract: When given two similar images, humans identify their differences by comparing the appearance ({\it e.g., color, texture}) with the help of semantics ({\it e.g., objects, relations}). However, mainstream change detection models adopt a supervised training paradigm, where the annotated binary change map is the main constraint. Thus, these methods primarily emphasize the difference-aware features bet… ▽ More

    Submitted 22 December, 2024; originally announced December 2024.

  9. arXiv:2410.16602  [pdf, other

    cs.CV

    Foundation Models for Remote Sensing and Earth Observation: A Survey

    Authors: Aoran Xiao, Weihao Xuan, Junjue Wang, Jiaxing Huang, Dacheng Tao, Shijian Lu, Naoto Yokoya

    Abstract: Remote Sensing (RS) is a crucial technology for observing, monitoring, and interpreting our planet, with broad applications across geoscience, economics, humanitarian fields, etc. While artificial intelligence (AI), particularly deep learning, has achieved significant advances in RS, unique challenges persist in developing more intelligent RS systems, including the complexity of Earth's environmen… ▽ More

    Submitted 25 October, 2024; v1 submitted 21 October, 2024; originally announced October 2024.

    Comments: Project: https://github.com/xiaoaoran/awesome-RSFMs

  10. arXiv:2408.09085  [pdf, other

    cs.CV

    Segment Anything with Multiple Modalities

    Authors: Aoran Xiao, Weihao Xuan, Heli Qi, Yun Xing, Naoto Yokoya, Shijian Lu

    Abstract: Robust and accurate segmentation of scenes has become one core functionality in various visual recognition and navigation tasks. This has inspired the recent development of Segment Anything Model (SAM), a foundation model for general mask segmentation. However, SAM is largely tailored for single-modal RGB images, limiting its applicability to multi-modal data captured with widely-adopted sensor su… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: Project page: https://xiaoaoran.github.io/projects/MM-SAM

  11. arXiv:2406.18151  [pdf, other

    cs.CV

    SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing Imagery

    Authors: Jian Song, Hongruixuan Chen, Weihao Xuan, Junshi Xia, Naoto Yokoya

    Abstract: Global semantic 3D understanding from single-view high-resolution remote sensing (RS) imagery is crucial for Earth Observation (EO). However, this task faces significant challenges due to the high costs of annotations and data collection, as well as geographically restricted data availability. To address these challenges, synthetic data offer a promising solution by being easily accessible and thu… ▽ More

    Submitted 26 September, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted at NeurIPS 2024 as a Spotlight

  12. arXiv:2405.20680  [pdf, other

    cs.AI cs.CL

    Unraveling and Mitigating Retriever Inconsistencies in Retrieval-Augmented Large Language Models

    Authors: Mingda Li, Xinyu Li, Yifan Chen, Wenfeng Xuan, Weinan Zhang

    Abstract: Although Retrieval-Augmented Large Language Models (RALMs) demonstrate their superiority in terms of factuality, they do not consistently outperform the original retrieval-free Language Models (LMs). Our experiments reveal that this example-level performance inconsistency exists not only between retrieval-augmented and retrieval-free LM but also among different retrievers. To understand this pheno… ▽ More

    Submitted 6 March, 2025; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: ACL 2024 (findings)

  13. arXiv:2405.00308  [pdf

    cs.CR stat.AP

    FPGA Digital Dice using Pseudo Random Number Generator

    Authors: Michael Lim Kee Hian, Ten Wei Lin, Zachary Wu Xuan, Stephanie-Ann Loy, Maoyang Xiang, T. Hui Teo

    Abstract: The goal of this project is to design a digital dice that displays dice numbers in real-time. The number is generated by a pseudo-random number generator (PRNG) using XORshift algorithm that is implemented in Verilog HDL on an FPGA. The digital dice is equipped with tilt sensor, display, power management circuit, and rechargeable battery hosted in a 3D printed dice casing. By shaking the digital d… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 15 pages, 5 figures

  14. arXiv:2404.17765  [pdf

    cs.CV

    RFL-CDNet: Towards Accurate Change Detection via Richer Feature Learning

    Authors: Yuhang Gan, Wenjie Xuan, Hang Chen, Juhua Liu, Bo Du

    Abstract: Change Detection is a crucial but extremely challenging task of remote sensing image analysis, and much progress has been made with the rapid development of deep learning. However, most existing deep learning-based change detection methods mainly focus on intricate feature extraction and multi-scale feature fusion, while ignoring the insufficient utilization of features in the intermediate stages,… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Accepted by PR, volume 153

  15. When ControlNet Meets Inexplicit Masks: A Case Study of ControlNet on its Contour-following Ability

    Authors: Wenjie Xuan, Yufei Xu, Shanshan Zhao, Chaoyue Wang, Juhua Liu, Bo Du, Dacheng Tao

    Abstract: ControlNet excels at creating content that closely matches precise contours in user-provided masks. However, when these masks contain noise, as a frequent occurrence with non-expert users, the output would include unwanted artifacts. This paper first highlights the crucial role of controlling the impact of these inexplicit masks with diverse deterioration levels through in-depth analysis. Subseque… ▽ More

    Submitted 14 October, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: Accepted by ACM-MM 2024

  16. arXiv:2402.03631  [pdf, other

    cs.CV

    CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model

    Authors: Aoran Xiao, Weihao Xuan, Heli Qi, Yun Xing, Ruijie Ren, Xiaoqin Zhang, Ling Shao, Shijian Lu

    Abstract: The recent Segment Anything Model (SAM) has demonstrated remarkable zero-shot capability and flexible geometric prompting in general image segmentation. However, SAM often struggles when handling various unconventional images, such as aerial, medical, and non-RGB images. This paper presents CAT-SAM, a ConditionAl Tuning network that adapts SAM toward various unconventional target tasks with just f… ▽ More

    Submitted 15 July, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: ECCV 2024

  17. PNT-Edge: Towards Robust Edge Detection with Noisy Labels by Learning Pixel-level Noise Transitions

    Authors: Wenjie Xuan, Shanshan Zhao, Yu Yao, Juhua Liu, Tongliang Liu, Yixin Chen, Bo Du, Dacheng Tao

    Abstract: Relying on large-scale training data with pixel-level labels, previous edge detection methods have achieved high performance. However, it is hard to manually label edges accurately, especially for large datasets, and thus the datasets inevitably contain noisy labels. This label-noise issue has been studied extensively for classification, while still remaining under-explored for edge detection. To… ▽ More

    Submitted 15 October, 2023; v1 submitted 26 July, 2023; originally announced July 2023.

    Comments: Accepted by ACM-MM 2023

  18. arXiv:2304.00690  [pdf, other

    cs.CV

    3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds

    Authors: Aoran Xiao, Jiaxing Huang, Weihao Xuan, Ruijie Ren, Kangcheng Liu, Dayan Guan, Abdulmotaleb El Saddik, Shijian Lu, Eric Xing

    Abstract: Robust point cloud parsing under all-weather conditions is crucial to level-5 autonomy in autonomous driving. However, how to learn a universal 3D semantic segmentation (3DSS) model is largely neglected as most existing benchmarks are dominated by point clouds captured under normal weather. We introduce SemanticSTF, an adverse-weather point cloud dataset that provides dense point-level annotations… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    Comments: CVPR2023

  19. arXiv:2204.00154  [pdf

    cs.CV

    An End-to-end Supervised Domain Adaptation Framework for Cross-Domain Change Detection

    Authors: Jia Liu, Wenjie Xuan, Yuhang Gan, Juhua Liu, Bo Du

    Abstract: Existing deep learning-based change detection methods try to elaborately design complicated neural networks with powerful feature representations, but ignore the universal domain shift induced by time-varying land cover changes, including luminance fluctuations and season changes between pre-event and post-event images, thereby producing sub-optimal results. In this paper, we propose an end-to-end… ▽ More

    Submitted 7 August, 2022; v1 submitted 31 March, 2022; originally announced April 2022.

    Comments: Accepted by Pattern Recognition

  20. arXiv:2106.10707  [pdf, other

    eess.SY cs.DC math.OC

    Minimizing Delay in Network Function Visualization with Quantum Computing

    Authors: Wenlu Xuan, Zhongqi Zhao, Lei Fan, Zhu Han

    Abstract: Network function virtualization (NFV) is a crucial technology for the 5G network development because it can improve the flexibility of employing hardware and reduce the construction of base stations. There are vast service chains in NFV to meet users' requests, which are composed of a sequence of network functions. These virtual network functions (VNFs) are implemented in virtual machines by softw… ▽ More

    Submitted 20 June, 2021; originally announced June 2021.

    Comments: Invited Paper by IEEE MASS 2021

  21. arXiv:1909.10737  [pdf

    cs.RO cs.LG cs.MA

    Multi-agent Interactive Prediction under Challenging Driving Scenarios

    Authors: Weihao Xuan, Ruijie Ren

    Abstract: In order to drive safely on the road, autonomous vehicle is expected to predict future outcomes of its surrounding environment and react properly. In fact, many researchers have been focused on solving behavioral prediction problems for autonomous vehicles. However, very few of them consider multi-agent prediction under challenging driving scenarios such as urban environment. In this paper, we pro… ▽ More

    Submitted 10 November, 2020; v1 submitted 24 September, 2019; originally announced September 2019.

  22. arXiv:1709.10154  [pdf, other

    eess.SY cs.DC math.OC

    Finite-Time Distributed Linear Equation Solver for Minimum $l_1$ Norm Solutions

    Authors: Jingqiu Zhou, Wang Xuan, Shaoshuai Mou, Brian. D. O. Anderson

    Abstract: This paper proposes distributed algorithms for multi-agent networks to achieve a solution in finite time to a linear equation $Ax=b$ where $A$ has full row rank, and with the minimum $l_1$-norm in the underdetermined case (where $A$ has more columns than rows). The underlying network is assumed to be undirected and fixed, and an analytical proof is provided for the proposed algorithm to drive all… ▽ More

    Submitted 28 September, 2017; originally announced September 2017.