Skip to main content

Showing 1–50 of 56 results for author: Guan, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.03950  [pdf, ps, other

    cs.NI cs.AI cs.LG eess.SY

    Optimizing Age of Trust and Throughput in Multi-Hop UAV-Aided IoT Networks

    Authors: Yizhou Luo, Kwan-Wu Chin, Ruyi Guan, Xi Xiao, Caimeng Wang, Jingyin Feng, Tengjiao He

    Abstract: Devices operating in Internet of Things (IoT) networks may be deployed across vast geographical areas and interconnected via multi-hop communications. Further, they may be unguarded. This makes them vulnerable to attacks and motivates operators to check on devices frequently. To this end, we propose and study an Unmanned Aerial Vehicle (UAV)-aided attestation framework for use in IoT networks with… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

  2. arXiv:2506.19288  [pdf, ps, other

    cs.CV cs.RO

    Da Yu: Towards USV-Based Image Captioning for Waterway Surveillance and Scene Understanding

    Authors: Runwei Guan, Ningwei Ouyang, Tianhao Xu, Shaofeng Liang, Wei Dai, Yafeng Sun, Shang Gao, Songning Lai, Shanliang Yao, Xuming Hu, Ryan Wen Liu, Yutao Yue, Hui Xiong

    Abstract: Automated waterway environment perception is crucial for enabling unmanned surface vessels (USVs) to understand their surroundings and make informed decisions. Most existing waterway perception models primarily focus on instance-level object perception paradigms (e.g., detection, segmentation). However, due to the complexity of waterway environments, current perception datasets and models fail to… ▽ More

    Submitted 30 June, 2025; v1 submitted 23 June, 2025; originally announced June 2025.

    Comments: 14 pages, 13 figures

  3. arXiv:2506.18737  [pdf, ps, other

    cs.CV cs.RO

    USVTrack: USV-Based 4D Radar-Camera Tracking Dataset for Autonomous Driving in Inland Waterways

    Authors: Shanliang Yao, Runwei Guan, Yi Ni, Sen Xu, Yong Yue, Xiaohui Zhu, Ryan Wen Liu

    Abstract: Object tracking in inland waterways plays a crucial role in safe and cost-effective applications, including waterborne transportation, sightseeing tours, environmental monitoring and surface rescue. Our Unmanned Surface Vehicle (USV), equipped with a 4D radar, a monocular camera, a GPS, and an IMU, delivers robust tracking capabilities in complex waterborne environments. By leveraging these sensor… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: Accepted by IROS

  4. arXiv:2506.00241  [pdf, other

    cs.HC cs.AI

    Designing AI Tools for Clinical Care Teams to Support Serious Illness Conversations with Older Adults in the Emergency Department

    Authors: Menglin Zhao, Zhuorui Yong, Ruijia Guan, Kai-Wei Chang, Adrian Haimovich, Kei Ouchi, Timothy Bickmore, Bingsheng Yao, Dakuo Wang, Smit Desai

    Abstract: Serious illness conversations (SICs), discussions between clinical care teams and patients with serious, life-limiting illnesses about their values, goals, and care preferences, are critical for patient-centered care. Without these conversations, patients often receive aggressive interventions that may not align with their goals. Clinical care teams face significant barriers when conducting seriou… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

  5. arXiv:2505.00941  [pdf, other

    cs.LG

    FreCT: Frequency-augmented Convolutional Transformer for Robust Time Series Anomaly Detection

    Authors: Wenxin Zhang, Ding Xu, Guangzhen Yao, Xiaojian Lin, Renxiang Guan, Chengze Du, Renda Han, Xi Xuan, Cuicui Luo

    Abstract: Time series anomaly detection is critical for system monitoring and risk identification, across various domains, such as finance and healthcare. However, for most reconstruction-based approaches, detecting anomalies remains a challenge due to the complexity of sequential patterns in time series data. On the one hand, reconstruction-based techniques are susceptible to computational deviation stemmi… ▽ More

    Submitted 10 May, 2025; v1 submitted 1 May, 2025; originally announced May 2025.

  6. arXiv:2504.21604  [pdf, other

    cs.CL cs.CY

    Robust Misinformation Detection by Visiting Potential Commonsense Conflict

    Authors: Bing Wang, Ximing Li, Changchun Li, Bingrui Zhao, Bo Fu, Renchu Guan, Shengsheng Wang

    Abstract: The development of Internet technology has led to an increased prevalence of misinformation, causing severe negative effects across diverse domains. To mitigate this challenge, Misinformation Detection (MD), aiming to detect online misinformation automatically, emerges as a rapidly growing research topic in the community. In this paper, we propose a novel plug-and-play augmentation method for the… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

    Comments: 11 pages, 2 figures. Accepted by IJCAI 2025. Code: https://github.com/wangbing1416/MD-PCC

  7. arXiv:2504.14779  [pdf, other

    cs.HC cs.AI

    Exploring Collaborative GenAI Agents in Synchronous Group Settings: Eliciting Team Perceptions and Design Considerations for the Future of Work

    Authors: Janet G. Johnson, Macarena Peralta, Mansanjam Kaur, Ruijie Sophia Huang, Sheng Zhao, Ruijia Guan, Shwetha Rajaram, Michael Nebeling

    Abstract: While generative artificial intelligence (GenAI) is finding increased adoption in workplaces, current tools are primarily designed for individual use. Prior work established the potential for these tools to enhance personal creativity and productivity towards shared goals; however, we don't know yet how to best take into account the nuances of group work and team dynamics when deploying GenAI in w… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

    Comments: To be published in ACM Conference on Computer-Supported Cooperative Work and Social Computing (CSCW 2025). 33 pages, 11 figures, 1 table

  8. arXiv:2504.02464  [pdf, other

    cs.CV cs.AI

    CornerPoint3D: Look at the Nearest Corner Instead of the Center

    Authors: Ruixiao Zhang, Runwei Guan, Xiangyu Chen, Adam Prugel-Bennett, Xiaohao Cai

    Abstract: 3D object detection aims to predict object centers, dimensions, and rotations from LiDAR point clouds. Despite its simplicity, LiDAR captures only the near side of objects, making center-based detectors prone to poor localization accuracy in cross-domain tasks with varying point distributions. Meanwhile, existing evaluation metrics designed for single-domain assessment also suffer from overfitting… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2407.04061

  9. arXiv:2503.12968  [pdf, other

    cs.CV cs.RO

    OptiPMB: Enhancing 3D Multi-Object Tracking with Optimized Poisson Multi-Bernoulli Filtering

    Authors: Guanhua Ding, Yuxuan Xia, Runwei Guan, Qinchen Wu, Tao Huang, Weiping Ding, Jinping Sun, Guoqiang Mao

    Abstract: Accurate 3D multi-object tracking (MOT) is crucial for autonomous driving, as it enables robust perception, navigation, and planning in complex environments. While deep learning-based solutions have demonstrated impressive 3D MOT performance, model-based approaches remain appealing for their simplicity, interpretability, and data efficiency. Conventional model-based trackers typically rely on rand… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

  10. arXiv:2503.11496  [pdf, other

    cs.CV

    Cognitive Disentanglement for Referring Multi-Object Tracking

    Authors: Shaofeng Liang, Runwei Guan, Wangwang Lian, Daizong Liu, Xiaolou Sun, Dongming Wu, Yutao Yue, Weiping Ding, Hui Xiong

    Abstract: As a significant application of multi-source information fusion in intelligent transportation perception systems, Referring Multi-Object Tracking (RMOT) involves localizing and tracking specific objects in video sequences based on language references. However, existing RMOT approaches often treat language descriptions as holistic embeddings and struggle to effectively integrate the rich semantic i… ▽ More

    Submitted 27 May, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

    Comments: 27 pages, 12 figures

  11. arXiv:2503.08336  [pdf, other

    cs.CV

    Talk2PC: Enhancing 3D Visual Grounding through LiDAR and Radar Point Clouds Fusion for Autonomous Driving

    Authors: Runwei Guan, Jianan Liu, Ningwei Ouyang, Daizong Liu, Xiaolou Sun, Lianqing Zheng, Ming Xu, Yutao Yue, Hui Xiong

    Abstract: Embodied outdoor scene understanding forms the foundation for autonomous agents to perceive, analyze, and react to dynamic driving environments. However, existing 3D understanding is predominantly based on 2D Vision-Language Models (VLMs), collecting and processing limited scene-aware contexts. Instead, compared to the 2D planar visual information, point cloud sensors like LiDAR offer rich depth i… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: 14 pages, 11 figures

  12. arXiv:2502.18755  [pdf, other

    cs.AR

    M-ANT: Efficient Low-bit Group Quantization for LLMs via Mathematically Adaptive Numerical Type

    Authors: Weiming Hu, Haoyan Zhang, Cong Guo, Yu Feng, Renyang Guan, Zhendong Hua, Zihan Liu, Yue Guan, Minyi Guo, Jingwen Leng

    Abstract: Large language models (LLMs) are one of the most important killer computer applications. The recent algorithmic advancement proposes a fine-grained group-wise quantization for LLMs, which treats a small set (e.g., 64) of values in a tensor as a compression unit. It effectively preserves the model accuracy without retraining, and has become the standard approach to efficiently deploy LLMs. On the o… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  13. Dual-level Mixup for Graph Few-shot Learning with Fewer Tasks

    Authors: Yonghao Liu, Mengyu Li, Fausto Giunchiglia, Lan Huang, Ximing Li, Xiaoyue Feng, Renchu Guan

    Abstract: Graph neural networks have been demonstrated as a powerful paradigm for effectively learning graph-structured data on the web and mining content from it.Current leading graph models require a large number of labeled samples for training, which unavoidably leads to overfitting in few-shot scenarios. Recent research has sought to alleviate this issue by simultaneously leveraging graph learning and m… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: WWW25

  14. arXiv:2502.13366  [pdf, other

    cs.RO eess.SY

    Low-Complexity Cooperative Payload Transportation for Nonholonomic Mobile Robots Under Scalable Constraints

    Authors: Renhe Guan, Yuanzhe Wang, Tao Liu, Yan Wang

    Abstract: Cooperative transportation, a key aspect of logistics cyber-physical systems (CPS), is typically approached using dis tributed control and optimization-based methods. The distributed control methods consume less time, but poorly handle and extend to multiple constraints. Instead, optimization-based methods handle constraints effectively, but they are usually centralized, time-consuming a… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  15. arXiv:2501.15394  [pdf, other

    cs.CV

    Doracamom: Joint 3D Detection and Occupancy Prediction with Multi-view 4D Radars and Cameras for Omnidirectional Perception

    Authors: Lianqing Zheng, Jianan Liu, Runwei Guan, Long Yang, Shouyi Lu, Yuanzhe Li, Xiaokai Bai, Jie Bai, Zhixiong Ma, Hui-Liang Shen, Xichan Zhu

    Abstract: 3D object detection and occupancy prediction are critical tasks in autonomous driving, attracting significant attention. Despite the potential of recent vision-based methods, they encounter challenges under adverse conditions. Thus, integrating cameras with next-generation 4D imaging radar to achieve unified multi-task perception is highly significant, though research in this domain remains limite… ▽ More

    Submitted 3 March, 2025; v1 submitted 25 January, 2025; originally announced January 2025.

  16. arXiv:2501.10343  [pdf, other

    cs.CV cs.AI

    3rd Workshop on Maritime Computer Vision (MaCVi) 2025: Challenge Results

    Authors: Benjamin Kiefer, Lojze Žust, Jon Muhovič, Matej Kristan, Janez Perš, Matija Teršek, Uma Mudenagudi Chaitra Desai, Arnold Wiliem, Marten Kreis, Nikhil Akalwadi, Yitong Quan, Zhiqiang Zhong, Zhe Zhang, Sujie Liu, Xuran Chen, Yang Yang, Matej Fabijanić, Fausto Ferreira, Seongju Lee, Junseok Lee, Kyoobin Lee, Shanliang Yao, Runwei Guan, Xiaoyu Huang, Yi Ni , et al. (23 additional authors not shown)

    Abstract: The 3rd Workshop on Maritime Computer Vision (MaCVi) 2025 addresses maritime computer vision for Unmanned Surface Vehicles (USV) and underwater. This report offers a comprehensive overview of the findings from the challenges. We provide both statistical and qualitative analyses, evaluating trends from over 700 submissions. All datasets, evaluation code, and the leaderboard are available to the pub… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

    Comments: Part of the MaCVi 2025 workshop

  17. arXiv:2501.09400  [pdf, ps, other

    cs.IT eess.SP

    Joint Antenna Selection and Beamforming Design for Active RIS-aided ISAC Systems

    Authors: Wei Ma, Peichang Zhang, Junjie Ye, Rouyang Guan, Xiao-Peng Li, Lei Huang

    Abstract: Active reconfigurable intelligent surface (A-RIS) aided integrated sensing and communications (ISAC) system has been considered as a promising paradigm to improve spectrum efficiency. However, massive energy-hungry radio frequency (RF) chains hinder its large-scale deployment. To address this issue, an A-RIS-aided ISAC system with antenna selection (AS) is proposed in this work, where a target is… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  18. arXiv:2501.09219  [pdf, other

    cs.CL

    A Simple Graph Contrastive Learning Framework for Short Text Classification

    Authors: Yonghao Liu, Fausto Giunchiglia, Lan Huang, Ximing Li, Xiaoyue Feng, Renchu Guan

    Abstract: Short text classification has gained significant attention in the information age due to its prevalence and real-world applications. Recent advancements in graph learning combined with contrastive learning have shown promising results in addressing the challenges of semantic sparsity and limited labeled data in short text classification. However, existing models have certain limitations. They rely… ▽ More

    Submitted 15 January, 2025; originally announced January 2025.

    Comments: AAAI2025

  19. arXiv:2501.09214  [pdf, other

    cs.CL

    Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning

    Authors: Yonghao Liu, Mengyu Li, Wei Pang, Fausto Giunchiglia, Lan Huang, Xiaoyue Feng, Renchu Guan

    Abstract: Short text classification, as a research subtopic in natural language processing, is more challenging due to its semantic sparsity and insufficient labeled samples in practical scenarios. We propose a novel model named MI-DELIGHT for short text classification in this work. Specifically, it first performs multi-source information (i.e., statistical information, linguistic information, and factual i… ▽ More

    Submitted 15 January, 2025; originally announced January 2025.

    Comments: AAAI2025

  20. Enhancing Unsupervised Graph Few-shot Learning via Set Functions and Optimal Transport

    Authors: Yonghao Liu, Fausto Giunchiglia, Ximing Li, Lan Huang, Xiaoyue Feng, Renchu Guan

    Abstract: Graph few-shot learning has garnered significant attention for its ability to rapidly adapt to downstream tasks with limited labeled data, sparking considerable interest among researchers. Recent advancements in graph few-shot learning models have exhibited superior performance across diverse applications. Despite their successes, several limitations still exist. First, existing models in the meta… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

    Comments: KDD2025

  21. arXiv:2501.02314  [pdf, ps, other

    cs.CV

    RadarNeXt: Real-Time and Reliable 3D Object Detector Based On 4D mmWave Imaging Radar

    Authors: Liye Jia, Runwei Guan, Haocheng Zhao, Qiuchi Zhao, Ka Lok Man, Jeremy Smith, Limin Yu, Yutao Yue

    Abstract: 3D object detection is crucial for Autonomous Driving (AD) and Advanced Driver Assistance Systems (ADAS). However, most 3D detectors prioritize detection accuracy, often overlooking network inference speed in practical applications. In this paper, we propose RadarNeXt, a real-time and reliable 3D object detector based on the 4D mmWave radar point clouds. It leverages the re-parameterizable neural… ▽ More

    Submitted 4 January, 2025; originally announced January 2025.

    Comments: 8 pages, 5 figures, 3 tables. Code: https://github.com/Pay246-git468/RadarNeXt

  22. arXiv:2412.17629  [pdf, other

    cs.NE cs.AI

    Learn from Global Correlations: Enhancing Evolutionary Algorithm via Spectral GNN

    Authors: Kaichen Ouyang, Shengwei Fu, Zong Ke, Renxiang Guan, Ke Liang, Dayu Hu

    Abstract: Evolutionary algorithms (EAs) simulate natural selection but have two main limitations: (1) they rarely update individuals based on global correlations, limiting comprehensive learning; (2) they struggle with balancing exploration and exploitation, where excessive exploitation causes premature convergence, and excessive exploration slows down the search. Moreover, EAs often depend on manual parame… ▽ More

    Submitted 27 May, 2025; v1 submitted 23 December, 2024; originally announced December 2024.

    Comments: 22 pages, 9 figures

  23. arXiv:2412.16674  [pdf, other

    cs.AI

    STAMPsy: Towards SpatioTemporal-Aware Mixed-Type Dialogues for Psychological Counseling

    Authors: Jieyi Wang, Yue Huang, Zeming Liu, Dexuan Xu, Chuan Wang, Xiaoming Shi, Ruiyuan Guan, Hongxing Wang, Weihua Yue, Yu Huang

    Abstract: Online psychological counseling dialogue systems are trending, offering a convenient and accessible alternative to traditional in-person therapy. However, existing psychological counseling dialogue systems mainly focus on basic empathetic dialogue or QA with minimal professional knowledge and without goal guidance. In many real-world counseling scenarios, clients often seek multi-type help, such a… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

  24. Self-regulated Learning Processes in Secondary Education: A Network Analysis of Trace-based Measures

    Authors: Yixin Cheng, Rui Guan, Tongguang Li, Mladen Raković, Xinyu Li, Yizhou Fan, Flora Jin, Yi-Shan Tsai, Dragan Gašević, Zachari Swiecki

    Abstract: While the capacity to self-regulate has been found to be crucial for secondary school students, prior studies often rely on self-report surveys and think-aloud protocols that present notable limitations in capturing self-regulated learning (SRL) processes. This study advances the understanding of SRL in secondary education by using trace data to examine SRL processes during multi-source writing ta… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

  25. arXiv:2409.20441  [pdf, other

    cs.CL

    Instance-adaptive Zero-shot Chain-of-Thought Prompting

    Authors: Xiaosong Yuan, Chen Shen, Shaotian Yan, Xiaofeng Zhang, Liang Xie, Wenxiao Wang, Renchu Guan, Ying Wang, Jieping Ye

    Abstract: Zero-shot Chain-of-Thought (CoT) prompting emerges as a simple and effective strategy for enhancing the performance of large language models (LLMs) in real-world reasoning tasks. Nonetheless, the efficacy of a singular, task-level prompt uniformly applied across the whole of instances is inherently limited since one prompt cannot be a good partner for all, a more appropriate approach should consid… ▽ More

    Submitted 30 October, 2024; v1 submitted 30 September, 2024; originally announced September 2024.

    Comments: Accepted by NeurIPS 2024

  26. arXiv:2409.14751  [pdf, other

    cs.CV cs.AI

    UniBEVFusion: Unified Radar-Vision BEVFusion for 3D Object Detection

    Authors: Haocheng Zhao, Runwei Guan, Taoyu Wu, Ka Lok Man, Limin Yu, Yutao Yue

    Abstract: 4D millimeter-wave (MMW) radar, which provides both height information and dense point cloud data over 3D MMW radar, has become increasingly popular in 3D object detection. In recent years, radar-vision fusion models have demonstrated performance close to that of LiDAR-based models, offering advantages in terms of lower hardware costs and better resilience in extreme conditions. However, many rada… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

    Comments: 6 pages, 4 figues, conference

  27. arXiv:2409.10330  [pdf, other

    cs.RO cs.CV

    DRIVE: Dependable Robust Interpretable Visionary Ensemble Framework in Autonomous Driving

    Authors: Songning Lai, Tianlang Xue, Hongru Xiao, Lijie Hu, Jiemin Wu, Ninghui Feng, Runwei Guan, Haicheng Liao, Zhenning Li, Yutao Yue

    Abstract: Recent advancements in autonomous driving have seen a paradigm shift towards end-to-end learning paradigms, which map sensory inputs directly to driving actions, thereby enhancing the robustness and adaptability of autonomous vehicles. However, these models often sacrifice interpretability, posing significant challenges to trust, safety, and regulatory compliance. To address these issues, we intro… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

  28. arXiv:2409.03192  [pdf, other

    cs.CV

    PEPL: Precision-Enhanced Pseudo-Labeling for Fine-Grained Image Classification in Semi-Supervised Learning

    Authors: Bowen Tian, Songning Lai, Lujundong Li, Zhihao Shuai, Runwei Guan, Tian Wu, Yutao Yue

    Abstract: Fine-grained image classification has witnessed significant advancements with the advent of deep learning and computer vision technologies. However, the scarcity of detailed annotations remains a major challenge, especially in scenarios where obtaining high-quality labeled data is costly or time-consuming. To address this limitation, we introduce Precision-Enhanced Pseudo-Labeling(PEPL) approach s… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: Under review

  29. arXiv:2408.17207  [pdf, other

    cs.CV cs.RO

    NanoMVG: USV-Centric Low-Power Multi-Task Visual Grounding based on Prompt-Guided Camera and 4D mmWave Radar

    Authors: Runwei Guan, Jianan Liu, Liye Jia, Haocheng Zhao, Shanliang Yao, Xiaohui Zhu, Ka Lok Man, Eng Gee Lim, Jeremy Smith, Yutao Yue

    Abstract: Recently, visual grounding and multi-sensors setting have been incorporated into perception system for terrestrial autonomous driving systems and Unmanned Surface Vehicles (USVs), yet the high complexity of modern learning-based visual grounding model using multi-sensors prevents such model to be deployed on USVs in the real-life. To this end, we design a low-power multi-task model named NanoMVG f… ▽ More

    Submitted 11 February, 2025; v1 submitted 30 August, 2024; originally announced August 2024.

    Comments: 8 pages, 6 figures

  30. radarODE: An ODE-Embedded Deep Learning Model for Contactless ECG Reconstruction from Millimeter-Wave Radar

    Authors: Yuanyuan Zhang, Runwei Guan, Lingxiao Li, Rui Yang, Yutao Yue, Eng Gee Lim

    Abstract: Radar-based contactless cardiac monitoring has become a popular research direction recently, but the fine-grained electrocardiogram (ECG) signal is still hard to reconstruct from millimeter-wave radar signal. The key obstacle is to decouple the cardiac activities in the electrical domain (i.e., ECG) from that in the mechanical domain (i.e., heartbeat), and most existing research only uses pure dat… ▽ More

    Submitted 6 May, 2025; v1 submitted 3 August, 2024; originally announced August 2024.

    Journal ref: IEEE Transactions on Mobile Computing, 2025

  31. arXiv:2407.19192  [pdf, other

    cs.CL cs.CV cs.MM

    Harmfully Manipulated Images Matter in Multimodal Misinformation Detection

    Authors: Bing Wang, Shengsheng Wang, Changchun Li, Renchu Guan, Ximing Li

    Abstract: Nowadays, misinformation is widely spreading over various social media platforms and causes extremely negative impacts on society. To combat this issue, automatically identifying misinformation, especially those containing multimodal content, has attracted growing attention from the academic and industrial communities, and induced an active research topic named Multimodal Misinformation Detection… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

    Comments: Accepted by ACM MM 2024. Code: https://github.com/wangbing1416/HAMI-M3D

  32. arXiv:2407.14732  [pdf, other

    cs.LG cs.SI

    Meta-GPS++: Enhancing Graph Meta-Learning with Contrastive Learning and Self-Training

    Authors: Yonghao Liu, Mengyu Li, Ximing Li, Lan Huang, Fausto Giunchiglia, Yanchun Liang, Xiaoyue Feng, Renchu Guan

    Abstract: Node classification is an essential problem in graph learning. However, many models typically obtain unsatisfactory performance when applied to few-shot scenarios. Some studies have attempted to combine meta-learning with graph neural networks to solve few-shot node classification on graphs. Despite their promising performance, some limitations remain. First, they employ the node encoding mechanis… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: ACM Transactions on Knowledge Discovery from Data (TKDD)

  33. arXiv:2407.04183  [pdf, other

    cs.CL cs.AI cs.CY cs.HC

    Seeing Like an AI: How LLMs Apply (and Misapply) Wikipedia Neutrality Norms

    Authors: Joshua Ashkinaze, Ruijia Guan, Laura Kurek, Eytan Adar, Ceren Budak, Eric Gilbert

    Abstract: Large language models (LLMs) are trained on broad corpora and then used in communities with specialized norms. Is providing LLMs with community rules enough for models to follow these norms? We evaluate LLMs' capacity to detect (Task 1) and correct (Task 2) biased Wikipedia edits according to Wikipedia's Neutral Point of View (NPOV) policy. LLMs struggled with bias detection, achieving only 64% ac… ▽ More

    Submitted 14 September, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

  34. arXiv:2405.12821  [pdf, other

    cs.RO cs.CV

    Talk2Radar: Bridging Natural Language with 4D mmWave Radar for 3D Referring Expression Comprehension

    Authors: Runwei Guan, Ruixiao Zhang, Ningwei Ouyang, Jianan Liu, Ka Lok Man, Xiaohao Cai, Ming Xu, Jeremy Smith, Eng Gee Lim, Yutao Yue, Hui Xiong

    Abstract: Embodied perception is essential for intelligent vehicles and robots in interactive environmental understanding. However, these advancements primarily focus on vision, with limited attention given to using 3D modeling sensors, restricting a comprehensive understanding of objects in response to prompts containing qualitative and quantitative queries. Recently, as a promising automotive sensor with… ▽ More

    Submitted 9 February, 2025; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: Accepted by ICRA 2025

  35. arXiv:2405.12434  [pdf, other

    cs.CL

    Resolving Word Vagueness with Scenario-guided Adapter for Natural Language Inference

    Authors: Yonghao Liu, Mengyu Li, Di Liang, Ximing Li, Fausto Giunchiglia, Lan Huang, Xiaoyue Feng, Renchu Guan

    Abstract: Natural Language Inference (NLI) is a crucial task in natural language processing that involves determining the relationship between two sentences, typically referred to as the premise and the hypothesis. However, traditional NLI models solely rely on the semantic information inherent in independent sentences and lack relevant situational visual information, which can hinder a complete understandi… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: IJCAI24

  36. arXiv:2405.11524  [pdf, other

    cs.CL

    Simple-Sampling and Hard-Mixup with Prototypes to Rebalance Contrastive Learning for Text Classification

    Authors: Mengyu Li, Yonghao Liu, Fausto Giunchiglia, Xiaoyue Feng, Renchu Guan

    Abstract: Text classification is a crucial and fundamental task in natural language processing. Compared with the previous learning paradigm of pre-training and fine-tuning by cross entropy loss, the recently proposed supervised contrastive learning approach has received tremendous attention due to its powerful feature learning capability and robustness. Although several studies have incorporated this techn… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: 12 pages, 9 figures

  37. arXiv:2404.10342  [pdf, other

    cs.CV cs.MM

    Referring Flexible Image Restoration

    Authors: Runwei Guan, Rongsheng Hu, Zhuhao Zhou, Tianlang Xue, Ka Lok Man, Jeremy Smith, Eng Gee Lim, Weiping Ding, Yutao Yue

    Abstract: In reality, images often exhibit multiple degradations, such as rain and fog at night (triple degradations). However, in many cases, individuals may not want to remove all degradations, for instance, a blurry lens revealing a beautiful snowy landscape (double degradations). In such scenarios, people may only desire to deblur. These situations and requirements shed light on a new challenge in image… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 15 pages, 19 figures

  38. arXiv:2404.09790  [pdf, other

    cs.CV

    NTIRE 2024 Challenge on Image Super-Resolution ($\times$4): Methods and Results

    Authors: Zheng Chen, Zongwei Wu, Eduard Zamfir, Kai Zhang, Yulun Zhang, Radu Timofte, Xiaokang Yang, Hongyuan Yu, Cheng Wan, Yuxin Hong, Zhijuan Huang, Yajun Zou, Yuan Huang, Jiamin Lin, Bingnan Han, Xianyu Guan, Yongsheng Yu, Daoan Zhang, Xuanwu Yin, Kunlong Zuo, Jinhua Hao, Kai Zhao, Kun Yuan, Ming Sun, Chao Zhou , et al. (63 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2024 challenge on image super-resolution ($\times$4), highlighting the solutions proposed and the outcomes obtained. The challenge involves generating corresponding high-resolution (HR) images, magnified by a factor of four, from low-resolution (LR) inputs using prior information. The LR images originate from bicubic downsampling degradation. The aim of the challenge i… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: NTIRE 2024 webpage: https://cvlai.net/ntire/2024. Code: https://github.com/zhengchen1999/NTIRE2024_ImageSR_x4

  39. arXiv:2404.05211  [pdf, other

    cs.CV

    Multi-level Graph Subspace Contrastive Learning for Hyperspectral Image Clustering

    Authors: Jingxin Wang, Renxiang Guan, Kainan Gao, Zihao Li, Hao Li, Xianju Li, Chang Tang

    Abstract: Hyperspectral image (HSI) clustering is a challenging task due to its high complexity. Despite subspace clustering shows impressive performance for HSI, traditional methods tend to ignore the global-local interaction in HSI data. In this study, we proposed a multi-level graph subspace contrastive learning (MLGSC) for HSI clustering. The model is divided into the following main parts. Graph convolu… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: IJCNN 2024

  40. arXiv:2404.00964  [pdf, other

    cs.CV

    S2RC-GCN: A Spatial-Spectral Reliable Contrastive Graph Convolutional Network for Complex Land Cover Classification Using Hyperspectral Images

    Authors: Renxiang Guan, Zihao Li, Chujia Song, Guo Yu, Xianju Li, Ruyi Feng

    Abstract: Spatial correlations between different ground objects are an important feature of mining land cover research. Graph Convolutional Networks (GCNs) can effectively capture such spatial feature representations and have demonstrated promising results in performing hyperspectral imagery (HSI) classification tasks of complex land. However, the existing GCN-based HSI classification methods are prone to i… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted to IJCNN 2024 (International Joint Conference on Neural Networks)

  41. arXiv:2403.12686  [pdf, other

    cs.CV cs.MM cs.RO

    WaterVG: Waterway Visual Grounding based on Text-Guided Vision and mmWave Radar

    Authors: Runwei Guan, Liye Jia, Fengyufan Yang, Shanliang Yao, Erick Purwanto, Xiaohui Zhu, Eng Gee Lim, Jeremy Smith, Ka Lok Man, Xuming Hu, Yutao Yue

    Abstract: The perception of waterways based on human intent is significant for autonomous navigation and operations of Unmanned Surface Vehicles (USVs) in water environments. Inspired by visual grounding, we introduce WaterVG, the first visual grounding dataset designed for USV-based waterway perception based on human prompts. WaterVG encompasses prompts describing multiple targets, with annotations at the… ▽ More

    Submitted 4 April, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 10 pages, 10 figures

  42. arXiv:2403.01465  [pdf

    cs.CV

    Multiview Subspace Clustering of Hyperspectral Images based on Graph Convolutional Networks

    Authors: Xianju Li, Renxiang Guan, Zihao Li, Hao Liu, Jing Yang

    Abstract: High-dimensional and complex spectral structures make clustering of hy-perspectral images (HSI) a challenging task. Subspace clustering has been shown to be an effective approach for addressing this problem. However, current subspace clustering algorithms are mainly designed for a single view and do not fully exploit spatial or texture feature information in HSI. This study proposed a multiview su… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: This paper was accepted by APWEB-WAIM 2024

  43. arXiv:2312.09630  [pdf, other

    cs.CV cs.AI

    Pixel-Superpixel Contrastive Learning and Pseudo-Label Correction for Hyperspectral Image Clustering

    Authors: Renxiang Guan, Zihao Li, Xianju Li, Chang Tang

    Abstract: Hyperspectral image (HSI) clustering is gaining considerable attention owing to recent methods that overcome the inefficiency and misleading results from the absence of supervised information. Contrastive learning methods excel at existing pixel level and super pixel level HSI clustering tasks. The pixel-level contrastive learning method can effectively improve the ability of the model to capture… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: Accepted at IEEE ICASSP 2024

  44. arXiv:2312.08851  [pdf, other

    cs.CV cs.CE cs.RO

    Achelous++: Power-Oriented Water-Surface Panoptic Perception Framework on Edge Devices based on Vision-Radar Fusion and Pruning of Heterogeneous Modalities

    Authors: Runwei Guan, Haocheng Zhao, Shanliang Yao, Ka Lok Man, Xiaohui Zhu, Limin Yu, Yong Yue, Jeremy Smith, Eng Gee Lim, Weiping Ding, Yutao Yue

    Abstract: Urban water-surface robust perception serves as the foundation for intelligent monitoring of aquatic environments and the autonomous navigation and operation of unmanned vessels, especially in the context of waterway safety. It is worth noting that current multi-sensor fusion and multi-task learning models consume substantial power and heavily rely on high-power GPUs for inference. This contribute… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 18 pages, 9 figures

  45. arXiv:2312.06068  [pdf, other

    cs.CV cs.AI

    Contrastive Multi-view Subspace Clustering of Hyperspectral Images based on Graph Convolutional Networks

    Authors: Renxiang Guan, Zihao Li, Xianju Li, Chang Tang, Ruyi Feng

    Abstract: High-dimensional and complex spectral structures make the clustering of hyperspectral images (HSI) a challenging task. Subspace clustering is an effective approach for addressing this problem. However, current subspace clustering algorithms are primarily designed for a single view and do not fully exploit the spatial or textural feature information in HSI. In this study, contrastive multi-view sub… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  46. Exploring Radar Data Representations in Autonomous Driving: A Comprehensive Review

    Authors: Shanliang Yao, Runwei Guan, Zitian Peng, Chenhang Xu, Yilu Shi, Weiping Ding, Eng Gee Lim, Yong Yue, Hyungjoon Seo, Ka Lok Man, Jieming Ma, Xiaohui Zhu, Yutao Yue

    Abstract: With the rapid advancements of sensor technology and deep learning, autonomous driving systems are providing safe and efficient access to intelligent vehicles as well as intelligent transportation. Among these equipped sensors, the radar sensor plays a crucial role in providing robust perception information in diverse environmental conditions. This review focuses on exploring different radar data… ▽ More

    Submitted 21 April, 2025; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: Accepted by TITS

    Journal ref: IEEE Transactions on Intelligent Transportation Systems 2025

  47. arXiv:2308.10287  [pdf, other

    cs.CV cs.RO

    ASY-VRNet: Waterway Panoptic Driving Perception Model based on Asymmetric Fair Fusion of Vision and 4D mmWave Radar

    Authors: Runwei Guan, Shanliang Yao, Xiaohui Zhu, Ka Lok Man, Yong Yue, Jeremy Smith, Eng Gee Lim, Yutao Yue

    Abstract: Panoptic Driving Perception (PDP) is critical for the autonomous navigation of Unmanned Surface Vehicles (USVs). A PDP model typically integrates multiple tasks, necessitating the simultaneous and robust execution of various perception tasks to facilitate downstream path planning. The fusion of visual and radar sensors is currently acknowledged as a robust and cost-effective approach. However, mos… ▽ More

    Submitted 4 July, 2024; v1 submitted 20 August, 2023; originally announced August 2023.

    Comments: Accepted by IROS 2024

  48. arXiv:2307.07102  [pdf, other

    cs.CV cs.RO

    Achelous: A Fast Unified Water-surface Panoptic Perception Framework based on Fusion of Monocular Camera and 4D mmWave Radar

    Authors: Runwei Guan, Shanliang Yao, Xiaohui Zhu, Ka Lok Man, Eng Gee Lim, Jeremy Smith, Yong Yue, Yutao Yue

    Abstract: Current perception models for different tasks usually exist in modular forms on Unmanned Surface Vehicles (USVs), which infer extremely slowly in parallel on edge devices, causing the asynchrony between perception results and USV position, and leading to error decisions of autonomous navigation. Compared with Unmanned Ground Vehicles (UGVs), the robust perception of USVs develops relatively slowly… ▽ More

    Submitted 13 July, 2023; originally announced July 2023.

    Comments: Accepted by ITSC 2023

  49. arXiv:2307.06505  [pdf, other

    cs.CV cs.RO

    WaterScenes: A Multi-Task 4D Radar-Camera Fusion Dataset and Benchmarks for Autonomous Driving on Water Surfaces

    Authors: Shanliang Yao, Runwei Guan, Zhaodong Wu, Yi Ni, Zile Huang, Ryan Wen Liu, Yong Yue, Weiping Ding, Eng Gee Lim, Hyungjoon Seo, Ka Lok Man, Jieming Ma, Xiaohui Zhu, Yutao Yue

    Abstract: Autonomous driving on water surfaces plays an essential role in executing hazardous and time-consuming missions, such as maritime surveillance, survivors rescue, environmental monitoring, hydrography mapping and waste cleaning. This work presents WaterScenes, the first multi-task 4D radar-camera fusion dataset for autonomous driving on water surfaces. Equipped with a 4D radar and a monocular camer… ▽ More

    Submitted 15 June, 2024; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: Accepted by IEEE Transactions on Intelligent Transportation Systems

  50. arXiv:2304.10893  [pdf, other

    cs.CV cs.MM

    FindVehicle and VehicleFinder: A NER dataset for natural language-based vehicle retrieval and a keyword-based cross-modal vehicle retrieval system

    Authors: Runwei Guan, Ka Lok Man, Feifan Chen, Shanliang Yao, Rongsheng Hu, Xiaohui Zhu, Jeremy Smith, Eng Gee Lim, Yutao Yue

    Abstract: Natural language (NL) based vehicle retrieval is a task aiming to retrieve a vehicle that is most consistent with a given NL query from among all candidate vehicles. Because NL query can be easily obtained, such a task has a promising prospect in building an interactive intelligent traffic system (ITS). Current solutions mainly focus on extracting both text and image features and mapping them to t… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.