Skip to main content

Showing 1–50 of 130 results for author: Qiao, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.01417  [pdf, ps, other

    cs.CV cs.LG

    Gradient Short-Circuit: Efficient Out-of-Distribution Detection via Feature Intervention

    Authors: Jiawei Gu, Ziyue Qiao, Zechao Li

    Abstract: Out-of-Distribution (OOD) detection is critical for safely deploying deep models in open-world environments, where inputs may lie outside the training distribution. During inference on a model trained exclusively with In-Distribution (ID) data, we observe a salient gradient phenomenon: around an ID sample, the local gradient directions for "enhancing" that sample's predicted class remain relativel… ▽ More

    Submitted 4 July, 2025; v1 submitted 2 July, 2025; originally announced July 2025.

    Comments: Accepted to ICCV 2025

  2. arXiv:2506.21343  [pdf, ps, other

    cs.LG

    DynamicBench: Evaluating Real-Time Report Generation in Large Language Models

    Authors: Jingyao Li, Hao Sun, Zile Qiao, Yong Jiang, Pengjun Xie, Fei Huang, Hong Xu, Jiaya Jia

    Abstract: Traditional benchmarks for large language models (LLMs) typically rely on static evaluations through storytelling or opinion expression, which fail to capture the dynamic requirements of real-time information processing in contemporary applications. To address this limitation, we present DynamicBench, a benchmark designed to evaluate the proficiency of LLMs in storing and processing up-to-the-minu… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  3. arXiv:2506.14087  [pdf, ps, other

    cs.LG

    Multi-Scale Finetuning for Encoder-based Time Series Foundation Models

    Authors: Zhongzheng Qiao, Chenghao Liu, Yiming Zhang, Ming Jin, Quang Pham, Qingsong Wen, P. N. Suganthan, Xudong Jiang, Savitha Ramasamy

    Abstract: Time series foundation models (TSFMs) demonstrate impressive zero-shot performance for time series forecasting. However, an important yet underexplored challenge is how to effectively finetune TSFMs on specific downstream tasks. While naive finetuning can yield performance gains, we argue that it falls short of fully leveraging TSFMs' capabilities, often resulting in overfitting and suboptimal per… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  4. arXiv:2506.06887  [pdf, ps, other

    cs.CL

    Mixture of Small and Large Models for Chinese Spelling Check

    Authors: Ziheng Qiao, Houquan Zhou, Zhenghua Li

    Abstract: In the era of large language models (LLMs), the Chinese Spelling Check (CSC) task has seen various LLM methods developed, yet their performance remains unsatisfactory. In contrast, fine-tuned BERT-based models, relying on high-quality in-domain data, show excellent performance but suffer from edit pattern overfitting. This paper proposes a novel dynamic mixture approach that effectively combines t… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

  5. arXiv:2506.03674  [pdf, other

    cs.LG

    Out-of-Distribution Graph Models Merging

    Authors: Yidi Wang, Jiawei Gu, pei Xiaobing, Xubin Zheng, Xiao Luo, Pengyang Wang, Ziyue Qiao

    Abstract: This paper studies a novel problem of out-of-distribution graph models merging, which aims to construct a generalized model from multiple graph models pre-trained on different domains with distribution discrepancy. This problem is challenging because of the difficulty in learning domain-invariant knowledge implicitly in model parameters and consolidating expertise from potentially heterogeneous GN… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  6. arXiv:2505.22389  [pdf, ps, other

    cs.LG cs.AI

    Train with Perturbation, Infer after Merging: A Two-Stage Framework for Continual Learning

    Authors: Haomiao Qiu, Miao Zhang, Ziyue Qiao, Liqiang Nie

    Abstract: Continual Learning (CL) aims to enable models to continuously acquire new knowledge from a sequence of tasks with avoiding the forgetting of learned information. However, existing CL methods only rely on the parameters of the most recent task for inference, which makes them susceptible to catastrophic forgetting. Inspired by the recent success of model merging techniques, we propose \textbf{Pertur… ▽ More

    Submitted 16 June, 2025; v1 submitted 28 May, 2025; originally announced May 2025.

    Comments: 17 pages, 3 figures

  7. arXiv:2505.22370  [pdf, ps, other

    cs.LG cs.AI

    SplitLoRA: Balancing Stability and Plasticity in Continual Learning Through Gradient Space Splitting

    Authors: Haomiao Qiu, Miao Zhang, Ziyue Qiao, Weili Guan, Min Zhang, Liqiang Nie

    Abstract: Continual Learning requires a model to learn multiple tasks in sequence while maintaining both stability:preserving knowledge from previously learned tasks, and plasticity:effectively learning new tasks. Gradient projection has emerged as an effective and popular paradigm in CL, where it partitions the gradient space of previously learned tasks into two orthogonal subspaces: a primary subspace and… ▽ More

    Submitted 11 June, 2025; v1 submitted 28 May, 2025; originally announced May 2025.

    Comments: 18 pages, 4 figures

  8. arXiv:2505.20246  [pdf, ps, other

    cs.AI cs.CL

    On Path to Multimodal Historical Reasoning: HistBench and HistAgent

    Authors: Jiahao Qiu, Fulian Xiao, Yimin Wang, Yuchen Mao, Yijia Chen, Xinzhe Juan, Shu Zhang, Siran Wang, Xuan Qi, Tongcheng Zhang, Zixin Yao, Jiacheng Guo, Yifu Lu, Charles Argon, Jundi Cui, Daixin Chen, Junran Zhou, Shuyao Zhou, Zhanpeng Zhou, Ling Yang, Shilong Liu, Hongru Wang, Kaixuan Huang, Xun Jiang, Yuming Cao , et al. (74 additional authors not shown)

    Abstract: Recent advances in large language models (LLMs) have led to remarkable progress across domains, yet their capabilities in the humanities, particularly history, remain underexplored. Historical reasoning poses unique challenges for AI, involving multimodal source interpretation, temporal inference, and cross-linguistic analysis. While general-purpose agents perform well on many existing benchmarks,… ▽ More

    Submitted 19 June, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

    Comments: 17 pages, 7 figures

  9. arXiv:2505.17421  [pdf, ps, other

    cs.IT eess.SP

    Adaptive Implicit-Based Deep Learning Channel Estimation for 6G Communications

    Authors: Zhen Qiao, Jiang Xue, Junkai Zhang, Guanzhang Liu, Xiaoqin Ma, Runhua Li, Faheem A. Khan, John S. Thompson, Zongben Xu

    Abstract: With the widespread deployment of fifth-generation (5G) wireless networks, research on sixth-generation (6G) technology is gaining momentum. Artificial Intelligence (AI) is anticipated to play a significant role in 6G, particularly through integration with the physical layer for tasks such as channel estimation. Considering resource limitations in real systems, the AI algorithm should be designed… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  10. arXiv:2505.16860  [pdf, ps, other

    cs.LG cs.AI

    GCAL: Adapting Graph Models to Evolving Domain Shifts

    Authors: Ziyue Qiao, Qianyi Cai, Hao Dong, Jiawei Gu, Pengyang Wang, Meng Xiao, Xiao Luo, Hui Xiong

    Abstract: This paper addresses the challenge of graph domain adaptation on evolving, multiple out-of-distribution (OOD) graphs. Conventional graph domain adaptation methods are confined to single-step adaptation, making them ineffective in handling continuous domain shifts and prone to catastrophic forgetting. This paper introduces the Graph Continual Adaptive Learning (GCAL) method, designed to enhance mod… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: Accepted to ICML 2025

  11. arXiv:2505.16314  [pdf, ps, other

    cs.CV cs.AI

    NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment

    Authors: Shuhao Han, Haotian Fan, Fangyuan Kong, Wenjie Liao, Chunle Guo, Chongyi Li, Radu Timofte, Liang Li, Tao Li, Junhui Cui, Yunqiu Wang, Yang Tai, Jingwei Sun, Jianhui Sun, Xinli Yue, Tianyi Wang, Huan Hou, Junda Lu, Xinyang Huang, Zitang Zhou, Zijian Zhang, Xuhui Zheng, Xuecheng Wu, Chong Peng, Xuezhi Cao , et al. (90 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2025 challenge on Text to Image (T2I) generation model quality assessment, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2025. The aim of this challenge is to address the fine-grained quality assessment of text-to-image generation models. This challenge evaluates text-to-image models from two aspe… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  12. arXiv:2505.16214  [pdf

    cs.RO cs.SE

    Behavioral Safety Assessment towards Large-scale Deployment of Autonomous Vehicles

    Authors: Henry X. Liu, Xintao Yan, Haowei Sun, Tinghan Wang, Zhijie Qiao, Haojie Zhu, Shengyin Shen, Shuo Feng, Greg Stevens, Greg McGuire

    Abstract: Autonomous vehicles (AVs) have significantly advanced in real-world deployment in recent years, yet safety continues to be a critical barrier to widespread adoption. Traditional functional safety approaches, which primarily verify the reliability, robustness, and adequacy of AV hardware and software systems from a vehicle-centric perspective, do not sufficiently address the AV's broader interactio… ▽ More

    Submitted 30 May, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

    Comments: Code and Supplementary Materials available at: https://github.com/michigan-traffic-lab/Behavioral-Safety-Assessment

  13. arXiv:2505.15180  [pdf, other

    cs.LG

    NeuBM: Mitigating Model Bias in Graph Neural Networks through Neutral Input Calibration

    Authors: Jiawei Gu, Ziyue Qiao, Xiao Luo

    Abstract: Graph Neural Networks (GNNs) have shown remarkable performance across various domains, yet they often struggle with model bias, particularly in the presence of class imbalance. This bias can lead to suboptimal performance and unfair predictions, especially for underrepresented classes. We introduce NeuBM (Neutral Bias Mitigation), a novel approach to mitigate model bias in GNNs through neutral inp… ▽ More

    Submitted 23 May, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

    Comments: Accepted to IJCAI 2025

  14. arXiv:2505.15177  [pdf, other

    cs.LG

    SpectralGap: Graph-Level Out-of-Distribution Detection via Laplacian Eigenvalue Gaps

    Authors: Jiawei Gu, Ziyue Qiao, Zechao Li

    Abstract: The task of graph-level out-of-distribution (OOD) detection is crucial for deploying graph neural networks in real-world settings. In this paper, we observe a significant difference in the relationship between the largest and second-largest eigenvalues of the Laplacian matrix for in-distribution (ID) and OOD graph samples: \textit{OOD samples often exhibit anomalous spectral gaps (the difference b… ▽ More

    Submitted 23 May, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

    Comments: Accepted to IJCAI 2025

  15. arXiv:2505.14020  [pdf, ps, other

    cs.AI cs.IR cs.LG

    Disentangled Multi-span Evolutionary Network against Temporal Knowledge Graph Reasoning

    Authors: Hao Dong, Ziyue Qiao, Zhiyuan Ning, Qi Hao, Yi Du, Pengyang Wang, Yuanchun Zhou

    Abstract: Temporal Knowledge Graphs (TKGs), as an extension of static Knowledge Graphs (KGs), incorporate the temporal feature to express the transience of knowledge by describing when facts occur. TKG extrapolation aims to infer possible future facts based on known history, which has garnered significant attention in recent years. Some existing methods treat TKG as a sequence of independent subgraphs to mo… ▽ More

    Submitted 29 May, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

    Comments: Accepted to ACL 2025 Findings

  16. arXiv:2505.13812  [pdf, other

    cs.CV

    Physics-Driven Local-Whole Elastic Deformation Modeling for Point Cloud Representation Learning

    Authors: Zhongyu Chen, Rong Zhao, Xie Han, Xindong Guo, Song Wang, Zherui Qiao

    Abstract: Existing point cloud representation learning tend to learning the geometric distribution of objects through data-driven approaches, emphasizing structural features while overlooking the relationship between the local information and the whole structure. Local features reflect the fine-grained variations of an object, while the whole structure is determined by the interaction and combination of the… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  17. arXiv:2505.05533  [pdf, other

    cs.LG cs.AI

    Rethinking Graph Contrastive Learning through Relative Similarity Preservation

    Authors: Zhiyuan Ning, Pengfei Wang, Ziyue Qiao, Pengyang Wang, Yuanchun Zhou

    Abstract: Graph contrastive learning (GCL) has achieved remarkable success by following the computer vision paradigm of preserving absolute similarity between augmented views. However, this approach faces fundamental challenges in graphs due to their discrete, non-Euclidean nature -- view generation often breaks semantic validity and similarity verification becomes unreliable. Through analyzing 11 real-worl… ▽ More

    Submitted 12 May, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

    Comments: Accepted by IJCAI2025; full version including appendix

  18. arXiv:2505.04881  [pdf, other

    cs.LG cs.AI cs.CL

    ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning

    Authors: Ziqing Qiao, Yongheng Deng, Jiali Zeng, Dong Wang, Lai Wei, Fandong Meng, Jie Zhou, Ju Ren, Yaoxue Zhang

    Abstract: Large Reasoning Models (LRMs) perform strongly in complex reasoning tasks via Chain-of-Thought (CoT) prompting, but often suffer from verbose outputs caused by redundant content, increasing computational overhead, and degrading user experience. Existing compression methods either operate post-hoc pruning, risking disruption to reasoning coherence, or rely on sampling-based selection, which fails t… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  19. arXiv:2505.04588  [pdf, other

    cs.CL

    ZeroSearch: Incentivize the Search Capability of LLMs without Searching

    Authors: Hao Sun, Zile Qiao, Jiayan Guo, Xuanbo Fan, Yingyan Hou, Yong Jiang, Pengjun Xie, Yan Zhang, Fei Huang, Jingren Zhou

    Abstract: Effective information searching is essential for enhancing the reasoning and generation capabilities of large language models (LLMs). Recent research has explored using reinforcement learning (RL) to improve LLMs' search capabilities by interacting with live search engines in real-world environments. While these approaches show promising results, they face two major challenges: (1) Uncontrolled Do… ▽ More

    Submitted 16 May, 2025; v1 submitted 7 May, 2025; originally announced May 2025.

  20. arXiv:2505.00284  [pdf, ps, other

    cs.RO cs.AI

    LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous Driving

    Authors: Zhijie Qiao, Haowei Li, Zhong Cao, Henry X. Liu

    Abstract: Vision-Language Models (VLMs) have demonstrated significant potential for end-to-end autonomous driving. However, fully exploiting their capabilities for safe and reliable vehicle control remains an open research challenge. To systematically examine advances and limitations of VLMs in driving tasks, we introduce LightEMMA, a Lightweight End-to-End Multimodal Model for Autonomous driving. LightEMMA… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

  21. arXiv:2504.17356  [pdf, other

    cs.AI cs.LG

    Comprehend, Divide, and Conquer: Feature Subspace Exploration via Multi-Agent Hierarchical Reinforcement Learning

    Authors: Weiliang Zhang, Xiaohan Huang, Yi Du, Ziyue Qiao, Qingqing Long, Zhen Meng, Yuanchun Zhou, Meng Xiao

    Abstract: Feature selection aims to preprocess the target dataset, find an optimal and most streamlined feature subset, and enhance the downstream machine learning task. Among filter, wrapper, and embedded-based approaches, the reinforcement learning (RL)-based subspace exploration strategy provides a novel objective optimization-directed perspective and promising performance. Nevertheless, even with improv… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: 20 pages, keywords: Automated Feature Engineering, Tabular Dataset, Multi-Agent Reinforcement Learning, Feature Selection

  22. arXiv:2504.17355  [pdf, other

    cs.LG cs.AI

    Collaborative Multi-Agent Reinforcement Learning for Automated Feature Transformation with Graph-Driven Path Optimization

    Authors: Xiaohan Huang, Dongjie Wang, Zhiyuan Ning, Ziyue Qiao, Qingqing Long, Haowei Zhu, Yi Du, Min Wu, Yuanchun Zhou, Meng Xiao

    Abstract: Feature transformation methods aim to find an optimal mathematical feature-feature crossing process that generates high-value features and improves the performance of downstream machine learning tasks. Existing frameworks, though designed to mitigate manual costs, often treat feature transformations as isolated operations, ignoring dynamic dependencies between transformation steps. To address the… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: 13 pages, Keywords: Automated Feature Transformation, Tabular Dataset, Reinforcement Learning

  23. arXiv:2504.14440  [pdf, other

    cs.RO cs.CV

    SG-Reg: Generalizable and Efficient Scene Graph Registration

    Authors: Chuhao Liu, Zhijian Qiao, Jieqi Shi, Ke Wang, Peize Liu, Shaojie Shen

    Abstract: This paper addresses the challenges of registering two rigid semantic scene graphs, an essential capability when an autonomous agent needs to register its map against a remote agent, or against a prior map. The hand-crafted descriptors in classical semantic-aided registration, or the ground-truth annotation reliance in learning-based scene graph registration, impede their application in practical… ▽ More

    Submitted 20 May, 2025; v1 submitted 19 April, 2025; originally announced April 2025.

    Comments: IEEE Transactions Robotics Regular Paper

  24. arXiv:2504.10273  [pdf, other

    cs.LG math.NA

    Sidecar: A Structure-Preserving Framework for Solving Partial Differential Equations with Neural Networks

    Authors: Gaohang Chen, Zhonghua Qiao

    Abstract: Solving partial differential equations (PDEs) with neural networks (NNs) has shown great potential in various scientific and engineering fields. However, most existing NN solvers mainly focus on satisfying the given PDEs, without explicitly considering intrinsic physical properties such as mass conservation or energy dissipation. This limitation can result in unstable or nonphysical solutions, par… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    MSC Class: 65M99; 68T07; 35L65

  25. arXiv:2503.22655  [pdf, other

    cs.AI cs.CV cs.MM

    Unicorn: Text-Only Data Synthesis for Vision Language Model Training

    Authors: Xiaomin Yu, Pengxiang Ding, Wenjie Zhang, Siteng Huang, Songyang Gao, Chengwei Qin, Kejian Wu, Zhaoxin Fan, Ziyue Qiao, Donglin Wang

    Abstract: Training vision-language models (VLMs) typically requires large-scale, high-quality image-text pairs, but collecting or synthesizing such data is costly. In contrast, text data is abundant and inexpensive, prompting the question: can high-quality multimodal training data be synthesized purely from text? To tackle this, we propose a cross-integrated three-stage multimodal data synthesis framework,… ▽ More

    Submitted 28 March, 2025; originally announced March 2025.

  26. arXiv:2503.21460  [pdf, other

    cs.CL

    Large Language Model Agent: A Survey on Methodology, Applications and Challenges

    Authors: Junyu Luo, Weizhi Zhang, Ye Yuan, Yusheng Zhao, Junwei Yang, Yiyang Gu, Bohan Wu, Binqi Chen, Ziyue Qiao, Qingqing Long, Rongcheng Tu, Xiao Luo, Wei Ju, Zhiping Xiao, Yifan Wang, Meng Xiao, Chenwu Liu, Jingyang Yuan, Shichang Zhang, Yiqiao Jin, Fan Zhang, Xian Wu, Hanqing Zhao, Dacheng Tao, Philip S. Yu , et al. (1 additional authors not shown)

    Abstract: The era of intelligent agents is upon us, driven by revolutionary advancements in large language models. Large Language Model (LLM) agents, with goal-driven behaviors and dynamic adaptation capabilities, potentially represent a critical pathway toward artificial general intelligence. This survey systematically deconstructs LLM agent systems through a methodology-centered taxonomy, linking architec… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

    Comments: 329 papers surveyed, resources are at https://github.com/luo-junyu/Awesome-Agent-Papers

  27. arXiv:2503.20394  [pdf, other

    cs.LG cs.AI

    FastFT: Accelerating Reinforced Feature Transformation via Advanced Exploration Strategies

    Authors: Tianqi He, Xiaohan Huang, Yi Du, Qingqing Long, Ziyue Qiao, Min Wu, Yanjie Fu, Yuanchun Zhou, Meng Xiao

    Abstract: Feature Transformation is crucial for classic machine learning that aims to generate feature combinations to enhance the performance of downstream tasks from a data-centric perspective. Current methodologies, such as manual expert-driven processes, iterative-feedback techniques, and exploration-generative tactics, have shown promise in automating such data engineering workflow by minimizing human… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

    Comments: 14 pages, Accepted by ICDE 2025

  28. arXiv:2503.03629  [pdf, other

    cs.RO eess.SY

    TeraSim: Uncovering Unknown Unsafe Events for Autonomous Vehicles through Generative Simulation

    Authors: Haowei Sun, Xintao Yan, Zhijie Qiao, Haojie Zhu, Yihao Sun, Jiawei Wang, Shengyin Shen, Darian Hogue, Rajanikant Ananta, Derek Johnson, Greg Stevens, Greg McGuire, Yifan Wei, Wei Zheng, Yong Sun, Yasuo Fukai, Henry X. Liu

    Abstract: Traffic simulation is essential for autonomous vehicle (AV) development, enabling comprehensive safety evaluation across diverse driving conditions. However, traditional rule-based simulators struggle to capture complex human interactions, while data-driven approaches often fail to maintain long-term behavioral realism or generate diverse safety-critical events. To address these challenges, we pro… ▽ More

    Submitted 1 April, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

  29. arXiv:2502.16856  [pdf, other

    cs.RO

    SLABIM: A SLAM-BIM Coupled Dataset in HKUST Main Building

    Authors: Haoming Huang, Zhijian Qiao, Zehuan Yu, Chuhao Liu, Shaojie Shen, Fumin Zhang, Huan Yin

    Abstract: Existing indoor SLAM datasets primarily focus on robot sensing, often lacking building architectures. To address this gap, we design and construct the first dataset to couple the SLAM and BIM, named SLABIM. This dataset provides BIM and SLAM-oriented sensor data, both modeling a university building at HKUST. The as-designed BIM is decomposed and converted for ease of use. We employ a multi-sensor… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

    Comments: Accepted by ICRA 2025. Dataset aviliable at https://github.com/HKUST-Aerial-Robotics/SLABIM . Video attachment at https://youtu.be/7NckgY15ABQ

  30. arXiv:2502.06205  [pdf, other

    cs.CL cs.AI cs.LG

    C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Generation

    Authors: Guoxin Chen, Minpeng Liao, Peiying Yu, Dingmin Wang, Zile Qiao, Chao Yang, Xin Zhao, Kai Fan

    Abstract: Retrieval-augmented generation (RAG) systems face a fundamental challenge in aligning independently developed retrievers and large language models (LLMs). Existing approaches typically involve modifying either component or introducing simple intermediate modules, resulting in practical limitations and sub-optimal performance. Inspired by human search behavior -- typically involving a back-and-fort… ▽ More

    Submitted 22 May, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: Camera ready version for ICML 2025

  31. arXiv:2501.14600  [pdf, other

    cs.SI

    On the Homophily of Heterogeneous Graphs: Understanding and Unleashing

    Authors: Zhen Tao, Ziyue Qiao, Chaoqi Chen, Zhengyi Yang, Lun Du, Qingqiang Sun

    Abstract: Homophily, the tendency of similar nodes to connect, is a fundamental phenomenon in network science and a critical factor in the performance of graph neural networks (GNNs). While existing studies primarily explore homophily in homogeneous graphs, where nodes share the same type, real-world networks are often more accurately modeled as heterogeneous graphs (HGs) with diverse node types and intrica… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

  32. arXiv:2501.06225  [pdf, other

    cs.CV cs.LG

    A Distributed Hybrid Quantum Convolutional Neural Network for Medical Image Classification

    Authors: Yangyang Li, Zhengya Qia, Yuelin Lia, Haorui Yanga, Ronghua Shanga, Licheng Jiaoa

    Abstract: Medical images are characterized by intricate and complex features, requiring interpretation by physicians with medical knowledge and experience. Classical neural networks can reduce the workload of physicians, but can only handle these complex features to a limited extent. Theoretically, quantum computing can explore a broader parameter space with fewer parameters, but it is currently limited by… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  33. arXiv:2412.12984  [pdf, other

    cs.LG cs.AI cs.IR cs.SI

    Cluster-guided Contrastive Class-imbalanced Graph Classification

    Authors: Wei Ju, Zhengyang Mao, Siyu Yi, Yifang Qin, Yiyang Gu, Zhiping Xiao, Jianhao Shen, Ziyue Qiao, Ming Zhang

    Abstract: This paper studies the problem of class-imbalanced graph classification, which aims at effectively classifying the graph categories in scenarios with imbalanced class distributions. While graph neural networks (GNNs) have achieved remarkable success, their modeling ability on imbalanced graph-structured data remains suboptimal, which typically leads to predictions biased towards the majority class… ▽ More

    Submitted 30 December, 2024; v1 submitted 17 December, 2024; originally announced December 2024.

    Comments: Accepted by Proceedings of the Thirty-Ninth AAAI Conference on Artificial Intelligence (AAAI-25)

  34. arXiv:2412.12863  [pdf, ps, other

    cs.CL cs.AI

    DISC: Plug-and-Play Decoding Intervention with Similarity of Characters for Chinese Spelling Check

    Authors: Ziheng Qiao, Houquan Zhou, Yumeng Liu, Zhenghua Li, Min Zhang, Bo Zhang, Chen Li, Ji Zhang, Fei Huang

    Abstract: One key characteristic of the Chinese spelling check (CSC) task is that incorrect characters are usually similar to the correct ones in either phonetics or glyph. To accommodate this, previous works usually leverage confusion sets, which suffer from two problems, i.e., difficulty in determining which character pairs to include and lack of probabilities to distinguish items in the set. In this pape… ▽ More

    Submitted 7 June, 2025; v1 submitted 17 December, 2024; originally announced December 2024.

  35. arXiv:2412.10743  [pdf, other

    cs.LG physics.chem-ph q-bio.BM

    NeuralPLexer3: Accurate Biomolecular Complex Structure Prediction with Flow Models

    Authors: Zhuoran Qiao, Feizhi Ding, Thomas Dresselhaus, Mia A. Rosenfeld, Xiaotian Han, Owen Howell, Aniketh Iyengar, Stephen Opalenski, Anders S. Christensen, Sai Krishna Sirumalla, Frederick R. Manby, Thomas F. Miller III, Matthew Welborn

    Abstract: Structure determination is essential to a mechanistic understanding of diseases and the development of novel therapeutics. Machine-learning-based structure prediction methods have made significant advancements by computationally predicting protein and bioassembly structures from sequences and molecular topology alone. Despite substantial progress in the field, challenges remain to deliver structur… ▽ More

    Submitted 18 December, 2024; v1 submitted 14 December, 2024; originally announced December 2024.

  36. arXiv:2412.09261  [pdf, other

    cs.LG

    Single-View Graph Contrastive Learning with Soft Neighborhood Awareness

    Authors: Qingqiang Sun, Chaoqi Chen, Ziyue Qiao, Xubin Zheng, Kai Wang

    Abstract: Most graph contrastive learning (GCL) methods heavily rely on cross-view contrast, thus facing several concomitant challenges, such as the complexity of designing effective augmentations, the potential for information loss between views, and increased computational costs. To mitigate reliance on cross-view contrasts, we propose \ttt{SIGNA}, a novel single-view graph contrastive learning framework.… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

    Comments: Accepted by AAAI2025; full version including appendix

  37. arXiv:2411.13917  [pdf, other

    cs.MM

    SpikEmo: Enhancing Emotion Recognition With Spiking Temporal Dynamics in Conversations

    Authors: Xiaomin Yu, Feiyang Wang, Ziyue Qiao

    Abstract: In affective computing, the task of Emotion Recognition in Conversations (ERC) has emerged as a focal area of research. The primary objective of this task is to predict emotional states within conversations by analyzing multimodal data including text, audio, and video. While existing studies have progressed in extracting and fusing representations from multimodal data, they often overlook the temp… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  38. arXiv:2411.07309  [pdf, other

    cs.RO

    Proprioceptive and Exteroceptive Information Perception in a Fabric Soft Robotic Arm via Physical Reservoir Computing with minimal training data

    Authors: Jun Wang, Zhi Qiao, Wenlong Zhang, Suyi Li

    Abstract: Over the past decades, we have witnessed a rapid emergence of soft and reconfigurable robots thanks to their capability to interact safely with humans and adapt to complex environments. However, their softness makes accurate control very challenging. High-fidelity sensing is critical in improving control performance, especially posture and contact estimation. To this end, traditional camera-based… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

  39. arXiv:2411.02779  [pdf

    cs.CV

    Advancing Recycling Efficiency: A Comparative Analysis of Deep Learning Models in Waste Classification

    Authors: Zhanshan Qiao

    Abstract: With the ongoing increase in the worldwide population and escalating consumption habits,there's a surge in the amount of waste produced.The situation poses considerable challenges for waste management and the optimization of recycling operations.The research tackles the pressing issue of waste classification for recycling by analyzing various deep learning models,including Convolutional Neural Net… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: Accepted by the 6th International Conference on Computing and Data Science (CONF-CDS 2024), 12 pages, 8 figures, references added

  40. arXiv:2409.12728  [pdf, other

    q-bio.GN cs.LG

    PRAGA: Prototype-aware Graph Adaptive Aggregation for Spatial Multi-modal Omics Analysis

    Authors: Xinlei Huang, Zhiqi Ma, Dian Meng, Yanran Liu, Shiwei Ruan, Qingqiang Sun, Xubin Zheng, Ziyue Qiao

    Abstract: Spatial multi-modal omics technology, highlighted by Nature Methods as an advanced biological technique in 2023, plays a critical role in resolving biological regulatory processes with spatial context. Recently, graph neural networks based on K-nearest neighbor (KNN) graphs have gained prominence in spatial multi-modal omics methods due to their ability to model semantic relations between sequenci… ▽ More

    Submitted 18 December, 2024; v1 submitted 19 September, 2024; originally announced September 2024.

    Comments: Accepted by AAAl2025; full version including appendix

  41. arXiv:2409.08681  [pdf, other

    cs.RO

    SLIM: Scalable and Lightweight LiDAR Mapping in Urban Environments

    Authors: Zehuan Yu, Zhijian Qiao, Wenyi Liu, Huan Yin, Shaojie Shen

    Abstract: LiDAR point cloud maps are extensively utilized on roads for robot navigation due to their high consistency. However, dense point clouds face challenges of high memory consumption and reduced maintainability for long-term operations. In this study, we introduce SLIM, a scalable and lightweight mapping system for long-term LiDAR mapping in urban environments. The system begins by parameterizing str… ▽ More

    Submitted 26 March, 2025; v1 submitted 13 September, 2024; originally announced September 2024.

    Comments: Accepted for publication in IEEE Transactions on Robotics. Video: https://youtu.be/8HQnYMf_BWI Code: https://github.com/HKUST-Aerial-Robotics/SLIM

  42. arXiv:2408.13750  [pdf, other

    cs.AI cs.MA

    Multi-Agent Target Assignment and Path Finding for Intelligent Warehouse: A Cooperative Multi-Agent Deep Reinforcement Learning Perspective

    Authors: Qi Liu, Jianqi Gao, Dongjie Zhu, Zhongjian Qiao, Pengbin Chen, Jingxiang Guo, Yanjie Li

    Abstract: Multi-agent target assignment and path planning (TAPF) are two key problems in intelligent warehouse. However, most literature only addresses one of these two problems separately. In this study, we propose a method to simultaneously solve target assignment and path planning from a perspective of cooperative multi-agent deep reinforcement learning (RL). To the best of our knowledge, this is the fir… ▽ More

    Submitted 27 October, 2024; v1 submitted 25 August, 2024; originally announced August 2024.

  43. arXiv:2408.12970  [pdf, other

    cs.LG

    SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning

    Authors: Zhongjian Qiao, Jiafei Lyu, Kechen Jiao, Qi Liu, Xiu Li

    Abstract: The performance of offline reinforcement learning (RL) suffers from the limited size and quality of static datasets. Model-based offline RL addresses this issue by generating synthetic samples through a dynamics model to enhance overall performance. To evaluate the reliability of the generated samples, uncertainty estimation methods are often employed. However, model ensemble, the most commonly us… ▽ More

    Submitted 12 November, 2024; v1 submitted 23 August, 2024; originally announced August 2024.

    Comments: Submitted to AAAI2025

  44. arXiv:2408.09736  [pdf, other

    eess.IV cs.CV

    Coarse-Fine View Attention Alignment-Based GAN for CT Reconstruction from Biplanar X-Rays

    Authors: Zhi Qiao, Hanqiang Ouyang, Dongheng Chu, Huishu Yuan, Xiantong Zhen, Pei Dong, Zhen Qian

    Abstract: For surgical planning and intra-operation imaging, CT reconstruction using X-ray images can potentially be an important alternative when CT imaging is not available or not feasible. In this paper, we aim to use biplanar X-rays to reconstruct a 3D CT image, because biplanar X-rays convey richer information than single-view X-rays and are more commonly used by surgeons. Different from previous studi… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  45. arXiv:2408.09731  [pdf, other

    eess.IV cs.CV

    Reconstruct Spine CT from Biplanar X-Rays via Diffusion Learning

    Authors: Zhi Qiao, Xuhui Liu, Xiaopeng Wang, Runkun Liu, Xiantong Zhen, Pei Dong, Zhen Qian

    Abstract: Intraoperative CT imaging serves as a crucial resource for surgical guidance; however, it may not always be readily accessible or practical to implement. In scenarios where CT imaging is not an option, reconstructing CT scans from X-rays can offer a viable alternative. In this paper, we introduce an innovative method for 3D CT reconstruction utilizing biplanar X-rays. Distinct from previous resear… ▽ More

    Submitted 20 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

  46. arXiv:2408.09715  [pdf, other

    cs.AI cs.CV cs.LG eess.IV

    HYDEN: Hyperbolic Density Representations for Medical Images and Reports

    Authors: Zhi Qiao, Linbin Han, Xiantong Zhen, Jia-Hong Gao, Zhen Qian

    Abstract: In light of the inherent entailment relations between images and text, hyperbolic point vector embeddings, leveraging the hierarchical modeling advantages of hyperbolic space, have been utilized for visual semantic representation learning. However, point vector embedding approaches fail to address the issue of semantic uncertainty, where an image may have multiple interpretations, and text may ref… ▽ More

    Submitted 19 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

  47. arXiv:2407.13545  [pdf, other

    eess.IV cs.CV

    DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays

    Authors: Xuhui Liu, Zhi Qiao, Runkun Liu, Hong Li, Juan Zhang, Xiantong Zhen, Zhen Qian, Baochang Zhang

    Abstract: Computed tomography (CT) is widely utilized in clinical settings because it delivers detailed 3D images of the human body. However, performing CT scans is not always feasible due to radiation exposure and limitations in certain surgical environments. As an alternative, reconstructing CT images from ultra-sparse X-rays offers a valuable solution and has gained significant interest in scientific res… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  48. Neural Networks Trained by Weight Permutation are Universal Approximators

    Authors: Yongqiang Cai, Gaohang Chen, Zhonghua Qiao

    Abstract: The universal approximation property is fundamental to the success of neural networks, and has traditionally been achieved by training networks without any constraints on their parameters. However, recent experimental research proposed a novel permutation-based training method, which exhibited a desired classification performance without modifying the exact weight values. In this paper, we provide… ▽ More

    Submitted 20 March, 2025; v1 submitted 1 July, 2024; originally announced July 2024.

    MSC Class: 41A30; 68T05; 68T07

    Journal ref: Neural Networks (2025). https://www.sciencedirect.com/science/article/pii/S089360802500156X

  49. arXiv:2406.08116  [pdf, other

    cs.CL cs.AI

    Supportiveness-based Knowledge Rewriting for Retrieval-augmented Language Modeling

    Authors: Zile Qiao, Wei Ye, Yong Jiang, Tong Mo, Pengjun Xie, Weiping Li, Fei Huang, Shikun Zhang

    Abstract: Retrieval-augmented language models (RALMs) have recently shown great potential in mitigating the limitations of implicit knowledge in LLMs, such as untimely updating of the latest expertise and unreliable retention of long-tail knowledge. However, since the external knowledge base, as well as the retriever, can not guarantee reliability, potentially leading to the knowledge retrieved not being he… ▽ More

    Submitted 3 October, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  50. arXiv:2406.07413  [pdf, other

    cs.LG

    Towards Continuous Reuse of Graph Models via Holistic Memory Diversification

    Authors: Ziyue Qiao, Junren Xiao, Qingqiang Sun, Meng Xiao, Xiao Luo, Hui Xiong

    Abstract: This paper addresses the challenge of incremental learning in growing graphs with increasingly complex tasks. The goal is to continuously train a graph model to handle new tasks while retaining proficiency in previous tasks via memory replay. Existing methods usually overlook the importance of memory diversity, limiting in selecting high-quality memory from previous tasks and remembering broad pre… ▽ More

    Submitted 1 March, 2025; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted by ICLR 2025