Skip to main content

Showing 1–50 of 1,738 results for author: Xu, B

.
  1. arXiv:2507.05227  [pdf, ps, other

    cs.RO cs.CV cs.LG cs.MM eess.SY

    NavigScene: Bridging Local Perception and Global Navigation for Beyond-Visual-Range Autonomous Driving

    Authors: Qucheng Peng, Chen Bai, Guoxiang Zhang, Bo Xu, Xiaotong Liu, Xiaoyin Zheng, Chen Chen, Cheng Lu

    Abstract: Autonomous driving systems have made significant advances in Q&A, perception, prediction, and planning based on local visual information, yet they struggle to incorporate broader navigational context that human drivers routinely utilize. We address this critical gap between local sensor data and global navigation information by proposing NavigScene, an auxiliary navigation-guided natural language… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: Accepted by ACM Multimedia 2025

  2. arXiv:2507.05113  [pdf, ps, other

    cs.MM cs.CR cs.LG

    CLIP-Guided Backdoor Defense through Entropy-Based Poisoned Dataset Separation

    Authors: Binyan Xu, Fan Yang, Xilin Dai, Di Tang, Kehuan Zhang

    Abstract: Deep Neural Networks (DNNs) are susceptible to backdoor attacks, where adversaries poison training data to implant backdoor into the victim model. Current backdoor defenses on poisoned data often suffer from high computational costs or low effectiveness against advanced attacks like clean-label and clean-image backdoors. To address them, we introduce CLIP-Guided backdoor Defense (CGD), an efficien… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 15 pages, 9 figures, 15 tables. To appear in the Proceedings of the 32nd ACM International Conference on Multimedia (MM '25)

    MSC Class: 68T07 ACM Class: I.2.6

  3. arXiv:2507.04574  [pdf, ps, other

    cond-mat.mtrl-sci physics.app-ph

    Deciphering the interplay between wetting and chemo-mechanical fracture in lithium-ion battery cathode materials

    Authors: Wan-Xin Chen, Luis J. Carrillo, Arnab Maji, Xiang-Long Peng, Joseph Handy, Sarbajit Banerjee, Bai-Xiang Xu

    Abstract: Crack growth in lithium-ion battery electrodes is typically detrimental and undesirable. However, recent experiments suggest that stabilized fracture of cathode active materials in liquid electrolytes can increase electrochemically active surfaces, shorten diffusion pathway, enhance (de)lithiation and improve overall capacity. To decipher the fundamental couplings between electrolyte wetting and f… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

  4. arXiv:2507.04487  [pdf, ps, other

    cs.LG cs.AI

    LoSiA: Efficient High-Rank Fine-Tuning via Subnet Localization and Optimization

    Authors: Xujia Wang. Yunjia Qi, Bin Xu

    Abstract: Parameter-Efficient Fine-Tuning (PEFT) methods, such as LoRA, significantly reduce the number of trainable parameters by introducing low-rank decomposition matrices. However, existing methods perform extensive matrix multiplications in domain specialization tasks, resulting in computational inefficiency and sub-optimal fine-tuning performance. Hence, we propose LoSiA(Low-Resources Subnet Integrati… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

    Comments: 18 pages, 12 figures

  5. arXiv:2507.03917  [pdf, ps, other

    cs.LG cs.CV

    Consistency-Aware Padding for Incomplete Multi-Modal Alignment Clustering Based on Self-Repellent Greedy Anchor Search

    Authors: Shubin Ma, Liang Zhao, Mingdong Lu, Yifan Guo, Bo Xu

    Abstract: Multimodal representation is faithful and highly effective in describing real-world data samples' characteristics by describing their complementary information. However, the collected data often exhibits incomplete and misaligned characteristics due to factors such as inconsistent sensor frequencies and device malfunctions. Existing research has not effectively addressed the issue of filling missi… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

    Comments: Accepted at IJCAI 2025. 9 pages, 3 figures

    ACM Class: I.2.6; I.5.3

  6. arXiv:2507.03133  [pdf, ps, other

    cs.CL

    ReliableMath: Benchmark of Reliable Mathematical Reasoning on Large Language Models

    Authors: Boyang Xue, Qi Zhu, Rui Wang, Sheng Wang, Hongru Wang, Fei Mi, Yasheng Wang, Lifeng Shang, Qun Liu, Kam-Fai Wong

    Abstract: Although demonstrating remarkable performance on reasoning tasks, Large Language Models (LLMs) still tend to fabricate unreliable responses when confronted with problems that are unsolvable or beyond their capability, severely undermining the reliability. Prior studies of LLM reliability have primarily focused on knowledge tasks to identify unanswerable questions, while mathematical reasoning task… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

    Comments: under review

  7. arXiv:2507.03122  [pdf, ps, other

    cs.IR cs.CL cs.LG

    Federated Learning for ICD Classification with Lightweight Models and Pretrained Embeddings

    Authors: Binbin Xu, Gérard Dray

    Abstract: This study investigates the feasibility and performance of federated learning (FL) for multi-label ICD code classification using clinical notes from the MIMIC-IV dataset. Unlike previous approaches that rely on centralized training or fine-tuned large language models, we propose a lightweight and scalable pipeline combining frozen text embeddings with simple multilayer perceptron (MLP) classifiers… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

    Comments: 20 pages

  8. arXiv:2507.02636  [pdf, ps, other

    math.OC eess.SY

    Online Convex Optimization for Coordinated Long-Term and Short-Term Isolated Microgrid Dispatch

    Authors: Ning Qi, Yousuf Baker, Bolun Xu

    Abstract: This paper proposes a novel non-anticipatory long-short-term coordinated dispatch framework for isolated microgrid with hybrid short-long-duration energy storages (LDES). We introduce a convex hull approximation model for nonconvex LDES electrochemical dynamics, facilitating computational tractability and accuracy. To address temporal coupling in SoC dynamics and long-term contracts, we generate h… ▽ More

    Submitted 4 July, 2025; v1 submitted 3 July, 2025; originally announced July 2025.

  9. arXiv:2507.01889  [pdf, ps, other

    cond-mat.dis-nn cond-mat.mtrl-sci cs.LG

    STEM Diffraction Pattern Analysis with Deep Learning Networks

    Authors: Sebastian Wissel, Jonas Scheunert, Aaron Dextre, Shamail Ahmed, Andreas Bayer, Kerstin Volz, Bai-Xiang Xu

    Abstract: Accurate grain orientation mapping is essential for understanding and optimizing the performance of polycrystalline materials, particularly in energy-related applications. Lithium nickel oxide (LiNiO$_{2}$) is a promising cathode material for next-generation lithium-ion batteries, and its electrochemical behaviour is closely linked to microstructural features such as grain size and crystallographi… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

  10. arXiv:2507.01299  [pdf, ps, other

    cs.CL

    La RoSA: Enhancing LLM Efficiency via Layerwise Rotated Sparse Activation

    Authors: Kai Liu, Bowen Xu, Shaoyu Wu, Xin Chen, Hao Zhou, Yongliang Tao, Lulu Hu

    Abstract: Activation sparsity can reduce the computational overhead and memory transfers during the forward pass of Large Language Model (LLM) inference. Existing methods face limitations, either demanding time-consuming recovery training that hinders real-world adoption, or relying on empirical magnitude-based pruning, which causes fluctuating sparsity and unstable inference speed-up. This paper introduces… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: ICML 2025 Acceptance

  11. arXiv:2507.01006  [pdf, ps, other

    cs.CV cs.AI cs.LG

    GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

    Authors: GLM-V Team, :, Wenyi Hong, Wenmeng Yu, Xiaotao Gu, Guo Wang, Guobing Gan, Haomiao Tang, Jiale Cheng, Ji Qi, Junhui Ji, Lihang Pan, Shuaiqi Duan, Weihan Wang, Yan Wang, Yean Cheng, Zehai He, Zhe Su, Zhen Yang, Ziyang Pan, Aohan Zeng, Baoxu Wang, Boyan Shi, Changyu Pang, Chenhui Zhang , et al. (54 additional authors not shown)

    Abstract: We present GLM-4.1V-Thinking, a vision-language model (VLM) designed to advance general-purpose multimodal understanding and reasoning. In this report, we share our key findings in the development of the reasoning-centric training framework. We first develop a capable vision foundation model with significant potential through large-scale pre-training, which arguably sets the upper bound for the fi… ▽ More

    Submitted 2 July, 2025; v1 submitted 1 July, 2025; originally announced July 2025.

  12. arXiv:2506.23972  [pdf, ps, other

    cs.CV

    Visual and Memory Dual Adapter for Multi-Modal Object Tracking

    Authors: Boyue Xu, Ruichao Hou, Tongwei Ren, Gangshan Wu

    Abstract: Prompt-learning-based multi-modal trackers have achieved promising progress by employing lightweight visual adapters to incorporate auxiliary modality features into frozen foundation models. However, existing approaches often struggle to learn reliable prompts due to limited exploitation of critical cues across frequency and temporal domains. In this paper, we propose a novel visual and memory dua… ▽ More

    Submitted 30 June, 2025; originally announced June 2025.

  13. arXiv:2506.22776  [pdf, ps, other

    cs.SE cs.AI cs.PL

    Smaller = Weaker? Benchmarking Robustness of Quantized LLMs in Code Generation

    Authors: Sen Fang, Weiyuan Ding, Antonio Mastropaolo, Bowen Xu

    Abstract: Quantization has emerged as a mainstream method for compressing Large Language Models (LLMs), reducing memory requirements and accelerating inference without architectural modifications. While existing research primarily focuses on evaluating the effectiveness of quantized LLMs compared to their original counterparts, the impact on robustness remains largely unexplored.In this paper, we present th… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

    Comments: 13 pages, 6 figures

  14. arXiv:2506.21270  [pdf, ps, other

    cs.CV

    Video Virtual Try-on with Conditional Diffusion Transformer Inpainter

    Authors: Cheng Zou, Senlin Cheng, Bolei Xu, Dandan Zheng, Xiaobo Li, Jingdong Chen, Ming Yang

    Abstract: Video virtual try-on aims to naturally fit a garment to a target person in consecutive video frames. It is a challenging task, on the one hand, the output video should be in good spatial-temporal consistency, on the other hand, the details of the given garment need to be preserved well in all the frames. Naively using image-based try-on methods frame by frame can get poor results due to severe inc… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: 10 pages, 6 figures

  15. arXiv:2506.20444  [pdf, ps, other

    cs.SE

    Smart Cuts: Enhance Active Learning for Vulnerability Detection by Pruning Bad Seeds

    Authors: Xiang Lan, Tim Menzies, Bowen Xu

    Abstract: Vulnerability detection is crucial for identifying security weaknesses in software systems. However, the effectiveness of machine learning models in this domain is often hindered by low-quality training datasets, which contain noisy, mislabeled, or imbalanced samples. This paper proposes a novel dataset maps-empowered approach that systematically identifies and mitigates hard-to-learn outliers, re… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  16. arXiv:2506.19580  [pdf, ps, other

    math.CO

    The optimal binding function for (cap, even hole)-free graphs

    Authors: Ran Chen, Baogang Xu, Yian Xu

    Abstract: A {\em hole} is an induced cycle of length at least 4, an {\em even hole} is a hole of even length, and a {\em cap} is a graph obtained from a hole by adding an additional vertex which is adjacent exactly to two adjacent vertices of the hole. A graph $G$ obtained from a graph $H$ by blowing up all the vertices into cliques is said to be a clique blowup of $H$. Let $p, q$ be two positive integers w… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  17. arXiv:2506.18416  [pdf

    cond-mat.mtrl-sci

    Determining the grain orientations of battery materials from electron diffraction patterns using convolutional neural networks

    Authors: Jonas Scheunert, Shamail Ahmed, Thomas Demuth, Andreas Beyer, Sebastian Wissel, Bai-Xiang Xu, Kerstin Volz

    Abstract: Polycrystalline materials have numerous applications due to their unique properties, which are often determined by the grain boundaries. Hence, quantitative characterization of grain as well as interface orientation is essential to optimize these materials, particularly energy materials. Using scanning transmission electron microscopy, matter can be analysed in an extremely fine grid of scan point… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  18. arXiv:2506.18350  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.str-el

    Optical Excitations of Flat Bands Induced by Exciton Condensation in Ta$_2$Pd$_3$Te$_{5}$

    Authors: Shaohui Yi, Zhiyu Liao, Chenhao Liang, Sheng Zhang, Xiutong Deng, Yongjie Xie, Lincong Zheng, Yujie Wang, Yubiao Wu, Zhijun Wang, Youguo Shi, Xianggang Qiu, Bing Xu

    Abstract: We report on the charge dynamics of Ta$_2$Pd$_3$Te$_5$ using temperature-dependent optical spectroscopy with polarized light. We observe a metal-insulator transition characterized by the collapse of Drude response and the emergence of sharp and narrow absorption peaks at low temperatures. Unlike previous excitonic insulator candidates such as TiSe$_2$ and Ta$_2$NiSe$_5$, where the excitonic order… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: 7 pages, 3 figures

  19. arXiv:2506.17495  [pdf, ps, other

    q-bio.PE

    Modeling and Inferring Metacommunity Dynamics with Maximum Caliber

    Authors: Zachary Jackson, Mathew A. Leibold, Robert D. Holt, BingKan Xue

    Abstract: A major challenge for community ecology is to use distribution patterns to infer basic parameters of dynamical models without conducting laborious experimental manipulations. We present a novel framework drawn from statistical physics -- Maximum Caliber -- for characterizing the temporal dynamics of complex ecological systems in spatially extended landscapes and inferring parameters from temporal… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  20. arXiv:2506.16445  [pdf, ps, other

    cs.CL cs.AI

    StoryWriter: A Multi-Agent Framework for Long Story Generation

    Authors: Haotian Xia, Hao Peng, Yunjia Qi, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li

    Abstract: Long story generation remains a challenge for existing large language models (LLMs), primarily due to two main factors: (1) discourse coherence, which requires plot consistency, logical coherence, and completeness in the long-form generation, and (2) narrative complexity, which requires an interwoven and engaging narrative. To address these challenges, we propose StoryWriter, a multi-agent story g… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  21. arXiv:2506.16051  [pdf, ps, other

    cs.LG cs.DB cs.DL cs.HC

    From Data to Decision: Data-Centric Infrastructure for Reproducible ML in Collaborative eScience

    Authors: Zhiwei Li, Carl Kesselman, Tran Huy Nguyen, Benjamin Yixing Xu, Kyle Bolo, Kimberley Yu

    Abstract: Reproducibility remains a central challenge in machine learning (ML), especially in collaborative eScience projects where teams iterate over data, features, and models. Current ML workflows are often dynamic yet fragmented, relying on informal data sharing, ad hoc scripts, and loosely connected tools. This fragmentation impedes transparency, reproducibility, and the adaptability of experiments ove… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  22. arXiv:2506.14813  [pdf, ps, other

    cs.LG cs.AI

    Training with Confidence: Catching Silent Errors in Deep Learning Training with Automated Proactive Checks

    Authors: Yuxuan Jiang, Ziming Zhou, Boyu Xu, Beijie Liu, Runhui Xu, Peng Huang

    Abstract: Training deep learning (DL) models is a complex process, making it prone to silent errors that are challenging to detect and diagnose. This paper presents TRAINCHECK, a framework that takes a proactive checking approach to address silent training errors. TRAINCHECK automatically infers invariants tailored for DL training. It uses these invariants to proactively detect silent errors during the trai… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: 19 pages, to appear in 19th USENIX Symposium on Operating Systems Design and Implementation (OSDI '25)

  23. arXiv:2506.14406  [pdf, ps, other

    hep-ex

    Search for neutron decay into an antineutrino and a neutral kaon in 0.401 megaton-years exposure of Super-Kamiokande

    Authors: Super-Kamiokande Collaboration, :, K. Yamauchi, K. Abe, S. Abe, Y. Asaoka, M. Harada, Y. Hayato, K. Hiraide, K. Hosokawa, K. Ieki, M. Ikeda, J. Kameda, Y. Kanemura, Y. Kataoka, S. Miki, S. Mine, M. Miura, S. Moriyama, M. Nakahata, S. Nakayama, Y. Noguchi, G. Pronost, K. Sato, H. Sekiya , et al. (240 additional authors not shown)

    Abstract: We searched for bound neutron decay via $n\to\barν+K^0$ predicted by the Grand Unified Theories in 0.401 Mton$\cdot$years exposure of all pure water phases in the Super-Kamiokande detector. About 4.4 times more data than in the previous search have been analyzed by a new method including a spectrum fit to kaon invariant mass distributions. No significant data excess has been observed in the signal… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: 12 pages, 5 figures

  24. arXiv:2506.12877  [pdf, ps, other

    cond-mat.mtrl-sci

    Symplectic Spin-Lattice Dynamics with Machine-Learning Potentials

    Authors: Zhengtao Huang, Ben Xu

    Abstract: Accurate atomic-scale simulations of magnetic materials require precise handling of coupled spin-lattice degrees of freedom. Traditional spin-lattice dynamics (SLD), employing Newtonian equation for lattice evolution and the Landau-Lifshitz-Gilbert (LLG) equation for spins, encounters severe limitations with machine-learning potentials, including poor energy conservation and excessive computationa… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  25. arXiv:2506.12446  [pdf, ps, other

    cs.CL

    From Outcomes to Processes: Guiding PRM Learning from ORM for Inference-Time Alignment

    Authors: Bin Xie, Bingbing Xu, Yige Yuan, Shengmao Zhu, Huawei Shen

    Abstract: Inference-time alignment methods have gained significant attention for their efficiency and effectiveness in aligning large language models (LLMs) with human preferences. However, existing dominant approaches using reward-guided search (RGS) primarily rely on outcome reward models (ORMs), which suffer from a critical granularity mismatch: ORMs are designed to provide outcome rewards for complete r… ▽ More

    Submitted 28 June, 2025; v1 submitted 14 June, 2025; originally announced June 2025.

  26. arXiv:2506.11763  [pdf, ps, other

    cs.CL cs.IR

    DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

    Authors: Mingxuan Du, Benfeng Xu, Chiwei Zhu, Xiaorui Wang, Zhendong Mao

    Abstract: Deep Research Agents are a prominent category of LLM-based agents. By autonomously orchestrating multistep web exploration, targeted retrieval, and higher-order synthesis, they transform vast amounts of online information into analyst-grade, citation-rich reports--compressing hours of manual desk research into minutes. However, a comprehensive benchmark for systematically evaluating the capabiliti… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Comments: 31 pages, 5 figures

  27. arXiv:2506.09942  [pdf, ps, other

    cs.CL cs.AI

    VerIF: Verification Engineering for Reinforcement Learning in Instruction Following

    Authors: Hao Peng, Yunjia Qi, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li

    Abstract: Reinforcement learning with verifiable rewards (RLVR) has become a key technique for enhancing large language models (LLMs), with verification engineering playing a central role. However, best practices for RL in instruction following remain underexplored. In this work, we explore the verification challenge in RL for instruction following and propose VerIF, a verification method that combines rule… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: 16 pages, 8 figures

  28. arXiv:2506.09002  [pdf, ps, other

    cs.SE

    Boosting Rust Unit Test Coverage through Hybrid Program Analysis and Large Language Models

    Authors: Bei Chu, Yang Feng, Kui Liu, Hange Shi, Zifan Nan, Zhaoqiang Guo, Baowen Xu

    Abstract: Unit testing is essential for ensuring software reliability and correctness. Classic Search-Based Software Testing (SBST) methods and concolic execution-based approaches for generating unit tests often fail to achieve high coverage due to difficulties in handling complex program units, such as branching conditions and external dependencies. Recent work has increasingly utilized large language mode… ▽ More

    Submitted 10 June, 2025; v1 submitted 10 June, 2025; originally announced June 2025.

    Comments: 10 pages, 5 figures

  29. arXiv:2506.07831  [pdf, other

    quant-ph

    Clock Synchronization for Drone-Based Entanglement Quantum Key Distribution

    Authors: Jinquan Huang, Bangying Tang, Hui Han, JianJi Yi, Bo Xu, Chunqing Wu, Xiangwei Zhu, Wanrong Yu, Huicun Yu, Jiahao Li, Shihai Sun, Bo Liu

    Abstract: Drone-based entanglement distribution provides full spatiotemporal coverage for quantum networks, enabling quantum key distribution (QKD) in dynamic environments. The security of QKD fundamentally depends on high-fidelity quantum state measurements, for which high-precision clock synchronization is indispensable, as timing jitter is inversely correlated with quantum state fidelity. However, drone-… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: 16 pages,6 figures

  30. arXiv:2506.07779  [pdf, ps, other

    cs.CV

    Design and Evaluation of Deep Learning-Based Dual-Spectrum Image Fusion Methods

    Authors: Beining Xu, Junxian Li

    Abstract: Visible images offer rich texture details, while infrared images emphasize salient targets. Fusing these complementary modalities enhances scene understanding, particularly for advanced vision tasks under challenging conditions. Recently, deep learning-based fusion methods have gained attention, but current evaluations primarily rely on general-purpose metrics without standardized benchmarks or do… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: 11 pages, 13 figures

  31. arXiv:2506.07599  [pdf, ps, other

    cs.IT eess.SP

    Flexible MIMO for Future Wireless Communications: Which Flexibilities are Possible?

    Authors: Zhe Wang, Jiayi Zhang, Bokai Xu, Wenhui Yi, Emil Björnson, Bo Ai

    Abstract: To enable next-generation wireless communication networks with modest spectrum availability, multiple-input multiple-output (MIMO) technology needs to undergo further evolution. In this paper, we introduce a promising next-generation wireless communication concept: flexible MIMO technology. This technology represents a MIMO technology with flexible physical configurations and integrated applicatio… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: 9 pages, 5 figures, 1 table

  32. arXiv:2506.07050  [pdf, ps, other

    cs.CV cs.IR cs.MM

    From Swath to Full-Disc: Advancing Precipitation Retrieval with Multimodal Knowledge Expansion

    Authors: Zheng Wang, Kai Ying, Bin Xu, Chunjiao Wang, Cong Bai

    Abstract: Accurate near-real-time precipitation retrieval has been enhanced by satellite-based technologies. However, infrared-based algorithms have low accuracy due to weak relations with surface precipitation, whereas passive microwave and radar-based methods are more accurate but limited in range. This challenge motivates the Precipitation Retrieval Expansion (PRE) task, which aims to enable accurate, in… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

  33. arXiv:2506.06881  [pdf, other

    cs.AI

    KnowCoder-V2: Deep Knowledge Analysis

    Authors: Zixuan Li, Wenxuan Liu, Long Bai, Chunmao Zhang, Wei Li, Fenghui Zhang, Quanxin Jin, Ruoyun He, Zhuo Chen, Zhilei Hu, Fei Wang, Bingbing Xu, Xuhui Jiang, Xiaolong Jin, Jiafeng Guo, Xueqi Cheng

    Abstract: Deep knowledge analysis tasks always involve the systematic extraction and association of knowledge from large volumes of data, followed by logical reasoning to discover insights. However, to solve such complex tasks, existing deep research frameworks face three major challenges: 1) They lack systematic organization and management of knowledge; 2) They operate purely online, making it inefficient… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

  34. arXiv:2506.06679  [pdf, ps, other

    eess.SY

    Controlled Reach-avoid Set Computation for Discrete-time Polynomial Systems via Convex Optimization

    Authors: Taoran Wu, Yiling Xue, Dejin Ren, Arvind Easwaran, Martin Fränzle, Bai Xue

    Abstract: This paper addresses the computation of controlled reach-avoid sets (CRASs) for discrete-time polynomial systems subject to control inputs. A CRAS is a set encompassing initial states from which there exist control inputs driving the system into a target set while avoiding unsafe sets. However, efficiently computing CRASs remains an open problem, especially for discrete-time systems. In this paper… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

  35. arXiv:2506.06481  [pdf, ps, other

    math.GT

    Ordering curves on surfaces

    Authors: Hugo Parlier, Hanh Vo, Binbin Xu

    Abstract: We study the order of lengths of closed geodesics on hyperbolic surfaces. Our first main result is that the order of lengths of curves determine a point in Teichmüller space. In an opposite direction, we identify classes of curves whose order never changes, independently of the choice of hyperbolic metric. We use this result to identify short curves with small intersections on pairs of pants.

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: 27 pages, 8 figures

  36. arXiv:2506.06392  [pdf

    astro-ph.IM astro-ph.EP cond-mat.other

    Additive Manufacturing of Lunar Regolith for Reconfigurable Building Blocks toward Lunar Habitation

    Authors: Cole McCallum, Youwen Liang, Nahid Tushar, Ben Xu, Bo Zhao, Hao Zeng, Wan Shou

    Abstract: Utilizing locally available materials is a crucial step towards sustainable planetary habitation. Lunar regolith has gained tremendous interest in additive manufacturing in the past decades. However, due to the constrained manufacturing facilities and materials on the moon, many existing additive manufacturing methods are not suitable for practical on-site manufacturing. Here, we envision that lig… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  37. arXiv:2506.05044  [pdf, ps, other

    cs.IR

    Rethinking Contrastive Learning in Session-based Recommendation

    Authors: Xiaokun Zhang, Bo Xu, Fenglong Ma, Zhizheng Wang, Liang Yang, Hongfei Lin

    Abstract: Session-based recommendation aims to predict intents of anonymous users based on limited behaviors. With the ability in alleviating data sparsity, contrastive learning is prevailing in the task. However, we spot that existing contrastive learning based methods still suffer from three obstacles: (1) they overlook item-level sparsity and primarily focus on session-level sparsity; (2) they typically… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: This work has been accepted by Pattern Recognition

  38. arXiv:2506.04699  [pdf, ps, other

    cs.AI

    Empowering Economic Simulation for Massively Multiplayer Online Games through Generative Agent-Based Modeling

    Authors: Bihan Xu, Shiwei Zhao, Runze Wu, Zhenya Huang, Jiawei Wang, Zhipeng Hu, Kai Wang, Haoyu Liu, Tangjie Lv, Le Li, Changjie Fan, Xin Tong, Jiangze Han

    Abstract: Within the domain of Massively Multiplayer Online (MMO) economy research, Agent-Based Modeling (ABM) has emerged as a robust tool for analyzing game economics, evolving from rule-based agents to decision-making agents enhanced by reinforcement learning. Nevertheless, existing works encounter significant challenges when attempting to emulate human-like economic activities among agents, particularly… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: KDD2025 Accepted

  39. arXiv:2506.04280  [pdf, ps, other

    cs.CV cs.AI

    Evaluating MLLMs with Multimodal Multi-image Reasoning Benchmark

    Authors: Ziming Cheng, Binrui Xu, Lisheng Gong, Zuhe Song, Tianshuo Zhou, Shiqi Zhong, Siyu Ren, Mingxiang Chen, Xiangchao Meng, Yuxin Zhang, Yanlin Li, Lei Ren, Wei Chen, Zhiyuan Huang, Mingjie Zhan, Xiaojie Wang, Fangxiang Feng

    Abstract: With enhanced capabilities and widespread applications, Multimodal Large Language Models (MLLMs) are increasingly required to process and reason over multiple images simultaneously. However, existing MLLM benchmarks focus either on single-image visual reasoning or on multi-image understanding tasks with only final-answer evaluation, leaving the reasoning capabilities of MLLMs over multi-image inpu… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 18 pages

    MSC Class: 68T50 ACM Class: I.2.7

  40. arXiv:2506.03968  [pdf, ps, other

    cs.CL

    From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding

    Authors: Chiwei Zhu, Benfeng Xu, Xiaorui Wang, Zhendong Mao

    Abstract: The pursuit of diverse, complex, and large-scale instruction data is crucial for automatically aligning large language models (LLMs). While there are methods capable of generating synthetic instructions at scale, they either suffer from limited grounding sources, leading to a narrow distribution, or rely on trivial extensions that fail to produce meaningful trajectories in terms of complexity. In… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: To be published at ACL 2025

  41. arXiv:2506.02244  [pdf, ps, other

    cs.CV cs.AI

    Motion aware video generative model

    Authors: Bowen Xue, Giuseppe Claudio Guarnera, Shuang Zhao, Zahra Montazeri

    Abstract: Recent advances in diffusion-based video generation have yielded unprecedented quality in visual content and semantic coherence. However, current approaches predominantly rely on statistical learning from vast datasets without explicitly modeling the underlying physics of motion, resulting in subtle yet perceptible non-physical artifacts that diminish the realism of generated videos. This paper in… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  42. arXiv:2506.01048  [pdf, ps, other

    cs.AI

    IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory

    Authors: Wei Song, Zhenya Huang, Cheng Cheng, Weibo Gao, Bihan Xu, GuanHao Zhao, Fei Wang, Runze Wu

    Abstract: Large language models (LLMs) have demonstrated exceptional performance across a wide range of natural language tasks. However, selecting the optimal LLM to respond to a user query often necessitates a delicate balance between performance and cost. While powerful models deliver better results, they come at a high cost, whereas smaller models are more cost-effective but less capable. To address this… ▽ More

    Submitted 20 June, 2025; v1 submitted 1 June, 2025; originally announced June 2025.

    Comments: ACL 2025 Main

  43. arXiv:2506.00886  [pdf, ps, other

    cs.AI

    Toward a Theory of Agents as Tool-Use Decision-Makers

    Authors: Hongru Wang, Cheng Qian, Manling Li, Jiahao Qiu, Boyang Xue, Mengdi Wang, Heng Ji, Kam-Fai Wong

    Abstract: As Large Language Models (LLMs) evolve into increasingly autonomous agents, fundamental questions about their epistemic foundations remain unresolved: What defines an agent? How should it make decisions? And what objectives should guide its behavior? In this position paper, we argue that true autonomy requires agents to be grounded in a coherent epistemic framework that governs what they know, wha… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

  44. arXiv:2506.00388  [pdf, ps, other

    cs.LG

    CLARIFY: Contrastive Preference Reinforcement Learning for Untangling Ambiguous Queries

    Authors: Ni Mu, Hao Hu, Xiao Hu, Yiqin Yang, Bo Xu, Qing-Shan Jia

    Abstract: Preference-based reinforcement learning (PbRL) bypasses explicit reward engineering by inferring reward functions from human preference comparisons, enabling better alignment with human intentions. However, humans often struggle to label a clear preference between similar segments, reducing label efficiency and limiting PbRL's real-world applicability. To address this, we propose an offline PbRL m… ▽ More

    Submitted 10 June, 2025; v1 submitted 31 May, 2025; originally announced June 2025.

    Comments: ICML 2025

  45. arXiv:2505.24710  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and Acting

    Authors: Wei Chen, Jiahao Zhang, Haipeng Zhu, Boyan Xu, Zhifeng Hao, Keli Zhang, Junjian Ye, Ruichu Cai

    Abstract: Large language models (LLMs) have shown great potential in decision-making due to the vast amount of knowledge stored within the models. However, these pre-trained models are prone to lack reasoning abilities and are difficult to adapt to new environments, further hindering their application to complex real-world tasks. To address these challenges, inspired by the human cognitive process, we propo… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

    Comments: Accepted by IJCAI 2025

  46. arXiv:2505.24147  [pdf, other

    cs.CL

    Rationales Are Not Silver Bullets: Measuring the Impact of Rationales on Model Performance and Reliability

    Authors: Chiwei Zhu, Benfeng Xu, An Yang, Junyang Lin, Quan Wang, Chang Zhou, Zhendong Mao

    Abstract: Training language models with rationales augmentation has been shown to be beneficial in many existing works. In this paper, we identify that such a prevailing view does not hold consistently. We conduct comprehensive investigations to thoroughly inspect the impact of rationales on model performance as well as a novel perspective of model reliability. The results lead to several key findings that… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: To be published in ACL 2025 Findings. (Work originally done in Jan 2024)

  47. arXiv:2505.24123  [pdf, ps, other

    physics.soc-ph

    Meta-heuristic Hypergraph-Assisted Robustness Optimization for Higher-order Complex Systems

    Authors: Xilong Qu, Wenbin Pei, Haifang Li, Qiang Zhang, Bing Xue, Mengjie Zhang

    Abstract: In complex systems (e.g., communication, transportation, and biological networks), high robustness ensures sustained functionality and stability even when resisting attacks. However, the inherent structure complexity and the unpredictability of attacks make robustness optimization challenging. Hypergraphs provide a framework for modeling complicated higher-order interactions in complex systems nat… ▽ More

    Submitted 12 June, 2025; v1 submitted 29 May, 2025; originally announced May 2025.

  48. arXiv:2505.22591  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Self-Error-Instruct: Generalizing from Errors for LLMs Mathematical Reasoning

    Authors: Erxin Yu, Jing Li, Ming Liao, Qi Zhu, Boyang Xue, Minghui Xu, Baojun Wang, Lanqing Hong, Fei Mi, Lifeng Shang

    Abstract: Although large language models demonstrate strong performance across various domains, they still struggle with numerous bad cases in mathematical reasoning. Previous approaches to learning from errors synthesize training data by solely extrapolating from isolated bad cases, thereby failing to generalize the extensive patterns inherent within these cases. This paper presents Self-Error-Instruct (SE… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 16 pages, 9 figures

  49. arXiv:2505.21901  [pdf, ps, other

    cs.NE

    Symbolically Regressing Fish Biomass Spectral Data: A Linear Genetic Programming Method with Tunable Primitives

    Authors: Zhixing Huang, Bing Xue, Mengjie Zhang, Jeremy S. Ronney, Keith C. Gordon, Daniel P. Killeen

    Abstract: Machine learning techniques play an important role in analyzing spectral data. The spectral data of fish biomass is useful in fish production, as it carries many important chemistry properties of fish meat. However, it is challenging for existing machine learning techniques to comprehensively discover hidden patterns from fish biomass spectral data since the spectral data often have a lot of noise… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  50. arXiv:2505.20884  [pdf

    cs.CV

    YOLO-FireAD: Efficient Fire Detection via Attention-Guided Inverted Residual Learning and Dual-Pooling Feature Preservation

    Authors: Weichao Pan, Bohan Xu, Xu Wang, Chengze Lv, Shuoyang Wang, Zhenke Duan

    Abstract: Fire detection in dynamic environments faces continuous challenges, including the interference of illumination changes, many false detections or missed detections, and it is difficult to achieve both efficiency and accuracy. To address the problem of feature extraction limitation and information loss in the existing YOLO-based models, this study propose You Only Look Once for Fire Detection with A… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.