Skip to main content

Showing 201–250 of 20,906 results for author: Liang

.
  1. arXiv:2506.05637  [pdf, ps, other

    cs.IT eess.SP

    Joint User Association and Beamforming Design for ISAC Networks with Large Language Models

    Authors: Haoyun Li, Ming Xiao, Kezhi Wang, Robert Schober, Dong In Kim, Yong Liang Guan

    Abstract: Integrated sensing and communication (ISAC) has been envisioned to play a more important role in future wireless networks. However, the design of ISAC networks is challenging, especially when there are multiple communication and sensing (C\&S) nodes and multiple sensing targets. We investigate a multi-base station (BS) ISAC network in which multiple BSs equipped with multiple antennas simultaneous… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  2. arXiv:2506.05523  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

    Authors: Zikui Cai, Andrew Wang, Anirudh Satheesh, Ankit Nakhawa, Hyunwoo Jae, Keenan Powell, Minghui Liu, Neel Jay, Sungbin Oh, Xiyao Wang, Yongyuan Liang, Tom Goldstein, Furong Huang

    Abstract: Despite rapid advances in vision-language models (VLMs), current benchmarks for multimodal reasoning fall short in three key dimensions. First, they overwhelmingly rely on static images, failing to capture the temporal complexity of real-world environments. Second, they narrowly focus on mathematical problem-solving, neglecting the broader spectrum of reasoning skills -- including abstract, physic… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  3. arXiv:2506.05507  [pdf, other

    hep-ex hep-th nucl-ex quant-ph

    Challenging Spontaneous Quantum Collapse with XENONnT

    Authors: E. Aprile, J. Aalbers, K. Abe, S. Ahmed Maouloud, L. Althueser, B. Andrieu, E. Angelino, D. Antón Martin, S. R. Armbruster, F. Arneodo, L. Baudis, M. Bazyk, L. Bellagamba, R. Biondi, A. Bismark, K. Boese, A. Brown, G. Bruno, R. Budnik, C. Cai, C. Capelli, J. M. R. Cardoso, A. P. Cimental Chávez, A. P. Colijn, J. Conrad , et al. (152 additional authors not shown)

    Abstract: We report on the search for X-ray radiation as predicted from dynamical quantum collapse with low-energy electronic recoil data in the energy range of 1-140 keV from the first science run of the XENONnT dark matter detector. Spontaneous radiation is an unavoidable effect of dynamical collapse models, which were introduced as a possible solution to the long-standing measurement problem in quantum m… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: 7 pages, 3 figures

  4. arXiv:2506.05454  [pdf, ps, other

    cs.LG cs.AI math.OC stat.ML

    Zeroth-Order Optimization Finds Flat Minima

    Authors: Liang Zhang, Bingcong Li, Kiran Koshy Thekumparampil, Sewoong Oh, Michael Muehlebach, Niao He

    Abstract: Zeroth-order methods are extensively used in machine learning applications where gradients are infeasible or expensive to compute, such as black-box attacks, reinforcement learning, and language model fine-tuning. Existing optimization theory focuses on convergence to an arbitrary stationary point, but less is known on the implicit regularization that provides a fine-grained characterization on wh… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  5. arXiv:2506.05424  [pdf, ps, other

    quant-ph cond-mat.mes-hall

    Spin textures in curved paths on a curved surface

    Authors: Guo-Hua Liang, Ai-Guo Mei, Zhi-Hui Yang, Ze-Lin Wei

    Abstract: This study investigates the quantum dynamics of a spin-1/2 particle confined to a curved path from the dynamics of a two-dimensional curved thin-layer system incorporating spin connection contributions. We demonstrate that the geodesic curvature, normal curvature, and geodesic torsion of the curve govern the emergent non-Abelian gauge potential and effective scalar potential in the system's Hamilt… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: 8 pages, 3 figures

  6. arXiv:2506.05415  [pdf, ps, other

    cs.CL

    Automatically Detecting Amusing Games in Wordle

    Authors: Ronaldo Luo, Gary Liang, Cindy Liu, Adam Kabbara, Minahil Bakhtawar, Kina Kim, Michael Guerzhoy

    Abstract: We explore automatically predicting which Wordle games Reddit users find amusing. We scrape approximately 80k reactions by Reddit users to Wordle games from Reddit, classify the reactions as expressing amusement or not using OpenAI's GPT-3.5 using few-shot prompting, and verify that GPT-3.5's labels roughly correspond to human labels. We then extract features from Wordle games that can predict… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: Accepted to the Intenational Conference on Computational Creeativity (ICCC) 2025

  7. arXiv:2506.05401  [pdf, ps, other

    cs.CR cs.CV

    Robust Anti-Backdoor Instruction Tuning in LVLMs

    Authors: Yuan Xun, Siyuan Liang, Xiaojun Jia, Xinwei Liu, Xiaochun Cao

    Abstract: Large visual language models (LVLMs) have demonstrated excellent instruction-following capabilities, yet remain vulnerable to stealthy backdoor attacks when finetuned using contaminated data. Existing backdoor defense techniques are usually developed for single-modal visual or language models under fully parameter-adjustable settings or rely on supervisory knowledge during training. However, in re… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  8. arXiv:2506.05381  [pdf, other

    cs.CR cs.IT eess.SP

    Heterogeneous Secure Transmissions in IRS-Assisted NOMA Communications: CO-GNN Approach

    Authors: Linlin Liang, Zongkai Tian, Haiyan Huang, Xiaoyan Li, Zhisheng Yin, Dehua Zhang, Nina Zhang, Wenchao Zhai

    Abstract: Intelligent Reflecting Surfaces (IRS) enhance spectral efficiency by adjusting reflection phase shifts, while Non-Orthogonal Multiple Access (NOMA) increases system capacity. Consequently, IRS-assisted NOMA communications have garnered significant research interest. However, the passive nature of the IRS, lacking authentication and security protocols, makes these systems vulnerable to external eav… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  9. arXiv:2506.05318  [pdf, ps, other

    cs.CV

    Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs

    Authors: Haoyuan Li, Yanpeng Zhou, Yufei Gao, Tao Tang, Jianhua Han, Yujie Yuan, Dave Zhenyu Chen, Jiawang Bian, Hang Xu, Xiaodan Liang

    Abstract: Remarkable progress in 2D Vision-Language Models (VLMs) has spurred interest in extending them to 3D settings for tasks like 3D Question Answering, Dense Captioning, and Visual Grounding. Unlike 2D VLMs that typically process images through an image encoder, 3D scenes, with their intricate spatial structures, allow for diverse model architectures. Based on their encoder design, this paper categori… ▽ More

    Submitted 6 June, 2025; v1 submitted 5 June, 2025; originally announced June 2025.

  10. arXiv:2506.05183  [pdf, ps, other

    cs.LG cs.AI

    TreeRPO: Tree Relative Policy Optimization

    Authors: Zhicheng Yang, Zhijiang Guo, Yinya Huang, Xiaodan Liang, Yiwei Wang, Jing Tang

    Abstract: Large Language Models (LLMs) have shown remarkable reasoning capabilities through Reinforcement Learning with Verifiable Rewards (RLVR) methods. However, a key limitation of existing approaches is that rewards defined at the full trajectory level provide insufficient guidance for optimizing the intermediate steps of a reasoning process. To address this, we introduce \textbf{\name}, a novel method… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: 13pages, 6 figures

  11. arXiv:2506.05115  [pdf, ps, other

    cs.RO

    Whole-Body Constrained Learning for Legged Locomotion via Hierarchical Optimization

    Authors: Haoyu Wang, Ruyi Zhou, Liang Ding, Tie Liu, Zhelin Zhang, Peng Xu, Haibo Gao, Zongquan Deng

    Abstract: Reinforcement learning (RL) has demonstrated impressive performance in legged locomotion over various challenging environments. However, due to the sim-to-real gap and lack of explainability, unconstrained RL policies deployed in the real world still suffer from inevitable safety issues, such as joint collisions, excessive torque, or foot slippage in low-friction environments. These problems limit… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  12. arXiv:2506.05055  [pdf, ps, other

    hep-ex

    Study of $f_1(1420)$ and $η(1405)$ in the decay $J/ψ\to γπ^{0}π^{0}π^{0}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (650 additional authors not shown)

    Abstract: A partial-wave analysis is performed on the decay $J/ψ\toγπ^{0}π^{0}π^{0}$ within the $π^{0}π^{0}π^{0}$ invariant-mass region below 1.6 GeV$/c^{2}$, using $(10.09~\pm~0.04)\times10^{9} ~J/ψ$ events collected with the BESIII detector. Significant isospin-violating decays of $η(1405)$ and $f_1(1420)$ into $f_0(980)π^{0}$ are observed. For the first time, three axial-vectors, $f_1(1285)$,… ▽ More

    Submitted 7 June, 2025; v1 submitted 5 June, 2025; originally announced June 2025.

  13. arXiv:2506.05044  [pdf, ps, other

    cs.IR

    Rethinking Contrastive Learning in Session-based Recommendation

    Authors: Xiaokun Zhang, Bo Xu, Fenglong Ma, Zhizheng Wang, Liang Yang, Hongfei Lin

    Abstract: Session-based recommendation aims to predict intents of anonymous users based on limited behaviors. With the ability in alleviating data sparsity, contrastive learning is prevailing in the task. However, we spot that existing contrastive learning based methods still suffer from three obstacles: (1) they overlook item-level sparsity and primarily focus on session-level sparsity; (2) they typically… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: This work has been accepted by Pattern Recognition

  14. arXiv:2506.05019  [pdf, ps, other

    cs.CE

    FinMultiTime: A Four-Modal Bilingual Dataset for Financial Time-Series Analysis

    Authors: Wenyan Xu, Dawei Xiang, Yue Liu, Xiyu Wang, Yanxiang Ma, Liang Zhang, Chang Xu, Jiaheng Zhang

    Abstract: Pure time series forecasting tasks typically focus exclusively on numerical features; however, real-world financial decision-making demands the comparison and analysis of heterogeneous sources of information. Recent advances in deep learning and large scale language models (LLMs) have made significant strides in capturing sentiment and other qualitative signals, thereby enhancing the accuracy of f… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: Under review

  15. arXiv:2506.05007  [pdf, ps, other

    cs.AR cs.LG

    QiMeng: Fully Automated Hardware and Software Design for Processor Chip

    Authors: Rui Zhang, Yuanbo Wen, Shuyao Cheng, Di Huang, Shaohui Peng, Jiaming Guo, Pengwei Jin, Jiacheng Zhao, Tianrui Ma, Yaoyu Zhu, Yifan Hao, Yongwei Zhao, Shengwen Liang, Ying Wang, Xing Hu, Zidong Du, Huimin Cui, Ling Li, Qi Guo, Yunji Chen

    Abstract: Processor chip design technology serves as a key frontier driving breakthroughs in computer science and related fields. With the rapid advancement of information technology, conventional design paradigms face three major challenges: the physical constraints of fabrication technologies, the escalating demands for design resources, and the increasing diversity of ecosystems. Automated processor chip… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  16. arXiv:2506.05000  [pdf, ps, other

    cs.CL

    SCOP: Evaluating the Comprehension Process of Large Language Models from a Cognitive View

    Authors: Yongjie Xiao, Hongru Liang, Peixin Qin, Yao Zhang, Wenqiang Lei

    Abstract: Despite the great potential of large language models(LLMs) in machine comprehension, it is still disturbing to fully count on them in real-world scenarios. This is probably because there is no rational explanation for whether the comprehension process of LLMs is aligned with that of experts. In this paper, we propose SCOP to carefully examine how LLMs perform during the comprehension process from… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: arXiv admin note: text overlap with arXiv:2004.14535 by other authors

  17. arXiv:2506.04924  [pdf, ps, other

    cs.LG

    Predicting ICU In-Hospital Mortality Using Adaptive Transformer Layer Fusion

    Authors: Han Wang, Ruoyun He, Guoguang Lao, Ting Liu, Hejiao Luo, Changqi Qin, Hongying Luo, Junmin Huang, Zihan Wei, Lu Chen, Yongzhi Xu, Ziqian Bi, Junhao Song, Tianyang Wang, Chia Xin Liang, Xinyuan Song, Huafeng Liu, Junfeng Hao, Chunjie Tian

    Abstract: Early identification of high-risk ICU patients is crucial for directing limited medical resources. We introduce ALFIA (Adaptive Layer Fusion with Intelligent Attention), a modular, attention-based architecture that jointly trains LoRA (Low-Rank Adaptation) adapters and an adaptive layer-weighting mechanism to fuse multi-layer semantic features from a BERT backbone. Trained on our rigorous cw-24 (C… ▽ More

    Submitted 6 June, 2025; v1 submitted 5 June, 2025; originally announced June 2025.

    Comments: 21 pages, 6 figures

  18. arXiv:2506.04890  [pdf, ps, other

    eess.AS

    Multivariate Probabilistic Assessment of Speech Quality

    Authors: Fredrik Cumlin, Xinyu Liang, Victor Ungureanu, Chandan K. A. Reddy, Christian Schüldt, Saikat Chatterjee

    Abstract: The mean opinion score (MOS) is a standard metric for assessing speech quality, but its singular focus fails to identify specific distortions when low scores are observed. The NISQA dataset addresses this limitation by providing ratings across four additional dimensions: noisiness, coloration, discontinuity, and loudness, alongside MOS. In this paper, we extend the explored univariate MOS estimati… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: Accepted at Interspeech 2025

  19. arXiv:2506.04821  [pdf, ps, other

    cs.LG

    LogicPuzzleRL: Cultivating Robust Mathematical Reasoning in LLMs via Reinforcement Learning

    Authors: Zhen Hao Wong, Jingwen Deng, Runming He, Zirong Chen, Qijie You, Hejun Dong, Hao Liang, Chengyu Shen, Bin Cui, Wentao Zhang

    Abstract: Large language models (LLMs) excel at many supervised tasks but often struggle with structured reasoning in unfamiliar settings. This discrepancy suggests that standard fine-tuning pipelines may instill narrow, domain-specific heuristics rather than fostering general-purpose thinking strategies. In this work, we propose a "play to learn" framework that fine-tunes LLMs through reinforcement learnin… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  20. arXiv:2506.04810  [pdf, ps, other

    cs.CL cs.AI cs.LO

    Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study

    Authors: Yujun Zhou, Jiayi Ye, Zipeng Ling, Yufei Han, Yue Huang, Haomin Zhuang, Zhenwen Liang, Kehan Guo, Taicheng Guo, Xiangqi Wang, Xiangliang Zhang

    Abstract: Logical reasoning is a core capability for many applications of large language models (LLMs), yet existing benchmarks often rely solely on final-answer accuracy, failing to capture the quality and structure of the reasoning process. We propose FineLogic, a fine-grained evaluation framework that assesses logical reasoning across three dimensions: overall benchmark accuracy, stepwise soundness, and… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  21. arXiv:2506.04743  [pdf, ps, other

    cs.CV

    SRD: Reinforcement-Learned Semantic Perturbation for Backdoor Defense in VLMs

    Authors: Shuhan Xu, Siyuan Liang, Hongling Zheng, Yong Luo, Aishan Liu, Dacheng Tao

    Abstract: Vision-Language Models (VLMs) have achieved remarkable performance in image captioning, but recent studies show they are vulnerable to backdoor attacks. Attackers can inject imperceptible perturbations-such as local pixel triggers or global semantic phrases-into the training data, causing the model to generate malicious, attacker-controlled captions for specific inputs. These attacks are hard to d… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  22. arXiv:2506.04674  [pdf, ps, other

    quant-ph

    Variational toolbox-based separability detection of multiqubit states

    Authors: Jin-Min Liang, Shao-Ming Fei, Qiongyi He

    Abstract: Parametrized quantum circuits (PQCs) are crucial in variational quantum algorithms. While it is commonly believed that the optimal PQC is solely used to reproduce the target state, we here reveal that the optimal PQC can also provide valuable insights into the state's properties. We propose variational toolboxes to identify the $k$-separability of pure states, with or without preparation noise, by… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: 9 pages, 4 figures

  23. arXiv:2506.04652  [pdf, ps, other

    eess.AS cs.CL

    EMO-Debias: Benchmarking Gender Debiasing Techniques in Multi-Label Speech Emotion Recognition

    Authors: Yi-Cheng Lin, Huang-Cheng Chou, Yu-Hsuan Li Liang, Hung-yi Lee

    Abstract: Speech emotion recognition (SER) systems often exhibit gender bias. However, the effectiveness and robustness of existing debiasing methods in such multi-label scenarios remain underexplored. To address this gap, we present EMO-Debias, a large-scale comparison of 13 debiasing methods applied to multi-label SER. Our study encompasses techniques from pre-processing, regularization, adversarial learn… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: 8 pages

  24. arXiv:2506.04600  [pdf, ps, other

    math.OC

    Achieving Linear Speedup and Near-Optimal Complexity for Decentralized Optimization over Row-stochastic Networks

    Authors: Liyuan Liang, Xinyi Chen, Gan Luo, Kun Yuan

    Abstract: A key challenge in decentralized optimization is determining the optimal convergence rate and designing algorithms to achieve it. While this problem has been extensively addressed for doubly-stochastic and column-stochastic mixing matrices, the row-stochastic scenario remains unexplored. This paper bridges this gap by introducing effective metrics to capture the influence of row-stochastic mixing… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  25. arXiv:2506.04570  [pdf, ps, other

    astro-ph.GA

    RIDEN pilot survey: broad-band selection of candidate quasars with extended Lyman-$α$ nebulae using CLAUDS-HSC-SSP-DUNES$^2$ joint data

    Authors: Rhythm Shimakawa, Satoshi Kikuta, Haruka Kusakabe, Marcin Sawicki, Yongming Liang, Rieko Momose, Stephen Gwyn, Guillaume Desprez

    Abstract: The Vera C. Rubin Observatory will conduct the Legacy Survey of Space and Time (LSST), delivering deep, multi-band ($ugrizy$) imaging data across 18,000 square degrees over the next decade. Before this ultra-wide-field survey, we constructed a broad-band Ly$α$ imaging toward 483 SDSS/BOSS quasars at $z=$ 1.9-3.0, using deep, wide-field ultraviolet to near-infrared ($u$-to-$K$) data from the Hyper… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 18 pages, 14 figures, 4 tables, accepted for publication in MNRAS

  26. arXiv:2506.04556  [pdf, ps, other

    cs.CR cs.AI

    BESA: Boosting Encoder Stealing Attack with Perturbation Recovery

    Authors: Xuhao Ren, Haotian Liang, Yajie Wang, Chuan Zhang, Zehui Xiong, Liehuang Zhu

    Abstract: To boost the encoder stealing attack under the perturbation-based defense that hinders the attack performance, we propose a boosting encoder stealing attack with perturbation recovery named BESA. It aims to overcome perturbation-based defenses. The core of BESA consists of two modules: perturbation detection and perturbation recovery, which can be combined with canonical encoder stealing attacks.… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  27. arXiv:2506.04516  [pdf, ps, other

    cs.CL

    DRE: An Effective Dual-Refined Method for Integrating Small and Large Language Models in Open-Domain Dialogue Evaluation

    Authors: Kun Zhao, Bohao Yang, Chen Tang, Siyuan Dai, Haoteng Tang, Chenghua Lin, Liang Zhan

    Abstract: Large Language Models (LLMs) excel at many tasks but struggle with ambiguous scenarios where multiple valid responses exist, often yielding unreliable results. Conversely, Small Language Models (SLMs) demonstrate robustness in such scenarios but are susceptible to misleading or adversarial inputs. We observed that LLMs handle negative examples effectively, while SLMs excel with positive examples.… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: arXiv admin note: text overlap with arXiv:2405.15924

  28. arXiv:2506.04335  [pdf, ps, other

    cond-mat.mes-hall quant-ph

    Emergent gravity and gravitational lensing in quantum materials

    Authors: Yugo Onishi, Nisarga Paul, Liang Fu

    Abstract: We show that an effective gravitational field naturally emerges in quantum materials with long-wavelength spin (or pseudospin) textures. When the itinerant electrons' spin strongly couples to the background spin texture, it effectively behaves as a spinless particle in a curved space, with the curvature arising from quantum corrections to the electron's spin orientation. The emergent gravity gives… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 7 pages (including references and 3 figures) + Appendix (8 pages and 5 figures)

  29. arXiv:2506.04276  [pdf, ps, other

    cs.MA cs.AI

    Autonomous Collaborative Scheduling of Time-dependent UAVs, Workers and Vehicles for Crowdsensing in Disaster Response

    Authors: Lei Han, Yitong Guo, Pengfei Yang, Zhiyong Yu, Liang Wang, Quan Wang, Zhiwen Yu

    Abstract: Natural disasters have caused significant losses to human society, and the timely and efficient acquisition of post-disaster environmental information is crucial for the effective implementation of rescue operations. Due to the complexity of post-disaster environments, existing sensing technologies face challenges such as weak environmental adaptability, insufficient specialized sensing capabiliti… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  30. arXiv:2506.04231  [pdf

    physics.app-ph physics.class-ph

    Observation of Coherent Perfect Acoustic Absorption at an Exceptional Point

    Authors: Yi-Fei Xia, Zi-Xiang Xu, Yu-Ting Yan, An Chen, Jing Yang, Bin Liang, Jian-Chun Cheng, Johan Christensen

    Abstract: Non-Hermitian systems have recently shown new possibilities to manipulate wave scattering by exploiting loss, yet coherent perfect absorption at an exceptional point (CPA EP) remains elusive in acoustics. Here we demonstrate it based on a two-channel waveguide with compact lossy resonators. We realize imbalanced losses crucial for CPA EP by using active components to independently modulate the non… ▽ More

    Submitted 19 May, 2025; originally announced June 2025.

  31. arXiv:2506.04217  [pdf, ps, other

    cs.RO cs.AI

    OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis

    Authors: Junting Chen, Haotian Liang, Lingxiao Du, Weiyun Wang, Mengkang Hu, Yao Mu, Wenhai Wang, Jifeng Dai, Ping Luo, Wenqi Shao, Lin Shao

    Abstract: The rapid progress of navigation, manipulation, and vision models has made mobile manipulators capable in many specialized tasks. However, the open-world mobile manipulation (OWMM) task remains a challenge due to the need for generalization to open-ended instructions and environments, as well as the systematic complexity to integrate high-level decision making with low-level robot control based on… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 9 pages of main content, 19 pages in total

    ACM Class: I.2.4; I.2.9; I.2.10

  32. arXiv:2506.04160  [pdf, ps, other

    physics.chem-ph physics.optics

    Interplay between ultrafast electronic and librational dynamics in liquid nitrobenzene probed with two-color four-wave mixing

    Authors: Niranjan Shivaram, Richard Thurston, Ali Belkacem, Thorsten Weber, Liang Z. Tan, Daniel S. Slaughter

    Abstract: We present an experimental and theoretical study of the interplay between ultrafast electron dynamics and librational dynamics in liquid nitrobenzene. A femtosecond ultraviolet pulse and two femtosecond near infrared pulses interact with nitrobenzene molecules, generating a four-wave mixing nonlinear signal that is measured in the Optical Kerr Effect geometry. The near infrared nonlinear signal is… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  33. arXiv:2506.04065  [pdf, ps, other

    cs.CL

    Progressive Mastery: Customized Curriculum Learning with Guided Prompting for Mathematical Reasoning

    Authors: Muling Wu, Qi Qian, Wenhao Liu, Xiaohua Wang, Zisu Huang, Di Liang, LI Miao, Shihan Dou, Changze Lv, Zhenghua Wang, Zhibo Xu, Lina Chen, Tianlong Li, Xiaoqing Zheng, Xuanjing Huang

    Abstract: Large Language Models (LLMs) have achieved remarkable performance across various reasoning tasks, yet post-training is constrained by inefficient sample utilization and inflexible difficulty samples processing. To address these limitations, we propose Customized Curriculum Learning (CCL), a novel framework with two key innovations. First, we introduce model-adaptive difficulty definition that cust… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  34. arXiv:2506.04015  [pdf, ps, other

    cs.IR

    GORACS: Group-level Optimal Transport-guided Coreset Selection for LLM-based Recommender Systems

    Authors: Tiehua Mei, Hengrui Chen, Peng Yu, Jiaqing Liang, Deqing Yang

    Abstract: Although large language models (LLMs) have shown great potential in recommender systems, the prohibitive computational costs for fine-tuning LLMs on entire datasets hinder their successful deployment in real-world scenarios. To develop affordable and effective LLM-based recommender systems, we focus on the task of coreset selection which identifies a small subset of fine-tuning data to optimize th… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: Accepted by KDD 2025

  35. arXiv:2506.03874  [pdf, ps, other

    cs.IT

    The equivalent condition for GRL codes to be MDS, AMDS or self-dual

    Authors: Zhonghao Liang, Qunying Liao

    Abstract: It is well-known that MDS, AMDS or self-dual codes have good algebraic properties, and are applied in communication systems, data storage, quantum codes, and so on. In this paper, we focus on a class of generalized Roth-Lempel linear codes, and give an equivalent condition for them or their dual to be non-RS MDS, AMDS or non-RS self-dual and some corresponding examples.

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 18 pages

    MSC Class: 94A24; 94B05

  36. arXiv:2506.03850  [pdf, ps, other

    cs.LG

    Vulnerability-Aware Alignment: Mitigating Uneven Forgetting in Harmful Fine-Tuning

    Authors: Liang Chen, Xueting Han, Li Shen, Jing Bai, Kam-Fai Wong

    Abstract: Harmful fine-tuning (HFT), performed directly on open-source LLMs or through Fine-tuning-as-a-Service, breaks safety alignment and poses significant threats. Existing methods aim to mitigate HFT risks by learning robust representation on alignment data or making harmful data unlearnable, but they treat each data sample equally, leaving data vulnerability patterns understudied. In this work, we rev… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: ICML 2025

  37. arXiv:2506.03747  [pdf, other

    physics.optics

    Fast Non-Line-of-Sight Transient Data Simulation and an Open Benchmark Dataset

    Authors: Yingjie Shi, Jinye Miao, Taotao Qin, Fuyao Cai, Yi Wei, Lingfeng Liu, Tongyao Li, Chenyang Wu, Huan Liang, Yuyang Yin, Lianfa Bai, Enlai Guo, Jing Han

    Abstract: Non-Line-of-Sight (NLOS) imaging reconstructs the shape and depth of hidden objects from picosecond-resolved transient signals, offering potential applications in autonomous driving, security, and medical diagnostics. However, current NLOS experiments rely on expensive hardware and complex system alignment, limiting their scalability. This manuscript presents a simplified simulation method that ge… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  38. arXiv:2506.03724  [pdf, ps, other

    math.FA

    Uncertainty principles for free metaplectic transformation and associated metaplectic operators

    Authors: Ping Liang, Pei Dang, Weixiong Mai

    Abstract: In this paper, we systematically investigate the Heisenberg-Pauli-Weyl uncertainty principle for free metaplectic transformation, as well as metaplectic operators. Specifically, we obtain two different types of the uncertainty principle for free metaplectic transformations in terms of the so-called phase derivative, one of which can be generalized to the $L^p$-case with $1\le p\le 2$. The obtained… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 42 pages

  39. arXiv:2506.03714  [pdf, other

    cs.CV

    FSHNet: Fully Sparse Hybrid Network for 3D Object Detection

    Authors: Shuai Liu, Mingyue Cui, Boyang Li, Quanmin Liang, Tinghe Hong, Kai Huang, Yunxiao Shan, Kai Huang

    Abstract: Fully sparse 3D detectors have recently gained significant attention due to their efficiency in long-range detection. However, sparse 3D detectors extract features only from non-empty voxels, which impairs long-range interactions and causes the center feature missing. The former weakens the feature extraction capability, while the latter hinders network optimization. To address these challenges, w… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: Accepted by CVPR2025

  40. arXiv:2506.03698  [pdf

    cs.CV

    Advancements in Artificial Intelligence Applications for Cardiovascular Disease Research

    Authors: Yuanlin Mo, Haishan Huang, Bocheng Liang, Weibo Ma

    Abstract: Recent advancements in artificial intelligence (AI) have revolutionized cardiovascular medicine, particularly through integration with computed tomography (CT), magnetic resonance imaging (MRI), electrocardiography (ECG) and ultrasound (US). Deep learning architectures, including convolutional neural networks and generative adversarial networks, enable automated analysis of medical imaging and phy… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  41. arXiv:2506.03643  [pdf, ps, other

    cs.CV

    Images are Worth Variable Length of Representations

    Authors: Lingjun Mao, Rodolfo Corona, Xin Liang, Wenhao Yan, Zineng Tang

    Abstract: Most existing vision encoders map images into a fixed-length sequence of tokens, overlooking the fact that different images contain varying amounts of information. For example, a visually complex image (e.g., a cluttered room) inherently carries more information and thus deserves more tokens than a simple image (e.g., a blank wall). To address this inefficiency, we propose DOVE, a dynamic vision e… ▽ More

    Submitted 5 June, 2025; v1 submitted 4 June, 2025; originally announced June 2025.

  42. arXiv:2506.03586  [pdf, ps, other

    cs.AI cs.IT

    Beamforming and Resource Allocation for Delay Optimization in RIS-Assisted OFDM Systems

    Authors: Yu Ma, Xiao Li, Chongtao Guo, Le Liang, Shi Jin

    Abstract: This paper investigates a joint phase design and resource allocation problem in downlink reconfigurable intelligent surface (RIS)-assisted orthogonal frequency division multiplexing (OFDM) systems to optimize average delay, where data packets for each user arrive at the base station stochastically. The sequential optimization problem is inherently a Markov decision process (MDP), making it fall wi… ▽ More

    Submitted 12 June, 2025; v1 submitted 4 June, 2025; originally announced June 2025.

  43. arXiv:2506.03569  [pdf, ps, other

    cs.CL

    MiMo-VL Technical Report

    Authors: Xiaomi LLM-Core Team, :, Zihao Yue, Zhenru Lin, Yifan Song, Weikun Wang, Shuhuai Ren, Shuhao Gu, Shicheng Li, Peidian Li, Liang Zhao, Lei Li, Kainan Bao, Hao Tian, Hailin Zhang, Gang Wang, Dawei Zhu, Cici, Chenhong He, Bowen Ye, Bowen Shen, Zihan Zhang, Zihan Jiang, Zhixian Zheng, Zhichao Song , et al. (50 additional authors not shown)

    Abstract: We open-source MiMo-VL-7B-SFT and MiMo-VL-7B-RL, two powerful vision-language models delivering state-of-the-art performance in both general visual understanding and multimodal reasoning. MiMo-VL-7B-RL outperforms Qwen2.5-VL-7B on 35 out of 40 evaluated tasks, and scores 59.4 on OlympiadBench, surpassing models with up to 78B parameters. For GUI grounding applications, it sets a new standard with… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 32 pages

  44. arXiv:2506.03524  [pdf, ps, other

    cs.CL cs.SE

    Seed-Coder: Let the Code Model Curate Data for Itself

    Authors: ByteDance Seed, Yuyu Zhang, Jing Su, Yifan Sun, Chenguang Xi, Xia Xiao, Shen Zheng, Anxiang Zhang, Kaibo Liu, Daoguang Zan, Tao Sun, Jinhua Zhu, Shulin Xin, Dong Huang, Yetao Bai, Lixin Dong, Chao Li, Jianchong Chen, Hanzhi Zhou, Yifan Huang, Guanghan Ning, Xierui Song, Jiaze Chen, Siyao Liu, Kai Shen , et al. (2 additional authors not shown)

    Abstract: Code data in large language model (LLM) pretraining is recognized crucial not only for code-related tasks but also for enhancing general intelligence of LLMs. Current open-source LLMs often heavily rely on human effort to produce their code pretraining data, such as employing hand-crafted filtering rules tailored to individual programming languages, or using human-annotated data to train quality f… ▽ More

    Submitted 4 June, 2025; v1 submitted 3 June, 2025; originally announced June 2025.

  45. arXiv:2506.03197  [pdf, ps, other

    cs.CV cs.AI cs.CL cs.LG

    Infinity Parser: Layout Aware Reinforcement Learning for Scanned Document Parsing

    Authors: Baode Wang, Biao Wu, Weizhen Li, Meng Fang, Yanjie Liang, Zuming Huang, Haozhe Wang, Jun Huang, Ling Chen, Wei Chu, Yuan Qi

    Abstract: Automated parsing of scanned documents into richly structured, machine-readable formats remains a critical bottleneck in Document AI, as traditional multi-stage pipelines suffer from error propagation and limited adaptability to diverse layouts. We introduce layoutRL, an end-to-end reinforcement learning framework that trains models to be explicitly layout-aware by optimizing a composite reward of… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

    Comments: 16 pages, 12 figures

    Report number: INF-CS-TR-2025-02

  46. arXiv:2506.03133  [pdf, ps, other

    cs.LG cs.AI eess.SP math.OC

    PoLAR: Polar-Decomposed Low-Rank Adapter Representation

    Authors: Kai Lion, Liang Zhang, Bingcong Li, Niao He

    Abstract: We show that low-rank adaptation of large-scale models suffers from a low stable rank that is well below the linear algebraic rank of the subspace, degrading fine-tuning performance. To mitigate the underutilization of the allocated subspace, we propose PoLAR, a parameterization inspired by the polar decomposition that factorizes the low-rank update into two direction matrices constrained to Stief… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  47. arXiv:2506.03100  [pdf, ps, other

    cs.LG cs.AI cs.CL cs.IR math.ST

    Retrieval-Augmented Generation as Noisy In-Context Learning: A Unified Theory and Risk Bounds

    Authors: Yang Guo, Yutian Tao, Yifei Ming, Robert D. Nowak, Yingyu Liang

    Abstract: Retrieval-augmented generation (RAG) has seen many empirical successes in recent years by aiding the LLM with external knowledge. However, its theoretical aspect has remained mostly unexplored. In this paper, we propose the first finite-sample generalization bound for RAG in in-context linear regression and derive an exact bias-variance tradeoff. Our framework views the retrieved texts as query-de… ▽ More

    Submitted 9 June, 2025; v1 submitted 3 June, 2025; originally announced June 2025.

    Comments: Under Review

  48. arXiv:2506.03072  [pdf, ps, other

    hep-ex

    Three-pion Bose-Einstein correlations measured in proton-proton collisions

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis, L. An , et al. (1125 additional authors not shown)

    Abstract: A study on the Bose-Einstein correlations for triplets of same-sign pions is presented. The analysis is performed using proton-proton collisions at a centre-of-mass energy of $\sqrt{s}$ = 7 TeV, recorded by the LHCb experiment, corresponding to an integrated luminosity of 1.0 fb$^{-1}$. For the first time, the results are interpreted in the core-halo model. The parameters of the model are determin… ▽ More

    Submitted 9 June, 2025; v1 submitted 3 June, 2025; originally announced June 2025.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3759/ (LHCb public pages)

    Report number: CERN-EP-2025-104, LHCb-PAPER-2025-007

  49. arXiv:2506.03068  [pdf, ps, other

    stat.ML cs.CY cs.LG

    Causal Explainability of Machine Learning in Heart Failure Prediction from Electronic Health Records

    Authors: Yina Hou, Shourav B. Rabbani, Liang Hong, Norou Diawara, Manar D. Samad

    Abstract: The importance of clinical variables in the prognosis of the disease is explained using statistical correlation or machine learning (ML). However, the predictive importance of these variables may not represent their causal relationships with diseases. This paper uses clinical variables from a heart failure (HF) patient cohort to investigate the causal explainability of important variables obtained… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: 4 figures

  50. arXiv:2506.02969  [pdf, ps, other

    hep-ex

    Measurement of the branching fractions of the Cabibbo-favored decays $Λ_{c}^{+}\toΛK_{S}^{0}K^{+}$ and $Λ_{c}^{+}\toΞ^{0}K_{S}^{0}π^{+}$ and search for $Λ_{c}^{+}\toΣ^{0} K_{S}^{0}K^{+}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (660 additional authors not shown)

    Abstract: Based on $e^{+}e^{-}$ collision data corresponding to an integrated luminosity of about 4.5 fb$^{-1}$ collected at center-of-mass energies between 4599.53 MeV and 4698.82 MeV with the BESIII detector, the absolute branching fraction of the Cabibbo-favored decay $Λ_{c}^{+}\toΛK_{S}^{0}K^{+}$ is measured to be $(3.12\pm0.46\pm0.15)\times10^{-3}$. Combined with a previous measurement from the BESIII… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.