Skip to main content

Showing 1–50 of 1,893 results for author: lu, C

.
  1. arXiv:2506.09340  [pdf, ps, other

    cs.CL cs.AI cs.LG

    RePO: Replay-Enhanced Policy Optimization

    Authors: Siheng Li, Zhanhui Zhou, Wai Lam, Chao Yang, Chaochao Lu

    Abstract: Reinforcement learning (RL) is vital for optimizing large language models (LLMs). Recent Group Relative Policy Optimization (GRPO) estimates advantages using multiple on-policy outputs per prompt, leading to high computational costs and low data efficiency. To address this, we introduce Replay-Enhanced Policy Optimization (RePO), which leverages diverse replay strategies to retrieve off-policy sam… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

    Comments: Project Page: https://github.com/SihengLi99/RePO

  2. arXiv:2506.09095  [pdf, ps, other

    eess.IV cs.AI cs.CV

    Foundation Models in Medical Imaging -- A Review and Outlook

    Authors: Vivien van Veldhuizen, Vanessa Botha, Chunyao Lu, Melis Erdal Cesur, Kevin Groot Lipman, Edwin D. de Jong, Hugo Horlings, Clárisa Sanchez, Cees Snoek, Ritse Mann, Eric Marcus, Jonas Teuwen

    Abstract: Foundation models (FMs) are changing the way medical images are analyzed by learning from large collections of unlabeled data. Instead of relying on manually annotated examples, FMs are pre-trained to learn general-purpose visual features that can later be adapted to specific clinical tasks with little additional supervision. In this review, we examine how FMs are being developed and applied in pa… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  3. arXiv:2506.08399  [pdf, ps, other

    cs.AI cs.LG

    SafeCoT: Improving VLM Safety with Minimal Reasoning

    Authors: Jiachen Ma, Zhanhui Zhou, Chao Yang, Chaochao Lu

    Abstract: Ensuring safe and appropriate responses from vision-language models (VLMs) remains a critical challenge, particularly in high-risk or ambiguous scenarios. We introduce SafeCoT, a lightweight, interpretable framework that leverages rule-based chain-of-thought (CoT) supervision to improve refusal behavior in VLMs. Unlike prior methods that rely on large-scale safety annotations or complex modeling,… ▽ More

    Submitted 11 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

  4. arXiv:2506.08334  [pdf, ps, other

    cs.GR cs.CV

    Generalizable Articulated Object Reconstruction from Casually Captured RGBD Videos

    Authors: Weikun Peng, Jun Lv, Cewu Lu, Manolis Savva

    Abstract: Articulated objects are prevalent in daily life. Understanding their kinematic structure and reconstructing them have numerous applications in embodied AI and robotics. However, current methods require carefully captured data for training or inference, preventing practical, scalable, and generalizable reconstruction of articulated objects. We focus on reconstruction of an articulated object from a… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: Project website can be found at https://3dlg-hcvc.github.io/video2articulation/

  5. arXiv:2506.07664  [pdf, ps, other

    cs.CL cs.AI

    Synthesis by Design: Controlled Data Generation via Structural Guidance

    Authors: Lei Xu, Sirui Chen, Yuxuan Huang, Chaochao Lu

    Abstract: Mathematical reasoning remains challenging for LLMs due to complex logic and the need for precise computation. Existing methods enhance LLM reasoning by synthesizing datasets through problem rephrasing, but face issues with generation quality and problem complexity. To address this, we propose to extract structural information with generated problem-solving code from mathematical reasoning and gui… ▽ More

    Submitted 10 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

  6. arXiv:2506.07639  [pdf, ps, other

    cs.RO

    Fast ECoT: Efficient Embodied Chain-of-Thought via Thoughts Reuse

    Authors: Zhekai Duan, Yuan Zhang, Shikai Geng, Gaowen Liu, Joschka Boedecker, Chris Xiaoxuan Lu

    Abstract: Embodied Chain-of-Thought (ECoT) reasoning enhances vision-language-action (VLA) models by improving performance and interpretability through intermediate reasoning steps. However, its sequential autoregressive token generation introduces significant inference latency, limiting real-time deployment. We propose Fast ECoT, an inference-time acceleration method that exploits the structured and repeti… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  7. arXiv:2506.06729  [pdf, other

    cs.CV cs.CL

    Mitigating Object Hallucination via Robust Local Perception Search

    Authors: Zixian Gao, Chao Yang, Zhanhui Zhou, Xing Xu, Chaochao Lu

    Abstract: Recent advancements in Multimodal Large Language Models (MLLMs) have enabled them to effectively integrate vision and language, addressing a variety of downstream tasks. However, despite their significant success, these models still exhibit hallucination phenomena, where the outputs appear plausible but do not align with the content of the images. To mitigate this issue, we introduce Local Percept… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

  8. arXiv:2506.05445  [pdf, ps, other

    cs.LG cs.AI

    Causal Policy Learning in Reinforcement Learning: Backdoor-Adjusted Soft Actor-Critic

    Authors: Thanh Vinh Vo, Young Lee, Haozhe Ma, Chien Lu, Tze-Yun Leong

    Abstract: Hidden confounders that influence both states and actions can bias policy learning in reinforcement learning (RL), leading to suboptimal or non-generalizable behavior. Most RL algorithms ignore this issue, learning policies from observational trajectories based solely on statistical associations rather than causal effects. We propose DoSAC (Do-Calculus Soft Actor-Critic with Backdoor Adjustment),… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: Preprint

  9. arXiv:2506.03614  [pdf, ps, other

    cs.CV cs.AI cs.CL cs.CR

    VLMs Can Aggregate Scattered Training Patches

    Authors: Zhanhui Zhou, Lingjie Chen, Chao Yang, Chaochao Lu

    Abstract: One way to mitigate risks in vision-language models (VLMs) is to remove dangerous samples in their training data. However, such data moderation can be easily bypassed when harmful images are split into small, benign-looking patches, scattered across many training samples. VLMs may then learn to piece these fragments together during training and generate harmful responses at inference, either from… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  10. arXiv:2506.03106  [pdf, ps, other

    cs.CL cs.AI

    Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback

    Authors: Xiaoying Zhang, Hao Sun, Yipeng Zhang, Kaituo Feng, Chaochao Lu, Chao Yang, Helen Meng

    Abstract: Recent advances in reinforcement learning (RL) with numerical feedback, such as scalar rewards, have significantly enhanced the complex reasoning capabilities of large language models (LLMs). Despite this success, we identify three key challenges encountered by RL with solely numerical feedback: performance plateaus, limited effectiveness of self-reflection, and persistent failures. We then demons… ▽ More

    Submitted 4 June, 2025; v1 submitted 3 June, 2025; originally announced June 2025.

    Comments: 38 pages

  11. arXiv:2506.02860  [pdf, ps, other

    cs.RO cs.AI

    Tru-POMDP: Task Planning Under Uncertainty via Tree of Hypotheses and Open-Ended POMDPs

    Authors: Wenjing Tang, Xinyu He, Yongxi Huang, Yunxiao Xiao, Cewu Lu, Panpan Cai

    Abstract: Task planning under uncertainty is essential for home-service robots operating in the real world. Tasks involve ambiguous human instructions, hidden or unknown object locations, and open-vocabulary object types, leading to significant open-ended uncertainty and a boundlessly large planning space. To address these challenges, we propose Tru-POMDP, a planner that combines structured belief generatio… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  12. arXiv:2506.02449  [pdf, ps, other

    cs.CL cs.HC

    IP-Dialog: Evaluating Implicit Personalization in Dialogue Systems with Synthetic Data

    Authors: Bo Peng, Zhiheng Wang, Heyang Gong, Chaochao Lu

    Abstract: In modern dialogue systems, the ability to implicitly infer user backgrounds from conversations and leverage this information for personalized assistance is crucial. However, the scarcity of high-quality data remains a fundamental challenge to evaluating and improving this capability. Traditional dataset construction methods are labor-intensive, resource-demanding, and raise privacy concerns. To a… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  13. arXiv:2506.01687  [pdf, ps, other

    cs.CL

    StochasTok: Improving Fine-Grained Subword Understanding in LLMs

    Authors: Anya Sims, Thom Foster, Klara Kaleb, Tuan-Duy H. Nguyen, Joseph Lee, Jakob N. Foerster, Yee Whye Teh, Cong Lu

    Abstract: Subword-level understanding is integral to numerous tasks, including understanding multi-digit numbers, spelling mistakes, abbreviations, rhyming, and wordplay. Despite this, current large language models (LLMs) still often struggle with seemingly simple subword-level tasks like How many 'r's in 'strawberry'?. A key factor behind these failures is tokenization which obscures the fine-grained struc… ▽ More

    Submitted 10 June, 2025; v1 submitted 2 June, 2025; originally announced June 2025.

  14. arXiv:2506.01551  [pdf, ps, other

    cs.CV cs.AI cs.CL

    EvolveNav: Self-Improving Embodied Reasoning for LLM-Based Vision-Language Navigation

    Authors: Bingqian Lin, Yunshuang Nie, Khun Loun Zai, Ziming Wei, Mingfei Han, Rongtao Xu, Minzhe Niu, Jianhua Han, Liang Lin, Cewu Lu, Xiaodan Liang

    Abstract: Building Vision-Language Navigation (VLN) agents which can navigate following natural language instructions is a long-standing goal in human-robot interaction applications. Recent studies have revealed the potential of training open-source Large Language Models (LLMs) to unleash LLMs' reasoning ability for improving navigation, and simultaneously mitigate the domain gap between LLMs' training corp… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  15. arXiv:2506.00902  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Observation of universal topological magnetoelectric switching in multiferroic GdMn2O5

    Authors: Haowen Wang, Fan Wang, Ming Yang, Yuting Chang, Mengyi Shi, Liang Li, Jun-Ming Liu, Junfeng Wang, Shuai Dong, Chengliang Lu

    Abstract: Topological magnetoelectricity was recently revealed as an emergent topic, which opens a unique route to precisely control magnetoelectric functionality. Here we report the synchronous magnetic-electric-cycle operation of topological magnetoelectric switching in GdMn2O5. Compared with pure magnetic-cycle operation, this topological winding can be accessed in a much broader parameter space, i.e. or… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

    Journal ref: Phys. Rev. Lett. 134, 016708 (2025)

  16. arXiv:2506.00765  [pdf, ps, other

    cs.AI

    HouseTS: A Large-Scale, Multimodal Spatiotemporal U.S. Housing Dataset

    Authors: Shengkun Wang, Yanshen Sun, Fanglan Chen, Linhan Wang, Naren Ramakrishnan, Chang-Tien Lu, Yinlin Chen

    Abstract: Accurate house-price forecasting is essential for investors, planners, and researchers. However, reproducible benchmarks with sufficient spatiotemporal depth and contextual richness for long horizon prediction remain scarce. To address this, we introduce HouseTS a large scale, multimodal dataset covering monthly house prices from March 2012 to December 2023 across 6,000 ZIP codes in 30 major U.S.… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  17. arXiv:2506.00298  [pdf, ps, other

    astro-ph.GA astro-ph.CO

    Millimeter-wave observations of Euclid Deep Field South using the South Pole Telescope: A data release of temperature maps and catalogs

    Authors: M. Archipley, A. Hryciuk, L. E. Bleem, K. Kornoelje, M. Klein, A. J. Anderson, B. Ansarinejad, M. Aravena, L. Balkenhol, P. S. Barry, K. Benabed, A. N. Bender, B. A. Benson, F. Bianchini, S. Bocquet, F. R. Bouchet, E. Camphuis, M. G. Campitiello, J. E. Carlstrom, J. Cathey, C. L. Chang, S. C. Chapman, P. Chaubal, P. M. Chichura, A. Chokshi , et al. (86 additional authors not shown)

    Abstract: Context. The South Pole Telescope third-generation camera (SPT-3G) has observed over 10,000 square degrees of sky at 95, 150, and 220 GHz (3.3, 2.0, 1.4 mm, respectively) overlapping the ongoing 14,000 square-degree Euclid Wide Survey. The Euclid collaboration recently released Euclid Deep Field observations in the first quick data release (Q1). Aims. With the goal of releasing complementary milli… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

    Comments: 26 pages, 12 figures, to be submitted to A&A

  18. arXiv:2505.24833  [pdf

    cond-mat.other physics.ins-det

    Cryogenic scanning photocurrent spectroscopy for materials responses to structured optical fields

    Authors: Duxing Hao, Chun-I Lu, Ziqi Sun, Yu-Chen Chang, Wen-Hao Chang, Ye-Ru Chen, Akiyoshi Park, Beining Rao, Siyuan Qiu, Yann-Wen Lan, Ting-Hua Lu, Nai-Chang Yeh

    Abstract: Circular dichroism spectroscopy is known to provide important insights into the interplay of different degrees of freedom in quantum materials, and yet spectroscopic study of the optoelectronic responses of quantum materials to structured optical fields, such as light with finite spin and orbital angular momentum, has not yet been widely explored, particularly at cryogenic temperature. Here we dem… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  19. arXiv:2505.24810  [pdf, ps, other

    hep-ex hep-ph

    New Physics Search at the CEPC: a General Perspective

    Authors: Stefan Antusch, Peter Athron, Daniele Barducci, Long Chen, Mingshui Chen, Xiang Chen, Huajie Cheng, Kingman Cheung, Joao Guimaraes da Costa, Arindam Das, Frank F. Deppisch, P. S. Bhupal Dev, Xiaokang Du, Yong Du, Yaquan Fang, Andrew Fowlie, Yu Gao, Bruce Mellado Garcia, Shao-Feng Ge, Jiayin Gu, Yu-Chen Guo, Jan Hajer, Chengcheng Han, Tao Han, Sven Heinemeyer , et al. (68 additional authors not shown)

    Abstract: The Circular Electron-Positron Collider (CEPC), a proposed next-generation Higgs factory, provides new opportunities to explore physics beyond the Standard Model (SM). With its clean electron-positron collision environment and the ability to collect large samples of Higgs, W, and Z bosons, the CEPC enables precision measurements and searches for new physics. This white paper outlines the CEPC's di… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  20. arXiv:2505.24678  [pdf

    physics.optics

    All-optical diode via nonreciprocal nonlinear absorption and interfacial charge transfer in two-dimensional van der Waals heterostructures

    Authors: Erkang Li, Jinhong Liu, Yanqing Ge, Mingjian Shi, Yijie Wang, Chunhui Lu, Yixuan Zhou, Xinlong Xu

    Abstract: Nonreciprocity is fundamental to photonic and optoelectronic devices such as all-optical diodes for ultrafast optical signal processing. However, previous nonreciprocity is mainly based on linear optical response instead of nonlinear optical response based on recently developed two-dimensional (2D) van der Waals heterostructures. Herein, an all-optical diode prototype based on nonreciprocal nonlin… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  21. arXiv:2505.24369  [pdf, ps, other

    cs.LG cs.AI

    Adversarial Preference Learning for Robust LLM Alignment

    Authors: Yuanfu Wang, Pengyu Wang, Chenyang Xi, Bo Tang, Junyi Zhu, Wenqiang Wei, Chen Chen, Chao Yang, Jingfeng Zhang, Chaochao Lu, Yijun Niu, Keming Mao, Zhiyu Li, Feiyu Xiong, Jie Hu, Mingchuan Yang

    Abstract: Modern language models often rely on Reinforcement Learning from Human Feedback (RLHF) to encourage safe behaviors. However, they remain vulnerable to adversarial attacks due to three key limitations: (1) the inefficiency and high cost of human annotation, (2) the vast diversity of potential adversarial attacks, and (3) the risk of feedback bias and reward hacking. To address these challenges, we… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

    Comments: Accepted at ACL2025 Findings

  22. arXiv:2505.22954  [pdf, ps, other

    cs.AI

    Darwin Godel Machine: Open-Ended Evolution of Self-Improving Agents

    Authors: Jenny Zhang, Shengran Hu, Cong Lu, Robert Lange, Jeff Clune

    Abstract: Today's AI systems have human-designed, fixed architectures and cannot autonomously and continuously improve themselves. The advance of AI could itself be automated. If done safely, that would accelerate AI development and allow us to reap its benefits much sooner. Meta-learning can automate the discovery of novel algorithms, but is limited by first-order improvements and the human design of a sui… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: Code at https://github.com/jennyzzt/dgm

  23. arXiv:2505.22170  [pdf, ps, other

    eess.SP cs.IT

    Attention-Enhanced Prompt Decision Transformers for UAV-Assisted Communications with AoI

    Authors: Chi Lu, Yiyang Ni, Zhe Wang, Xiaoli Shi, Jun Li, Shi Jin

    Abstract: Decision Transformer (DT) has recently demonstrated strong generalizability in dynamic resource allocation within unmanned aerial vehicle (UAV) networks, compared to conventional deep reinforcement learning (DRL). However, its performance is hindered due to zero-padding for varying state dimensions, inability to manage long-term energy constraint, and challenges in acquiring expert samples for few… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  24. arXiv:2505.22159  [pdf, ps, other

    cs.RO cs.CV

    ForceVLA: Enhancing VLA Models with a Force-aware MoE for Contact-rich Manipulation

    Authors: Jiawen Yu, Hairuo Liu, Qiaojun Yu, Jieji Ren, Ce Hao, Haitong Ding, Guangyu Huang, Guofan Huang, Yan Song, Panpan Cai, Cewu Lu, Wenqiang Zhang

    Abstract: Vision-Language-Action (VLA) models have advanced general-purpose robotic manipulation by leveraging pretrained visual and linguistic representations. However, they struggle with contact-rich tasks that require fine-grained control involving force, especially under visual occlusion or dynamic uncertainty. To address these limitations, we propose \textbf{ForceVLA}, a novel end-to-end manipulation f… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  25. arXiv:2505.20709  [pdf, ps, other

    math.CV

    Fractional order derivative characterizations of Besov-Morrey type spaces with applications

    Authors: Chen Lu, Mingjin Li, Jianren Long

    Abstract: On the one hand, the fractional order derivative characterization of the Besov-Morrey type space $B_{p}^{K}(s)$ is established by $K$-Carleson measures, and it was also shown that $f \in B_{p}^{K}(s_1) \Leftrightarrow f^{\left(\frac{s_2 - s_1}{p}\right)} \in B_{p}^{K}(s_2)$, which extended the results of Sun et al. on the fractional derivative of Morrey type space. On the other hand, some sufficie… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    MSC Class: Primary 32A37; 32K15; Second 32M10

  26. arXiv:2505.20678  [pdf, ps, other

    eess.AS cs.SD eess.SP

    PromptEVC: Controllable Emotional Voice Conversion with Natural Language Prompts

    Authors: Tianhua Qi, Shiyan Wang, Cheng Lu, Tengfei Song, Hao Yang, Zhanglin Wu, Wenming Zheng

    Abstract: Controllable emotional voice conversion (EVC) aims to manipulate emotional expressions to increase the diversity of synthesized speech. Existing methods typically rely on predefined labels, reference audios, or prespecified factor values, often overlooking individual differences in emotion perception and expression. In this paper, we introduce PromptEVC that utilizes natural language prompts for p… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Accepted to INTERSPEECH2025

  27. arXiv:2505.20149  [pdf, ps, other

    eess.IV cs.AI cs.CV

    Improvement Strategies for Few-Shot Learning in OCT Image Classification of Rare Retinal Diseases

    Authors: Cheng-Yu Tai, Ching-Wen Chen, Chi-Chin Wu, Bo-Chen Chiu, Cheng-Hung, Lin, Cheng-Kai Lu, Jia-Kang Wang, Tzu-Lun Huang

    Abstract: This paper focuses on using few-shot learning to improve the accuracy of classifying OCT diagnosis images with major and rare classes. We used the GAN-based augmentation strategy as a baseline and introduced several novel methods to further enhance our model. The proposed strategy contains U-GAT-IT for improving the generative part and uses the data balance technique to narrow down the skew of acc… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  28. arXiv:2505.19806  [pdf, other

    cs.CL cs.CY cs.LG

    Exploring Consciousness in LLMs: A Systematic Survey of Theories, Implementations, and Frontier Risks

    Authors: Sirui Chen, Shuqin Ma, Shu Yu, Hanwang Zhang, Shengjie Zhao, Chaochao Lu

    Abstract: Consciousness stands as one of the most profound and distinguishing features of the human mind, fundamentally shaping our understanding of existence and agency. As large language models (LLMs) develop at an unprecedented pace, questions concerning intelligence and consciousness have become increasingly significant. However, discourse on LLM consciousness remains largely unexplored territory. In th… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  29. arXiv:2505.19507  [pdf, ps, other

    cs.CV cs.LG

    Multimodal Machine Translation with Visual Scene Graph Pruning

    Authors: Chenyu Lu, Shiliang Sun, Jing Zhao, Nan Zhang, Tengfei Song, Hao Yang

    Abstract: Multimodal machine translation (MMT) seeks to address the challenges posed by linguistic polysemy and ambiguity in translation tasks by incorporating visual information. A key bottleneck in current MMT research is the effective utilization of visual data. Previous approaches have focused on extracting global or region-level image features and using attention or gating mechanisms for multimodal inf… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  30. arXiv:2505.19505  [pdf, other

    cs.IR cs.AI

    Hierarchical Tree Search-based User Lifelong Behavior Modeling on Large Language Model

    Authors: Yu Xia, Rui Zhong, Hao Gu, Wei Yang, Chi Lu, Peng Jiang, Kun Gai

    Abstract: Large Language Models (LLMs) have garnered significant attention in Recommendation Systems (RS) due to their extensive world knowledge and robust reasoning capabilities. However, a critical challenge lies in enabling LLMs to effectively comprehend and extract insights from massive user behaviors. Current approaches that directly leverage LLMs for user interest learning face limitations in handling… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  31. arXiv:2505.19219  [pdf, other

    cs.AI cs.LG cs.MA math.CO

    Where Paths Collide: A Comprehensive Survey of Classic and Learning-Based Multi-Agent Pathfinding

    Authors: Shiyue Wang, Haozheng Xu, Yuhan Zhang, Jingran Lin, Changhong Lu, Xiangfeng Wang, Wenhao Li

    Abstract: Multi-Agent Path Finding (MAPF) is a fundamental problem in artificial intelligence and robotics, requiring the computation of collision-free paths for multiple agents navigating from their start locations to designated goals. As autonomous systems become increasingly prevalent in warehouses, urban transportation, and other complex environments, MAPF has evolved from a theoretical challenge to a c… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 112 pages, 21 figures, 20 tables

  32. arXiv:2505.19058  [pdf, ps, other

    cs.LG math.OC q-fin.PM stat.ML

    Distributionally Robust Deep Q-Learning

    Authors: Chung I Lu, Julian Sester, Aijia Zhang

    Abstract: We propose a novel distributionally robust $Q$-learning algorithm for the non-tabular case accounting for continuous state spaces where the state transition of the underlying Markov decision process is subject to model uncertainty. The uncertainty is taken into account by considering the worst-case transition from a ball around a reference probability measure. To determine the optimal policy under… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  33. arXiv:2505.17650  [pdf, ps, other

    cs.AI

    Does Chain-of-Thought Reasoning Really Reduce Harmfulness from Jailbreaking?

    Authors: Chengda Lu, Xiaoyu Fan, Yu Huang, Rongwu Xu, Jijie Li, Wei Xu

    Abstract: Jailbreak attacks have been observed to largely fail against recent reasoning models enhanced by Chain-of-Thought (CoT) reasoning. However, the underlying mechanism remains underexplored, and relying solely on reasoning capacity may raise security concerns. In this paper, we try to answer the question: Does CoT reasoning really reduce harmfulness from jailbreaking? Through rigorous theoretical ana… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  34. arXiv:2505.17621  [pdf, ps, other

    cs.LG

    Navigate the Unknown: Enhancing LLM Reasoning with Intrinsic Motivation Guided Exploration

    Authors: Jingtong Gao, Ling Pan, Yejing Wang, Rui Zhong, Chi Lu, Qingpeng Cai, Peng Jiang, Xiangyu Zhao

    Abstract: Reinforcement learning (RL) has emerged as a pivotal method for improving the reasoning capabilities of Large Language Models (LLMs). However, prevalent RL approaches such as Proximal Policy Optimization (PPO) and Group-Regularized Policy Optimization (GRPO) face critical limitations due to their reliance on sparse outcome-based rewards and inadequate mechanisms for incentivizing exploration. Thes… ▽ More

    Submitted 27 May, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

  35. arXiv:2505.16288  [pdf, ps, other

    cs.AI

    No Black Boxes: Interpretable and Interactable Predictive Healthcare with Knowledge-Enhanced Agentic Causal Discovery

    Authors: Xiaoxue Han, Pengfei Hu, Jun-En Ding, Chang Lu, Feng Liu, Yue Ning

    Abstract: Deep learning models trained on extensive Electronic Health Records (EHR) data have achieved high accuracy in diagnosis prediction, offering the potential to assist clinicians in decision-making and treatment planning. However, these models lack two crucial features that clinicians highly value: interpretability and interactivity. The ``black-box'' nature of these models makes it difficult for cli… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  36. arXiv:2505.12797  [pdf, ps, other

    physics.optics

    Simulation and Experimental Studies of DWDM Nonlinear Phase/Polarization/Power Crosstalk Between DFOS and Communication Channels in 27.6-Tb/s 800ZR Metro Network

    Authors: Jingchuan Wang, Maoqi Liu, Liwang Lu, Alan Pak Tao Lau, Chao Lu

    Abstract: We comprehensively analyze the fiber nonlinearity crosstalks between DAS and communication channels through numerical results and 40 x 800-Gb/s 90-km experimental demonstration. Our findings indicate that conventional pulse-based DAS is unsuitable for in-band DWDM coexistence system, whereas pulse-compression DAS shows negligible penalties with legacy coherent transceivers.

    Submitted 19 May, 2025; originally announced May 2025.

  37. arXiv:2505.11820  [pdf, other

    cs.CL

    Chain-of-Model Learning for Language Model

    Authors: Kaitao Song, Xiaohua Wang, Xu Tan, Huiqiang Jiang, Chengruidong Zhang, Yongliang Shen, Cen LU, Zihao Li, Zifan Song, Caihua Shan, Yansen Wang, Kan Ren, Xiaoqing Zheng, Tao Qin, Yuqing Yang, Dongsheng Li, Lili Qiu

    Abstract: In this paper, we propose a novel learning paradigm, termed Chain-of-Model (CoM), which incorporates the causal relationship into the hidden states of each layer as a chain style, thereby introducing great scaling efficiency in model training and inference flexibility in deployment. We introduce the concept of Chain-of-Representation (CoR), which formulates the hidden states at each layer as a com… ▽ More

    Submitted 23 May, 2025; v1 submitted 17 May, 2025; originally announced May 2025.

  38. arXiv:2505.11172  [pdf, ps, other

    math.DS

    On the holomorphic foliations admitting a common invariant algebraic set

    Authors: Guangfeng Dong, Chujun Lu

    Abstract: In this paper, we study the holomorphic foliations admitting a common invariant algebraic set $C$ defined by a polynomial $f$ in $ \mathbb{K}[x_1,x_2,...,x_n]$ over any characteristic $0$ subfield $\mathbb{K}\subseteq\mathbb{C}$. For the $\mathbb{K}[x_1,x_2,...,x_n]$-module $V_f$ of vector fields inducing foliations admitting $C$ as an invariant set, we present several conditions for $V_f$ to be f… ▽ More

    Submitted 10 June, 2025; v1 submitted 16 May, 2025; originally announced May 2025.

    Comments: 10 pages. In this version, we include several results addressing more general cases (where $f$ need not be weakly tame) and higher-dimensional spaces (n>2)

    MSC Class: Primary: 37F75; Secondary: 32M25;

  39. arXiv:2505.08503  [pdf, ps, other

    cs.SE

    ICVul: A Well-labeled C/C++ Vulnerability Dataset with Comprehensive Metadata and VCCs

    Authors: Chaomeng Lu, Tianyu Li, Toon Dehaene, Bert Lagaisse

    Abstract: Machine learning-based software vulnerability detection requires high-quality datasets, which is essential for training effective models. To address challenges related to data label quality, diversity, and comprehensiveness, we constructed ICVul, a dataset emphasizing data quality and enriched with comprehensive metadata, including Vulnerability-Contributing Commits (VCCs). We began by filtering C… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: 5 pages, to appear in the Proceedings of the 22nd IEEE/ACM International Conference on Mining Software Repositories (MSR'25)

  40. arXiv:2505.06585  [pdf

    cond-mat.mtrl-sci

    General First-Principles Approach to Crystals in Finite Magnetic Fields

    Authors: Chengye Lü, Yingwei Chen, Yuzhi Wang, Zhihao Dai, Zhong Fang, Xin-Gao Gong, Quansheng Wu, Hongjun Xiang

    Abstract: We introduce a general first-principles methodology for computing electronic structure in a finite uniform magnetic field which allows for an arbitrary rational magnetic flux and nonlocal pseudopotentials, at a comparable time complexity of conventional plane-wave pseudopotential approaches in zero-field conditions. The versatility of this method is demonstrated through comprehensive applications… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

    Comments: 17 pages, 3 figures

  41. arXiv:2505.04354  [pdf, other

    math.OC cs.AI

    Optimization Problem Solving Can Transition to Evolutionary Agentic Workflows

    Authors: Wenhao Li, Bo Jin, Mingyi Hong, Changhong Lu, Xiangfeng Wang

    Abstract: This position paper argues that optimization problem solving can transition from expert-dependent to evolutionary agentic workflows. Traditional optimization practices rely on human specialists for problem formulation, algorithm selection, and hyperparameter tuning, creating bottlenecks that impede industrial adoption of cutting-edge methods. We contend that an evolutionary agentic workflow, power… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 27 pages, 5 figures

  42. arXiv:2505.02827  [pdf, ps, other

    astro-ph.CO

    Constraints on Inflationary Gravitational Waves with Two Years of SPT-3G Data

    Authors: J. A. Zebrowski, C. L. Reichardt, A. J. Anderson, B. Ansarinejad, M. Archipley, L. Balkenhol, P. Barry, K. Benabed, A. N. Bender, B. A. Benson, F. Bianchini, L. E. Bleem, F. R. Bouchet, L. Bryant, E. Camphuis, J. E. Carlstrom, C. L. Chang, P. Chaubal, P. M. Chichura, A. Chokshi, T. -L. Chou, A. Coerver, T. M. Crawford, C. Daley, T. de Haan , et al. (73 additional authors not shown)

    Abstract: We present a measurement of the $B$-mode polarization power spectrum of the cosmic microwave background anisotropies at 32 $\le$ $\ell$ $<$ 502 for three bands centered at 95, 150, and 220 GHz using data from the SPT-3G receiver on the South Pole Telescope. This work uses SPT-3G observations from the 2019 and 2020 winter observing seasons of a $\sim$1500 deg$^2$ patch of sky that directly overlaps… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: Submitted to PRD, 14 pages, 8 figures

  43. arXiv:2505.01978  [pdf, other

    quant-ph

    Generation of 95-qubit genuine entanglement and verification of symmetry-protected topological phases

    Authors: Tao Jiang, Jianbin Cai, Junxiang Huang, Naibin Zhou, Yukun Zhang, Jiahao Bei, Guoqing Cai, Sirui Cao, Fusheng Chen, Jiang Chen, Kefu Chen, Xiawei Chen, Xiqing Chen, Zhe Chen, Zhiyuan Chen, Zihua Chen, Wenhao Chu, Hui Deng, Zhibin Deng, Pei Ding, Xun Ding, Zhuzhengqi Ding, Shuai Dong, Bo Fan, Daojin Fan , et al. (130 additional authors not shown)

    Abstract: Symmetry-protected topological (SPT) phases are fundamental features of cluster states, serving as key resources for measurement-based quantum computation (MBQC). Generating large-scale cluster states and verifying their SPT phases are essential steps toward practical MBQC, which however still presents significant experimental challenges. In this work, we address these challenges by utilizing adva… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

    Comments: Main text: 15 pages, 4 figures; supplementary materials: 42 pages, 19 figures. Total: 57 pages, 23 figures

  44. arXiv:2505.01396  [pdf, ps, other

    cs.RO cs.AI cs.LG

    SIME: Enhancing Policy Self-Improvement with Modal-level Exploration

    Authors: Yang Jin, Jun Lv, Wenye Yu, Hongjie Fang, Yong-Lu Li, Cewu Lu

    Abstract: Self-improvement requires robotic systems to initially learn from human-provided data and then gradually enhance their capabilities through interaction with the environment. This is similar to how humans improve their skills through continuous practice. However, achieving effective self-improvement is challenging, primarily because robots tend to repeat their existing abilities during interactions… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

  45. arXiv:2505.00053  [pdf, other

    physics.ins-det nucl-ex physics.acc-ph

    Full realization of the RIBLL2 separator at the HIRFL-CSR facility

    Authors: Xiao-Dong Xu, Yong Zheng, Zhi-Yu Sun, Yu-Nan Song, Bao-Hua Sun, Satoru Terashima, Chang-Jian Wang, Ge Guo, Guang-Shuai Li, Xiu-Lin Wei, Jun-Yao Xu, Ji-Chao Zhang, Yong Cao, Bing-Shui Gao, Jia-Xing Han, Jin-Rong Liu, Chen-Gui Lu, Shu-Ya Jin, Hooi Jin Ong, Hao-Tian Qi, Yun Qin, Ya-Zhou Sun, Isao Tanihata, Lu-Ping Wan, Kai-Long Wang , et al. (11 additional authors not shown)

    Abstract: A new experimental platform was constructed at the Second Radioactive Ion Beam Line in Lanzhou (RIBLL2) of HIRFL-CSR accelerator facility at Lanzhou, China. Its performance, along with several newly developed detectors, was tested in two radioactive ion beam experiments utilizing a 400 MeV/u 40Ar beam and a 350 MeV/u 78Kr beam, respectively. The first results from these two experiments demonstrate… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

    Comments: 5 pages, 2 figures

    Journal ref: Science Bulletin, 70, 1026-1029, (2025)

  46. arXiv:2504.20624  [pdf, other

    cs.AI

    PaRT: Enhancing Proactive Social Chatbots with Personalized Real-Time Retrieval

    Authors: Zihan Niu, Zheyong Xie, Shaosheng Cao, Chonggang Lu, Zheyu Ye, Tong Xu, Zuozhu Liu, Yan Gao, Jia Chen, Zhe Xu, Yi Wu, Yao Hu

    Abstract: Social chatbots have become essential intelligent companions in daily scenarios ranging from emotional support to personal interaction. However, conventional chatbots with passive response mechanisms usually rely on users to initiate or sustain dialogues by bringing up new topics, resulting in diminished engagement and shortened dialogue duration. In this paper, we present PaRT, a novel framework… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  47. arXiv:2504.20038  [pdf, other

    astro-ph.CO

    Unified and consistent structure growth measurements from joint ACT, SPT and \textit{Planck} CMB lensing

    Authors: Frank J. Qu, Fei Ge, W. L. Kimmy Wu, Irene Abril-Cabezas, Mathew S. Madhavacheril, Marius Millea, Ethan Anderes, Adam J. Anderson, Behzad Ansarinejad, Melanie Archipley, Zachary Atkins, Lennart Balkenhol, Nicholas Battaglia, Karim Benabed, Amy N. Bender, Bradford A. Benson, Federico Bianchini, Lindsey. E. Bleem, Boris Bolliet, J Richard Bond, François. R. Bouchet, Lincoln Bryant, Erminia Calabrese, Etienne Camphuis, John E. Carlstrom , et al. (120 additional authors not shown)

    Abstract: We present the tightest cosmic microwave background (CMB) lensing constraints to date on the growth of structure by combining CMB lensing measurements from the Atacama Cosmology Telescope (ACT), the South Pole Telescope (SPT) and \textit{Planck}. Each of these surveys individually provides lensing measurements with similarly high statistical power, achieving signal-to-noise ratios of approximately… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

    Comments: 9+9 pages, 6 figures Likelihood software and data available at this https URL: https://github.com/qujia7/spt_act_likelihood/

  48. arXiv:2504.19898  [pdf, other

    cs.CL

    GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets

    Authors: Mingqian He, Fei Zhao, Chonggang Lu, Ziyan Liu, Yue Wang, Haofu Qian

    Abstract: As a fundamental task in machine learning, text classification plays a crucial role in many areas. With the rapid scaling of Large Language Models (LLMs), particularly through reinforcement learning (RL), there is a growing need for more capable discriminators. Consequently, advances in classification are becoming increasingly vital for enhancing the overall capabilities of LLMs. Traditional discr… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  49. arXiv:2504.19067  [pdf, other

    hep-ph

    Searching for elusive dark Higgs boson in spin-1/2 inelastic dark matter models at Belle II

    Authors: P. Ko, Youngjoon Kwon, Chih-Ting Lu, Xinqi Wei

    Abstract: Spin-1/2 inelastic dark matter (DM) models are popular among sub-GeV to GeV thermal DM scenarios due to the dominant role of co-annihilation in determining the DM relic abundance. In these models, the dark Higgs boson plays a crucial role in generating the mass of the new gauge boson, the dark photon ($A^{'}$), and in establishing the mass splitting between the excited ($χ_2$) and ground ($χ_1$) s… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

  50. arXiv:2504.18015  [pdf, ps, other

    cs.CR cs.CV cs.LG

    DiffUMI: Training-Free Universal Model Inversion via Unconditional Diffusion for Face Recognition

    Authors: Hanrui Wang, Shuo Wang, Chun-Shien Lu, Isao Echizen

    Abstract: Face recognition technology presents serious privacy risks due to its reliance on sensitive and immutable biometric data. To address these concerns, such systems typically convert raw facial images into embeddings, which are traditionally viewed as privacy-preserving. However, model inversion attacks challenge this assumption by reconstructing private facial images from embeddings, highlighting a… ▽ More

    Submitted 11 June, 2025; v1 submitted 24 April, 2025; originally announced April 2025.