Skip to main content

Showing 151–200 of 5,652 results for author: wang, D

.
  1. arXiv:2505.22140  [pdf, other

    hep-ex

    Search for a dark baryon in the $Ξ^-\rightarrowπ^-+{\rm invisible}$ decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (697 additional authors not shown)

    Abstract: A search for a dark baryon is performed for the first time in the two-body decay $Ξ^-\rightarrowπ^-+{\rm invisible}$ using $(10.087\pm0.044)\times10^{9}$ $J/ψ$ events collected at a center-of-mass energy of $\sqrt{s}=3.097\,\mbox{GeV}$ with the BESIII detector at the BEPCII collider. No significant signal is observed, and the 90% (95%) confidence level upper limits on the branching fraction… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 11 pages, 4 figures, 1 table

  2. arXiv:2505.21805  [pdf, ps, other

    cs.SD eess.AS

    An Investigation on Speaker Augmentation for End-to-End Speaker Extraction

    Authors: Zhenghai You, Zhenyu Zhou, Lantian Li, Dong Wang

    Abstract: Target confusion, defined as occasional switching to non-target speakers, poses a key challenge for end-to-end speaker extraction (E2E-SE) systems. We argue that this problem is largely caused by the lack of generalizability and discrimination of the speaker embeddings, and introduce a simple yet effective speaker augmentation strategy to tackle the problem. Specifically, we propose a time-domain… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  3. arXiv:2505.21432  [pdf, ps, other

    cs.RO cs.AI

    Hume: Introducing System-2 Thinking in Visual-Language-Action Model

    Authors: Haoming Song, Delin Qu, Yuanqi Yao, Qizhi Chen, Qi Lv, Yiwen Tang, Modi Shi, Guanghui Ren, Maoqing Yao, Bin Zhao, Dong Wang, Xuelong Li

    Abstract: Humans practice slow thinking before performing actual actions when handling complex tasks in the physical world. This thinking paradigm, recently, has achieved remarkable advancement in boosting Large Language Models (LLMs) to solve complex tasks in digital domains. However, the potential of slow thinking remains largely unexplored for robotic foundation models interacting with the physical world… ▽ More

    Submitted 8 July, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

  4. arXiv:2505.21375  [pdf, ps, other

    cs.CV

    GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution

    Authors: Fengxiang Wang, Mingshuo Chen, Yueying Li, Di Wang, Haotian Wang, Zonghao Guo, Zefan Wang, Boqi Shan, Long Lan, Yulin Wang, Hongzhen Wang, Wenjing Yang, Bo Du, Jing Zhang

    Abstract: Ultra-high-resolution (UHR) remote sensing (RS) imagery offers valuable data for Earth observation but pose challenges for existing multimodal foundation models due to two key bottlenecks: (1) limited availability of UHR training data, and (2) token explosion caused by the large image size. To address data scarcity, we introduce SuperRS-VQA (avg. 8,376$\times$8,376) and HighRS-VQA (avg. 2,000… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  5. arXiv:2505.21226  [pdf, ps, other

    cs.LG

    Why Do More Experts Fail? A Theoretical Analysis of Model Merging

    Authors: Zijing Wang, Xingle Xu, Yongkang Liu, Yiqun Zhang, Peiqin Lin, Shi Feng, Xiaocui Yang, Daling Wang, Hinrich Schütze

    Abstract: Model merging dramatically reduces storage and computational resources by combining multiple expert models into a single multi-task model. Although recent model merging methods have shown promising results, they struggle to maintain performance gains as the number of merged models increases. In this paper, we investigate the key obstacles that limit the scalability of model merging when integratin… ▽ More

    Submitted 3 June, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

  6. arXiv:2505.21049  [pdf, ps, other

    cs.CV

    Robust Video-Based Pothole Detection and Area Estimation for Intelligent Vehicles with Depth Map and Kalman Smoothing

    Authors: Dehao Wang, Haohang Zhu, Yiwen Xu, Kaiqi Liu

    Abstract: Road potholes pose a serious threat to driving safety and comfort, making their detection and assessment a critical task in fields such as autonomous driving. When driving vehicles, the operators usually avoid large potholes and approach smaller ones at reduced speeds to ensure safety. Therefore, accurately estimating pothole area is of vital importance. Most existing vision-based methods rely on… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  7. arXiv:2505.20897  [pdf, ps, other

    cs.CV cs.AI cs.CL cs.RO

    Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation

    Authors: Pingrui Zhang, Yifei Su, Pengyuan Wu, Dong An, Li Zhang, Zhigang Wang, Dong Wang, Yan Ding, Bin Zhao, Xuelong Li

    Abstract: Vision-and-Language Navigation (VLN) requires the agent to navigate by following natural instructions under partial observability, making it difficult to align perception with language. Recent methods mitigate this by imagining future scenes, yet they rely on vision-based synthesis, leading to high computational cost and redundant details. To this end, we propose to adaptively imagine key environm… ▽ More

    Submitted 22 June, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

  8. arXiv:2505.20589  [pdf, ps, other

    cs.LG q-bio.QM

    Prot2Token: A Unified Framework for Protein Modeling via Next-Token Prediction

    Authors: Mahdi Pourmirzaei, Farzaneh Esmaili, Salhuldin Alqarghuli, Mohammadreza Pourmirzaei, Ye Han, Kai Chen, Mohsen Rezaei, Duolin Wang, Dong Xu

    Abstract: The diverse nature of protein prediction tasks has traditionally necessitated specialized models, hindering the development of broadly applicable and computationally efficient Protein Language Models (PLMs). In this work, we introduce Prot2Token, a unified framework that overcomes these challenges by converting a wide spectrum of protein-related predictions, from sequence-level properties and resi… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  9. arXiv:2505.20293  [pdf, ps, other

    cs.CL

    Enhancing the Comprehensibility of Text Explanations via Unsupervised Concept Discovery

    Authors: Yifan Sun, Danding Wang, Qiang Sheng, Juan Cao, Jintao Li

    Abstract: Concept-based explainable approaches have emerged as a promising method in explainable AI because they can interpret models in a way that aligns with human reasoning. However, their adaption in the text domain remains limited. Most existing methods rely on predefined concept annotations and cannot discover unseen concepts, while other methods that extract concepts without supervision often produce… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: ACL 2025 Findings

  10. arXiv:2505.20279  [pdf, ps, other

    cs.CV cs.CL

    VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction

    Authors: Zhiwen Fan, Jian Zhang, Renjie Li, Junge Zhang, Runjin Chen, Hezhen Hu, Kevin Wang, Huaizhi Qu, Dilin Wang, Zhicheng Yan, Hongyu Xu, Justin Theiss, Tianlong Chen, Jiachen Li, Zhengzhong Tu, Zhangyang Wang, Rakesh Ranjan

    Abstract: The rapid advancement of Large Multimodal Models (LMMs) for 2D images and videos has motivated extending these models to understand 3D scenes, aiming for human-like visual-spatial intelligence. Nevertheless, achieving deep spatial understanding comparable to human capabilities poses significant challenges in model encoding and data acquisition. Existing methods frequently depend on external depth… ▽ More

    Submitted 1 June, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

    Comments: Project Page: https://vlm-3r.github.io/

  11. arXiv:2505.20124  [pdf, ps, other

    cs.CV cs.MM

    TUNA: Comprehensive Fine-grained Temporal Understanding Evaluation on Dense Dynamic Videos

    Authors: Fanheng Kong, Jingyuan Zhang, Hongzhi Zhang, Shi Feng, Daling Wang, Linhao Yu, Xingguang Ji, Yu Tian, Victoria W., Fuzheng Zhang

    Abstract: Videos are unique in their integration of temporal elements, including camera, scene, action, and attribute, along with their dynamic relationships over time. However, existing benchmarks for video understanding often treat these properties separately or narrowly focus on specific aspects, overlooking the holistic nature of video content. To address this, we introduce TUNA, a temporal-oriented ben… ▽ More

    Submitted 27 May, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

    Comments: Accepted to ACL 2025 Main. Project page: https://friedrichor.github.io/projects/TUNA

  12. arXiv:2505.20075  [pdf, other

    cs.AI

    Curriculum-RLAIF: Curriculum Alignment with Reinforcement Learning from AI Feedback

    Authors: Mengdi Li, Jiaye Lin, Xufeng Zhao, Wenhao Lu, Peilin Zhao, Stefan Wermter, Di Wang

    Abstract: Reward models trained with conventional Reinforcement Learning from AI Feedback (RLAIF) methods suffer from limited generalizability, which hinders the alignment performance of the policy model during reinforcement learning (RL). This challenge stems from various issues, including distribution shift, preference label noise, and mismatches between overly challenging samples and model capacity. In t… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  13. arXiv:2505.20016  [pdf, ps, other

    cs.CL

    TTPA: Token-level Tool-use Preference Alignment Training Framework with Fine-grained Evaluation

    Authors: Chengrui Huang, Shen Gao, Zhengliang Shi, Dongsheng Wang, Shuo Shang

    Abstract: Existing tool-learning methods usually rely on supervised fine-tuning, they often overlook fine-grained optimization of internal tool call details, leading to limitations in preference alignment and error discrimination. To overcome these challenges, we propose Token-level Tool-use Preference Alignment Training Framework (TTPA), a training paradigm for constructing token-level tool-use preference… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 16 pages, 5 figures

  14. arXiv:2505.19943  [pdf, other

    cs.LG

    Beyond Freezing: Sparse Tuning Enhances Plasticity in Continual Learning with Pre-Trained Models

    Authors: Huan Zhang, Fan Lyu, Shuyu Dong, Shenghua Fan, Yujin Zheng, Dingwen Wang

    Abstract: Continual Learning with Pre-trained Models holds great promise for efficient adaptation across sequential tasks. However, most existing approaches freeze PTMs and rely on auxiliary modules like prompts or adapters, limiting model plasticity and leading to suboptimal generalization when facing significant distribution shifts. While full fine-tuning can improve adaptability, it risks disrupting cruc… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  15. arXiv:2505.19907  [pdf, ps, other

    hep-ex nucl-ex

    First measurement of $Σ^{+}n\rightarrowΛp$ and $Σ^{+}n\rightarrowΣ^{0}p$ cross-sections via $Σ^+$-nucleus scattering at an electron-positron collider

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

    Abstract: Using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected with the BESIII detector at the BEPCII storage ring, the reactions $Σ^{+}n\rightarrowΛp$ and $Σ^{+}n\rightarrowΣ^{0}p$ are studied, where the $Σ^{+}$ baryon is produced in the process $J/ψ\rightarrowΣ^{+}\barΣ^-$ and the neutron is a component of the $^9\rm{Be}$, $^{12}\rm{C}$ and $^{197}\rm{Au}$ nuclei in the beam pipe. Clear signals o… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 9 pages, 2 figures

  16. arXiv:2505.19819  [pdf, other

    cs.CE cs.AI

    FinLoRA: Benchmarking LoRA Methods for Fine-Tuning LLMs on Financial Datasets

    Authors: Dannong Wang, Jaisal Patel, Daochen Zha, Steve Y. Yang, Xiao-Yang Liu

    Abstract: Low-rank adaptation (LoRA) methods show great potential for scaling pre-trained general-purpose Large Language Models (LLMs) to hundreds or thousands of use scenarios. However, their efficacy in high-stakes domains like finance is rarely explored, e.g., passing CFA exams and analyzing SEC filings. In this paper, we present the open-source FinLoRA project that benchmarks LoRA methods on both genera… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  17. arXiv:2505.19797  [pdf, ps, other

    cs.CL

    The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants

    Authors: Yiqun Zhang, Hao Li, Chenxu Wang, Linyao Chen, Qiaosheng Zhang, Peng Ye, Shi Feng, Daling Wang, Zhen Wang, Xinrun Wang, Jia Xu, Lei Bai, Wanli Ouyang, Shuyue Hu

    Abstract: Proprietary giants are increasingly dominating the race for ever-larger language models. Can open-source, smaller models remain competitive across a broad range of tasks? In this paper, we present the Avengers -- a simple recipe that leverages the collective intelligence of these smaller models. The Avengers builds upon four lightweight operations: (i) embedding: encode queries using a text embedd… ▽ More

    Submitted 18 June, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

    Comments: 9 pages, 4 figures, 6 tables, supplementary material (appendix) included separately

  18. arXiv:2505.19699  [pdf, ps, other

    cs.LG cs.AI cs.DC

    Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments

    Authors: Junming Liu, Yanting Gao, Siyuan Meng, Yifei Sun, Aoqi Wu, Yufei Jin, Yirong Chen, Ding Wang, Guosun Zeng

    Abstract: Federated Learning (FL) is a decentralized machine learning paradigm that enables clients to collaboratively train models while preserving data privacy. However, the coexistence of model and data heterogeneity gives rise to inconsistent representations and divergent optimization dynamics across clients, ultimately hindering robust global performance. To transcend these challenges, we propose Mosai… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 43 pages, 23 figures, 15 tables; the last dance

  19. arXiv:2505.19650  [pdf, ps, other

    cs.CV cs.IR cs.MM

    Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval

    Authors: Fanheng Kong, Jingyuan Zhang, Yahui Liu, Hongzhi Zhang, Shi Feng, Xiaocui Yang, Daling Wang, Yu Tian, Victoria W., Fuzheng Zhang, Guorui Zhou

    Abstract: Multimodal information retrieval (MIR) faces inherent challenges due to the heterogeneity of data sources and the complexity of cross-modal alignment. While previous studies have identified modal gaps in feature spaces, a systematic approach to address these challenges remains unexplored. In this work, we introduce UNITE, a universal framework that tackles these challenges through two critical yet… ▽ More

    Submitted 27 May, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

    Comments: 26 pages, project page: https://friedrichor.github.io/projects/UNITE

  20. arXiv:2505.19066  [pdf, ps, other

    hep-th

    Bootstrapping the Cosmological Collider with Resonant Features

    Authors: Dong-Gang Wang, Bowei Zhang

    Abstract: Signatures of heavy particles during inflation are exponentially suppressed by the Boltzmann factor when the masses are far above the Hubble scale. In more realistic scenarios, however, scale-dependent features may change this conventional picture and boost the cosmological collider signals. In this paper, we compute cosmological correlators of the primordial curvature perturbations exchanging an… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 43 pages, 10 figures

  21. arXiv:2505.18992  [pdf, ps, other

    cs.CV

    VPGS-SLAM: Voxel-based Progressive 3D Gaussian SLAM in Large-Scale Scenes

    Authors: Tianchen Deng, Wenhua Wu, Junjie He, Yue Pan, Xirui Jiang, Shenghai Yuan, Danwei Wang, Hesheng Wang, Weidong Chen

    Abstract: 3D Gaussian Splatting has recently shown promising results in dense visual SLAM. However, existing 3DGS-based SLAM methods are all constrained to small-room scenarios and struggle with memory explosion in large-scale scenes and long sequences. To this end, we propose VPGS-SLAM, the first 3DGS-based large-scale RGBD SLAM framework for both indoor and outdoor scenarios. We design a novel voxel-based… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  22. arXiv:2505.18533  [pdf, ps, other

    eess.AS cs.AI

    TS-URGENet: A Three-stage Universal Robust and Generalizable Speech Enhancement Network

    Authors: Xiaobin Rong, Dahan Wang, Qinwen Hu, Yushi Wang, Yuxiang Hu, Jing Lu

    Abstract: Universal speech enhancement aims to handle input speech with different distortions and input formats. To tackle this challenge, we present TS-URGENet, a Three-Stage Universal, Robust, and Generalizable speech Enhancement Network. To address various distortions, the proposed system employs a novel three-stage architecture consisting of a filling stage, a separation stage, and a restoration stage.… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

    Comments: Accepted by Interspeech 2025

  23. arXiv:2505.18454  [pdf, other

    cs.CL

    Hybrid Latent Reasoning via Reinforcement Learning

    Authors: Zhenrui Yue, Bowen Jin, Huimin Zeng, Honglei Zhuang, Zhen Qin, Jinsung Yoon, Lanyu Shang, Jiawei Han, Dong Wang

    Abstract: Recent advances in large language models (LLMs) have introduced latent reasoning as a promising alternative to autoregressive reasoning. By performing internal computation with hidden states from previous steps, latent reasoning benefit from more informative features rather than sampling a discrete chain-of-thought (CoT) path. Yet latent reasoning approaches are often incompatible with LLMs, as th… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  24. arXiv:2505.18154  [pdf, ps, other

    cs.CL cs.CY

    The Staircase of Ethics: Probing LLM Value Priorities through Multi-Step Induction to Complex Moral Dilemmas

    Authors: Ya Wu, Qiang Sheng, Danding Wang, Guang Yang, Yifan Sun, Zhengjia Wang, Yuyan Bu, Juan Cao

    Abstract: Ethical decision-making is a critical aspect of human judgment, and the growing use of LLMs in decision-support systems necessitates a rigorous evaluation of their moral reasoning capabilities. However, existing assessments primarily rely on single-step evaluations, failing to capture how models adapt to evolving ethical challenges. Addressing this gap, we introduce the Multi-step Moral Dilemmas (… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 25 pages, 8 figures

  25. arXiv:2505.18004  [pdf, ps, other

    hep-ex

    Measurement of branching fractions of $Λ_{c}^{+}$ decays to $Σ^{+} η$ and $Σ^{+} η'$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: By analyzing $e^+e^-$ collision data taken at center-of-mass energies $\sqrt{s} = 4.600 \sim 4.699$ $\mbox{GeV}$ with the BESIII detector at the BEPCII collider, corresponding to an integrated luminosity of $\rm 4.5~fb^{-1}$, we study the hadronic decays $Λ_{c}^{+} \rightarrow Σ^{+} η$ and $Λ_{c}^{+} \rightarrow Σ^{+} η^{\prime}$ using the single-tag method. The branching fraction ratio of… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  26. arXiv:2505.17827  [pdf, ps, other

    cs.CL

    Not All Tokens Are What You Need In Thinking

    Authors: Hang Yuan, Bin Yu, Haotian Li, Shijun Yang, Christina Dan Wang, Zhou Yu, Xueyin Xu, Weizhen Qi, Kai Chen

    Abstract: Modern reasoning models, such as OpenAI's o1 and DeepSeek-R1, exhibit impressive problem-solving capabilities but suffer from critical inefficiencies: high inference latency, excessive computational resource consumption, and a tendency toward overthinking -- generating verbose chains of thought (CoT) laden with redundant tokens that contribute minimally to the final answer. To address these issues… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 11 pages, 7 figures and 3 tables

  27. arXiv:2505.17712  [pdf, ps, other

    cs.CL

    Understanding How Value Neurons Shape the Generation of Specified Values in LLMs

    Authors: Yi Su, Jiayi Zhang, Shu Yang, Xinhai Wang, Lijie Hu, Di Wang

    Abstract: Rapid integration of large language models (LLMs) into societal applications has intensified concerns about their alignment with universal ethical principles, as their internal value representations remain opaque despite behavioral alignment advancements. Current approaches struggle to systematically interpret how values are encoded in neural architectures, limited by datasets that prioritize supe… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  28. arXiv:2505.16969  [pdf, ps, other

    cs.RO

    3D Equivariant Visuomotor Policy Learning via Spherical Projection

    Authors: Boce Hu, Dian Wang, David Klee, Heng Tian, Xupeng Zhu, Haojie Huang, Robert Platt, Robin Walters

    Abstract: Equivariant models have recently been shown to improve the data efficiency of diffusion policy by a significant margin. However, prior work that explored this direction focused primarily on point cloud inputs generated by multiple cameras fixed in the workspace. This type of point cloud input is not compatible with the now-common setting where the primary input modality is an eye-in-hand RGB camer… ▽ More

    Submitted 2 June, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

  29. arXiv:2505.16856  [pdf, ps, other

    cs.LG cs.AI cs.RO

    Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only

    Authors: Wei Xiao, Jiacheng Liu, Zifeng Zhuang, Runze Suo, Shangke Lyu, Donglin Wang

    Abstract: Improving the performance of pre-trained policies through online reinforcement learning (RL) is a critical yet challenging topic. Existing online RL fine-tuning methods require continued training with offline pretrained Q-functions for stability and performance. However, these offline pretrained Q-functions commonly underestimate state-action pairs beyond the offline dataset due to the conservatis… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  30. arXiv:2505.16660  [pdf

    cs.CL cs.AI

    Can reasoning models comprehend mathematical problems in Chinese ancient texts? An empirical study based on data from Suanjing Shishu

    Authors: Chang Liu, Dongbo Wang, Liu liu, Zhixiao Zhao

    Abstract: This study addresses the challenges in intelligent processing of Chinese ancient mathematical classics by constructing Guji_MATH, a benchmark for evaluating classical texts based on Suanjing Shishu. It systematically assesses the mathematical problem-solving capabilities of mainstream reasoning models under the unique linguistic constraints of classical Chinese. Through machine-assisted annotation… ▽ More

    Submitted 13 June, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

    Comments: 29pages, 7 figures

  31. arXiv:2505.16557  [pdf, ps, other

    cs.MA

    Is Your LLM-Based Multi-Agent a Reliable Real-World Planner? Exploring Fraud Detection in Travel Planning

    Authors: Junchi Yao, Jianhua Xu, Tianyu Xin, Ziyi Wang, Shenzhe Zhu, Shu Yang, Di Wang

    Abstract: The rise of Large Language Model-based Multi-Agent Planning has leveraged advanced frameworks to enable autonomous and collaborative task execution. Some systems rely on platforms like review sites and social media, which are prone to fraudulent information, such as fake reviews or misleading descriptions. This reliance poses risks, potentially causing financial losses and harming user experiences… ▽ More

    Submitted 13 June, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

    Comments: Accepted by ICML 2025 Workshop MAS

  32. arXiv:2505.16379  [pdf, other

    cond-mat.mtrl-sci cs.AI

    Materials Generation in the Era of Artificial Intelligence: A Comprehensive Survey

    Authors: Zhixun Li, Bin Cao, Rui Jiao, Liang Wang, Ding Wang, Yang Liu, Dingshuo Chen, Jia Li, Qiang Liu, Yu Rong, Liang Wang, Tong-yi Zhang, Jeffrey Xu Yu

    Abstract: Materials are the foundation of modern society, underpinning advancements in energy, electronics, healthcare, transportation, and infrastructure. The ability to discover and design new materials with tailored properties is critical to solving some of the most pressing global challenges. In recent years, the growing availability of high-quality materials data combined with rapid advances in Artific… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: Work in progress

  33. arXiv:2505.16321  [pdf, ps, other

    cs.CV

    Efficient Motion Prompt Learning for Robust Visual Tracking

    Authors: Jie Zhao, Xin Chen, Yongsheng Yuan, Michael Felsberg, Dong Wang, Huchuan Lu

    Abstract: Due to the challenges of processing temporal information, most trackers depend solely on visual discriminability and overlook the unique temporal coherence of video data. In this paper, we propose a lightweight and plug-and-play motion prompt tracking method. It can be easily integrated into existing vision-based trackers to build a joint tracking framework leveraging both motion and vision cues,… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: Accepted by ICML2025

  34. arXiv:2505.15791  [pdf, ps, other

    cs.CV cs.LG

    VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL

    Authors: Fengyuan Dai, Zifeng Zhuang, Yufei Huang, Siteng Huang, Bangyan Liao, Donglin Wang, Fajie Yuan

    Abstract: Diffusion models have emerged as powerful generative tools across various domains, yet tailoring pre-trained models to exhibit specific desirable properties remains challenging. While reinforcement learning (RL) offers a promising solution,current methods struggle to simultaneously achieve stable, efficient fine-tuning and support non-differentiable rewards. Furthermore, their reliance on sparse r… ▽ More

    Submitted 2 June, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

    Comments: Under review

  35. arXiv:2505.15647  [pdf, ps, other

    cs.LG cs.AI

    Second-Order Convergence in Private Stochastic Non-Convex Optimization

    Authors: Youming Tao, Zuyuan Zhang, Dongxiao Yu, Xiuzhen Cheng, Falko Dressler, Di Wang

    Abstract: We investigate the problem of finding second-order stationary points (SOSP) in differentially private (DP) stochastic non-convex optimization. Existing methods suffer from two key limitations: (i) inaccurate convergence error rate due to overlooking gradient variance in the saddle point escape analysis, and (ii) dependence on auxiliary private model selection procedures for identifying DP-SOSP, wh… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  36. arXiv:2505.15620  [pdf, ps, other

    hep-ex

    Observation of $χ_{cJ}\to 3K_S^0K^\pmπ^\mp$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (678 additional authors not shown)

    Abstract: By analyzing $(2712.4\pm14.3)\times10^6$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays $χ_{c0,1,2} \to 3K_S^0K^\pmπ^\mp$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be $\mathcal{B}(χ_{c0}\to 3K_S^0K^\pmπ^\mp )=(7.95\pm0.50\pm0.65)\times10^{-5},$… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 11 pages, 6 figures

  37. arXiv:2505.15431  [pdf, ps, other

    cs.CL

    Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

    Authors: Tencent Hunyuan Team, Ao Liu, Botong Zhou, Can Xu, Chayse Zhou, ChenChen Zhang, Chengcheng Xu, Chenhao Wang, Decheng Wu, Dengpeng Wu, Dian Jiao, Dong Du, Dong Wang, Feng Zhang, Fengzong Lian, Guanghui Xu, Guanwei Zhang, Hai Wang, Haipeng Luo, Han Hu, Huilin Xu, Jiajia Wu, Jianchen Zhu, Jianfeng Yan, Jiaqi Zhu , et al. (230 additional authors not shown)

    Abstract: As Large Language Models (LLMs) rapidly advance, we introduce Hunyuan-TurboS, a novel large hybrid Transformer-Mamba Mixture of Experts (MoE) model. It synergistically combines Mamba's long-sequence processing efficiency with Transformer's superior contextual understanding. Hunyuan-TurboS features an adaptive long-short chain-of-thought (CoT) mechanism, dynamically switching between rapid response… ▽ More

    Submitted 4 July, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

  38. arXiv:2505.15317  [pdf

    cond-mat.mes-hall quant-ph

    Procedure of tuning up a three-site artificial Kitaev chain based on transmon measurements

    Authors: Xiaozhou Yang, Zhaozheng Lyu, Xiang Wang, Enna Zhuo, Yunxiao Zhang, Duolin Wang, Yukun Shi, Yuyang Huang, Bing Li, Xiaohui Song, Peiling Li, Bingbing Tong, Ziwei Dou, Jie Shen, Guangtong Liu, Fanming Qu, Li Lu

    Abstract: Artificial Kitaev chains (AKCs), formed of quantum dot-superconductor linear arrays, provide a promising platform for hosting Majorana bound states (MBSs) and implementing topological quantum computing. The main challenges along this research direction would include the tuning up of AKCs for hosting MBSs and the readout of the parity of the chains. In this work, we present a step-by-step procedure… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  39. arXiv:2505.15110  [pdf, other

    cs.CL

    RoT: Enhancing Table Reasoning with Iterative Row-Wise Traversals

    Authors: Xuanliang Zhang, Dingzirui Wang, Keyan Xu, Qingfu Zhu, Wanxiang Che

    Abstract: The table reasoning task, crucial for efficient data acquisition, aims to answer questions based on the given table. Recently, reasoning large language models (RLLMs) with Long Chain-of-Thought (Long CoT) significantly enhance reasoning capabilities, leading to brilliant performance on table reasoning. However, Long CoT suffers from high cost for training and exhibits low reliability due to table… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  40. Test of local realism via entangled $Λ\barΛ$ system

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (597 additional authors not shown)

    Abstract: The non-locality of quantum correlations is a fundamental feature of quantum theory. The Bell inequality serves as a benchmark for distinguishing between predictions made by quantum theory and local hidden variable theory (LHVT). Recent advancements in photon-entanglement experiments have addressed potential loopholes and have observed significant violations of variants of Bell inequality. However… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Journal ref: Nat Commun 16, 4948 (2025)

  41. arXiv:2505.14886  [pdf, ps, other

    cs.CL

    Strategic Planning and Rationalizing on Trees Make LLMs Better Debaters

    Authors: Danqing Wang, Zhuorui Ye, Xinran Zhao, Fei Fang, Lei Li

    Abstract: Winning competitive debates requires sophisticated reasoning and argument skills. There are unique challenges in the competitive debate: (1) The time constraints force debaters to make strategic choices about which points to pursue rather than covering all possible arguments; (2) The persuasiveness of the debate relies on the back-and-forth interaction between arguments, which a single final game… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: 9 main pages

  42. arXiv:2505.14757  [pdf

    cs.CY cs.AI

    Bridge2AI: Building A Cross-disciplinary Curriculum Towards AI-Enhanced Biomedical and Clinical Care

    Authors: John Rincon, Alexander R. Pelletier, Destiny Gilliland, Wei Wang, Ding Wang, Baradwaj S. Sankar, Lori Scott-Sheldon, Samson Gebreab, William Hersh, Parisa Rashidi, Sally Baxter, Wade Schulz, Trey Ideker, Yael Bensoussan, Paul C. Boutros, Alex A. T. Bui, Colin Walsh, Karol E. Watson, Peipei Ping

    Abstract: Objective: As AI becomes increasingly central to healthcare, there is a pressing need for bioinformatics and biomedical training systems that are personalized and adaptable. Materials and Methods: The NIH Bridge2AI Training, Recruitment, and Mentoring (TRM) Working Group developed a cross-disciplinary curriculum grounded in collaborative innovation, ethical data stewardship, and professional devel… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  43. arXiv:2505.14519  [pdf, ps, other

    quant-ph cs.AR cs.DC

    Distributed quantum computing with black-box subroutines

    Authors: X. Xu, Y. -D. Liu, S. Shi, Y. -J. Wang, D. -S. Wang

    Abstract: In this work, we propose a general protocol for distributed quantum computing that accommodates arbitrary unknown subroutines. It can be applied to scale up quantum computing through multi-chip interconnection, as well as to tasks such as estimating unknown parameters or processes for circuit depth reduction and constructing secure quantum cryptographic protocols. Our protocol builds upon a few te… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  44. arXiv:2505.14447  [pdf, ps, other

    astro-ph.HE hep-ex

    First Identification and Precise Spectral Measurement of the Proton Component in the Cosmic-Ray `Knee'

    Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, C. M. Cai, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, G. H. Chen, H. X. Chen, Liang Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (292 additional authors not shown)

    Abstract: We report the first high-purity identification of cosmic-ray (CR) protons and a precise measurement of their energy spectrum from 0.15 to 12 PeV using the Large High Altitude Air Shower Observatory (LHAASO). Abundant event statistics, combined with the simultaneous detection of electrons/photons, muons, and Cherenkov light in air showers, enable spectroscopic measurements with statistical and syst… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  45. arXiv:2505.14437  [pdf, ps, other

    cs.SE

    Building Reuse-Sensitive Control Flow Graphs (CFGs) for EVM Bytecode

    Authors: Dingding Wang, Jianting He, Yizheng Yang, Lei Wu, Rui Chang, Yajin Zhou

    Abstract: The emergence of smart contracts brings security risks, exposing users to the threat of losing valuable cryptocurrencies, underscoring the urgency of meticulous scrutiny. Nevertheless, the static analysis of smart contracts in EVM bytecode faces obstacles due to flawed primitives resulting from code reuse introduced by compilers. Code reuse, a phenomenon where identical code executes in diverse co… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  46. arXiv:2505.14135  [pdf, other

    cs.CV

    Hunyuan-Game: Industrial-grade Intelligent Game Creation Model

    Authors: Ruihuang Li, Caijin Zhou, Shoujian Zheng, Jianxiang Lu, Jiabin Huang, Comi Chen, Junshu Tang, Guangzheng Xu, Jiale Tao, Hongmei Wang, Donghao Li, Wenqing Yu, Senbo Wang, Zhimin Li, Yetshuan Shi, Haoyu Yang, Yukun Wang, Wenxun Dai, Jiaqi Li, Linqing Wang, Qixun Wang, Zhiyong Xu, Yingfang Zhang, Jiangfeng Xiong, Weijie Kong , et al. (33 additional authors not shown)

    Abstract: Intelligent game creation represents a transformative advancement in game development, utilizing generative artificial intelligence to dynamically generate and enhance game content. Despite notable progress in generative models, the comprehensive synthesis of high-quality game assets, including both images and videos, remains a challenging frontier. To create high-fidelity game content that simult… ▽ More

    Submitted 28 May, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

  47. arXiv:2505.13949  [pdf, ps, other

    cs.CL cs.AI

    FlashThink: An Early Exit Method For Efficient Reasoning

    Authors: Guochao Jiang, Guofeng Quan, Zepeng Ding, Ziqin Luo, Dixuan Wang, Zheng Hu

    Abstract: Large Language Models (LLMs) have shown impressive performance in reasoning tasks. However, LLMs tend to generate excessively long reasoning content, leading to significant computational overhead. Our observations indicate that even on simple problems, LLMs tend to produce unnecessarily lengthy reasoning content, which is against intuitive expectations. Preliminary experiments show that at a certa… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  48. arXiv:2505.13431  [pdf, other

    cs.RO

    A Practical Guide for Incorporating Symmetry in Diffusion Policy

    Authors: Dian Wang, Boce Hu, Shuran Song, Robin Walters, Robert Platt

    Abstract: Recently, equivariant neural networks for policy learning have shown promising improvements in sample efficiency and generalization, however, their wide adoption faces substantial barriers due to implementation complexity. Equivariant architectures typically require specialized mathematical formulations and custom network design, posing significant challenges when integrating with modern policy fr… ▽ More

    Submitted 20 May, 2025; v1 submitted 19 May, 2025; originally announced May 2025.

  49. arXiv:2505.13413  [pdf, ps, other

    cs.LG

    Joint Velocity-Growth Flow Matching for Single-Cell Dynamics Modeling

    Authors: Dongyi Wang, Yuanwei Jiang, Zhenyi Zhang, Xiang Gu, Peijie Zhou, Jian Sun

    Abstract: Learning the underlying dynamics of single cells from snapshot data has gained increasing attention in scientific and machine learning research. The destructive measurement technique and cell proliferation/death result in unpaired and unbalanced data between snapshots, making the learning of the underlying dynamics challenging. In this paper, we propose joint Velocity-Growth Flow Matching (VGFM),… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  50. arXiv:2505.13222  [pdf, ps, other

    hep-ex

    Partial Wave Analysis of $e^{+}e^{-} \rightarrow π^{+}π^{-}J/ψ$ and Cross Section Measurement of $e^{+}e^{-} \rightarrow π^{\pm}Z_{c}(3900)^{\mp}$ from 4.1271 to 4.3583 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Based on 12.0 $\mathrm{fb^{-1}}$ of $e^{+}e^{-}$ collision data samples collected by the BESIII detector at center-of-mass energies from 4.1271 to 4.3583 GeV, a partial wave analysis is performed for the process $e^{+}e^{-} \rightarrow π^{+}π^{-}J/ψ$. The cross sections for the sub processes ${e^{+}e^{-}\rightarrowπ^{+}Z_{c}(3900)^{-}+c.c.\rightarrowπ^{+}π^{-}J/ψ}$,… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.