Skip to main content

Showing 151–200 of 6,528 results for author: Suen, Y

.
  1. arXiv:2505.24268  [pdf, ps, other

    physics.atom-ph quant-ph

    Heterodyne detection of low-frequency fields via Rydberg EIT with phase demodulation

    Authors: Shenchao Jin, Xiayang Fan, Xin Wang, Yi Song, Yuan Sun

    Abstract: Recently, the rapid progress of quantum sensing research reveals that the Rydberg atoms have great potentials in becoming high-precision centimeter-scale antenna of low-frequency fields. In order to facilitate efficient and reliable detection of low-frequency fields via Rydberg atoms, we design, implement and analyze a special but low-cost and scalable method based on heterodyning processes under… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

    Comments: 5 figures;

  2. arXiv:2505.24241  [pdf, ps, other

    cs.CL

    Advantageous Parameter Expansion Training Makes Better Large Language Models

    Authors: Naibin Gu, Yilong Chen, Zhenyu Zhang, Peng Fu, Zheng Lin, Shuohuan Wang, Yu Sun, Hua Wu, Weiping Wang, Haifeng Wang

    Abstract: Although scaling up the number of trainable parameters in both pre-training and fine-tuning can effectively improve the performance of large language models, it also leads to increased computational overhead. When delving into the parameter difference, we find that a subset of parameters, termed advantageous parameters, plays a crucial role in determining model performance. Further analysis reveal… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  3. arXiv:2505.24164  [pdf, ps, other

    cs.CL cs.CV

    Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models

    Authors: Shilin Xu, Yanwei Li, Rui Yang, Tao Zhang, Yueyi Sun, Wei Chow, Linfeng Li, Hang Song, Qi Xu, Yunhai Tong, Xiangtai Li, Hao Fei

    Abstract: Recent works on large language models (LLMs) have successfully demonstrated the emergence of reasoning capabilities via reinforcement learning (RL). Although recent efforts leverage group relative policy optimization (GRPO) for MLLMs post-training, they constantly explore one specific aspect, such as grounding tasks, math problems, or chart analysis. There are no works that can leverage multi-sour… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Report number: arxiv:2505.24164

  4. arXiv:2505.22140  [pdf, other

    hep-ex

    Search for a dark baryon in the $Ξ^-\rightarrowπ^-+{\rm invisible}$ decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (697 additional authors not shown)

    Abstract: A search for a dark baryon is performed for the first time in the two-body decay $Ξ^-\rightarrowπ^-+{\rm invisible}$ using $(10.087\pm0.044)\times10^{9}$ $J/ψ$ events collected at a center-of-mass energy of $\sqrt{s}=3.097\,\mbox{GeV}$ with the BESIII detector at the BEPCII collider. No significant signal is observed, and the 90% (95%) confidence level upper limits on the branching fraction… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 11 pages, 4 figures, 1 table

  5. arXiv:2505.21946  [pdf, ps, other

    cs.GR physics.flu-dyn

    Fluid Simulation on Vortex Particle Flow Maps

    Authors: Sinan Wang, Junwei Zhou, Fan Feng, Zhiqi Li, Yuchen Sun, Duowen Chen, Greg Turk, Bo Zhu

    Abstract: We propose the Vortex Particle Flow Map (VPFM) method to simulate incompressible flow with complex vortical evolution in the presence of dynamic solid boundaries. The core insight of our approach is that vorticity is an ideal quantity for evolution on particle flow maps, enabling significantly longer flow map distances compared to other fluid quantities like velocity or impulse. To achieve this go… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: ACM Transactions on Graphics (SIGGRAPH 2025), 24 pages

  6. arXiv:2505.21502  [pdf, ps, other

    cs.CV

    Generalizable and Relightable Gaussian Splatting for Human Novel View Synthesis

    Authors: Yipengjing Sun, Chenyang Wang, Shunyuan Zheng, Zonglin Li, Shengping Zhang, Xiangyang Ji

    Abstract: We propose GRGS, a generalizable and relightable 3D Gaussian framework for high-fidelity human novel view synthesis under diverse lighting conditions. Unlike existing methods that rely on per-character optimization or ignore physical constraints, GRGS adopts a feed-forward, fully supervised strategy that projects geometry, material, and illumination cues from multi-view 2D observations into 3D Gau… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: Project Webpage: https://sypj-98.github.io/grgs/

  7. arXiv:2505.21277  [pdf, ps, other

    cs.CR cs.AI cs.CL

    Breaking the Ceiling: Exploring the Potential of Jailbreak Attacks through Expanding Strategy Space

    Authors: Yao Huang, Yitong Sun, Shouwei Ruan, Yichi Zhang, Yinpeng Dong, Xingxing Wei

    Abstract: Large Language Models (LLMs), despite advanced general capabilities, still suffer from numerous safety risks, especially jailbreak attacks that bypass safety protocols. Understanding these vulnerabilities through black-box jailbreak attacks, which better reflect real-world scenarios, offers critical insights into model robustness. While existing methods have shown improvements through various prom… ▽ More

    Submitted 28 May, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

    Comments: 19 pages, 20 figures, accepted by ACL 2025, Findings

  8. arXiv:2505.20771  [pdf, ps, other

    cs.IR cs.AI

    Bridging the Gap: Self-Optimized Fine-Tuning for LLM-based Recommender Systems

    Authors: Heng Tang, Feng Liu, Xinbo Chen, Jiawei Chen, Bohao Wang, Changwang Zhang, Jun Wang, Yuegang Sun, Bingde Hu, Can Wang

    Abstract: Recent years have witnessed extensive exploration of Large Language Models (LLMs) on the field of Recommender Systems (RS). There are currently two commonly used strategies to enable LLMs to have recommendation capabilities: 1) The "Guidance-Only" strategy uses in-context learning to exploit and amplify the inherent semantic understanding and item recommendation capabilities of LLMs; 2) The "Tunin… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  9. arXiv:2505.20602  [pdf, ps, other

    math.NA

    Connecting randomized iterative methods with Krylov subspaces

    Authors: Yonghan Sun, Deren Han, Jiaxin Xie

    Abstract: Randomized iterative methods, such as the randomized Kaczmarz method, have gained significant attention for solving large-scale linear systems due to their simplicity and efficiency. Meanwhile, Krylov subspace methods have emerged as a powerful class of algorithms, known for their robust theoretical foundations and rapid convergence properties. Despite the individual successes of these two paradig… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  10. arXiv:2505.20513  [pdf, other

    cs.CV

    MetaWriter: Personalized Handwritten Text Recognition Using Meta-Learned Prompt Tuning

    Authors: Wenhao Gu, Li Gu, Ching Yee Suen, Yang Wang

    Abstract: Recent advancements in handwritten text recognition (HTR) have enabled the effective conversion of handwritten text to digital formats. However, achieving robust recognition across diverse writing styles remains challenging. Traditional HTR methods lack writer-specific personalization at test time due to limitations in model architecture and training strategies. Existing attempts to bridge this ga… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: CVPR2025

  11. arXiv:2505.20510  [pdf, other

    cs.CV

    CPathAgent: An Agent-based Foundation Model for Interpretable High-Resolution Pathology Image Analysis Mimicking Pathologists' Diagnostic Logic

    Authors: Yuxuan Sun, Yixuan Si, Chenglu Zhu, Kai Zhang, Zhongyi Shui, Bowen Ding, Tao Lin, Lin Yang

    Abstract: Recent advances in computational pathology have led to the emergence of numerous foundation models. However, these approaches fail to replicate the diagnostic process of pathologists, as they either simply rely on general-purpose encoders with multi-instance learning for classification or directly apply multimodal models to generate reports from images. A significant limitation is their inability… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 49 pages, 33 figures

  12. arXiv:2505.20437  [pdf, ps, other

    math.PR

    Rough backward SDEs with discontinuous Young drivers

    Authors: Dirk Becherer, Yuchen Sun

    Abstract: We study solutions to backward differential equations that are driven hybridly by a deterministic discontinuous rough path $W$ of finite $q$-variation for $q \in [1, 2)$ and by Brownian motion $B$. To distinguish between integration of jumps in a forward- or Marcus-sense, we refer to these equations as forward- respectively Marcus-type rough backward stochastic differential equations (RBSDEs). We… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    MSC Class: 60L90; 60J76; 60H20; 60H15; 37H30

  13. arXiv:2505.20349  [pdf, ps, other

    physics.flu-dyn cs.LG

    FD-Bench: A Modular and Fair Benchmark for Data-driven Fluid Simulation

    Authors: Haixin Wang, Ruoyan Li, Fred Xu, Fang Sun, Kaiqiao Han, Zijie Huang, Guancheng Wan, Ching Chang, Xiao Luo, Wei Wang, Yizhou Sun

    Abstract: Data-driven modeling of fluid dynamics has advanced rapidly with neural PDE solvers, yet a fair and strong benchmark remains fragmented due to the absence of unified PDE datasets and standardized evaluation protocols. Although architectural innovations are abundant, fair assessment is further impeded by the lack of clear disentanglement between spatial, temporal and loss modules. In this paper, we… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 31 pages, 18 figures, paper under review

  14. arXiv:2505.20293  [pdf, ps, other

    cs.CL

    Enhancing the Comprehensibility of Text Explanations via Unsupervised Concept Discovery

    Authors: Yifan Sun, Danding Wang, Qiang Sheng, Juan Cao, Jintao Li

    Abstract: Concept-based explainable approaches have emerged as a promising method in explainable AI because they can interpret models in a way that aligns with human reasoning. However, their adaption in the text domain remains limited. Most existing methods rely on predefined concept annotations and cannot discover unseen concepts, while other methods that extract concepts without supervision often produce… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: ACL 2025 Findings

  15. arXiv:2505.20131  [pdf, ps, other

    cs.LG q-bio.QM

    MolEditRL: Structure-Preserving Molecular Editing via Discrete Diffusion and Reinforcement Learning

    Authors: Yuanxin Zhuang, Dazhong Shen, Ying Sun

    Abstract: Molecular editing aims to modify a given molecule to optimize desired chemical properties while preserving structural similarity. However, current approaches typically rely on string-based or continuous representations, which fail to adequately capture the discrete, graph-structured nature of molecules, resulting in limited structural fidelity and poor controllability. In this paper, we propose Mo… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  16. arXiv:2505.19907  [pdf, ps, other

    hep-ex nucl-ex

    First measurement of $Σ^{+}n\rightarrowΛp$ and $Σ^{+}n\rightarrowΣ^{0}p$ cross-sections via $Σ^+$-nucleus scattering at an electron-positron collider

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

    Abstract: Using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected with the BESIII detector at the BEPCII storage ring, the reactions $Σ^{+}n\rightarrowΛp$ and $Σ^{+}n\rightarrowΣ^{0}p$ are studied, where the $Σ^{+}$ baryon is produced in the process $J/ψ\rightarrowΣ^{+}\barΣ^-$ and the neutron is a component of the $^9\rm{Be}$, $^{12}\rm{C}$ and $^{197}\rm{Au}$ nuclei in the beam pipe. Clear signals o… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 9 pages, 2 figures

  17. arXiv:2505.19699  [pdf, ps, other

    cs.LG cs.AI cs.DC

    Mosaic: Data-Free Knowledge Distillation via Mixture-of-Experts for Heterogeneous Distributed Environments

    Authors: Junming Liu, Yanting Gao, Siyuan Meng, Yifei Sun, Aoqi Wu, Yufei Jin, Yirong Chen, Ding Wang, Guosun Zeng

    Abstract: Federated Learning (FL) is a decentralized machine learning paradigm that enables clients to collaboratively train models while preserving data privacy. However, the coexistence of model and data heterogeneity gives rise to inconsistent representations and divergent optimization dynamics across clients, ultimately hindering robust global performance. To transcend these challenges, we propose Mosai… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 43 pages, 23 figures, 15 tables; the last dance

  18. arXiv:2505.19640  [pdf, other

    cs.CL

    Interleaved Reasoning for Large Language Models via Reinforcement Learning

    Authors: Roy Xie, David Qiu, Deepak Gopinath, Dong Lin, Yanchao Sun, Chong Wang, Saloni Potdar, Bhuwan Dhingra

    Abstract: Long chain-of-thought (CoT) significantly enhances large language models' (LLM) reasoning capabilities. However, the extensive reasoning traces lead to inefficiencies and an increased time-to-first-token (TTFT). We propose a novel training paradigm that uses reinforcement learning (RL) to guide reasoning LLMs to interleave thinking and answering for multi-hop questions. We observe that models inhe… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  19. arXiv:2505.19612  [pdf, ps, other

    cs.SI stat.ME

    Optimal Intervention for Self-triggering Spatial Networks with Application to Urban Crime Analytics

    Authors: Pramit Das, Moulinath Banerjee, Yuekai Sun

    Abstract: In many network systems, events at one node trigger further activity at other nodes, e.g., social media users reacting to each other's posts or the clustering of criminal activity in urban environments. These systems are typically referred to as self-exciting networks. In such systems, targeted intervention at critical nodes can be an effective strategy for mitigating undesirable consequences such… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  20. arXiv:2505.19597  [pdf, ps, other

    eess.AS

    A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions

    Authors: Zheng Wang, Xiaobin Rong, Yu Sun, Tianchi Sun, Zhibin Lin, Jing Lu

    Abstract: Although deep learning based multi-channel speech enhancement has achieved significant advancements, its practical deployment is often limited by constrained computational resources, particularly in low signal-to-noise ratio (SNR) conditions. In this paper, we propose a lightweight hybrid dual-channel speech enhancement system that combines independent vector analysis (IVA) with a modified version… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Accepted by Interspeech 2025

  21. arXiv:2505.19490  [pdf, other

    cs.AI

    Automated CAD Modeling Sequence Generation from Text Descriptions via Transformer-Based Large Language Models

    Authors: Jianxing Liao, Junyan Xu, Yatao Sun, Maowen Tang, Sicheng He, Jingxian Liao, Shui Yu, Yun Li, Hongguan Xiao

    Abstract: Designing complex computer-aided design (CAD) models is often time-consuming due to challenges such as computational inefficiency and the difficulty of generating precise models. We propose a novel language-guided framework for industrial design automation to address these issues, integrating large language models (LLMs) with computer-automated design (CAutoD).Through this framework, CAD models ar… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Accepted by ACL 2025 Main Conference

    ACM Class: I.2.7; I.2.6

  22. arXiv:2505.19473  [pdf, ps, other

    cs.IR

    Improving Recommendation Fairness without Sensitive Attributes Using Multi-Persona LLMs

    Authors: Haoran Xin, Ying Sun, Chao Wang, Yanke Yu, Weijia Zhang, Hui Xiong

    Abstract: Despite the success of recommender systems in alleviating information overload, fairness issues have raised concerns in recent years, potentially leading to unequal treatment for certain user groups. While efforts have been made to improve recommendation fairness, they often assume that users' sensitive attributes are available during model training. However, collecting sensitive information can b… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 18 pages, 9 figures

  23. arXiv:2505.19464  [pdf, ps, other

    cs.IR

    LLMs as Better Recommenders with Natural Language Collaborative Signals: A Self-Assessing Retrieval Approach

    Authors: Haoran Xin, Ying Sun, Chao Wang, Weijia Zhang, Hui Xiong

    Abstract: Incorporating collaborative information (CI) effectively is crucial for leveraging LLMs in recommendation tasks. Existing approaches often encode CI using soft tokens or abstract identifiers, which introduces a semantic misalignment with the LLM's natural language pretraining and hampers knowledge integration. To address this, we propose expressing CI directly in natural language to better align w… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 13 pages, 6 figures

  24. arXiv:2505.19432  [pdf, ps, other

    cs.LG

    Advanced long-term earth system forecasting by learning the small-scale nature

    Authors: Hao Wu, Yuan Gao, Ruiqi Shu, Kun Wang, Ruijian Gou, Chuhan Wu, Xinliang Liu, Juncai He, Shuhao Cao, Junfeng Fang, Xingjian Shi, Feng Tao, Qi Song, Shengxuan Ji, Yanfei Xiang, Yuze Sun, Jiahao Li, Fan Xu, Huanshuo Dong, Haixin Wang, Fan Zhang, Penghao Zhao, Xian Wu, Qingsong Wen, Deliang Chen , et al. (1 additional authors not shown)

    Abstract: Reliable long-term forecast of Earth system dynamics is heavily hampered by instabilities in current AI models during extended autoregressive simulations. These failures often originate from inherent spectral bias, leading to inadequate representation of critical high-frequency, small-scale processes and subsequent uncontrolled error amplification. We present Triton, an AI framework designed to ad… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  25. arXiv:2505.19371  [pdf, other

    cs.AI cs.LG math.ST

    Foundations of Top-$k$ Decoding For Language Models

    Authors: Georgy Noarov, Soham Mallick, Tao Wang, Sunay Joshi, Yan Sun, Yangxinyu Xie, Mengxin Yu, Edgar Dobriban

    Abstract: Top-$k$ decoding is a widely used method for sampling from LLMs: at each token, only the largest $k$ next-token-probabilities are kept, and the next token is sampled after re-normalizing them to sum to unity. Top-$k$ and other sampling methods are motivated by the intuition that true next-token distributions are sparse, and the noisy LLM probabilities need to be truncated. However, to our knowledg… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  26. arXiv:2505.19009  [pdf, ps, other

    q-bio.NC

    Capturing Aperiodic Temporal Dynamics of EEG Signals through Stochastic Fluctuation Modeling

    Authors: Yuhao Sun, Zhiyuan Ma, Xinke Shen, Jinhao Li, Guan Wang, Sen Song

    Abstract: Electrophysiological brain signals, such as electroencephalography (EEG), exhibit both periodic and aperiodic components, with the latter often modeled as 1/f noise and considered critical to cognitive and neurological processes. Although various theoretical frameworks have been proposed to account for aperiodic activity, its scale-invariant and long-range temporal dependency remain insufficiently… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  27. arXiv:2505.18812  [pdf, ps, other

    cs.CV

    SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models

    Authors: Ye Sun, Hao Zhang, Henghui Ding, Tiehua Zhang, Xingjun Ma, Yu-Gang Jiang

    Abstract: Achieving fine-grained spatio-temporal understanding in videos remains a major challenge for current Video Large Multimodal Models (Video LMMs). Addressing this challenge requires mastering two core capabilities: video referring understanding, which captures the semantics of video regions, and video grounding, which segments object regions based on natural language descriptions. However, most exis… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

  28. arXiv:2505.18355  [pdf, ps, other

    cs.LG

    X-MethaneWet: A Cross-scale Global Wetland Methane Emission Benchmark Dataset for Advancing Science Discovery with AI

    Authors: Yiming Sun, Shuo Chen, Shengyu Chen, Chonghao Qiu, Licheng Liu, Youmi Oh, Sparkle L. Malone, Gavin McNicol, Qianlai Zhuang, Chris Smith, Yiqun Xie, Xiaowei Jia

    Abstract: Methane (CH$_4$) is the second most powerful greenhouse gas after carbon dioxide and plays a crucial role in climate change due to its high global warming potential. Accurately modeling CH$_4$ fluxes across the globe and at fine temporal scales is essential for understanding its spatial and temporal variability and developing effective mitigation strategies. In this work, we introduce the first-of… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 8 pages, 8 figures, 3 tables

  29. arXiv:2505.18302  [pdf, ps, other

    cs.CV cs.IT

    Sampling Strategies for Efficient Training of Deep Learning Object Detection Algorithms

    Authors: Gefei Shen, Yung-Hong Sun, Yu Hen Hu, Hongrui Jiang

    Abstract: Two sampling strategies are investigated to enhance efficiency in training a deep learning object detection model. These sampling strategies are employed under the assumption of Lipschitz continuity of deep learning models. The first strategy is uniform sampling which seeks to obtain samples evenly yet randomly through the state space of the object dynamics. The second strategy of frame difference… ▽ More

    Submitted 27 May, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

  30. arXiv:2505.18154  [pdf, ps, other

    cs.CL cs.CY

    The Staircase of Ethics: Probing LLM Value Priorities through Multi-Step Induction to Complex Moral Dilemmas

    Authors: Ya Wu, Qiang Sheng, Danding Wang, Guang Yang, Yifan Sun, Zhengjia Wang, Yuyan Bu, Juan Cao

    Abstract: Ethical decision-making is a critical aspect of human judgment, and the growing use of LLMs in decision-support systems necessitates a rigorous evaluation of their moral reasoning capabilities. However, existing assessments primarily rely on single-step evaluations, failing to capture how models adapt to evolving ethical challenges. Addressing this gap, we introduce the Multi-step Moral Dilemmas (… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 25 pages, 8 figures

  31. arXiv:2505.18021  [pdf, other

    cs.CV

    Building Floor Number Estimation from Crowdsourced Street-Level Images: Munich Dataset and Baseline Method

    Authors: Yao Sun, Sining Chen, Yifan Tian, Xiao Xiang Zhu

    Abstract: Accurate information on the number of building floors, or above-ground storeys, is essential for household estimation, utility provision, risk assessment, evacuation planning, and energy modeling. Yet large-scale floor-count data are rarely available in cadastral and 3D city databases. This study proposes an end-to-end deep learning framework that infers floor numbers directly from unrestricted, c… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: Code and data: https://github.com/ya0-sun/Munich-SVI-Floor-Benchmark

  32. arXiv:2505.18018  [pdf, ps, other

    cs.RO cs.AI

    ExoGait-MS: Learning Periodic Dynamics with Multi-Scale Graph Network for Exoskeleton Gait Recognition

    Authors: Lijiang Liu, Junyu Shi, Yong Sun, Zhiyuan Zhang, Jinni Zhou, Shugen Ma, Qiang Nie

    Abstract: Current exoskeleton control methods often face challenges in delivering personalized treatment. Standardized walking gaits can lead to patient discomfort or even injury. Therefore, personalized gait is essential for the effectiveness of exoskeleton robots, as it directly impacts their adaptability, comfort, and rehabilitation outcomes for individual users. To enable personalized treatment in exosk… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  33. arXiv:2505.18004  [pdf, ps, other

    hep-ex

    Measurement of branching fractions of $Λ_{c}^{+}$ decays to $Σ^{+} η$ and $Σ^{+} η'$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: By analyzing $e^+e^-$ collision data taken at center-of-mass energies $\sqrt{s} = 4.600 \sim 4.699$ $\mbox{GeV}$ with the BESIII detector at the BEPCII collider, corresponding to an integrated luminosity of $\rm 4.5~fb^{-1}$, we study the hadronic decays $Λ_{c}^{+} \rightarrow Σ^{+} η$ and $Λ_{c}^{+} \rightarrow Σ^{+} η^{\prime}$ using the single-tag method. The branching fraction ratio of… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  34. arXiv:2505.17826  [pdf, other

    cs.LG cs.CL cs.DC

    Trinity-RFT: A General-Purpose and Unified Framework for Reinforcement Fine-Tuning of Large Language Models

    Authors: Xuchen Pan, Yanxi Chen, Yushuo Chen, Yuchang Sun, Daoyuan Chen, Wenhao Zhang, Yuexiang Xie, Yilun Huang, Yilei Zhang, Dawei Gao, Yaliang Li, Bolin Ding, Jingren Zhou

    Abstract: Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models. It is built with a decoupled design, consisting of (1) an RFT-core that unifies and generalizes synchronous/asynchronous, on-policy/off-policy, and online/offline modes of RFT, (2) seamless integration for agent-environment interaction with high efficiency and ro… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: This technical report will be continuously updated as the codebase evolves. GitHub: https://github.com/modelscope/Trinity-RFT

  35. arXiv:2505.17288  [pdf, ps, other

    stat.ML cs.LG

    Learning to Choose or Choosing to Learn: Best-of-N vs. Supervised Fine-Tuning for Bit String Generation

    Authors: Seamus Somerstep, Vinod Raman, Unique Subedi, Yuekai Sun

    Abstract: Using the bit string generation problem as a case study, we theoretically compare two standard methods for adapting large language models to new tasks. The first, referred to as supervised fine-tuning, involves training a new next token predictor on good generations. The second method, Best-of-N, trains a reward model to select good responses from a collection generated by an unaltered base model.… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  36. arXiv:2505.17249  [pdf, other

    cs.AI cs.LG

    Where You Go is Who You Are: Behavioral Theory-Guided LLMs for Inverse Reinforcement Learning

    Authors: Yuran Sun, Susu Xu, Chenguang Wang, Xilei Zhao

    Abstract: Big trajectory data hold great promise for human mobility analysis, but their utility is often constrained by the absence of critical traveler attributes, particularly sociodemographic information. While prior studies have explored predicting such attributes from mobility patterns, they often overlooked underlying cognitive mechanisms and exhibited low predictive accuracy. This study introduces SI… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  37. arXiv:2505.17133  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Learning Probabilities of Causation from Finite Population Data

    Authors: Shuai Wang, Song Jiang, Yizhou Sun, Judea Pearl, Ang Li

    Abstract: Probabilities of causation play a crucial role in modern decision-making. This paper addresses the challenge of predicting probabilities of causation for subpopulations with \textbf{insufficient} data using machine learning models. Tian and Pearl first defined and derived tight bounds for three fundamental probabilities of causation: the probability of necessity and sufficiency (PNS), the probabil… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: arXiv admin note: text overlap with arXiv:2502.08858

  38. arXiv:2505.16643  [pdf, other

    cs.CV cs.AI

    From Evaluation to Defense: Advancing Safety in Video Large Language Models

    Authors: Yiwei Sun, Peiqi Jiang, Chuanbin Liu, Luohao Lin, Zhiying Lu, Hongtao Xie

    Abstract: While the safety risks of image-based large language models have been extensively studied, their video-based counterparts (Video LLMs) remain critically under-examined. To systematically study this problem, we introduce \textbf{VideoSafetyBench (VSB-77k) - the first large-scale, culturally diverse benchmark for Video LLM safety}, which compromises 77,646 video-query pairs and spans 19 principal ri… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: 49 pages, 12 figures, 17 tables

  39. arXiv:2505.15656  [pdf, ps, other

    cs.CL

    Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen!

    Authors: Zhexin Zhang, Yuhao Sun, Junxiao Yang, Shiyao Cui, Hongning Wang, Minlie Huang

    Abstract: Fine-tuning on open-source Large Language Models (LLMs) with proprietary data is now a standard practice for downstream developers to obtain task-specific LLMs. Surprisingly, we reveal a new and concerning risk along with the practice: the creator of the open-source LLMs can later extract the private downstream fine-tuning data through simple backdoor training, only requiring black-box access to t… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 19 pages

  40. arXiv:2505.15620  [pdf, ps, other

    hep-ex

    Observation of $χ_{cJ}\to 3K_S^0K^\pmπ^\mp$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (678 additional authors not shown)

    Abstract: By analyzing $(2712.4\pm14.3)\times10^6$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays $χ_{c0,1,2} \to 3K_S^0K^\pmπ^\mp$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be $\mathcal{B}(χ_{c0}\to 3K_S^0K^\pmπ^\mp )=(7.95\pm0.50\pm0.65)\times10^{-5},$… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 11 pages, 6 figures

  41. arXiv:2505.15151  [pdf, ps, other

    cs.LG

    Time Tracker: Mixture-of-Experts-Enhanced Foundation Time Series Forecasting Model with Decoupled Training Pipelines

    Authors: Xiaohou Shi, Ke Li, Aobo Liang, Yan Sun

    Abstract: In the past few years, time series foundation models have achieved superior predicting accuracy. However, real-world time series often exhibit significant diversity in their temporal patterns across different time spans and domains, making it challenging for a single model architecture to fit all complex scenarios. In addition, time series data may have multiple variables exhibiting complex correl… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  42. Test of local realism via entangled $Λ\barΛ$ system

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (597 additional authors not shown)

    Abstract: The non-locality of quantum correlations is a fundamental feature of quantum theory. The Bell inequality serves as a benchmark for distinguishing between predictions made by quantum theory and local hidden variable theory (LHVT). Recent advancements in photon-entanglement experiments have addressed potential loopholes and have observed significant violations of variants of Bell inequality. However… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Journal ref: Nat Commun 16, 4948 (2025)

  43. arXiv:2505.14974  [pdf, ps, other

    physics.optics

    Full spectral response of grating-induced loss in photonic crystal microrings

    Authors: Daniel Pimbi, Yi Sun, Roy Zektzer, Xiyuan Lu, Kartik Srinivasan

    Abstract: Photonic crystal microrings (PhCRs) have emerged as powerful and versatile platforms for integrated nonlinear photonics, offering precise control over frequency and phase matching while maintaining high optical quality factors. Through grating-mediated mode coupling, PhCRs enable advanced dispersion engineering, which is critical for wideband nonlinear processes such as optical parametric oscillat… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  44. arXiv:2505.14916  [pdf

    eess.IV cs.CV

    Super-Resolution Optical Coherence Tomography Using Diffusion Model-Based Plug-and-Play Priors

    Authors: Yaning Wang, Jinglun Yu, Wenhan Guo, Yu Sun, Jin U. Kang

    Abstract: We propose an OCT super-resolution framework based on a plug-and-play diffusion model (PnP-DM) to reconstruct high-quality images from sparse measurements (OCT B-mode corneal images). Our method formulates reconstruction as an inverse problem, combining a diffusion prior with Markov chain Monte Carlo sampling for efficient posterior inference. We collect high-speed under-sampled B-mode corneal ima… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  45. arXiv:2505.14560  [pdf, ps, other

    eess.IV cs.CV

    Neural Inverse Scattering with Score-based Regularization

    Authors: Yuan Gao, Wenhan Guo, Yu Sun

    Abstract: Inverse scattering is a fundamental challenge in many imaging applications, ranging from microscopy to remote sensing. Solving this problem often requires jointly estimating two unknowns -- the image and the scattering field inside the object -- necessitating effective image prior to regularize the inference. In this paper, we propose a regularized neural field (NF) approach which integrates the d… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  46. arXiv:2505.14161  [pdf, other

    cs.LG

    Personalized Bayesian Federated Learning with Wasserstein Barycenter Aggregation

    Authors: Ting Wei, Biao Mei, Junliang Lyu, Renquan Zhang, Feng Zhou, Yifan Sun

    Abstract: Personalized Bayesian federated learning (PBFL) handles non-i.i.d. client data and quantifies uncertainty by combining personalization with Bayesian inference. However, existing PBFL methods face two limitations: restrictive parametric assumptions in client posterior inference and naive parameter averaging for server aggregation. To overcome these issues, we propose FedWBA, a novel PBFL method tha… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  47. arXiv:2505.14135  [pdf, other

    cs.CV

    Hunyuan-Game: Industrial-grade Intelligent Game Creation Model

    Authors: Ruihuang Li, Caijin Zhou, Shoujian Zheng, Jianxiang Lu, Jiabin Huang, Comi Chen, Junshu Tang, Guangzheng Xu, Jiale Tao, Hongmei Wang, Donghao Li, Wenqing Yu, Senbo Wang, Zhimin Li, Yetshuan Shi, Haoyu Yang, Yukun Wang, Wenxun Dai, Jiaqi Li, Linqing Wang, Qixun Wang, Zhiyong Xu, Yingfang Zhang, Jiangfeng Xiong, Weijie Kong , et al. (33 additional authors not shown)

    Abstract: Intelligent game creation represents a transformative advancement in game development, utilizing generative artificial intelligence to dynamically generate and enhance game content. Despite notable progress in generative models, the comprehensive synthesis of high-quality game assets, including both images and videos, remains a challenging frontier. To create high-fidelity game content that simult… ▽ More

    Submitted 28 May, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

  48. arXiv:2505.14057  [pdf, ps, other

    cs.IR cs.AI

    Field Matters: A lightweight LLM-enhanced Method for CTR Prediction

    Authors: Yu Cui, Feng Liu, Jiawei Chen, Xingyu Lou, Changwang Zhang, Jun Wang, Yuegang Sun, Xiaohu Yang, Can Wang

    Abstract: Click-through rate (CTR) prediction is a fundamental task in modern recommender systems. In recent years, the integration of large language models (LLMs) has been shown to effectively enhance the performance of traditional CTR methods. However, existing LLM-enhanced methods often require extensive processing of detailed textual descriptions for large-scale instances or user/item entities, leading… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  49. OGHReS: Star formation in the Outer Galaxy II ($\ell = 180^\circ$-$280^\circ$)

    Authors: J. S. Urquhart, C. Koenig, D. Colombo, A. Karska, A. Giannetti, T. J. T. Moore, A. Y. Yang, F. Wyrowski, Y. Sun, Z. Jiang, K. R. Neralwar, D. Eden, I. Grozdanova, S. Neupane, M. Figueira, E. Dann, V., S. Veena, W. -J. Kim, S. Leurini, J. Brand, M. -Y. Lee

    Abstract: The Outer Galaxy High-Resolution Survey (OGHReS) covers 100 square degrees ($180^\circ < \ell < 280^\circ$) in the (2--1) transitions of three CO-isotopologues. We use the spectra to refine the velocities and physical properties to 6706 \higal\ clumps located in the OGHReS region. In a previous paper, we analysed 3584 clumps between $\ell = 250^\circ$ and $280^\circ$. Here, we cover a further 3122… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 18 pages, 14 figues. Full versions of Tables 1 and 2 are only available in electronic form via CDS. arXiv admin note: text overlap with arXiv:2401.00808

  50. arXiv:2505.13633  [pdf, ps, other

    cs.CV

    IPENS:Interactive Unsupervised Framework for Rapid Plant Phenotyping Extraction via NeRF-SAM2 Fusion

    Authors: Wentao Song, He Huang, Youqiang Sun, Fang Qu, Jiaqi Zhang, Longhui Fang, Yuwei Hao, Chenyang Peng

    Abstract: Advanced plant phenotyping technologies play a crucial role in targeted trait improvement and accelerating intelligent breeding. Due to the species diversity of plants, existing methods heavily rely on large-scale high-precision manually annotated data. For self-occluded objects at the grain level, unsupervised methods often prove ineffective. This study proposes IPENS, an interactive unsupervised… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.