Skip to main content

Showing 51–100 of 6,717 results for author: Mao, Y

.
  1. arXiv:2506.23271  [pdf, ps, other

    cs.CV

    Mettle: Meta-Token Learning for Memory-Efficient Audio-Visual Adaptation

    Authors: Jinxing Zhou, Zhihui Li, Yongqiang Yu, Yanghao Zhou, Ruohao Guo, Guangyao Li, Yuxin Mao, Mingfei Han, Xiaojun Chang, Meng Wang

    Abstract: We present \textbf{Met}a-\textbf{T}oken \textbf{Le}arning (Mettle), a simple and memory-efficient method for adapting large-scale pretrained transformer models to downstream audio-visual tasks. Instead of sequentially modifying the output feature distribution of the transformer backbone, Mettle utilizes a lightweight \textit{Layer-Centric Distillation (LCD)} module to distill in parallel the intac… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

    Comments: Technical Report

  2. arXiv:2506.23152  [pdf, ps, other

    cs.RO

    DexH2R: A Benchmark for Dynamic Dexterous Grasping in Human-to-Robot Handover

    Authors: Youzhuo Wang, Jiayi Ye, Chuyang Xiao, Yiming Zhong, Heng Tao, Hang Yu, Yumeng Liu, Jingyi Yu, Yuexin Ma

    Abstract: Handover between a human and a dexterous robotic hand is a fundamental yet challenging task in human-robot collaboration. It requires handling dynamic environments and a wide variety of objects and demands robust and adaptive grasping strategies. However, progress in developing effective dynamic dexterous grasping methods is limited by the absence of high-quality, real-world human-to-robot handove… ▽ More

    Submitted 2 July, 2025; v1 submitted 29 June, 2025; originally announced June 2025.

    Comments: Comments: Accepted by ICCV 2025. Project page: https://dexh2r.github.io/

  3. arXiv:2506.23133  [pdf, ps, other

    cs.CL

    Format-Adapter: Improving Reasoning Capability of LLMs by Adapting Suitable Format

    Authors: Dingzirui Wang, Xuanliang Zhang, Rongyu Cao, Longxu Dou, Xianzhen Luo, Yingwei Ma, Qingfu Zhu, Wanxiang Che, Binhua Li, Fei Huang, Yongbin Li

    Abstract: Generating and voting multiple answers is an effective method to mitigate reasoning inconsistencies of large language models (LLMs). Prior works have shown that multiple reasoning formats outperform a single format when generating multiple answers. However, previous works using multiple formats rely on formats labeled by humans, which could be unsuitable for all tasks and have high labeling costs.… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

  4. arXiv:2506.22937  [pdf, ps, other

    cs.HC

    GamerAstra: Enhancing Video Game Accessibility for Blind and Low-Vision Players through a Multi-Agent AI Framework

    Authors: Tianrun Qiu, Changxin Chen, Sizhe Cheng, Yiming Yang, Yixiao Guo, Zhicong Lu, Yuxin Ma

    Abstract: Blind and low-vision (BLV) players encounter critical challenges in engaging with video games due to the inaccessibility of visual elements, difficulties in navigating interfaces, and limitations in sending interaction input. Moreover, the development of specialized accessibility features typically requires substantial programming effort and is often implemented on a game-by-game basis. To address… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

    Comments: 19 pages, 9 figures

    ACM Class: H.5.2

  5. arXiv:2506.22911  [pdf, ps, other

    cs.GT cs.AI cs.LG

    Learning Truthful Mechanisms without Discretization

    Authors: Yunxuan Ma, Siqiang Wang, Zhijian Duan, Yukun Cheng, Xiaotie Deng

    Abstract: This paper introduces TEDI (Truthful, Expressive, and Dimension-Insensitive approach), a discretization-free algorithm to learn truthful and utility-maximizing mechanisms. Existing learning-based approaches often rely on discretization of outcome spaces to ensure truthfulness, which leads to inefficiency with increasing problem size. To address this limitation, we formalize the concept of pricing… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

    Comments: 66 pages

  6. arXiv:2506.22785  [pdf, ps, other

    cond-mat.soft

    The mechanics of disclination emergence in 3D active nematics

    Authors: Yingyou Ma, Christopher Amey, Aparna Baskaran, Michael F. Hagan

    Abstract: The spontaneous creation of disclinations is a defining characteristic of active nematics, which is rarely observed in equilibrium systems or other active matter systems. Thus, understanding the mechanics of disclinations is crucial for developing reliable continuum theories and practical applications. In this work, we explore this intrinsic mechanics by performing large-scale 3D simulations of a… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

  7. arXiv:2506.22611  [pdf

    q-fin.PM cs.LG math.OC q-fin.CP q-fin.RM

    Deep Hedging to Manage Tail Risk

    Authors: Yuming Ma

    Abstract: Extending Buehler et al.'s 2019 Deep Hedging paradigm, we innovatively employ deep neural networks to parameterize convex-risk minimization (CVaR/ES) for the portfolio tail-risk hedging problem. Through comprehensive numerical experiments on crisis-era bootstrap market simulators -- customizable with transaction costs, risk budgets, liquidity constraints, and market impact -- our end-to-end framew… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: 59 pages

    MSC Class: 91G70 91G20 91G60

  8. arXiv:2506.22448  [pdf, ps, other

    eess.SP cs.AI cs.IT

    Unsupervised Learning-Based Joint Resource Allocation and Beamforming Design for RIS-Assisted MISO-OFDMA Systems

    Authors: Yu Ma, Xingyu Zhou, Xiao Li, Le Liang, Shi Jin

    Abstract: Reconfigurable intelligent surfaces (RIS) are key enablers for 6G wireless systems. This paper studies downlink transmission in an RIS-assisted MISO-OFDMA system, addressing resource allocation challenges. A two-stage unsupervised learning-based framework is proposed to jointly design RIS phase shifts, BS beamforming, and resource block (RB) allocation. The framework includes BeamNet, which predic… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: Due to the limitation "The abstract field cannot be longer than 1,920 characters", the abstract here is shorter than that in the PDF file

  9. arXiv:2506.22247  [pdf, ps, other

    nucl-th hep-ex nucl-ex

    Spin polarization from nucleon-nucleon scatterings in intermediate-energy heavy-ion collisions

    Authors: Rong-Jun Liu, Jun Xu, Yu-Gang Ma

    Abstract: We propose a new mechanism of generating spin polarization in heavy-ion collisions dominated by nucleon degree of freedom. By incorporating the spin change in nucleon-nucleon scatterings based on the phase shift data together with the constraint of rigorous angular momentum conservation and Pauli blocking, we illustrate through a Boltzmann-Uehling-Uhlenbeck transport model that appreciable spin po… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: 6 pages, 4 figures

    Journal ref: Physics Letters B 868, 139703 (2025)

  10. arXiv:2506.22059  [pdf, ps, other

    eess.SP

    Hybrid Constellation Modulation for Symbol-Level Precoding in RIS-Enhanced MU-MISO Systems

    Authors: Yupeng Zheng, Yi Ma, Rahim Tafazolli

    Abstract: The application of symbol-level precoding (SLP) in reconfigurable intelligent surfaces (RIS) enhanced multi-user multiple-input single-output (MU-MISO) systems faces two main challenges. First, the state-of-the-art joint reflecting and SLP optimization approach requires exhaustive enumeration of all possible transmit symbol combinations, resulting in scalability issues as the modulation order and… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: This work has been accepted by IEEE SPAWC 2025

  11. arXiv:2506.21962  [pdf, ps, other

    cs.HC

    AnyAni: An Interactive System with Generative AI for Animation Effect Creation and Code Understanding in Web Development

    Authors: Tianrun Qiu, Yuxin Ma

    Abstract: Generative AI assistants have been widely used in front-end programming. However, besides code writing, developers often encounter the need to generate animation effects. As novices in creative design without the assistance of professional designers, developers typically face difficulties in describing, designing, and implementing desired animations. To address this issue, we conducted a formative… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    ACM Class: J.6

  12. arXiv:2506.21579  [pdf, ps, other

    cs.IR cs.AI

    LLM2Rec: Large Language Models Are Powerful Embedding Models for Sequential Recommendation

    Authors: Yingzhi He, Xiaohao Liu, An Zhang, Yunshan Ma, Tat-Seng Chua

    Abstract: Sequential recommendation aims to predict users' future interactions by modeling collaborative filtering (CF) signals from historical behaviors of similar users or items. Traditional sequential recommenders predominantly rely on ID-based embeddings, which capture CF signals through high-order co-occurrence patterns. However, these embeddings depend solely on past interactions, lacking transferable… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: KDD 2025

  13. arXiv:2506.21574  [pdf, ps, other

    cs.CL cs.AI

    Digital Gatekeepers: Exploring Large Language Model's Role in Immigration Decisions

    Authors: Yicheng Mao, Yang Zhao

    Abstract: With globalization and increasing immigrant populations, immigration departments face significant work-loads and the challenge of ensuring fairness in decision-making processes. Integrating artificial intelligence offers a promising solution to these challenges. This study investigates the potential of large language models (LLMs),such as GPT-3.5 and GPT-4, in supporting immigration decision-makin… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  14. arXiv:2506.21370  [pdf, ps, other

    cs.IT eess.SP

    Cluster-Aware Two-Stage Method for Fast Iterative MIMO Detection in LEO Satellite Communications

    Authors: Jiuyu Liu, Yi Ma, Qihao Peng, Rahim Tafazolli

    Abstract: In this paper, a cluster-aware two-stage multiple-input multiple-output (MIMO) detection method is proposed for direct-to-cell satellite communications. The method achieves computational efficiency by exploiting a distinctive property of satellite MIMO channels: users within the same geographical cluster exhibit highly correlated channel characteristics due to their physical proximity, which typic… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: This work has been accepted by IEEE/CIC ICCC 2025

  15. arXiv:2506.21190  [pdf, ps, other

    stat.ME

    Survival analysis under label shift

    Authors: Yuxiang Zong, Yanyuan Ma, Ingrid Van Keilegom

    Abstract: Let P represent the source population with complete data, containing covariate $\mathbf{Z}$ and response $T$, and Q the target population, where only the covariate $\mathbf{Z}$ is available. We consider a setting with both label shift and label censoring. Label shift assumes that the marginal distribution of $T$ differs between $P$ and $Q$, while the conditional distribution of $\mathbf{Z}$ given… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  16. arXiv:2506.21038  [pdf

    cond-mat.mtrl-sci

    Ferroelectricity in 6 Angstrom-Thick Two-dimensional Ga$_2$O$_3$

    Authors: Tong Jiang, Han Chen, Yubo Yuan, Xiang Xu, Junwei Cao, Hao Wang, Xuechun Sun, Junshuai Li, Yaqing Ma, Huaze Zhu, Wenbin Li, Wei Kong

    Abstract: Atomic-scale ferroelectric thin films hold great promise for high-density, low-power applications but face stability and voltage scaling challenges at extreme thinness. Here, we demonstrate ferroelectricity in single-crystalline two-dimensional (2D) Ga$_2$O$_3$, an ultra-wide-bandgap semiconductor, at just 6 angstrom thickness, exhibiting exceptional retention and thermal stability. We show that e… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: 38 pages, 13 figures

  17. arXiv:2506.21032  [pdf, ps, other

    cs.IR

    RecCoT: Enhancing Recommendation via Chain-of-Thought

    Authors: Shuo Yang, Jiangxia Cao, Haipeng Li, Yuqi Mao, Shuchao Pang

    Abstract: In real-world applications, users always interact with items in multiple aspects, such as through implicit binary feedback (e.g., clicks, dislikes, long views) and explicit feedback (e.g., comments, reviews). Modern recommendation systems (RecSys) learn user-item collaborative signals from these implicit feedback signals as a large-scale binary data-streaming, subsequently recommending other highl… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: Work in progress

  18. arXiv:2506.20986  [pdf, ps, other

    cs.CV

    EVA: Mixture-of-Experts Semantic Variant Alignment for Compositional Zero-Shot Learning

    Authors: Xiao Zhang, Yongqiang Ma, Haodong Jing, Nanning Zheng

    Abstract: Compositional Zero-Shot Learning (CZSL) investigates compositional generalization capacity to recognize unknown state-object pairs based on learned primitive concepts. Existing CZSL methods typically derive primitives features through a simple composition-prototype mapping, which is suboptimal for a set of individuals that can be divided into distinct semantic subsets. Moreover, the all-to-one cro… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  19. arXiv:2506.20931  [pdf, ps, other

    cs.CR

    SPA: Towards More Stealth and Persistent Backdoor Attacks in Federated Learning

    Authors: Chengcheng Zhu, Ye Li, Bosen Rao, Jiale Zhang, Yunlong Mao, Sheng Zhong

    Abstract: Federated Learning (FL) has emerged as a leading paradigm for privacy-preserving distributed machine learning, yet the distributed nature of FL introduces unique security challenges, notably the threat of backdoor attacks. Existing backdoor strategies predominantly rely on end-to-end label supervision, which, despite their efficacy, often results in detectable feature disentanglement and limited p… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 18 pages

  20. arXiv:2506.20443  [pdf, ps, other

    physics.plasm-ph

    MHD simulation of tilt instability during the dynamic FRC magnetic compression process

    Authors: Yiming Ma, Ping Zhu, Bo Rao, Haolong Li

    Abstract: The nonlinear evolution of the tilt instability in a field reversed configuration (FRC) during the dynamic magnetic compression process has been investigated using magnetohydrodynamic (MHD) simulations with the NIMROD code [C. R. Sovinec \textit{et al.}, J. Comput. Phys. \textbf{195}, 355 (2004)]. The tilt mode induces significant deformations in the linear growth phase and results in complete con… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  21. arXiv:2506.19955  [pdf, ps, other

    cs.CV

    EBC-ZIP: Improving Blockwise Crowd Counting with Zero-Inflated Poisson Regression

    Authors: Yiming Ma, Victor Sanchez, Tanaya Guha

    Abstract: Density map estimation has become the mainstream paradigm in crowd counting. However, most existing methods overlook the extreme sparsity of ground-truth density maps. In real-world crowd scenes, the vast majority of spatial regions (often over 95%) contain no people, leading to heavily imbalanced count distributions. Ignoring this imbalance can bias models toward overestimating dense regions and… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  22. arXiv:2506.19180  [pdf, ps, other

    hep-ex hep-ph

    Precise Measurement of the $Λ$ Electric Dipole Moment through the Entangled Strange Baryon-Antibaryon System

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (696 additional authors not shown)

    Abstract: The dominance of matter over antimatter in the universe has consistently driven the pursuit of new physics beyond the Standard Model that violates charge-parity symmetry. Unlike the well-constrained electrons and neutrons, strange baryons (hyperons) remain a largely unexplored territory, in which interactions between hyperons and particles from new physics could induce a non-trivial electric dipol… ▽ More

    Submitted 28 June, 2025; v1 submitted 23 June, 2025; originally announced June 2025.

  23. A new window into the sub-parsec scale magnetic field in the Milky Way? Unveiling small-scale magneto-ionic structures with Faraday complexity

    Authors: Yik Ki Ma, Amit Seta, N. M. McClure-Griffiths, C. L. Van Eck, S. A. Mao, A. Ordog, J. C. Brown, T. O. Kovacs, Takuya Akahori, K. Kurahara, L. Oberhelman, C. S. Anderson

    Abstract: Radio broadband spectro-polarimetric observations are sensitive to the spatial fluctuations of the Faraday depth (FD) within the telescope beam. Such FD fluctuations are referred to as "Faraday complexity", and can unveil small-scale magneto-ionic structures in both the synchrotron-emitting and the foreground volumes. We explore the astrophysical origin of the Faraday complexity exhibited by 191 p… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: 32 pages, 19 figures, MNRAS accepted

  24. arXiv:2506.18240  [pdf, ps, other

    cs.LG cs.AI physics.optics

    Quantum-Classical Hybrid Quantized Neural Network

    Authors: Wenxin Li, Chuan Wang, Hongdong Zhu, Qi Gao, Yin Ma, Hai Wei, Kai Wen

    Abstract: Here in this work, we present a novel Quadratic Binary Optimization (QBO) model for quantized neural network training, enabling the use of arbitrary activation and loss functions through spline interpolation. We introduce Forward Interval Propagation (FIP), a method designed to tackle the challenges of non-linearity and the multi-layer composite structure in neural networks by discretizing activat… ▽ More

    Submitted 24 June, 2025; v1 submitted 22 June, 2025; originally announced June 2025.

    Comments: 27 pages, 5 figures, comments are welcome

  25. arXiv:2506.17125  [pdf, ps, other

    cs.SE

    Large Language Model Unlearning for Source Code

    Authors: Xue Jiang, Yihong Dong, Zheng Fang, Yingwei Ma, Tangxinyu Wang, Rongyu Cao, Binhua Li, Zhi Jin, Wenpin Jiao, Yongbin Li, Ge Li

    Abstract: LLM4SE has demonstrated significant success, but LLMs' potential memorization of sensitive or outdated training data introduces critical risks to legal compliance, software security, and code quality. LLM unlearning techniques, which can eliminate the influence of undesired data from LLMs in a post-training way, present a promising solution to address these concerns. While recent efforts in LLM un… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  26. arXiv:2506.16986  [pdf, ps, other

    cs.RO

    Learning Accurate Whole-body Throwing with High-frequency Residual Policy and Pullback Tube Acceleration

    Authors: Yuntao Ma, Yang Liu, Kaixian Qu, Marco Hutter

    Abstract: Throwing is a fundamental skill that enables robots to manipulate objects in ways that extend beyond the reach of their arms. We present a control framework that combines learning and model-based control for prehensile whole-body throwing with legged mobile manipulators. Our framework consists of three components: a nominal tracking policy for the end-effector, a high-frequency residual policy to… ▽ More

    Submitted 23 June, 2025; v1 submitted 20 June, 2025; originally announced June 2025.

    Comments: 8 pages, IROS 2025

    MSC Class: 68T40; 93C85; 70E60 ACM Class: I.2.9; I.2.10; I.2.8

  27. arXiv:2506.16968  [pdf, ps, other

    cs.CR cs.CY

    MM-AttacKG: A Multimodal Approach to Attack Graph Construction with Large Language Models

    Authors: Yongheng Zhang, Xinyun Zhao, Yunshan Ma, Haokai Ma, Yingxiao Guan, Guozheng Yang, Yuliang Lu, Xiang Wang

    Abstract: Cyber Threat Intelligence (CTI) parsing aims to extract key threat information from massive data, transform it into actionable intelligence, enhance threat detection and defense efficiency, including attack graph construction, intelligence fusion and indicator extraction. Among these research topics, Attack Graph Construction (AGC) is essential for visualizing and understanding the potential attac… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  28. arXiv:2506.16967  [pdf, ps, other

    math.PR

    How fast does spectral radius of truncated circular unitary ensemble converge?

    Authors: Yutao Ma, Xujia Meng

    Abstract: Let $z_1, \cdots, z_p$ be the eigenvalues of $A,$ which is the left-top $p\times p$ submatrix of an $n\times n$ Haar-invariant unitary matrix. Suppose there exist two constants $0<h_1<h_2<1$ such that $h_1<\frac pn<h_2.$ Then, $$\sup_{x\in \mathbb{R}}|\mathbb{P}(X_n\le x)-e^{-e^{-x}}|=\frac{(\log \log n)^{2}}{2e\log n}(1+o(1))$$ and further… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    MSC Class: 60F10; 15B52

  29. arXiv:2506.16346  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Preferred Synthesis of Armchair SnS2 Nanotubes

    Authors: Abid, Luneng Zhao, Ju Huang, Yongjia Zheng, Yuta Sato, Qingyun Lin, Zhen Han, Chunxia Yang, Tianyu Wang, Bill Herve Nduwarugira, Yicheng Ma, Lingfeng Wang, Yige Zheng, Hang Wang, Salman Ullah, Afzal Khan, Qi Zhang, Wenbin Li, Junfeng Gao, Bingfeng Ju, Feng Ding, Yan Li, Kazu Suenaga, Shigeo Maruyama, Huayong Yang , et al. (1 additional authors not shown)

    Abstract: In this work, we present the synthesis of tin disulfide (SnS2) nanotubes (NTs) with preferred chiral angle. A sacrificial template is used to create channels of boron nitride nanotubes (BNNTs) with an optimized diameter of 4-5 nm, inside of which SnS2 NTs are formed with the high yield and structural purity. Atomic resolution imaging and nano-area electron diffraction reveal that these synthesized… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  30. arXiv:2506.16299  [pdf, ps, other

    cs.CG cs.CV

    Wavelet-based Global Orientation and Surface Reconstruction for Point Clouds

    Authors: Yueji Ma, Yanzun Meng, Dong Xiao, Zuoqiang Shi, Bin Wang

    Abstract: Unoriented surface reconstruction is an important task in computer graphics and has extensive applications. Based on the compact support of wavelet and orthogonality properties, classic wavelet surface reconstruction achieves good and fast reconstruction. However, this method can only handle oriented points. Despite some improved attempts for unoriented points, such as iWSR, these methods perform… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

    Comments: 22Pages

  31. arXiv:2506.16067  [pdf, ps, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Shift current in 2D Janus Transition-Metal Dichalcogenides: the role of excitons

    Authors: Yuncheng Mao, Ju Zhou, Myrta Grüning, Claudio Attaccalite

    Abstract: We study the shift current in two two-dimensional (2D) Janus transition metal dichalcogenides: molybdenum diselenide (MoSSe) and tungsten diselenide (WSSe). The shift current is evaluated using a real-time approach, in which the coupling with an external field is described in terms of a dynamical Berry phase. This approach incorporates electron-hole interactions and quasiparticle band structure re… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  32. arXiv:2506.15672  [pdf, ps, other

    cs.AI cs.MA

    SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence

    Authors: Yao Zhang, Chenyang Lin, Shijie Tang, Haokun Chen, Shijie Zhou, Yunpu Ma, Volker Tresp

    Abstract: The rapid progress of Large Language Models has advanced agentic systems in decision-making, coordination, and task execution. Yet, existing agentic system generation frameworks lack full autonomy, missing from-scratch agent generation, self-optimizing agent functionality, and collaboration, limiting adaptability and scalability. We propose SwarmAgentic, a framework for fully automated agentic sys… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 41 pages

  33. arXiv:2506.15533  [pdf, ps, other

    hep-ex

    Measurements of the absolute branching fractions of the doubly Cabibbo-suppressed decays $D^+\to K^+π^0$, $D^+\to K^+η$ and $D^+\to K^+η^{\prime}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (697 additional authors not shown)

    Abstract: Using $20.3\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of 3.773\,GeV with the BESIII detector, we present improved measurements of the absolute branching fractions of the doubly Cabibbo-suppressed decays $D^+\to K^+π^0$, $D^+\to K^+η$ and $ D^+ \to K^+ η^{\prime}$ with the double-tag method. The statistical significance of each signal decay exceeds $10σ$. The bra… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 20 pages, 4 figures

  34. ECFA Higgs, electroweak, and top Factory Study

    Authors: H. Abidi, J. A. Aguilar-Saavedra, S. Airen, S. Ajmal, M. Al-Thakeel, G. L. Alberghi, J. Alcaraz Maestre, J. Alimena, S. Alshamaily, J. Altmann, W. Altmannshofer, Y. Amhis, A. Amiri, A. Andreazza, S. Antusch, O. Arnaez, K. A. Assamagan, S. Aumiller, K. Azizi, P. Azzi, P. Azzurri, E. Bagnaschi, Z. Baharyioon, H. Bahl, V. Balagura , et al. (346 additional authors not shown)

    Abstract: The ECFA Higgs, electroweak, and top Factory Study ran between 2021 and 2025 as a broad effort across the experimental and theoretical particle physics communities, bringing together participants from many different proposed future collider projects. Activities across three main working groups advanced the joint development of tools and analysis techniques, fostered new considerations of detector… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Report number: CERN-2025-005

  35. arXiv:2506.15256  [pdf, ps, other

    hep-ex

    Determination of $|V_{cb}|$ using $B\to D\ellν_\ell$ Decays at Belle II

    Authors: Belle II Collaboration, I. Adachi, K. Adamczyk, L. Aggarwal, H. Ahmed, Y. Ahn, H. Aihara, N. Akopov, S. Alghamdi, M. Alhakami, A. Aloisio, K. Amos, M. Angelsmark, N. Anh Ky, C. Antonioli, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, N. K. Baghel, S. Bahinipati , et al. (385 additional authors not shown)

    Abstract: We present a determination of the Cabibbo-Kobayashi-Maskawa matrix element $|V_{cb}|$ from the decay $B\to D\ellν_\ell$ using a $365~\mathrm{fb}^{-1}$ $e^+e^-\toΥ(4S)\to B\bar B$ data sample recorded by the Belle II experiment at the SuperKEKB collider. The semileptonic decay of one $B$ meson is reconstructed in the modes $B^0\to D^-(\to K^+π^-π^-)\ell^+ν_\ell$ and… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Report number: Belle II Preprint 2025-004, KEK Preprint 2025-1

  36. arXiv:2506.15170  [pdf, ps, other

    cs.CR

    From LLMs to MLLMs to Agents: A Survey of Emerging Paradigms in Jailbreak Attacks and Defenses within LLM Ecosystem

    Authors: Yanxu Mao, Tiehan Cui, Peipei Liu, Datao You, Hongsong Zhu

    Abstract: Large language models (LLMs) are rapidly evolving from single-modal systems to multimodal LLMs and intelligent agents, significantly expanding their capabilities while introducing increasingly severe security risks. This paper presents a systematic survey of the growing complexity of jailbreak attacks and corresponding defense mechanisms within the expanding LLM ecosystem. We first trace the devel… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  37. arXiv:2506.14824  [pdf, ps, other

    cs.LG cs.AI cs.MM

    FedNano: Toward Lightweight Federated Tuning for Pretrained Multimodal Large Language Models

    Authors: Yao Zhang, Hewei Gao, Haokun Chen, Weiguo Li, Yunpu Ma, Volker Tresp

    Abstract: Multimodal Large Language Models (MLLMs) excel in tasks like multimodal reasoning and cross-modal retrieval but face deployment challenges in real-world scenarios due to distributed multimodal data and strict privacy requirements. Federated Learning (FL) offers a solution by enabling collaborative model training without centralizing data. However, realizing FL for MLLMs presents significant challe… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: 12 pages, 3 figures

  38. arXiv:2506.14766  [pdf, ps, other

    cs.CV cs.CL

    ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM

    Authors: Yujun Wang, Jinhe Bi, Yunpu Ma, Soeren Pirk

    Abstract: Multimodal Large Language Model (MLLM) often suffer from hallucinations. They over-rely on partial cues and generate incorrect responses. Recently, methods like Visual Contrastive Decoding (VCD) and Instruction Contrastive Decoding (ICD) have been proposed to mitigate hallucinations by contrasting predictions from perturbed or negatively prefixed inputs against original outputs. In this work, we u… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: 15 pages, 7 figures

    MSC Class: 68T45

  39. arXiv:2506.14697  [pdf, ps, other

    cs.CR cs.RO

    AGENTSAFE: Benchmarking the Safety of Embodied Agents on Hazardous Instructions

    Authors: Aishan Liu, Zonghao Ying, Le Wang, Junjie Mu, Jinyang Guo, Jiakai Wang, Yuqing Ma, Siyuan Liang, Mingchuan Zhang, Xianglong Liu, Dacheng Tao

    Abstract: The rapid advancement of vision-language models (VLMs) and their integration into embodied agents have unlocked powerful capabilities for decision-making. However, as these systems are increasingly deployed in real-world environments, they face mounting safety concerns, particularly when responding to hazardous instructions. In this work, we propose AGENTSAFE, the first comprehensive benchmark for… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: 11 pages

  40. arXiv:2506.14531  [pdf, ps, other

    stat.AP stat.ME

    A statistical framework for dynamic cognitive diagnosis in digital learning environments

    Authors: Yawen Ma, Anastasia Ushakova, Kate Cain, Gabriel Wallin

    Abstract: Reading is foundational for educational, employment, and economic outcomes, but a persistent proportion of students globally struggle to develop adequate reading skills. Some countries promote digital tools to support reading development, alongside regular classroom instruction. Such tools generate rich log data capturing students' behaviour and performance. This study proposes a dynamic cognitive… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  41. arXiv:2506.14418  [pdf, ps, other

    cs.CV cs.AI

    Compositional Attribute Imbalance in Vision Datasets

    Authors: Jiayi Chen, Yanbiao Ma, Andi Zhang, Weidong Tang, Wei Dai, Bowei Liu

    Abstract: Visual attribute imbalance is a common yet underexplored issue in image classification, significantly impacting model performance and generalization. In this work, we first define the first-level and second-level attributes of images and then introduce a CLIP-based framework to construct a visual attribute dictionary, enabling automatic evaluation of image attributes. By systematically analyzing b… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  42. arXiv:2506.14407  [pdf, ps, other

    cs.CL cs.AI

    ImpliRet: Benchmarking the Implicit Fact Retrieval Challenge

    Authors: Zeinab Sadat Taghavi, Ali Modarressi, Yunpu Ma, Hinrich Schütze

    Abstract: Retrieval systems are central to many NLP pipelines, but often rely on surface-level cues such as keyword overlap and lexical semantic similarity. To evaluate retrieval beyond these shallow signals, recent benchmarks introduce reasoning-heavy queries; however, they primarily shift the burden to query-side processing techniques -- like prompting or multi-hop retrieval -- that can help resolve compl… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  43. arXiv:2506.14358  [pdf, ps, other

    quant-ph

    Temperature dependent single- and double-quantum relaxation of negatively charged boron vacancies in hexagonal boron nitride

    Authors: Lin-Ke Xie, Wei Liu, Kaiyu Huang, Nai-Jie Guo, Jun-You Liu, Yu-Hang Ma, Ya-Qi Wu, Yi-Tao Wang, Zhao-an Wang, Xiao-Dong Zeng, Jia-Ming Ren, Chun Ao, Shuo Deng, Haifei Lu, Jian-Shun Tang, Chuan-Feng Li, Guang-Can Guo

    Abstract: The negatively charged boron vacancy in two-dimensional hexagonal boron nitride has emerged as a promising candidate for quantum sensing. The coherence time of this defect spins which coherent quantum sensing resides in is limited spin-phonon interactions, while the underlying physical mechanism of the corresponding high-temperature behavior is still not fully understood. Here, we probe the single… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  44. arXiv:2506.14315  [pdf, ps, other

    cs.GR cs.CV

    ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies

    Authors: Jinyan Yuan, Bangbang Yang, Keke Wang, Panwang Pan, Lin Ma, Xuehai Zhang, Xiao Liu, Zhaopeng Cui, Yuewen Ma

    Abstract: Automatic creation of 3D scenes for immersive VR presence has been a significant research focus for decades. However, existing methods often rely on either high-poly mesh modeling with post-hoc simplification or massive 3D Gaussians, resulting in a complex pipeline or limited visual realism. In this paper, we demonstrate that such exhaustive modeling is unnecessary for achieving compelling immersi… ▽ More

    Submitted 18 June, 2025; v1 submitted 17 June, 2025; originally announced June 2025.

    Comments: Project webpage: https://immersegen.github.io

  45. FEWSim: A Visual Analytic Framework for Exploring the Nexus of Food-Energy-Water Simulations

    Authors: Fan Lei, David A. Sampson, Jiayi Hong, Yuxin Ma, Giuseppe Mascaro, Dave White, Rimjhim Agarwal, Ross Maciejewski

    Abstract: The interdependencies of food, energy, and water (FEW) systems create a nexus opportunity to explore the strengths and vulnerabilities of individual and cross-sector interactions within FEW systems. However, the variables quantifying nexus interactions are hard to observe, which hinders the cross-sector analysis. To overcome such challenges, we present FEWSim, a visual analytics framework designed… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: Accepted by IEEE Computer Graphics and Applications (CG&A)

  46. arXiv:2506.14005  [pdf

    astro-ph.CO

    MeerKAT HI observations of Low Surface Brightness/Ultradiffuse Galaxy Candidates Projected around Two Southern Loose Groups

    Authors: Chandreyee Sengupta, Tom C. Scott, Hao Chen, Hyein Yoon, Yogesh Chandola, Mengtian Li, Gyula I. G. Józsa, O. Ivy Wong, Yin-Zhe Ma, Patricio Lagos, Ruta Kale, Denis Tramonte

    Abstract: A large catalogue of low surface brightness galaxies (LSBGs) from the Dark Energy Survey showed significant clustering around nearby galaxy groups and clusters. Using the HIPASS survey, we tried to determine the redshift of a sub-sample of these LSBGs and determine whether they were members of the groups they were projected near, but this was hampered by HIPASS's high spectral rms. This letter rep… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: Accepted in ApJ Letters

  47. arXiv:2506.13737  [pdf, ps, other

    cs.CR

    ExtendAttack: Attacking Servers of LRMs via Extending Reasoning

    Authors: Zhenhao Zhu, Yue Liu, Yingwei Ma, Hongcheng Gao, Nuo Chen, Yanpei Guo, Wenjie Qu, Huiying Xu, Xinzhong Zhu, Jiaheng Zhang

    Abstract: Large Reasoning Models (LRMs) have demonstrated promising performance in complex tasks. However, the resource-consuming reasoning processes may be exploited by attackers to maliciously occupy the resources of the servers, leading to a crash, like the DDoS attack in cyber. To this end, we propose a novel attack method on LRMs termed ExtendAttack to maliciously occupy the resources of servers by ste… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  48. arXiv:2506.13721  [pdf, ps, other

    cond-mat.mtrl-sci cond-mat.str-el cond-mat.supr-con

    Catalogue of chiral phonon materials

    Authors: Yue Yang, Zhenyu Xiao, Yu Mao, Zhanghuan Li, Zhenyang Wang, Tianqi Deng, Yanhao Tang, Zhi-Da Song, Yuan Li, Huiqiu Yuan, Ming Shi, Yuanfeng Xu

    Abstract: Chiral phonons, circularly polarized lattice vibrations carrying intrinsic angular momentum, offer unprecedented opportunities for controlling heat flow, manipulating quantum states through spin-phonon coupling, and realizing exotic transport phenomena. Despite their fundamental importance, a universal framework for identifying and classifying these elusive excitations has remained out of reach. H… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: 163 pages, 4+173 figures. The Chiral Phonon Materials Database can be accessed at https://materialsfingerprint.com

  49. arXiv:2506.13585  [pdf, ps, other

    cs.CL cs.LG

    MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

    Authors: MiniMax, :, Aili Chen, Aonian Li, Bangwei Gong, Binyang Jiang, Bo Fei, Bo Yang, Boji Shan, Changqing Yu, Chao Wang, Cheng Zhu, Chengjun Xiao, Chengyu Du, Chi Zhang, Chu Qiao, Chunhao Zhang, Chunhui Du, Congchao Guo, Da Chen, Deming Ding, Dianjun Sun, Dong Li, Enwei Jiao, Haigang Zhou , et al. (103 additional authors not shown)

    Abstract: We introduce MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. MiniMax-M1 is powered by a hybrid Mixture-of-Experts (MoE) architecture combined with a lightning attention mechanism. The model is developed based on our previous MiniMax-Text-01 model, which contains a total of 456 billion parameters with 45.9 billion parameters activated per token. The M1 model… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: A technical report from MiniMax. The authors are listed in alphabetical order. We open-source our MiniMax-M1 at https://github.com/MiniMax-AI/MiniMax-M1

  50. arXiv:2506.13558  [pdf, ps, other

    cs.CV

    X-Scene: Large-Scale Driving Scene Generation with High Fidelity and Flexible Controllability

    Authors: Yu Yang, Alan Liang, Jianbiao Mei, Yukai Ma, Yong Liu, Gim Hee Lee

    Abstract: Diffusion models are advancing autonomous driving by enabling realistic data synthesis, predictive end-to-end planning, and closed-loop simulation, with a primary focus on temporally consistent generation. However, the generation of large-scale 3D scenes that require spatial coherence remains underexplored. In this paper, we propose X-Scene, a novel framework for large-scale driving scene generati… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: 28 pages, 9 figures, Project page at https://x-scene.github.io/