Skip to main content

Showing 51–100 of 4,775 results for author: Hu, Y

.
  1. arXiv:2504.19302  [pdf, other

    physics.flu-dyn physics.ao-ph

    Wave Energy Is Conserved in a Spatially Varying and Inhomogeneously Moving Medium

    Authors: Zhaohua Wu, Jie Sun, Zhe-Min Tan, Ming Cai, Yongyun Hu, Norden E. Huang

    Abstract: Waves are propagating disturbances that redistribute energy across space. Previous studies have shown that for waves propagating through an inhomogeneously moving mean flow, the conserved quantity is wave action rather than wave energy, raising questions about the validity of energy conservation, which is one of the foundational principles of physics. In this study, we prove that wave action conse… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

    Comments: 25 pages, 5 figures

  2. arXiv:2504.19213  [pdf, other

    hep-ex

    Measurements of branching fractions of $D^0\to K^- 3π^+2π^-$, $D^0\to K^- 2π^+π^-2π^0$ and $D^+\to K^- 3π^+π^-π^0$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (693 additional authors not shown)

    Abstract: Utilizing $7.9\,\rm fb^{-1}$ of $e^+e^-$ collision data taken with the BESIII detector at the center-of-mass energy of 3.773 GeV, we report the measurements of absolute branching fractions of the hadronic decays $D^0\to K^- 3π^+2π^-$, $D^0\to K^- 2π^+π^-2π^0$ and $D^+\to K^- 3π^+π^-π^0$. The $D^0\to K^- 3π^+2π^-$ decay is measured with improved precision, while the latter two decays are observed w… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

    Comments: 12pages, 6 figures, 4 tables

    Report number: BAM-00843

  3. arXiv:2504.19099  [pdf, other

    cs.SE cs.AI cs.AR

    VeriDebug: A Unified LLM for Verilog Debugging via Contrastive Embedding and Guided Correction

    Authors: Ning Wang, Bingkun Yao, Jie Zhou, Yuchen Hu, Xi Wang, Nan Guan, Zhe Jiang

    Abstract: Large Language Models (LLMs) have demonstrated remarkable potential in debugging for various programming languages. However, the application of LLMs to Verilog debugging remains insufficiently explored. Here, we present VeriDebug, an approach that integrates contrastive representation and guided correction capabilities for automated Verilog debugging. Unlike existing methods, VeriDebug employs an… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

  4. arXiv:2504.19087  [pdf, ps, other

    hep-ex

    Search for $η_{1}(1855)$ in $χ_{cJ}\toηηη^{\prime}$ decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (697 additional authors not shown)

    Abstract: Based on a sample of $2.7\times10^{9}$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, an analysis of the decay $ψ(3686)\toγχ_{cJ}, χ_{cJ}\toηηη^{\prime}$ is performed. The decay modes $χ_{c1}$ and $χ_{c2}\toηηη^{\prime}$ are observed for the first time, and their corresponding branching fractions are determined to be… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

  5. arXiv:2504.18926  [pdf, other

    cond-mat.mes-hall quant-ph

    Critical Non-Hermitian Edge Modes

    Authors: Kunling Zhou, Zihe Yang, Bowen Zeng, Yong Hu

    Abstract: We unveil a unique critical phenomenon of topological edge modes in non-Hermitian systems, dubbed the critical non-Hermitian edge modes (CNHEM). Specifically, in the thermodynamic limit, the eigenvectors of edge modes jump discontinuously under infinitesimal on-site staggered perturbations. The CNHEM arises from the competition between the introduced on-site staggered potentials and size-dependent… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

    Comments: 5 pages, 2 figures

  6. arXiv:2504.18509  [pdf, other

    cs.CV

    Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation

    Authors: Shivam Duggal, Yushi Hu, Oscar Michel, Aniruddha Kembhavi, William T. Freeman, Noah A. Smith, Ranjay Krishna, Antonio Torralba, Ali Farhadi, Wei-Chiu Ma

    Abstract: Despite the unprecedented progress in the field of 3D generation, current systems still often fail to produce high-quality 3D assets that are visually appealing and geometrically and semantically consistent across multiple viewpoints. To effectively assess the quality of the generated 3D data, there is a need for a reliable 3D evaluation tool. Unfortunately, existing 3D evaluation metrics often ov… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

    Comments: CVPR 2025. Project page and codes: https://eval3d.github.io/

  7. arXiv:2504.17789  [pdf, other

    cs.CV

    Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models

    Authors: Xu Ma, Peize Sun, Haoyu Ma, Hao Tang, Chih-Yao Ma, Jialiang Wang, Kunpeng Li, Xiaoliang Dai, Yujun Shi, Xuan Ju, Yushi Hu, Artsiom Sanakoyeu, Felix Juefei-Xu, Ji Hou, Junjiao Tian, Tao Xu, Tingbo Hou, Yen-Cheng Liu, Zecheng He, Zijian He, Matt Feiszli, Peizhao Zhang, Peter Vajda, Sam Tsai, Yun Fu

    Abstract: Autoregressive (AR) models, long dominant in language generation, are increasingly applied to image synthesis but are often considered less competitive than Diffusion-based models. A primary limitation is the substantial number of image tokens required for AR models, which constrains both training and inference efficiency, as well as image resolution. To address this, we present Token-Shuffle, a n… ▽ More

    Submitted 27 April, 2025; v1 submitted 24 April, 2025; originally announced April 2025.

    Comments: Project Page: https://ma-xu.github.io/token-shuffle/ Add related works

  8. arXiv:2504.17705  [pdf, other

    cs.HC

    LUIDA: Large-scale Unified Infrastructure for Digital Assessments based on Commercial Metaverse Platform

    Authors: Yong-Hao Hu, Sotaro Yokoi, Yuji Hatada, Yuichi Hiroi, Takuji Narumi, Takefumi Hiraki

    Abstract: Online experiments using metaverse platforms have gained significant traction in Human-Computer Interaction and Virtual Reality (VR) research. However, current research workflows are highly fragmented, as researchers must use separate tools for system implementation, participant recruitment, experiment execution, and data collection, reducing consistency and increasing workload. We present LUIDA (… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  9. arXiv:2504.17533  [pdf, other

    gr-qc astro-ph.CO hep-th

    Relic gravitational waves from cosmological horizon radiation during de Sitter period: as zero-order approximation of inflation

    Authors: Chen-Hao Wu, Xiao Liang, Ya-Peng Hu

    Abstract: It is well known that the event horizon of the de Sitter universe can produce particles, and one can get sizable Hawking radiation by considering inflationary phases as de Sitter spacetimes with large Hubble rates. In this compact paper, we consider the graviton emission part of these radiations and assume that these graviton signals can exist in the current universe in the form of gravitational w… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: 7 pages, 2 figures

  10. arXiv:2504.17034  [pdf, other

    astro-ph.HE

    An extremely soft and weak fast X-ray transient associated with a luminous supernova

    Authors: W. -X. Li, Z. -P. Zhu, X. -Z. Zou, J. -J. Geng, L. -D. Liu, Y. -H. Wang, R. -Z. Li, D. Xu, H. Sun, X. -F. Wang, Y. -W. Yu, B. Zhang, X. -F. Wu, Y. Yang, A. V. Filippenko, X. -W. Liu, W. -M. Yuan, D. Aguado, J. An, T. An, D. A. H. Buckley, A. J. Castro-Tirado, S. -Y. Fu, J. P. U. Fynbo, D. A. Howell , et al. (80 additional authors not shown)

    Abstract: Long gamma-ray bursts (LGRBs), including their subclasses of low-luminosity GRBs (LL-GRBs) and X-ray flashes (XRFs) characterized by low spectral peak energies, are known to be associated with broad-lined Type Ic supernovae (SNe Ic-BL), which result from the core collapse of massive stars that lose their outer hydrogen and helium envelopes. However, the soft and weak end of the GRB/XRF population… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: 54 pages, 10 figures, submitted

  11. arXiv:2504.16789  [pdf, other

    econ.EM stat.AP

    MLOps Monitoring at Scale for Digital Platforms

    Authors: Yu Jeffrey Hu, Jeroen Rombouts, Ines Wilms

    Abstract: Machine learning models are widely recognized for their strong performance in forecasting. To keep that performance in streaming data settings, they have to be monitored and frequently re-trained. This can be done with machine learning operations (MLOps) techniques under supervision of an MLOps engineer. However, in digital platform settings where the number of data streams is typically large and… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  12. arXiv:2504.16214  [pdf, other

    cs.LG cs.AI cs.PL

    Hexcute: A Tile-based Programming Language with Automatic Layout and Task-Mapping Synthesis

    Authors: Xiao Zhang, Yaoyao Ding, Yang Hu, Gennady Pekhimenko

    Abstract: Deep learning (DL) workloads mainly run on accelerators like GPUs. Recent DL quantization techniques demand a new matrix multiplication operator with mixed input data types, further complicating GPU optimization. Prior high-level compilers like Triton lack the expressiveness to implement key optimizations like fine-grained data pipelines and hardware-friendly memory layouts for these operators, wh… ▽ More

    Submitted 30 April, 2025; v1 submitted 22 April, 2025; originally announced April 2025.

    Comments: 17 pages, 24 figures

  13. arXiv:2504.16074  [pdf, other

    cs.CL

    PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models

    Authors: Shi Qiu, Shaoyang Guo, Zhuo-Yang Song, Yunbo Sun, Zeyu Cai, Jiashen Wei, Tianyu Luo, Yixuan Yin, Haoxu Zhang, Yi Hu, Chenyang Wang, Chencheng Tang, Haoling Chang, Qi Liu, Ziheng Zhou, Tianyu Zhang, Jingtian Zhang, Zhangyi Liu, Minghao Li, Yuku Zhang, Boxuan Jing, Xianqi Yin, Yutong Ren, Zizhuo Fu, Weike Wang , et al. (27 additional authors not shown)

    Abstract: We introduce PHYBench, a novel, high-quality benchmark designed for evaluating reasoning capabilities of large language models (LLMs) in physical contexts. PHYBench consists of 500 meticulously curated physics problems based on real-world physical scenarios, designed to assess the ability of models to understand and reason about realistic physical processes. Covering mechanics, electromagnetism, t… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: 21 pages ,8 figures, 4 tables

  14. arXiv:2504.16068  [pdf, other

    physics.comp-ph cs.LG physics.chem-ph

    High-performance training and inference for deep equivariant interatomic potentials

    Authors: Chuin Wei Tan, Marc L. Descoteaux, Mit Kotak, Gabriel de Miranda Nascimento, Seán R. Kavanagh, Laura Zichi, Menghang Wang, Aadit Saluja, Yizhong R. Hu, Tess Smidt, Anders Johansson, William C. Witt, Boris Kozinsky, Albert Musaelian

    Abstract: Machine learning interatomic potentials, particularly those based on deep equivariant neural networks, have demonstrated state-of-the-art accuracy and computational efficiency in atomistic modeling tasks like molecular dynamics and high-throughput screening. The size of datasets and demands of downstream workflows are growing rapidly, making robust and scalable software essential. This work presen… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  15. arXiv:2504.15956  [pdf, other

    cs.LG cs.AI stat.ML

    Universal Approximation with Softmax Attention

    Authors: Jerry Yao-Chieh Hu, Hude Liu, Hong-Yu Chen, Weimin Wu, Han Liu

    Abstract: We prove that with linear transformations, both (i) two-layer self-attention and (ii) one-layer self-attention followed by a softmax function are universal approximators for continuous sequence-to-sequence functions on compact domains. Our main technique is a new interpolation-based method for analyzing attention's internal mechanism. This leads to our key insight: self-attention is able to approx… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  16. arXiv:2504.15804  [pdf, other

    cs.AR cs.AI

    Insights from Verification: Training a Verilog Generation LLM with Reinforcement Learning with Testbench Feedback

    Authors: Ning Wang, Bingkun Yao, Jie Zhou, Yuchen Hu, Xi Wang, Nan Guan, Zhe Jiang

    Abstract: Large language models (LLMs) have shown strong performance in Verilog generation from natural language description. However, ensuring the functional correctness of the generated code remains a significant challenge. This paper introduces a method that integrates verification insights from testbench into the training of Verilog generation LLMs, aligning the training with the fundamental goal of har… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  17. arXiv:2504.15677  [pdf, ps, other

    math.DG

    Affine isoperimetric type inequalities for static convex domains in hyperbolic space

    Authors: Yingxiang Hu, Haizhong Li, Yao Wan, Botong Xu

    Abstract: In this paper, the notion of hyperbolic ellipsoids in hyperbolic space is introduced. Using a natural orthogonal projection from hyperbolic space to Euclidean space, we establish affine isoperimetric type inequalities for static convex domains in hyperbolic space. Moreover, equality of such inequalities is characterized by these hyperbolic ellipsoids.

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: 18 pages, 1 figure. Comments are welcome

    MSC Class: 52A40; 53C24; 53A15

    Journal ref: Journal of Mathematical Study, Vol. 58 (2025), Iss. 1 : pp. 62-81

  18. arXiv:2504.15281  [pdf, other

    cs.CV

    StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians

    Authors: Cailin Zhuang, Yaoqi Hu, Xuanyang Zhang, Wei Cheng, Jiacheng Bao, Shengqi Liu, Yiying Yang, Xianfang Zeng, Gang Yu, Ming Li

    Abstract: 3D Gaussian Splatting (3DGS) excels in photorealistic scene reconstruction but struggles with stylized scenarios (e.g., cartoons, games) due to fragmented textures, semantic misalignment, and limited adaptability to abstract aesthetics. We propose StyleMe3D, a holistic framework for 3D GS style transfer that integrates multi-modal style conditioning, multi-level semantic alignment, and perceptual… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: 16 pages; Project page: https://styleme3d.github.io/

  19. arXiv:2504.14558  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    A Review on the Applications of Density Functional Theory to the FQH System

    Authors: Yi Yang, Yayun Hu, Zi-Xiang Hu

    Abstract: The fractional quantum Hall (FQH) effect remains a captivating area in condensed matter physics, characterized by strongly correlated topological order, fractionalized excitations, and anyonic statistics. Numerical simulations, such as exact diagonalization, density matrix renormalization group, matrix product states, and Monte Carlo methods, are essential to examine the properties of strongly cor… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

    Comments: 11 pages, 11 figures

  20. arXiv:2504.14308  [pdf, other

    math.NA

    The Schur complements for $SDD_{1}$ matrices and their application to linear complementarity problems

    Authors: Yang Hu, Jianzhou Liu, Wenlong Zeng

    Abstract: In this paper we propose a new scaling method to study the Schur complements of $SDD_{1}$ matrices. Its core is related to the non-negative property of the inverse $M$-matrix, while numerically improving the Quotient formula. Based on the Schur complement and a novel norm splitting manner, we establish an upper bound for the infinity norm of the inverse of $SDD_{1}$ matrices, which depends solely… ▽ More

    Submitted 19 April, 2025; originally announced April 2025.

    Comments: 26pages

  21. arXiv:2504.14174  [pdf, other

    cs.LG cs.AI

    A Physics-guided Multimodal Transformer Path to Weather and Climate Sciences

    Authors: Jing Han, Hanting Chen, Kai Han, Xiaomeng Huang, Yongyun Hu, Wenjun Xu, Dacheng Tao, Ping Zhang

    Abstract: With the rapid development of machine learning in recent years, many problems in meteorology can now be addressed using AI models. In particular, data-driven algorithms have significantly improved accuracy compared to traditional methods. Meteorological data is often transformed into 2D images or 3D videos, which are then fed into AI models for learning. Additionally, these models often incorporat… ▽ More

    Submitted 19 April, 2025; originally announced April 2025.

    Comments: Perspective article

  22. arXiv:2504.13950  [pdf, other

    cs.LG cs.AI

    Open-Medical-R1: How to Choose Data for RLVR Training at Medicine Domain

    Authors: Zhongxi Qiu, Zhang Zhang, Yan Hu, Heng Li, Jiang Liu

    Abstract: This paper explores optimal data selection strategies for Reinforcement Learning with Verified Rewards (RLVR) training in the medical domain. While RLVR has shown exceptional potential for enhancing reasoning capabilities in large language models, most prior implementations have focused on mathematics and logical puzzles, with limited exploration of domain-specific applications like medicine. We i… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

    Comments: 15 figures

  23. arXiv:2504.13916  [pdf, other

    cs.HC cs.RO

    Task Matters: Investigating Human Questioning Behavior in Different Household Service for Learning by Asking Robots

    Authors: Yuanda Hu, Hou Jiani, Zhang Junyu, Yate Ge, Xiaohua Sun, Weiwei Guo

    Abstract: Learning by Asking (LBA) enables robots to identify knowledge gaps during task execution and acquire the missing information by asking targeted questions. However, different tasks often require different types of questions, and how to adapt questioning strategies accordingly remains underexplored. This paper investigates human questioning behavior in two representative household service tasks: a G… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  24. arXiv:2504.13771  [pdf, other

    hep-ex

    Search for $J/ψ\rightarrow K^{0}_{S}K^{0}_{S}$ and $ψ(3686)\rightarrow K^{0}_{S}K^{0}_{S}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

    Abstract: Using data samples of $(10087\pm 44)\times10^{6}$ $J/ψ$ events and $(2712.4\pm 14.3)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, we search for the CP violating decays $J/ψ\rightarrow K^{0}_{S}K^{0}_{S}$ and $ψ(3686)\rightarrow K^{0}_{S}K^{0}_{S}$. No significant signals are observed over the expected background yields. The upper limits on their branchin… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  25. arXiv:2504.13616  [pdf, other

    quant-ph physics.atom-ph

    Realizing exceptional points by Floquet dissipative couplings in thermal atoms

    Authors: Zimo Zhang, Fengbo Zhang, Zhongxiao Xu, Ying Hu, Han Bao, Heng Shen

    Abstract: Exceptional degeneracies and generically complex spectra of non-Hermitian systems are at the heart of numerous phenomena absent in the Hermitian realm. Recently, it was suggested that Floquet dissipative coupling in the space-time domain may provide a novel mechanism to drive intriguing spectral topology with no static analogues, though its experimental investigation in quantum systems remains elu… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

    Comments: 6 pages, 4 figures

    Journal ref: Phys. Rev. Lett. 133, 133601 (2024)

  26. arXiv:2504.13539  [pdf, other

    hep-ex

    Search for $1^{-+}$ charmonium-like hybrid via $e^{+}e^{-}\rightarrow γη^{(\prime)} η_{c}$ at center-of-mass energies between 4.258 and 4.681 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (696 additional authors not shown)

    Abstract: Using $e^{+}e^{-}$ collision data corresponding to an integrated luminosity of 10.6 fb$^{-1}$ collected at center-of-mass energies between 4.258 and 4.681 GeV with the BESIII detector at the BEPCII collider, we search for the $1^{- +}$ charmonium-like hybrid via $e^{+}e^{-}\rightarrowγηη_{c}$ and $e^{+}e^{-}\rightarrowγη^{\prime}η_{c}$ decays for the first time. No significant signal is observed a… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  27. arXiv:2504.13413  [pdf, other

    cs.LG cs.RO eess.SY

    A Model-Based Approach to Imitation Learning through Multi-Step Predictions

    Authors: Haldun Balim, Yang Hu, Yuyang Zhang, Na Li

    Abstract: Imitation learning is a widely used approach for training agents to replicate expert behavior in complex decision-making tasks. However, existing methods often struggle with compounding errors and limited generalization, due to the inherent challenge of error correction and the distribution shift between training and deployment. In this paper, we present a novel model-based imitation learning fram… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  28. arXiv:2504.12983  [pdf, other

    astro-ph.IM gr-qc

    Adaptive Modeling of Correlated Noise in Space-Based Gravitational Wave Detectors

    Authors: Ya-Nan Li, Yi-Ming Hu, En-Kun Li

    Abstract: Accurately estimating the statistical properties of noise is important in space-based gravitational wave data analysis. Traditional methods often assume uncorrelated noise or impose restrictive parametric forms on cross-channel correlations, which could lead to biased estimation in complex instrumental noise. This paper introduces a spline-based framework with trans-dimensional Bayesian inference… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: 5 figures, submitted

  29. arXiv:2504.12754  [pdf, other

    quant-ph

    A tight consecutive measurement theorem and its applications

    Authors: Chen-Xun Weng, Minglong Qin, Yanglin Hu, Marco Tomamichel

    Abstract: In many cryptographic tasks, we encounter scenarios where information about two incompatible observables must be retrieved. A natural approach is to perform consecutive measurements, raising a key question: How does the information gained from the first measurement compare to that from both? The consecutive measurement theorem provides a general relation between these quantities and has been used… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  30. arXiv:2504.12285  [pdf, other

    cs.CL cs.LG

    BitNet b1.58 2B4T Technical Report

    Authors: Shuming Ma, Hongyu Wang, Shaohan Huang, Xingxing Zhang, Ying Hu, Ting Song, Yan Xia, Furu Wei

    Abstract: We introduce BitNet b1.58 2B4T, the first open-source, native 1-bit Large Language Model (LLM) at the 2-billion parameter scale. Trained on a corpus of 4 trillion tokens, the model has been rigorously evaluated across benchmarks covering language understanding, mathematical reasoning, coding proficiency, and conversational ability. Our results demonstrate that BitNet b1.58 2B4T achieves performanc… ▽ More

    Submitted 24 April, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

    Comments: Work in progress

  31. arXiv:2504.11967  [pdf, other

    cs.CV cs.AI cs.RO

    Securing the Skies: A Comprehensive Survey on Anti-UAV Methods, Benchmarking, and Future Directions

    Authors: Yifei Dong, Fengyi Wu, Sanjian Zhang, Guangyu Chen, Yuzhi Hu, Masumi Yano, Jingdong Sun, Siyu Huang, Feng Liu, Qi Dai, Zhi-Qi Cheng

    Abstract: Unmanned Aerial Vehicles (UAVs) are indispensable for infrastructure inspection, surveillance, and related tasks, yet they also introduce critical security challenges. This survey provides a wide-ranging examination of the anti-UAV domain, centering on three core objectives-classification, detection, and tracking-while detailing emerging methodologies such as diffusion-based data synthesis, multi-… ▽ More

    Submitted 17 April, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

    Comments: Accepted at CVPR Workshop Anti-UAV 2025. 15 pages

  32. arXiv:2504.11354  [pdf, other

    cs.AI

    Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning

    Authors: Haiming Wang, Mert Unsal, Xiaohan Lin, Mantas Baksys, Junqi Liu, Marco Dos Santos, Flood Sung, Marina Vinyes, Zhenzhe Ying, Zekai Zhu, Jianqiao Lu, Hugues de Saxcé, Bolton Bailey, Chendong Song, Chenjun Xiao, Dehao Zhang, Ebony Zhang, Frederick Pu, Han Zhu, Jiawei Liu, Jonas Bayer, Julien Michel, Longhui Yu, Léo Dreyfus-Schmidt, Lewis Tunstall , et al. (15 additional authors not shown)

    Abstract: We introduce Kimina-Prover Preview, a large language model that pioneers a novel reasoning-driven exploration paradigm for formal theorem proving, as showcased in this preview release. Trained with a large-scale reinforcement learning pipeline from Qwen2.5-72B, Kimina-Prover demonstrates strong performance in Lean 4 proof generation by employing a structured reasoning pattern we term \textit{forma… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: 22 pages

  33. arXiv:2504.11023  [pdf, other

    math.OC

    An Inexact Variable Metric Proximal Gradient-subgradient Algorithm for a Class of Fractional Optimization Problems

    Authors: Lei Yang, Xiangrui Kong, Min Zhang, Yaohua Hu

    Abstract: In this paper, we study a class of fractional optimization problems, in which the numerator of the objective is the sum of a convex function and a differentiable function with a Lipschitz continuous gradient, while the denominator is a nonsmooth convex function. This model has broad applicability and encompasses several important optimization problems in the literature. To address these problems,… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: arXiv admin note: text overlap with arXiv:2406.04646

  34. arXiv:2504.11004  [pdf, other

    cs.CL cs.AI

    Dynamic Compressing Prompts for Efficient Inference of Large Language Models

    Authors: Jinwu Hu, Wei Zhang, Yufeng Wang, Yu Hu, Bin Xiao, Mingkui Tan, Qing Du

    Abstract: Large Language Models (LLMs) have shown outstanding performance across a variety of tasks, partly due to advanced prompting techniques. However, these techniques often require lengthy prompts, which increase computational costs and can hinder performance because of the limited context windows of LLMs. While prompt compression is a straightforward solution, existing methods confront the challenges… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: Under review (submited in 2024.11)

  35. arXiv:2504.10867  [pdf, other

    hep-ex

    Precise measurement of the form factors in $D^0\rightarrow K^*(892)^-μ^+ν_μ$ and test of lepton universality with $D^0\rightarrow K^*(892)^-\ell^+ν_{\ell}$ decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (696 additional authors not shown)

    Abstract: We report a study of the semileptonic decay $D^0 \rightarrow \bar{K}^0π^-μ^+ν_μ$ based on a sample of $7.9~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773~GeV with the BESIII detector at the BEPCII collider. The branching fraction of the decay is measured for the first time to be… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: 9 pages, 4 figures

  36. arXiv:2504.10418  [pdf, other

    cs.CL

    CliniChat: A Multi-Source Knowledge-Driven Framework for Clinical Interview Dialogue Reconstruction and Evaluation

    Authors: Jing Chen, Zhihua Wei, Wei Zhang, Yingying Hu, Qiong Zhang

    Abstract: Large language models (LLMs) hold great promise for assisting clinical interviews due to their fluent interactive capabilities and extensive medical knowledge. However, the lack of high-quality interview dialogue data and widely accepted evaluation methods has significantly impeded this process. So we propose CliniChat, a framework that integrates multi-source knowledge to enable LLMs to simulate… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

  37. arXiv:2504.10352  [pdf, other

    eess.AS cs.CL

    Pseudo-Autoregressive Neural Codec Language Models for Efficient Zero-Shot Text-to-Speech Synthesis

    Authors: Yifan Yang, Shujie Liu, Jinyu Li, Yuxuan Hu, Haibin Wu, Hui Wang, Jianwei Yu, Lingwei Meng, Haiyang Sun, Yanqing Liu, Yan Lu, Kai Yu, Xie Chen

    Abstract: Recent zero-shot text-to-speech (TTS) systems face a common dilemma: autoregressive (AR) models suffer from slow generation and lack duration controllability, while non-autoregressive (NAR) models lack temporal modeling and typically require complex designs. In this paper, we introduce a novel pseudo-autoregressive (PAR) codec language modeling approach that unifies AR and NAR modeling. Combining… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: Submitted to ACM MM 2025

  38. arXiv:2504.10329  [pdf, other

    cs.CV

    InstructEngine: Instruction-driven Text-to-Image Alignment

    Authors: Xingyu Lu, Yuhang Hu, YiFan Zhang, Kaiyu Jiang, Changyi Liu, Tianke Zhang, Jinpeng Wang, Chun Yuan, Bin Wen, Fan Yang, Tingting Gao, Di Zhang

    Abstract: Reinforcement Learning from Human/AI Feedback (RLHF/RLAIF) has been extensively utilized for preference alignment of text-to-image models. Existing methods face certain limitations in terms of both data and algorithm. For training data, most approaches rely on manual annotated preference data, either by directly fine-tuning the generators or by training reward models to provide training signals. H… ▽ More

    Submitted 21 April, 2025; v1 submitted 14 April, 2025; originally announced April 2025.

    Comments: 8 pages, 7 figures

  39. arXiv:2504.10281  [pdf, other

    cond-mat.mtrl-sci cond-mat.mes-hall cs.AI cs.CV cs.LG

    Zero-shot Autonomous Microscopy for Scalable and Intelligent Characterization of 2D Materials

    Authors: Jingyun Yang, Ruoyan Avery Yin, Chi Jiang, Yuepeng Hu, Xiaokai Zhu, Xingjian Hu, Sutharsika Kumar, Xiao Wang, Xiaohua Zhai, Keran Rong, Yunyue Zhu, Tianyi Zhang, Zongyou Yin, Jing Kong, Neil Zhenqiang Gong, Zhichu Ren, Haozhe Wang

    Abstract: Characterization of atomic-scale materials traditionally requires human experts with months to years of specialized training. Even for trained human operators, accurate and reliable characterization remains challenging when examining newly discovered materials such as two-dimensional (2D) structures. This bottleneck drives demand for fully autonomous experimentation systems capable of comprehendin… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: 13 pages, 4 figures

  40. arXiv:2504.10160  [pdf, other

    cs.CL cs.AI cs.LG

    MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning

    Authors: Zhaopeng Feng, Shaosheng Cao, Jiahan Ren, Jiayuan Su, Ruizhe Chen, Yan Zhang, Zhe Xu, Yao Hu, Jian Wu, Zuozhu Liu

    Abstract: Large-scale reinforcement learning (RL) methods have proven highly effective in enhancing the reasoning abilities of large language models (LLMs), particularly for tasks with verifiable solutions such as mathematics and coding. However, applying this idea to machine translation (MT), where outputs are flexibly formatted and difficult to automatically evaluate with explicit rules, remains underexpl… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: Work in progress. Our code is available at https://github.com/fzp0424/MT-R1-Zero

  41. arXiv:2504.10157  [pdf, other

    cs.CL cs.CY

    SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users

    Authors: Xinnong Zhang, Jiayu Lin, Xinyi Mou, Shiyue Yang, Xiawei Liu, Libo Sun, Hanjia Lyu, Yihang Yang, Weihong Qi, Yue Chen, Guanying Li, Ling Yan, Yao Hu, Siming Chen, Yu Wang, Xuanjing Huang, Jiebo Luo, Shiping Tang, Libo Wu, Baohua Zhou, Zhongyu Wei

    Abstract: Social simulation is transforming traditional social science research by modeling human behavior through interactions between virtual individuals and their environments. With recent advances in large language models (LLMs), this approach has shown growing potential in capturing individual differences and predicting group behaviors. However, existing methods face alignment challenges related to the… ▽ More

    Submitted 23 April, 2025; v1 submitted 14 April, 2025; originally announced April 2025.

    Comments: work in progress

  42. arXiv:2504.09847  [pdf, other

    cond-mat.str-el

    $\mathbb{Z}_N$ generalizations of three-dimensional stabilizer codes

    Authors: Chanbeen Lee, Yaozong Hu, Gil Young Cho, Haruki Watanabe

    Abstract: In this work, we generalize several three-dimensional Z2 stabilizer models--including the X-cube model, the three-dimensional toric code, and Haah's code--to their ZN counterparts. Under periodic boundary conditions, we analyze their ground state degeneracies and topological excitations, and uncover behaviors that strongly depend on system size. For the X-cube model, we identify excitations with m… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

    Comments: 36 pages, 25 figures

  43. arXiv:2504.09844  [pdf, other

    cs.DC cs.AI

    OVERLORD: Ultimate Scaling of DataLoader for Multi-Source Large Foundation Model Training

    Authors: Juntao Zhao, Qi Lu, Wei Jia, Borui Wan, Lei Zuo, Junda Feng, Jianyu Jiang, Yangrui Chen, Shuaishuai Cao, Jialing He, Kaihua Jiang, Yuanzhe Hu, Yanghua Peng, Haibin Lin, Xin Liu, Chuan Wu

    Abstract: Modern frameworks for training large foundation models (LFMs) employ data loaders in a data parallel paradigm. While this design offers implementation simplicity, it introduces two fundamental challenges. First, due to the quadratic computational complexity of the attention operator, the non-uniform sample distribution over data-parallel ranks leads to a significant workload imbalance among loader… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

  44. arXiv:2504.09669  [pdf, other

    cs.GT cs.DS

    Nash Social Welfare with Submodular Valuations: Approximation Algorithms and Integrality Gaps

    Authors: Xiaohui Bei, Yuda Feng, Yang Hu, Shi Li, Ruilong Zhang

    Abstract: We study the problem of allocating items to agents such that the (un)weighted Nash social welfare (NSW) is maximized under submodular valuations. The best-known results for unweighted and weighted problems are the $(4+ε)$ approximation given by Garg, Husic, Li, Vega, and Vondrak~\cite{stoc/GargHLVV23} and the $(233+ε)$ approximation given by Feng, Hu, Li, and Zhang~\cite{stoc/FHLZ25}, respectively… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

  45. arXiv:2504.09622  [pdf, other

    physics.soc-ph

    Predicting the critical behavior of complex dynamic systems via learning the governing mechanisms

    Authors: Xiangrong Wang, Dan Lu, Zongze Wu, Weina Xu, Hongru Hou, Yanqing Hu, Yamir Moreno

    Abstract: Critical points separate distinct dynamical regimes of complex systems, often delimiting functional or macroscopic phases in which the system operates. However, the long-term prediction of critical regimes and behaviors is challenging given the narrow set of parameters from which they emerge. Here, we propose a framework to learn the rules that govern the dynamic processes of a system. The learned… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

    Comments: 22 pages including figures. Submitted for publication

  46. arXiv:2504.09587  [pdf, other

    cs.RO

    GeoNav: Empowering MLLMs with Explicit Geospatial Reasoning Abilities for Language-Goal Aerial Navigation

    Authors: Haotian Xu, Yue Hu, Chen Gao, Zhengqiu Zhu, Yong Zhao, Yong Li, Quanjun Yin

    Abstract: Language-goal aerial navigation is a critical challenge in embodied AI, requiring UAVs to localize targets in complex environments such as urban blocks based on textual specification. Existing methods, often adapted from indoor navigation, struggle to scale due to limited field of view, semantic ambiguity among objects, and lack of structured spatial reasoning. In this work, we propose GeoNav, a g… ▽ More

    Submitted 11 May, 2025; v1 submitted 13 April, 2025; originally announced April 2025.

  47. arXiv:2504.09466  [pdf, other

    cs.CR cs.CL

    AdaSteer: Your Aligned LLM is Inherently an Adaptive Jailbreak Defender

    Authors: Weixiang Zhao, Jiahe Guo, Yulin Hu, Yang Deng, An Zhang, Xingyu Sui, Xinyang Han, Yanyan Zhao, Bing Qin, Tat-Seng Chua, Ting Liu

    Abstract: Despite extensive efforts in safety alignment, large language models (LLMs) remain vulnerable to jailbreak attacks. Activation steering offers a training-free defense method but relies on fixed steering coefficients, resulting in suboptimal protection and increased false rejections of benign inputs. To address this, we propose AdaSteer, an adaptive activation steering method that dynamically adjus… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

    Comments: 17 pages, 6 figures, 9 tables

  48. arXiv:2504.09405  [pdf, other

    cs.LG

    Tin-Tin: Towards Tiny Learning on Tiny Devices with Integer-based Neural Network Training

    Authors: Yi Hu, Jinhang Zuo, Eddie Zhang, Bob Iannucci, Carlee Joe-Wong

    Abstract: Recent advancements in machine learning (ML) have enabled its deployment on resource-constrained edge devices, fostering innovative applications such as intelligent environmental sensing. However, these devices, particularly microcontrollers (MCUs), face substantial challenges due to limited memory, computing capabilities, and the absence of dedicated floating-point units (FPUs). These constraints… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

  49. arXiv:2504.09320  [pdf, ps, other

    math.DG math.AP math.MG

    Capillary Christoffel-Minkowski problem

    Authors: Yingxiang Hu, Mohammad N. Ivaki, Julian Scheuer

    Abstract: The result of Guan and Ma (Invent. Math. 151 (2003)) states that if $φ^{-1/k} : \mathbb{S}^n \to (0,\infty)$ is spherically convex, then $φ$ arises as the $σ_k$ curvature (the $k$-th elementary symmetric function of the principal radii of curvature) of a strictly convex hypersurface. In this paper, we establish an analogous result in the capillary setting in the half-space for $θ\in(0,π/2)$: if… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

  50. arXiv:2504.09160  [pdf, other

    cs.CV

    SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow

    Authors: Qingyuan Wang, Rui Song, Jiaojiao Li, Kerui Cheng, David Ferstl, Yinlin Hu

    Abstract: We introduce SCFlow2, a plug-and-play refinement framework for 6D object pose estimation. Most recent 6D object pose methods rely on refinement to get accurate results. However, most existing refinement methods either suffer from noises in establishing correspondences, or rely on retraining for novel objects. SCFlow2 is based on the SCFlow model designed for refinement with shape constraint, but f… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

    Comments: Accepted by CVPR 2025