Skip to main content

Showing 51–100 of 3,998 results for author: Lu, J

.
  1. arXiv:2505.23966  [pdf, ps, other

    cs.CL

    FLAT-LLM: Fine-grained Low-rank Activation Space Transformation for Large Language Model Compression

    Authors: Jiayi Tian, Ryan Solgi, Jinming Lu, Yifan Yang, Hai Li, Zheng Zhang

    Abstract: Large Language Models (LLMs) have enabled remarkable progress in natural language processing, yet their high computational and memory demands pose challenges for deployment in resource-constrained environments. Although recent low-rank decomposition methods offer a promising path for structural compression, they often suffer from accuracy degradation, expensive calibration procedures, and result i… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  2. arXiv:2505.23732  [pdf, ps, other

    cs.LG

    EmotionRankCLAP: Bridging Natural Language Speaking Styles and Ordinal Speech Emotion via Rank-N-Contrast

    Authors: Shreeram Suresh Chandra, Lucas Goncalves, Junchen Lu, Carlos Busso, Berrak Sisman

    Abstract: Current emotion-based contrastive language-audio pretraining (CLAP) methods typically learn by naïvely aligning audio samples with corresponding text prompts. Consequently, this approach fails to capture the ordinal nature of emotions, hindering inter-emotion understanding and often resulting in a wide modality gap between the audio and text embeddings due to insufficient alignment. To handle thes… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: Accepted at Interspeech 2025

  3. arXiv:2505.22424  [pdf, ps, other

    cs.NI eess.SP

    Hybrid Learning for Cold-Start-Aware Microservice Scheduling in Dynamic Edge Environments

    Authors: Jingxi Lu, Wenhao Li, Jianxiong Guo, Xingjian Ding, Zhiqing Tang, Tian Wang, Weijia Jia

    Abstract: With the rapid growth of IoT devices and their diverse workloads, container-based microservices deployed at edge nodes have become a lightweight and scalable solution. However, existing microservice scheduling algorithms often assume static resource availability, which is unrealistic when multiple containers are assigned to an edge node. Besides, containers suffer from cold-start inefficiencies du… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  4. arXiv:2505.22140  [pdf, other

    hep-ex

    Search for a dark baryon in the $Ξ^-\rightarrowπ^-+{\rm invisible}$ decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (697 additional authors not shown)

    Abstract: A search for a dark baryon is performed for the first time in the two-body decay $Ξ^-\rightarrowπ^-+{\rm invisible}$ using $(10.087\pm0.044)\times10^{9}$ $J/ψ$ events collected at a center-of-mass energy of $\sqrt{s}=3.097\,\mbox{GeV}$ with the BESIII detector at the BEPCII collider. No significant signal is observed, and the 90% (95%) confidence level upper limits on the branching fraction… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: 11 pages, 4 figures, 1 table

  5. arXiv:2505.20642  [pdf, ps, other

    cs.AI

    CoderAgent: Simulating Student Behavior for Personalized Programming Learning with Large Language Models

    Authors: Yi Zhan, Qi Liu, Weibo Gao, Zheng Zhang, Tianfu Wang, Shuanghong Shen, Junyu Lu, Zhenya Huang

    Abstract: Personalized programming tutoring, such as exercise recommendation, can enhance learners' efficiency, motivation, and outcomes, which is increasingly important in modern digital education. However, the lack of sufficient and high-quality programming data, combined with the mismatch between offline evaluation and real-world learning, hinders the practical deployment of such systems. To address this… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Accepted by IJCAI2025

  6. arXiv:2505.20296  [pdf, ps, other

    cs.CL cs.AI cs.LG cs.MM

    Reasoning LLMs are Wandering Solution Explorers

    Authors: Jiahao Lu, Ziwei Xu, Mohan Kankanhalli

    Abstract: Large Language Models (LLMs) have demonstrated impressive reasoning abilities through test-time computation (TTC) techniques such as chain-of-thought prompting and tree-based reasoning. However, we argue that current reasoning LLMs (RLLMs) lack the ability to systematically explore the solution space. This paper formalizes what constitutes systematic problem solving and identifies common failure m… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 71 pages, 14 figures, 2 tables

  7. arXiv:2505.20246  [pdf, ps, other

    cs.AI cs.CL

    On Path to Multimodal Historical Reasoning: HistBench and HistAgent

    Authors: Jiahao Qiu, Fulian Xiao, Yimin Wang, Yuchen Mao, Yijia Chen, Xinzhe Juan, Siran Wang, Xuan Qi, Tongcheng Zhang, Zixin Yao, Jiacheng Guo, Yifu Lu, Charles Argon, Jundi Cui, Daixin Chen, Junran Zhou, Shuyao Zhou, Zhanpeng Zhou, Ling Yang, Shilong Liu, Hongru Wang, Kaixuan Huang, Xun Jiang, Yuming Cao, Yue Chen , et al. (73 additional authors not shown)

    Abstract: Recent advances in large language models (LLMs) have led to remarkable progress across domains, yet their capabilities in the humanities, particularly history, remain underexplored. Historical reasoning poses unique challenges for AI, involving multimodal source interpretation, temporal inference, and cross-linguistic analysis. While general-purpose agents perform well on many existing benchmarks,… ▽ More

    Submitted 7 June, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

    Comments: 17 pages, 7 figures

  8. arXiv:2505.19907  [pdf, ps, other

    hep-ex nucl-ex

    First measurement of $Σ^{+}n\rightarrowΛp$ and $Σ^{+}n\rightarrowΣ^{0}p$ cross-sections via $Σ^+$-nucleus scattering at an electron-positron collider

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

    Abstract: Using $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events collected with the BESIII detector at the BEPCII storage ring, the reactions $Σ^{+}n\rightarrowΛp$ and $Σ^{+}n\rightarrowΣ^{0}p$ are studied, where the $Σ^{+}$ baryon is produced in the process $J/ψ\rightarrowΣ^{+}\barΣ^-$ and the neutron is a component of the $^9\rm{Be}$, $^{12}\rm{C}$ and $^{197}\rm{Au}$ nuclei in the beam pipe. Clear signals o… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 9 pages, 2 figures

  9. arXiv:2505.19597  [pdf, ps, other

    eess.AS

    A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions

    Authors: Zheng Wang, Xiaobin Rong, Yu Sun, Tianchi Sun, Zhibin Lin, Jing Lu

    Abstract: Although deep learning based multi-channel speech enhancement has achieved significant advancements, its practical deployment is often limited by constrained computational resources, particularly in low signal-to-noise ratio (SNR) conditions. In this paper, we propose a lightweight hybrid dual-channel speech enhancement system that combines independent vector analysis (IVA) with a modified version… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Accepted by Interspeech 2025

  10. arXiv:2505.18714  [pdf, ps, other

    cs.RO

    YOPO-Rally: A Sim-to-Real Single-Stage Planner for Off-Road Terrain

    Authors: Hongyu Cao, Junjie Lu, Xuewei Zhang, Yulin Hui, Zhiyu Li, Bailing Tian

    Abstract: Off-road navigation remains challenging for autonomous robots due to the harsh terrain and clustered obstacles. In this letter, we extend the YOPO (You Only Plan Once) end-to-end navigation framework to off-road environments, explicitly focusing on forest terrains, consisting of a high-performance, multi-sensor supported off-road simulator YOPO-Sim, a zero-shot transfer sim-to-real planner YOPO-Ra… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

    Comments: 8 pages, 8 figures

  11. arXiv:2505.18533  [pdf, ps, other

    eess.AS cs.AI

    TS-URGENet: A Three-stage Universal Robust and Generalizable Speech Enhancement Network

    Authors: Xiaobin Rong, Dahan Wang, Qinwen Hu, Yushi Wang, Yuxiang Hu, Jing Lu

    Abstract: Universal speech enhancement aims to handle input speech with different distortions and input formats. To tackle this challenge, we present TS-URGENet, a Three-Stage Universal, Robust, and Generalizable speech Enhancement Network. To address various distortions, the proposed system employs a novel three-stage architecture consisting of a filling stage, a separation stage, and a restoration stage.… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

    Comments: Accepted by Interspeech 2025

  12. arXiv:2505.18060  [pdf, ps, other

    cs.CV

    Semantic Correspondence: Unified Benchmarking and a Strong Baseline

    Authors: Kaiyan Zhang, Xinghui Li, Jingyi Lu, Kai Han

    Abstract: Establishing semantic correspondence is a challenging task in computer vision, aiming to match keypoints with the same semantic information across different images. Benefiting from the rapid development of deep learning, remarkable progress has been made over the past decade. However, a comprehensive review and analysis of this task remains absent. In this paper, we present the first extensive sur… ▽ More

    Submitted 27 May, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

  13. arXiv:2505.18004  [pdf, ps, other

    hep-ex

    Measurement of branching fractions of $Λ_{c}^{+}$ decays to $Σ^{+} η$ and $Σ^{+} η'$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: By analyzing $e^+e^-$ collision data taken at center-of-mass energies $\sqrt{s} = 4.600 \sim 4.699$ $\mbox{GeV}$ with the BESIII detector at the BEPCII collider, corresponding to an integrated luminosity of $\rm 4.5~fb^{-1}$, we study the hadronic decays $Λ_{c}^{+} \rightarrow Σ^{+} η$ and $Λ_{c}^{+} \rightarrow Σ^{+} η^{\prime}$ using the single-tag method. The branching fraction ratio of… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  14. arXiv:2505.17987  [pdf, other

    cs.LG cs.AI

    ADLGen: Synthesizing Symbolic, Event-Triggered Sensor Sequences for Human Activity Modeling

    Authors: Weihang You, Hanqi Jiang, Zishuai Liu, Zihang Xie, Tianming Liu, Jin Lu, Fei Dou

    Abstract: Real world collection of Activities of Daily Living data is challenging due to privacy concerns, costly deployment and labeling, and the inherent sparsity and imbalance of human behavior. We present ADLGen, a generative framework specifically designed to synthesize realistic, event triggered, and symbolic sensor sequences for ambient assistive environments. ADLGen integrates a decoder only Transfo… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  15. arXiv:2505.17928  [pdf, ps, other

    cs.SE cs.AI cs.CL cs.LG

    Towards Practical Defect-Focused Automated Code Review

    Authors: Junyi Lu, Lili Jiang, Xiaojia Li, Jianbing Fang, Fengjun Zhang, Li Yang, Chun Zuo

    Abstract: The complexity of code reviews has driven efforts to automate review comments, but prior approaches oversimplify this task by treating it as snippet-level code-to-text generation and relying on text similarity metrics like BLEU for evaluation. These methods overlook repository context, real-world merge request evaluation, and defect detection, limiting their practicality. To address these issues,… ▽ More

    Submitted 28 May, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

    Comments: Accepted as Spotlight at the 42nd International Conference on Machine Learning (ICML 2025)

  16. arXiv:2505.17457  [pdf, ps, other

    cs.CV cs.AI

    Graph Mamba for Efficient Whole Slide Image Understanding

    Authors: Jiaxuan Lu, Junyan Shi, Yuhui Lin, Fang Yan, Yue Gao, Shaoting Zhang, Xiaosong Wang

    Abstract: Whole Slide Images (WSIs) in histopathology present a significant challenge for large-scale medical image analysis due to their high resolution, large size, and complex tile relationships. Existing Multiple Instance Learning (MIL) methods, such as Graph Neural Networks (GNNs) and Transformer-based models, face limitations in scalability and computational cost. To bridge this gap, we propose the WS… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  17. arXiv:2505.17192  [pdf, ps, other

    cond-mat.mes-hall cond-mat.mtrl-sci physics.comp-ph

    Unconventional tunnel magnetoresistance scaling with altermagnets

    Authors: Zongmeng Yang, Xingyue Yang, Jianhua Wang, Rui Peng, Lee Ching Hua, Lay Kee Ang, Jing Lu, Yee Sin Ang, Shibo Fang

    Abstract: In conventional magnetic tunnel junctions (MTJs), the tunnel magnetoresistance (TMR) typically increases with barrier thickness as electron transmission in the antiparallel configuration decays faster than that of the parallel configuration. In this work, we reveal an anomalous scaling effect in altermagnetic tunnel junctions (AMTJs), where the TMR decreases anomalously with an increasing barrier… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  18. arXiv:2505.16858  [pdf, other

    astro-ph.GA

    Non-Parametric Attenuation Curves in Local Star-Forming Galaxies: Geometry Effect, Dust Evolution, and ISS

    Authors: Jiafeng Lu, Xi Kang, Shiyin Shen, Qi Zeng, Shuai Feng

    Abstract: We introduce a non-parametric approach, the Stellar Population Synthesis with Equivalent Widths (SEW) method, to reconstruct spectral-resolution wavelength-dependent attenuation curves for 169,568 star-forming galaxies from the SDSS DR7. Composite attenuation curves, stacked across stellar mass and inclination bins, reveal systematic trends: higher stellar mass correlates with steeper attenuation… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: 11 pages, 7 figures

  19. arXiv:2505.16502  [pdf, ps, other

    cs.DC cs.NI

    Recursive Offloading for LLM Serving in Multi-tier Networks

    Authors: Zhiyuan Wu, Sheng Sun, Yuwei Wang, Min Liu, Bo Gao, Jinda Lu, Zheming Yang, Tian Wen

    Abstract: Heterogeneous device-edge-cloud computing infrastructures have become widely adopted in telecommunication operators and Wide Area Networks (WANs), offering multi-tier computational support for emerging intelligent services. With the rapid proliferation of Large Language Model (LLM) services, efficiently coordinating inference tasks and reducing communication overhead within these multi-tier networ… ▽ More

    Submitted 24 May, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

    Comments: 7 figures, 3 tables

  20. arXiv:2505.16314  [pdf, ps, other

    cs.CV cs.AI

    NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment

    Authors: Shuhao Han, Haotian Fan, Fangyuan Kong, Wenjie Liao, Chunle Guo, Chongyi Li, Radu Timofte, Liang Li, Tao Li, Junhui Cui, Yunqiu Wang, Yang Tai, Jingwei Sun, Jianhui Sun, Xinli Yue, Tianyi Wang, Huan Hou, Junda Lu, Xinyang Huang, Zitang Zhou, Zijian Zhang, Xuhui Zheng, Xuecheng Wu, Chong Peng, Xuezhi Cao , et al. (90 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2025 challenge on Text to Image (T2I) generation model quality assessment, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2025. The aim of this challenge is to address the fine-grained quality assessment of text-to-image generation models. This challenge evaluates text-to-image models from two aspe… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  21. arXiv:2505.16307  [pdf, other

    cs.CL cs.AI cs.LG

    PMPO: Probabilistic Metric Prompt Optimization for Small and Large Language Models

    Authors: Chenzhuo Zhao, Ziqian Liu, Xingda Wang, Junting Lu, Chaoyi Ruan

    Abstract: Prompt optimization offers a practical and broadly applicable alternative to fine-tuning for improving large language model (LLM) performance. However, existing methods often rely on costly output generation, self-critiquing abilities, or human-annotated preferences, which limit their scalability, especially for smaller or non-instruction-tuned models. We introduce PMPO (Probabilistic Metric Promp… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  22. arXiv:2505.16060  [pdf, ps, other

    cs.LG

    Few-Shot Test-Time Optimization Without Retraining for Semiconductor Recipe Generation and Beyond

    Authors: Shangding Gu, Donghao Ying, Ming Jin, Yu Joe Lu, Jun Wang, Javad Lavaei, Costas Spanos

    Abstract: We introduce Model Feedback Learning (MFL), a novel test-time optimization framework for optimizing inputs to pre-trained AI models or deployed hardware systems without requiring any retraining of the models or modifications to the hardware. In contrast to existing methods that rely on adjusting model parameters, MFL leverages a lightweight reverse model to iteratively search for optimal inputs, e… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  23. arXiv:2505.15662  [pdf, ps, other

    quant-ph cs.AI cs.ET

    Neural Quantum Digital Twins for Optimizing Quantum Annealing

    Authors: Jianlong Lu, Hanqiu Peng, Ying Chen

    Abstract: Quantum annealers have shown potential in addressing certain combinatorial optimization problems, though their performance is often limited by scalability and errors rates. In this work, we propose a Neural Quantum Digital Twin (NQDT) framework that reconstructs the energy landscape of quantum many-body systems relevant to quantum annealing. The digital twin models both ground and excited state dy… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 20 pages, 11 figures, 2 tables

  24. arXiv:2505.15620  [pdf, ps, other

    hep-ex

    Observation of $χ_{cJ}\to 3K_S^0K^\pmπ^\mp$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (678 additional authors not shown)

    Abstract: By analyzing $(2712.4\pm14.3)\times10^6$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays $χ_{c0,1,2} \to 3K_S^0K^\pmπ^\mp$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be $\mathcal{B}(χ_{c0}\to 3K_S^0K^\pmπ^\mp )=(7.95\pm0.50\pm0.65)\times10^{-5},$… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 11 pages, 6 figures

  25. arXiv:2505.15154  [pdf, other

    cs.CL cs.AI cs.MM

    Prolonged Reasoning Is Not All You Need: Certainty-Based Adaptive Routing for Efficient LLM/MLLM Reasoning

    Authors: Jinghui Lu, Haiyang Yu, Siliang Xu, Shiwei Ran, Guozhi Tang, Siqi Wang, Bin Shan, Teng Fu, Hao Feng, Jingqun Tang, Han Wang, Can Huang

    Abstract: Recent advancements in reasoning have significantly enhanced the capabilities of Large Language Models (LLMs) and Multimodal Large Language Models (MLLMs) across diverse tasks. However, excessive reliance on chain-of-thought (CoT) reasoning can impair model performance and brings unnecessarily lengthened outputs, reducing efficiency. Our work reveals that prolonged reasoning does not universally i… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  26. arXiv:2505.14988  [pdf, ps, other

    hep-ex

    Test of local realism via entangled $Λ\barΛ$ system

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (597 additional authors not shown)

    Abstract: The non-locality of quantum correlations is a fundamental feature of quantum theory. The Bell inequality serves as a benchmark for distinguishing between predictions made by quantum theory and local hidden variable theory (LHVT). Recent advancements in photon-entanglement experiments have addressed potential loopholes and have observed significant violations of variants of Bell inequality. However… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  27. arXiv:2505.14970  [pdf, ps, other

    cs.AI cs.LG

    Self-Evolving Curriculum for LLM Reasoning

    Authors: Xiaoyin Chen, Jiarui Lu, Minsu Kim, Dinghuai Zhang, Jian Tang, Alexandre Piché, Nicolas Gontier, Yoshua Bengio, Ehsan Kamalloo

    Abstract: Reinforcement learning (RL) has proven effective for fine-tuning large language models (LLMs), significantly enhancing their reasoning abilities in domains such as mathematics and code generation. A crucial factor influencing RL fine-tuning success is the training curriculum: the order in which training problems are presented. While random curricula serve as common baselines, they remain suboptima… ▽ More

    Submitted 29 May, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

  28. arXiv:2505.14682  [pdf, ps, other

    cs.CV

    UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation

    Authors: Rui Tian, Mingfei Gao, Mingze Xu, Jiaming Hu, Jiasen Lu, Zuxuan Wu, Yinfei Yang, Afshin Dehghan

    Abstract: We introduce UniGen, a unified multimodal large language model (MLLM) capable of image understanding and generation. We study the full training pipeline of UniGen from a data-centric perspective, including multi-stage pre-training, supervised fine-tuning, and direct preference optimization. More importantly, we propose a new Chain-of-Thought Verification (CoT-V) strategy for test-time scaling, whi… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: Technical report

  29. arXiv:2505.14135  [pdf, other

    cs.CV

    Hunyuan-Game: Industrial-grade Intelligent Game Creation Model

    Authors: Ruihuang Li, Caijin Zhou, Shoujian Zheng, Jianxiang Lu, Jiabin Huang, Comi Chen, Junshu Tang, Guangzheng Xu, Jiale Tao, Hongmei Wang, Donghao Li, Wenqing Yu, Senbo Wang, Zhimin Li, Yetshuan Shi, Haoyu Yang, Yukun Wang, Wenxun Dai, Jiaqi Li, Linqing Wang, Qixun Wang, Zhiyong Xu, Yingfang Zhang, Jiangfeng Xiong, Weijie Kong , et al. (33 additional authors not shown)

    Abstract: Intelligent game creation represents a transformative advancement in game development, utilizing generative artificial intelligence to dynamically generate and enhance game content. Despite notable progress in generative models, the comprehensive synthesis of high-quality game assets, including both images and videos, remains a challenging frontier. To create high-fidelity game content that simult… ▽ More

    Submitted 28 May, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

  30. arXiv:2505.14059  [pdf, ps, other

    cs.CV

    Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting

    Authors: Hao Feng, Shu Wei, Xiang Fei, Wei Shi, Yingdong Han, Lei Liao, Jinghui Lu, Binghong Wu, Qi Liu, Chunhui Lin, Jingqun Tang, Hao Liu, Can Huang

    Abstract: Document image parsing is challenging due to its complexly intertwined elements such as text paragraphs, figures, formulas, and tables. Current approaches either assemble specialized expert models or directly generate page-level content autoregressively, facing integration overhead, efficiency bottlenecks, and layout structure degradation despite their decent performance. To address these limitati… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: Accepted to ACL 2025

  31. arXiv:2505.13383  [pdf, ps, other

    physics.optics

    Inverse-Designed Silicon Nitride Nanophotonics

    Authors: Toby Bi, Shuangyou Zhang, Egemen Bostan, Danxian Liu, Aditya Paul, Olga Ohletz, Irina Harder, Yaojing Zhang, Alekhya Ghosh, Abdullah Alabbadi, Masoud Kheyri, Tianyi Zeng, Jesse Lu, Kiyoul Yang, Pascal Del'Haye

    Abstract: Silicon nitride photonics has enabled integration of a variety of components for applications in linear and nonlinear optics, including telecommunications, optical clocks, astrocombs, bio-sensing, and LiDAR. With the advent of inverse design - where desired device performance is specified and closely achieved through iterative, gradient-based optimization - and the increasing availability of silic… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  32. arXiv:2505.13328  [pdf, other

    cs.CL

    Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges

    Authors: Hongru Wang, Wenyu Huang, Yufei Wang, Yuanhao Xi, Jianqiao Lu, Huan Zhang, Nan Hu, Zeming Liu, Jeff Z. Pan, Kam-Fai Wong

    Abstract: Existing benchmarks that assess Language Models (LMs) as Language Agents (LAs) for tool use primarily focus on stateless, single-turn interactions or partial evaluations, such as tool selection in a single turn, overlooking the inherent stateful nature of interactions in multi-turn applications. To fulfill this gap, we propose \texttt{DialogTool}, a multi-turn dialogue dataset with stateful tool i… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  33. arXiv:2505.13222  [pdf, ps, other

    hep-ex

    Partial Wave Analysis of $e^{+}e^{-} \rightarrow π^{+}π^{-}J/ψ$ and Cross Section Measurement of $e^{+}e^{-} \rightarrow π^{\pm}Z_{c}(3900)^{\mp}$ from 4.1271 to 4.3583 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Based on 12.0 $\mathrm{fb^{-1}}$ of $e^{+}e^{-}$ collision data samples collected by the BESIII detector at center-of-mass energies from 4.1271 to 4.3583 GeV, a partial wave analysis is performed for the process $e^{+}e^{-} \rightarrow π^{+}π^{-}J/ψ$. The cross sections for the sub processes ${e^{+}e^{-}\rightarrowπ^{+}Z_{c}(3900)^{-}+c.c.\rightarrowπ^{+}π^{-}J/ψ}$,… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  34. arXiv:2505.13081  [pdf, ps, other

    cs.LG cs.CV

    Walking the Tightrope: Disentangling Beneficial and Detrimental Drifts in Non-Stationary Custom-Tuning

    Authors: Xiaoyu Yang, Jie Lu, En Yu

    Abstract: This paper uncovers a critical yet overlooked phenomenon in multi-modal large language models (MLLMs): detrimental concept drift within chain-of-thought (CoT) reasoning during non-stationary reinforcement fine-tuning (RFT), where reasoning token distributions evolve unpredictably, thereby introducing significant biases in final predictions. To address this, we are pioneers in establishing the theo… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 17 pages, 5figures

  35. arXiv:2505.13077  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Advancing Sequential Numerical Prediction in Autoregressive Models

    Authors: Xiang Fei, Jinghui Lu, Qi Sun, Hao Feng, Yanjie Wang, Wei Shi, An-Lan Wang, Jingqun Tang, Can Huang

    Abstract: Autoregressive models have become the de facto choice for sequence generation tasks, but standard approaches treat digits as independent tokens and apply cross-entropy loss, overlooking the coherent structure of numerical sequences. This paper introduces Numerical Token Integrity Loss (NTIL) to address this gap. NTIL operates at two levels: (1) token-level, where it extends the Earth Mover's Dista… ▽ More

    Submitted 28 May, 2025; v1 submitted 19 May, 2025; originally announced May 2025.

    Comments: Accepted to ACL 2025 Main Conference

  36. arXiv:2505.12916  [pdf, other

    hep-ph

    Modular Symmetry with Weighton

    Authors: Gui-Jun Ding, Stephen F. King, Jun-Nan Lu, Ming-Hua Weng

    Abstract: We systematically develop the weighton mechanism for natural quark and charged lepton mass hierarchies in the framework of modular symmetry with a single modulus field $τ$. The weighton $φ$ is defined as a complete singlet with unit modular weight, leading to fermion mass suppression by powers of $\tildeφ$, which is the vacuum expectation value of the field scaled by a flavour cut-off. Further mas… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 81 pages, 3 figures

  37. arXiv:2505.12585  [pdf, other

    cs.LG cs.AI

    Learning Robust Spectral Dynamics for Temporal Domain Generalization

    Authors: En Yu, Jie Lu, Xiaoyu Yang, Guangquan Zhang, Zhen Fang

    Abstract: Modern machine learning models struggle to maintain performance in dynamic environments where temporal distribution shifts, \emph{i.e., concept drift}, are prevalent. Temporal Domain Generalization (TDG) seeks to enable model generalization across evolving domains, yet existing approaches typically assume smooth incremental changes, struggling with complex real-world drifts involving long-term str… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  38. arXiv:2505.12234  [pdf, other

    hep-ex

    Observation of $χ_{cJ}(J=0,1,2)\rightarrow p\bar{p}ηη$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (678 additional authors not shown)

    Abstract: Using $(2712.4\pm14.3)\times10^6$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII storage ring, the decays $χ_{cJ}(J=0,1,2)\rightarrow p\bar{p}ηη$ are observed for the first time through the radiative transition $ψ(3686)\toγχ_{cJ}$. The statistical significances for $χ_{cJ}$ signals are all larger than 5$σ$. The branching fractions of $χ_{c0,1,2}\to p\bar{p} ηη$ are deter… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

    Comments: 17 pages, 16 figures

  39. arXiv:2505.12187  [pdf, ps, other

    math.PR math.FA quant-ph

    Speeding up quantum Markov processes through lifting

    Authors: Bowen Li, Jianfeng Lu

    Abstract: We generalize the concept of non-reversible lifts for reversible diffusion processes initiated by Eberle and Lorler (2024) to quantum Markov dynamics. The lifting operation, which naturally results in hypocoercive processes, can be formally interpreted as, though not restricted to, the reverse of the overdamped limit. We prove that the $L^2$ convergence rate of the lifted process is bounded above… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  40. arXiv:2505.12086  [pdf, ps, other

    hep-ex

    Observation of an Altered $a_{0}(980)$ Line-shape in $D^{+} \rightarrow π^{+}ηη$ due to the Triangle Loop Rescattering Effect

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (705 additional authors not shown)

    Abstract: Using 20.3~${\rm fb}^{-1}$ of $e^{+}e^{-}$ collision data taken with the BESIII detector at the center-of-mass energy 3.773~GeV, we report the first amplitude analysis of the hadronic decay $D^{+} \rightarrow π^{+}ηη$. The intermediate process $D^{+} \to a_{0}(980)^{+}η, a_{0}(980)^{+} \to π^{+}η$ is observed and is found to be the only component and its branching fraction is measured to be… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  41. arXiv:2505.12082  [pdf, other

    cs.CL cs.LG

    Model Merging in Pre-training of Large Language Models

    Authors: Yunshui Li, Yiyuan Ma, Shen Yan, Chaoyi Zhang, Jing Liu, Jianqiao Lu, Ziwen Xu, Mengzhao Chen, Minrui Wang, Shiyi Zhan, Jin Ma, Xunhao Lai, Deyi Liu, Yao Luo, Xingyan Bin, Hongbin Ren, Mingji Han, Wenhao Hao, Bairen Yi, LingJun Liu, Bole Ma, Xiaoying Jia, Xun Zhou, Siyuan Qiao, Liang Xiang , et al. (1 additional authors not shown)

    Abstract: Model merging has emerged as a promising technique for enhancing large language models, though its application in large-scale pre-training remains relatively unexplored. In this paper, we present a comprehensive investigation of model merging techniques during the pre-training process. Through extensive experiments with both dense and Mixture-of-Experts (MoE) architectures ranging from millions to… ▽ More

    Submitted 22 May, 2025; v1 submitted 17 May, 2025; originally announced May 2025.

  42. arXiv:2505.11493  [pdf, ps, other

    cs.CV

    GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing

    Authors: Yusu Qian, Jiasen Lu, Tsu-Jui Fu, Xinze Wang, Chen Chen, Yinfei Yang, Wenze Hu, Zhe Gan

    Abstract: Editing images using natural language instructions has become a natural and expressive way to modify visual content; yet, evaluating the performance of such models remains challenging. Existing evaluation approaches often rely on image-text similarity metrics like CLIP, which lack precision. In this work, we introduce a new benchmark designed to evaluate text-guided image editing models in a more… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  43. arXiv:2505.11066  [pdf, ps, other

    cs.AI cs.MM

    A Multi-modal Fusion Network for Terrain Perception Based on Illumination Aware

    Authors: Rui Wang, Shichun Yang, Yuyi Chen, Zhuoyang Li, Zexiang Tong, Jianyi Xu, Jiayi Lu, Xinjie Feng, Yaoguang Cao

    Abstract: Road terrains play a crucial role in ensuring the driving safety of autonomous vehicles (AVs). However, existing sensors of AVs, including cameras and Lidars, are susceptible to variations in lighting and weather conditions, making it challenging to achieve real-time perception of road conditions. In this paper, we propose an illumination-aware multi-modal fusion network (IMF), which leverages bot… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  44. arXiv:2505.11015  [pdf, ps, other

    cs.CV

    WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?

    Authors: An-Lan Wang, Jingqun Tang, Liao Lei, Hao Feng, Qi Liu, Xiang Fei, Jinghui Lu, Han Wang, Weiwei Liu, Hao Liu, Yuliang Liu, Xiang Bai, Can Huang

    Abstract: The rapid advancements in Multimodal Large Language Models (MLLMs) have significantly enhanced capabilities in Document Understanding. However, prevailing benchmarks like DocVQA and ChartQA predominantly comprise \textit{scanned or digital} documents, inadequately reflecting the intricate challenges posed by diverse real-world scenarios, such as variable illumination and physical distortions. This… ▽ More

    Submitted 27 May, 2025; v1 submitted 16 May, 2025; originally announced May 2025.

  45. arXiv:2505.10641  [pdf, ps, other

    cs.LG

    FRET: Feature Redundancy Elimination for Test Time Adaptation

    Authors: Linjing You, Jiabao Lu, Xiayuan Huang, Xiangli Nie

    Abstract: Test-Time Adaptation (TTA) aims to enhance the generalization of deep learning models when faced with test data that exhibits distribution shifts from the training data. In this context, only a pre-trained model and unlabeled test data are available, making it particularly relevant for privacy-sensitive applications. In practice, we observe that feature redundancy in embeddings tends to increase a… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  46. arXiv:2505.08601  [pdf, other

    cs.CV cond-mat.mtrl-sci

    Rejoining fragmented ancient bamboo slips with physics-driven deep learning

    Authors: Jinchi Zhu, Zhou Zhao, Hailong Lei, Xiaoguang Wang, Jialiang Lu, Jing Li, Qianqian Tang, Jiachen Shen, Gui-Song Xia, Bo Du, Yongchao Xu

    Abstract: Bamboo slips are a crucial medium for recording ancient civilizations in East Asia, and offers invaluable archaeological insights for reconstructing the Silk Road, studying material culture exchanges, and global history. However, many excavated bamboo slips have been fragmented into thousands of irregular pieces, making their rejoining a vital yet challenging step for understanding their content.… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  47. arXiv:2505.08494  [pdf, ps, other

    math.AC

    Universal enveloping H-pseudoalgebras of DGP pseudoalgebras

    Authors: Ying Chen, Jiafeng Lü, Jiaqun Wei

    Abstract: The notions of Poisson $H$-pseudoalgebras are generalizations of Poisson algebras in a pseudotensor category $\mathcal{M}^{\ast}(H)$. This paper introduces an analogue of Poisson-Ore extension in Poisson $H$-pseudoalgebras. Poisson $H$-pseudoalgebras with the differential graded setting induces the notions of differential graded Poisson $H$-pseudoalgebras (DGP pseudoalgebras, for short). The DGP p… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  48. arXiv:2505.08409  [pdf, ps, other

    gr-qc astro-ph.CO

    Observational constraints on the Kerr and its several single-parameter modified spacetimes using quasi-periodic oscillation data

    Authors: Shining Yang, Jianbo Lu, Wenmei Li, Mou Xu, Jingyang Xu

    Abstract: This paper investigates the dynamical effects of particles moving in the Kerr spacetime and its nine single-parameter modified spacetimes, including Bardeen, Ayon-Beato and Garcia (ABG), Hayward, Kerr-Newman (KN), Kerr-Taub-NUT (KTN), Braneworld Kerr (BK), Kerr-MOG, Kerr-Sen, and Perfect Fluid Dark Matter (PFDM) black holes. Using quasi-periodic oscillation (QPO) observational data, we constrain t… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  49. arXiv:2505.08283  [pdf, ps, other

    cs.LG cs.CV

    Decoupled Multimodal Prototypes for Visual Recognition with Missing Modalities

    Authors: Jueqing Lu, Yuanyuan Qi, Xiaohao Yang, Shujie Zhou, Lan Du

    Abstract: Multimodal learning enhances deep learning models by enabling them to perceive and understand information from multiple data modalities, such as visual and textual inputs. However, most existing approaches assume the availability of all modalities, an assumption that often fails in real-world applications. Recent works have introduced learnable missing-case-aware prompts to mitigate performance de… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  50. arXiv:2505.07476  [pdf, ps, other

    gr-qc quant-ph

    Multiqubit coherence of mixed states near event horizon

    Authors: Wen-Mei Li, Jianbo Lu, Shu-Min Wu

    Abstract: We study physically accessible and inaccessible N-qubit coherence of the mixed Greenberger-Horne-Zeilinger (GHZ) and W states for bosonic and fermionic fields when any $n$ ($n<N$) qubits hover over the Schwarzschild black hole. We derive a comprehensive analytical expression for the coherence of mixed N-qubit systems, taking into account both accessible and inaccessible components in the curved sp… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: 28 pages, 7 figures