Skip to main content

Showing 1–50 of 2,187 results for author: Guo, X

.
  1. arXiv:2510.02029  [pdf, ps, other

    eess.SP

    Joint DOA and Attitude Sensing Based on Tri-Polarized Continuous Aperture Array

    Authors: Haonan Si, Zhaolin Wang, Xiansheng Guo, Jin Zhang, Yuanwei Liu

    Abstract: This paper investigates joint direction-of-arrival (DOA) and attitude sensing using tri-polarized continuous aperture arrays (CAPAs). By employing electromagnetic (EM) information theory, the spatially continuous received signals in tri-polarized CAPA are modeled, thereby enabling accurate DOA and attitude estimation. To facilitate subspace decomposition for continuous operators, an equivalent con… ▽ More

    Submitted 2 October, 2025; originally announced October 2025.

    Comments: 13 pages, 10 figures

  2. arXiv:2509.26574  [pdf, ps, other

    cs.AI cond-mat.other cs.CL hep-th quant-ph

    Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark

    Authors: Minhui Zhu, Minyang Tian, Xiaocheng Yang, Tianci Zhou, Penghao Zhu, Eli Chertkov, Shengyan Liu, Yufeng Du, Lifan Yuan, Ziming Ji, Indranil Das, Junyi Cao, Yufeng Du, Jinchen He, Yifan Su, Jiabin Yu, Yikun Jiang, Yujie Zhang, Chang Liu, Ze-Min Huang, Weizhen Jia, Xinan Chen, Peixue Wu, Yunkai Wang, Juntai Zhou , et al. (40 additional authors not shown)

    Abstract: While large language models (LLMs) with reasoning capabilities are progressing rapidly on high-school math competitions and coding, can they reason effectively through complex, open-ended challenges found in frontier physics research? And crucially, what kinds of reasoning tasks do physicists want LLMs to assist with? To address these questions, we present the CritPt (Complex Research using Integr… ▽ More

    Submitted 30 September, 2025; v1 submitted 30 September, 2025; originally announced September 2025.

    Comments: 39 pages, 6 figures, 6 tables

  3. arXiv:2509.25748  [pdf, ps, other

    cs.CV cs.AI

    Dolphin v1.0 Technical Report

    Authors: Taohan Weng, Chi zhang, Chaoran Yan, Siya Liu, Xiaoyang Liu, Yalun Wu, Boyang Wang, Boyan Wang, Jiren Ren, Kaiwen Yan, Jinze Yu, Kaibing Hu, Henan Liu, Haoyun Zheng, Zhenyu Liu, Duo Zhang, Xiaoqing Guo, Anjie Le, Hongcheng Guo

    Abstract: Ultrasound is crucial in modern medicine but faces challenges like operator dependence, image noise, and real-time scanning, hindering AI integration. While large multimodal models excel in other medical imaging areas, they struggle with ultrasound's complexities. To address this, we introduce Dolphin v1.0 (V1) and its reasoning-augmented version, Dolphin R1-the first large-scale multimodal ultras… ▽ More

    Submitted 30 September, 2025; v1 submitted 30 September, 2025; originally announced September 2025.

  4. arXiv:2509.25004  [pdf, ps, other

    cs.AI

    CLPO: Curriculum Learning meets Policy Optimization for LLM Reasoning

    Authors: Shijie Zhang, Guohao Sun, Kevin Zhang, Xiang Guo, Rujun Guo

    Abstract: Recently, online Reinforcement Learning with Verifiable Rewards (RLVR) has become a key paradigm for enhancing the reasoning capabilities of Large Language Models (LLMs). However, existing methods typically treat all training samples uniformly, overlooking the vast differences in problem difficulty relative to the model's current capabilities. This uniform training strategy leads to inefficient ex… ▽ More

    Submitted 29 September, 2025; originally announced September 2025.

  5. arXiv:2509.23761  [pdf, ps, other

    hep-ex

    Observation of a resonance-like structure near the $π^+π^-$ mass threshold in $ψ(3686) \rightarrow π^{+}π^{-}J/ψ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (677 additional authors not shown)

    Abstract: Based on the $(2712.4\pm14.4)\times 10^{6}$ $ψ(3686)$ events collected with the BESIII detector, we present a high-precision study of the $π^+π^-$ mass spectrum in $ψ(3686)\rightarrowπ^{+}π^{-}J/ψ$ decays. A clear resonance-like structure is observed near the $π^+π^-$ mass threshold for the first time. A fit with a Breit-Wigner function yields a mass of $285.6\pm 2.5~{\rm MeV}/c^2$ and a width of… ▽ More

    Submitted 28 September, 2025; originally announced September 2025.

  6. arXiv:2509.23711  [pdf, ps, other

    cs.LG cs.AI math.OC stat.ML

    Bridging Discrete and Continuous RL: Stable Deterministic Policy Gradient with Martingale Characterization

    Authors: Ziheng Cheng, Xin Guo, Yufei Zhang

    Abstract: The theory of discrete-time reinforcement learning (RL) has advanced rapidly over the past decades. Although primarily designed for discrete environments, many real-world RL applications are inherently continuous and complex. A major challenge in extending discrete-time algorithms to continuous-time settings is their sensitivity to time discretization, often leading to poor stability and slow conv… ▽ More

    Submitted 28 September, 2025; originally announced September 2025.

  7. arXiv:2509.22681  [pdf, ps, other

    cs.DC

    FLAME: A Serving System Optimized for Large-Scale Generative Recommendation with Efficiency

    Authors: Xianwen Guo, Bin Huang, Xiaomeng Wu, Guanlin Wu, Fangjian Li, Shijia Wang, Qiang Xiao, Chuanjiang Luo, Yong Li

    Abstract: Generative recommendation (GR) models possess greater scaling power compared to traditional deep learning recommendation models (DLRMs), yet they also impose a tremendous increase in computational burden. Measured in FLOPs, a typical GR model's workload sits in $10^9 \sim 10^{11}$ range, roughly four orders of magnitude higher than traditional DLRMs. Delivering accurate results in a few tens of mi… ▽ More

    Submitted 17 September, 2025; originally announced September 2025.

  8. arXiv:2509.22670  [pdf, ps, other

    stat.AP math.PR

    Modeling Tennis In-Match Momentum Using Probability Method

    Authors: Jackson Graves, Daniel X. Guo, Ridge Shepherd, Alexander Young

    Abstract: This paper investigates the Tennis Momentum Model (TMM), which aims to enhance the understanding of match dynamics by integrating key factors such as efficiency, historical scoring probabilities, and real-time scoring data. The model is designed to explore how momentum affects player performance throughout a match and how it might influence overall match outcomes. By leveraging this model, players… ▽ More

    Submitted 11 September, 2025; originally announced September 2025.

    Comments: 18 pages, 7 figures

    MSC Class: 65C20; 65C50; 68U20

  9. arXiv:2509.22496  [pdf, ps, other

    cs.CV

    Where MLLMs Attend and What They Rely On: Explaining Autoregressive Token Generation

    Authors: Ruoyu Chen, Xiaoqing Guo, Kangwei Liu, Siyuan Liang, Shiming Liu, Qunli Zhang, Hua Zhang, Xiaochun Cao

    Abstract: Multimodal large language models (MLLMs) have demonstrated remarkable capabilities in aligning visual inputs with natural language outputs. Yet, the extent to which generated tokens depend on visual modalities remains poorly understood, limiting interpretability and reliability. In this work, we present EAGLE, a lightweight black-box framework for explaining autoregressive token generation in MLLM… ▽ More

    Submitted 26 September, 2025; originally announced September 2025.

  10. arXiv:2509.19999  [pdf

    cs.MM cs.CV cs.SD

    MultiSoundGen: Video-to-Audio Generation for Multi-Event Scenarios via SlowFast Contrastive Audio-Visual Pretraining and Direct Preference Optimization

    Authors: Jianxuan Yang, Xiaoran Yang, Lipan Zhang, Xinyue Guo, Zhao Wang, Gongping Huang

    Abstract: Current video-to-audio (V2A) methods struggle in complex multi-event scenarios (video scenarios involving multiple sound sources, sound events, or transitions) due to two critical limitations. First, existing methods face challenges in precisely aligning intricate semantic information together with rapid dynamic features. Second, foundational training lacks quantitative preference optimization for… ▽ More

    Submitted 24 September, 2025; originally announced September 2025.

  11. arXiv:2509.18080  [pdf, ps, other

    quant-ph physics.optics

    Distribution of non-Gaussian states in a deployed telecommunication fiber channel

    Authors: Casper A. Breum, Xueshi Guo, Mikkel V. Larsen, Shigehito Miki, Hirotaka Terai, Ulrik L. Andersen, Jonas S. Neergaard-Nielsen

    Abstract: Optical non-Gaussian states hold great promise as a pivotal resource for advanced optical quantum information processing and fault-tolerant long-distance quantum communication. Establishing their faithful transmission in a real-world communication channel, therefore, marks an important milestone. In this study, we experimentally demonstrate the distribution of such non-Gaussian states in a functio… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: 10 pages, 6 figures

  12. arXiv:2509.16521  [pdf, ps, other

    cs.LG

    mmExpert: Integrating Large Language Models for Comprehensive mmWave Data Synthesis and Understanding

    Authors: Yifan Yan, Shuai Yang, Xiuzhen Guo, Xiangguang Wang, Wei Chow, Yuanchao Shu, Shibo He

    Abstract: Millimeter-wave (mmWave) sensing technology holds significant value in human-centric applications, yet the high costs associated with data acquisition and annotation limit its widespread adoption in our daily lives. Concurrently, the rapid evolution of large language models (LLMs) has opened up opportunities for addressing complex human needs. This paper presents mmExpert, an innovative mmWave und… ▽ More

    Submitted 20 September, 2025; originally announced September 2025.

    Comments: Accepted to ACM MobiHoc '25

  13. arXiv:2509.16204  [pdf, ps, other

    cs.CE cs.HC cs.RO

    Toward Engineering AGI: Benchmarking the Engineering Design Capabilities of LLMs

    Authors: Xingang Guo, Yaxin Li, Xiangyi Kong, Yilan Jiang, Xiayu Zhao, Zhihua Gong, Yufan Zhang, Daixuan Li, Tianle Sang, Beixiao Zhu, Gregory Jun, Yingbing Huang, Yiqi Liu, Yuqi Xue, Rahul Dev Kundu, Qi Jian Lim, Yizhou Zhao, Luke Alexander Granger, Mohamed Badr Younis, Darioush Keivan, Nippun Sabharwal, Shreyanka Sinha, Prakhar Agarwal, Kojo Vandyck, Hanlin Mai , et al. (40 additional authors not shown)

    Abstract: Today, industry pioneers dream of developing general-purpose AI engineers capable of designing and building humanity's most ambitious projects--from starships that will carry us to distant worlds to Dyson spheres that harness stellar energy. Yet engineering design represents a fundamentally different challenge for large language models (LLMs) compared to traditional textbook-style problem solving… ▽ More

    Submitted 1 July, 2025; originally announced September 2025.

  14. arXiv:2509.15791  [pdf, ps, other

    cs.CV

    Minimal Semantic Sufficiency Meets Unsupervised Domain Generalization

    Authors: Tan Pan, Kaiyu Guo, Dongli Xu, Zhaorui Tan, Chen Jiang, Deshu Chen, Xin Guo, Brian C. Lovell, Limei Han, Yuan Cheng, Mahsa Baktashmotlagh

    Abstract: The generalization ability of deep learning has been extensively studied in supervised settings, yet it remains less explored in unsupervised scenarios. Recently, the Unsupervised Domain Generalization (UDG) task has been proposed to enhance the generalization of models trained with prevalent unsupervised learning techniques, such as Self-Supervised Learning (SSL). UDG confronts the challenge of d… ▽ More

    Submitted 24 September, 2025; v1 submitted 19 September, 2025; originally announced September 2025.

    Comments: Accepted by NeurIPS 2025

  15. arXiv:2509.15464  [pdf, ps, other

    cs.LG

    Temporal Reasoning with Large Language Models Augmented by Evolving Knowledge Graphs

    Authors: Junhong Lin, Song Wang, Xiaojie Guo, Julian Shun, Yada Zhu

    Abstract: Large language models (LLMs) excel at many language understanding tasks but struggle to reason over knowledge that evolves. To address this, recent work has explored augmenting LLMs with knowledge graphs (KGs) to provide structured, up-to-date information. However, many existing approaches assume a static snapshot of the KG and overlook the temporal dynamics and factual inconsistencies inherent in… ▽ More

    Submitted 18 September, 2025; originally announced September 2025.

  16. arXiv:2509.15276  [pdf, ps, other

    hep-ex

    First Observation of $Λ$ Hyperon Transverse Polarization in $ψ(3686)\toΛ\barΛ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (687 additional authors not shown)

    Abstract: Based on $(448.1\pm2.9)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, we present the first observation of spin transverse polarization of $Λ$ and $\barΛ$ hyperons produced coherently in the decay $ψ(3686)\toΛ(\to pπ^-)\barΛ(\to\bar pπ^+)$. The relative phase between the electric and magnetic hadronic form factors is measured to be… ▽ More

    Submitted 18 September, 2025; originally announced September 2025.

  17. arXiv:2509.14551  [pdf, ps, other

    cs.AR

    Shift-Left Techniques in Electronic Design Automation: A Survey

    Authors: Xinyue Wu, Zixuan Li, Fan Hu, Ting Lin, Xiaotian Zhao, Runxi Wang, Xinfei Guo

    Abstract: The chip design process involves numerous steps, beginning with defining product requirements and progressing through architectural planning, system-level design, and the physical layout of individual circuit blocks. As the enablers of large-scale chip development, Electronic Design Automation (EDA) tools play a vital role in helping designers achieve high-quality results. The Shift-Left methodolo… ▽ More

    Submitted 17 September, 2025; originally announced September 2025.

  18. arXiv:2509.14281  [pdf, ps, other

    cs.SE cs.AI

    SCoGen: Scenario-Centric Graph-Based Synthesis of Real-World Code Problems

    Authors: Xifeng Yao, Dongyu Lang, Wu Zhang, Xintong Guo, Huarui Xie, Yinhao Ni, Ping Liu, Guang Shen, Yi Bai, Dandan Tu, Changzheng Zhang

    Abstract: Significant advancements have been made in the capabilities of code large language models, leading to their rapid adoption and application across a wide range of domains. However, their further advancements are often constrained by the scarcity of real-world coding problems. To bridge this gap, we propose a novel framework for synthesizing code problems that emulate authentic real-world scenarios.… ▽ More

    Submitted 16 September, 2025; originally announced September 2025.

  19. arXiv:2509.13990  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Slim-SC: Thought Pruning for Efficient Scaling with Self-Consistency

    Authors: Colin Hong, Xu Guo, Anand Chaanan Singh, Esha Choukse, Dmitrii Ustiugov

    Abstract: Recently, Test-Time Scaling (TTS) has gained increasing attention for improving LLM reasoning performance at test time without retraining the model. A notable TTS technique is Self-Consistency (SC), which generates multiple reasoning chains in parallel and selects the final answer via majority voting. While effective, the order-of-magnitude computational overhead limits its broad deployment. Prior… ▽ More

    Submitted 17 September, 2025; originally announced September 2025.

    Comments: Accepted by EMNLP 2025 (Oral), 9 pages

    ACM Class: I.2.7

  20. arXiv:2509.13542  [pdf

    cond-mat.mtrl-sci

    Tuning Coupled Toroidic and Polar Orders in a Bilayer Antiferromagnet

    Authors: Chuangtang Wang, Xiaoyu Guo, Zixin Zhai, Meixin Cheng, Sang-Wook Cheong, Adam W. Tsen, Bing Lv, Liuyan Zhao

    Abstract: Magnetic toroidal order features a loop-like arrangement of magnetic dipole moments, thus breaking both spatial inversion (P) and time-reversal (T) symmetries while preserving their combined PT sym-metry. This PT symmetry enables a linear magnetoelectric effect, allowing the coupling between magnetic toroidicity and electric polarity. However, the detection and control of two-dimensional (2D) magn… ▽ More

    Submitted 16 September, 2025; originally announced September 2025.

    Comments: 12 pages, 4 figures

  21. arXiv:2509.12683  [pdf, ps, other

    cs.CV

    StereoCarla: A High-Fidelity Driving Dataset for Generalizable Stereo

    Authors: Xianda Guo, Chenming Zhang, Ruilin Wang, Youmin Zhang, Wenzhao Zheng, Matteo Poggi, Hao Zhao, Qin Zou, Long Chen

    Abstract: Stereo matching plays a crucial role in enabling depth perception for autonomous driving and robotics. While recent years have witnessed remarkable progress in stereo matching algorithms, largely driven by learning-based methods and synthetic datasets, the generalization performance of these models remains constrained by the limited diversity of existing training data. To address these challenges,… ▽ More

    Submitted 16 September, 2025; originally announced September 2025.

  22. arXiv:2509.11499  [pdf

    cs.LG physics.data-an

    OASIS: A Deep Learning Framework for Universal Spectroscopic Analysis Driven by Novel Loss Functions

    Authors: Chris Young, Juejing Liu, Marie L. Mortensen, Yifu Feng, Elizabeth Li, Zheming Wang, Xiaofeng Guo, Kevin M. Rosso, Xin Zhang

    Abstract: The proliferation of spectroscopic data across various scientific and engineering fields necessitates automated processing. We introduce OASIS (Omni-purpose Analysis of Spectra via Intelligent Systems), a machine learning (ML) framework for technique-independent, automated spectral analysis, encompassing denoising, baseline correction, and comprehensive peak parameter (location, intensity, FWHM) r… ▽ More

    Submitted 14 September, 2025; originally announced September 2025.

  23. arXiv:2509.10005  [pdf, ps, other

    cs.CV

    TUNI: Real-time RGB-T Semantic Segmentation with Unified Multi-Modal Feature Extraction and Cross-Modal Feature Fusion

    Authors: Xiaodong Guo, Tong Liu, Yike Li, Zi'ang Lin, Zhihong Deng

    Abstract: RGB-thermal (RGB-T) semantic segmentation improves the environmental perception of autonomous platforms in challenging conditions. Prevailing models employ encoders pre-trained on RGB images to extract features from both RGB and infrared inputs, and design additional modules to achieve cross-modal feature fusion. This results in limited thermal feature extraction and suboptimal cross-modal fusion,… ▽ More

    Submitted 12 September, 2025; originally announced September 2025.

  24. arXiv:2509.09773  [pdf, ps, other

    stat.ME math.ST

    Optimal Inference of the Mean Outcome under Optimal Treatment Regime

    Authors: Shuoxun Xu, Xinzhou Guo

    Abstract: When an optimal treatment regime (OTR) is considered, we need to evaluate the OTR in a valid and efficient way. The classical inference applied to the mean outcome under OTR, assuming the OTR is the same as the estimated OTR, might be biased when the regularity assumption that OTR is unique is violated. Although several methods have been proposed to allow nonregularity in such inference, its optim… ▽ More

    Submitted 11 September, 2025; originally announced September 2025.

    Comments: 17 pages, 5 figures

    MSC Class: 62G20; 62C05 (Primary) 62B10 (Secondary)

  25. arXiv:2509.09505  [pdf, ps, other

    cs.AR

    Combating the Memory Walls: Optimization Pathways for Long-Context Agentic LLM Inference

    Authors: Haoran Wu, Can Xiao, Jiayi Nie, Xuan Guo, Binglei Lou, Jeffrey T. H. Wong, Zhiwen Mo, Cheng Zhang, Przemyslaw Forys, Wayne Luk, Hongxiang Fan, Jianyi Cheng, Timothy M. Jones, Rika Antonova, Robert Mullins, Aaron Zhao

    Abstract: LLMs now form the backbone of AI agents for a diverse array of applications, including tool use, command-line agents, and web or computer use agents. These agentic LLM inference tasks are fundamentally different from chatbot-focused inference -- they often have much larger context lengths to capture complex, prolonged inputs, such as entire webpage DOMs or complicated tool call trajectories. This,… ▽ More

    Submitted 24 September, 2025; v1 submitted 11 September, 2025; originally announced September 2025.

  26. arXiv:2509.08650  [pdf

    cond-mat.mtrl-sci

    Intertwined polar, chiral, and ferro-rotational orders in a rotation-only insulator

    Authors: Weizhe Zhang, June Ho Yeo, Xiaoyu Guo, Tony Chiang, Nishkarsh Agarwal, John T. Heron, Kai Sun, Junjie Yang, Sang-Wook Cheong, Youngjun Ahn, Liuyan Zhao

    Abstract: Intertwined orders refer to strongly coupled and mutually dependent orders that coexist in correlated electron systems, often underpinning key physical properties of the host materials. Among them, polar, chiral, and ferro-rotational orders have been theoretically known to form a closed set of intertwined orders. However, experimental investigation into their mutual coupling and physical consequen… ▽ More

    Submitted 10 September, 2025; originally announced September 2025.

    Comments: 15 pages, 4 figures

  27. arXiv:2509.07571  [pdf, ps, other

    cs.MA cs.AI

    Towards Generalized Routing: Model and Agent Orchestration for Adaptive and Efficient Inference

    Authors: Xiyu Guo, Shan Wang, Chunfang Ji, Xuefeng Zhao, Wenhao Xi, Yaoyao Liu, Qinglan Li, Chao Deng, Junlan Feng

    Abstract: The rapid advancement of large language models (LLMs) and domain-specific AI agents has greatly expanded the ecosystem of AI-powered services. User queries, however, are highly diverse and often span multiple domains and task types, resulting in a complex and heterogeneous landscape. This diversity presents a fundamental routing challenge: how to accurately direct each query to an appropriate exec… ▽ More

    Submitted 10 September, 2025; v1 submitted 9 September, 2025; originally announced September 2025.

  28. arXiv:2509.07322  [pdf, ps, other

    stat.ME

    Double Machine Learning for Estimating Time-Varying Delayed and Instantaneous Effects Using Digital Phenotypes

    Authors: Xingche Guo, Zexi Cai, Yuanjia Wang, Donglin Zeng

    Abstract: Mobile health (mHealth) leverages digital technologies, such as mobile phones, to capture objective, frequent, and real-world digital phenotypes from individuals, enabling the delivery of tailored interventions to accommodate substantial between-subject and temporal heterogeneity. However, evaluating heterogeneous treatment effects from digital phenotype data is challenging due to the dynamic natu… ▽ More

    Submitted 8 September, 2025; originally announced September 2025.

  29. arXiv:2509.06887  [pdf, ps, other

    cs.IR

    UniSearch: Rethinking Search System with a Unified Generative Architecture

    Authors: Jiahui Chen, Xiaoze Jiang, Zhibo Wang, Quanzhi Zhu, Junyao Zhao, Feng Hu, Kang Pan, Ao Xie, Maohua Pei, Zhiheng Qin, Hongjing Zhang, Zhixin Zhai, Xiaobo Guo, Runbin Zhou, Kefeng Wang, Mingyang Geng, Cheng Chen, Jingshan Lv, Yupeng Huang, Xiao Liang, Han Li

    Abstract: Modern search systems play a crucial role in facilitating information acquisition. Traditional search engines typically rely on a cascaded architecture, where results are retrieved through recall, pre-ranking, and ranking stages. The complexity of designing and maintaining multiple modules makes it difficult to achieve holistic performance gains. Recent advances in generative recommendation have m… ▽ More

    Submitted 10 September, 2025; v1 submitted 8 September, 2025; originally announced September 2025.

  30. arXiv:2509.06798  [pdf, ps, other

    cs.CV

    SynthDrive: Scalable Real2Sim2Real Sensor Simulation Pipeline for High-Fidelity Asset Generation and Driving Data Synthesis

    Authors: Zhengqing Chen, Ruohong Mei, Xiaoyang Guo, Qingjie Wang, Yubin Hu, Wei Yin, Weiqiang Ren, Qian Zhang

    Abstract: In the field of autonomous driving, sensor simulation is essential for generating rare and diverse scenarios that are difficult to capture in real-world environments. Current solutions fall into two categories: 1) CG-based methods, such as CARLA, which lack diversity and struggle to scale to the vast array of rare cases required for robust perception training; and 2) learning-based approaches, suc… ▽ More

    Submitted 8 September, 2025; originally announced September 2025.

    Comments: 8 pages

  31. arXiv:2509.06389  [pdf, ps, other

    cs.SD cs.AI

    MeanFlow-Accelerated Multimodal Video-to-Audio Synthesis via One-Step Generation

    Authors: Xiaoran Yang, Jianxuan Yang, Xinyue Guo, Haoyu Wang, Ningning Pan, Gongping Huang

    Abstract: A key challenge in synthesizing audios from silent videos is the inherent trade-off between synthesis quality and inference efficiency in existing methods. For instance, flow matching based models rely on modeling instantaneous velocity, inherently require an iterative sampling process, leading to slow inference speeds. To address this efficiency bottleneck, we introduce a MeanFlow-accelerated mod… ▽ More

    Submitted 8 September, 2025; originally announced September 2025.

  32. arXiv:2509.05950  [pdf, ps, other

    cond-mat.dis-nn

    Theory of Localized States in Quasiperiodic Lattices

    Authors: Jin-Rong Chen, Xin-Yu Guo, Shi-Ping Ding, Tian-Le Wu, Miao Liang, Jin-Hua Gao, X. C. Xie

    Abstract: The physics of localized states in quasiperiodic lattices has been extensively studied for decades, but still lacks an comprehensive theoretical framework. Recently, we developed a incommensurate energy band (IEB) theory, which extends the concept of energy bands to quasiperiodic systems lacking translational symmetry, thereby achieving a breakthrough in elucidating extended states. Here, we demon… ▽ More

    Submitted 7 September, 2025; originally announced September 2025.

    Comments: 6 pages, 3 figures

  33. arXiv:2509.04829  [pdf, ps, other

    physics.ins-det hep-ex

    Preparation and measurement of an $\rm ^{37}$Ar source for liquid xenon detector calibration

    Authors: Xu-Nan Guo, Chang Cai, Fei Gao, Yang Lei, Kai-Hang Li, Chun-Lei Su, Ze-Peng Wu, Xiang Xiao, Ling-Feng Xie, Yi-Fei Zhao, Xiao-Peng Zhou

    Abstract: We present the preparation and measurement of the radioactive isotope $\rm ^{37}Ar$, which was produced using thermal neutrons from a reactor, as a calibration source for liquid xenon time projection chambers. $\rm ^{37}Ar$ is a low-energy calibration source with a half-life of 35.01 days, making it suitable for calibration in the low-energy region of liquid xenon dark-matter experiments. Radioact… ▽ More

    Submitted 5 September, 2025; originally announced September 2025.

  34. arXiv:2509.04218  [pdf, ps, other

    astro-ph.CO astro-ph.HE

    Bright siren without electromagnetic counterpart by LISA-Taiji-TianQin network

    Authors: Yejing Zhan, David Izquierdo-Villalba, Xiao Guo, Qing Yang, Daniele Spinoso, Fa-Yin Wang

    Abstract: Gravitational waves (GWs) with electromagnetic counterparts (EMc) offer a novel approach to measure the Hubble constant ($H_0$), known as bright sirens, enabling $H_0$ measurements by combining GW-derived distances with EM-derived redshifts. Host galaxy identification is essential for redshift determination but remains challenging due to poor GW sky localization and uncertainties in EMc models. To… ▽ More

    Submitted 4 September, 2025; originally announced September 2025.

    Comments: 20 pages, 9 figures, 2 tables. Submitted to AAS journal

  35. arXiv:2509.03887  [pdf, ps, other

    cs.CV

    OccTENS: 3D Occupancy World Model via Temporal Next-Scale Prediction

    Authors: Bu Jin, Songen Gu, Xiaotao Hu, Yupeng Zheng, Xiaoyang Guo, Qian Zhang, Xiaoxiao Long, Wei Yin

    Abstract: In this paper, we propose OccTENS, a generative occupancy world model that enables controllable, high-fidelity long-term occupancy generation while maintaining computational efficiency. Different from visual generation, the occupancy world model must capture the fine-grained 3D geometry and dynamic evolution of the 3D scenes, posing great challenges for the generative models. Recent approaches bas… ▽ More

    Submitted 4 September, 2025; originally announced September 2025.

  36. arXiv:2509.03332  [pdf, ps, other

    astro-ph.CO astro-ph.HE gr-qc

    PTA Frequency Band Individual Gravitational Wave Sources and Dark Energy Detection Based on Cosmological Simulation

    Authors: Qing Yang, Gu-yue Zhang, Yi Huang, Xiao Guo

    Abstract: Nanohertz gravitational waves (GWs) from supermassive binary black holes (SMBBHs), detectable via pulsar timing arrays (PTAs), offer a novel avenue to constrain dark energy. Based on cosmological simulations and semi-analytic galaxy formation models, this study explores the detectability of individual nanohertz SMBBH sources using next-generation PTAs and their potential for constraining dark ener… ▽ More

    Submitted 3 September, 2025; originally announced September 2025.

    Comments: 15 pages, 8 figures, 5 tables

  37. arXiv:2509.03236  [pdf, ps, other

    cs.IR

    OneSearch: A Preliminary Exploration of the Unified End-to-End Generative Framework for E-commerce Search

    Authors: Ben Chen, Xian Guo, Siyuan Wang, Zihan Liang, Yue Lv, Yufei Ma, Xinlong Xiao, Bowen Xue, Xuxin Zhang, Ying Yang, Huangyu Dai, Xing Xu, Tong Zhao, Mingcan Peng, Xiaoyang Zheng, Chao Wang, Qihang Zhao, Zhixin Zhai, Yang Zhao, Bochao Liu, Jingshan Lv, Xiao Liang, Yuqing Ding, Jing Chen, Chenyi Lei , et al. (3 additional authors not shown)

    Abstract: Traditional e-commerce search systems employ multi-stage cascading architectures (MCA) that progressively filter items through recall, pre-ranking, and ranking stages. While effective at balancing computational efficiency with business conversion, these systems suffer from fragmented computation and optimization objective collisions across stages, which ultimately limit their performance ceiling.… ▽ More

    Submitted 30 September, 2025; v1 submitted 3 September, 2025; originally announced September 2025.

  38. arXiv:2509.01898  [pdf, ps, other

    cs.CV

    DroneSR: Rethinking Few-shot Thermal Image Super-Resolution from Drone-based Perspective

    Authors: Zhipeng Weng, Xiaopeng Liu, Ce Liu, Xingyuan Guo, Yukai Shi, Liang Lin

    Abstract: Although large scale models achieve significant improvements in performance, the overfitting challenge still frequently undermines their generalization ability. In super resolution tasks on images, diffusion models as representatives of generative models typically adopt large scale architectures. However, few-shot drone-captured infrared training data frequently induces severe overfitting in large… ▽ More

    Submitted 1 September, 2025; originally announced September 2025.

  39. arXiv:2509.01262  [pdf

    physics.optics

    Integrated photonic neuromorphic computing: device, architecture, chip, algorithm

    Authors: Shuiying Xiang, Chengyang Yu, Yizhi Wang, Xintao Zeng, Yuna Zhang, Dianzhuang Zheng, Xinran Niu, Haowen Zhao, Hanxu Zhou, Yanan Han, Xingxing Guo, Yahui Zhang, Yue Hao

    Abstract: Artificial intelligence (AI) has experienced explosive growth in recent years. The large models have been widely applied in various fields, including natural language processing, image generation, and complex decision-making systems, revolutionizing technological paradigms across multiple industries. Nevertheless, the substantial data processing demands during model training and inference result i… ▽ More

    Submitted 1 September, 2025; originally announced September 2025.

  40. arXiv:2509.00951  [pdf, ps, other

    astro-ph.HE

    The Most Luminous Known Fast Blue Optical Transient AT 2024wpp: Unprecedented Evolution and Properties in the Ultraviolet to the Near-Infrared

    Authors: Natalie LeBaron, Raffaella Margutti, Ryan Chornock, A. J. Nayana, Olivia Aspegren, Wenbin Lu, Brian Metzger, Daniel Kasen, Thomas Brink, Sergio Campana, Paolo D'Avanzo, Jakob Faber, Matteo Ferro, Alex Filippenko, Ryan Foley, Xinze Guo, Erica Hammerstein, Saurabh Jha, Charles Kilpatrick, Giulia Migliori, Dan Milisavljevic, Kishore Patra, Huei Sears, Jonathan Swift, Samaporn Tinyanont , et al. (23 additional authors not shown)

    Abstract: We present an extensive photometric and spectroscopic ultraviolet-optical-infrared campaign on the luminous fast blue optical transient (LFBOT) AT 2024wpp over the first ~100 d. AT 2024wpp is the most luminous LFBOT discovered to date, with $L_{\rm{pk}}\approx(2-4)\times10^{45}$ erg s$^{-1}$ (5-10 times that of the prototypical AT 2018cow). This extreme luminosity enabled the acquisition of the mo… ▽ More

    Submitted 31 August, 2025; originally announced September 2025.

    Comments: 33 pages, 13 figures, submitted to ApJL

  41. arXiv:2508.20395  [pdf, ps, other

    cs.CL cs.AI

    Measuring Reasoning Utility in LLMs via Conditional Entropy Reduction

    Authors: Xu Guo

    Abstract: Recent advancements in large language models (LLMs) often rely on generating intermediate reasoning steps to enhance accuracy. However, little work has examined how reasoning utility contributes to the final answer's correctness. Due to the stochastic nature of autoregressive generation, generating more context does not guarantee increased confidence in the answer. If we could predict, during gene… ▽ More

    Submitted 27 August, 2025; originally announced August 2025.

    Comments: 11 pages, 4 figures

    ACM Class: I.2.7

  42. arXiv:2508.18761  [pdf, ps, other

    hep-ex

    Study of the $χ_{cJ}\rightarrowΛ\barΛη^\prime$ decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (683 additional authors not shown)

    Abstract: Using a data sample of $(2.712\pm0.014)\times10^{9}$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, we investigate the decays $χ_{cJ} \rightarrow Λ\barΛ η^\prime$ for $J=0,~1,~2$ via the radiative transition $ψ(3686) \rightarrow γχ_{cJ}$. The decays $χ_{c0,2}\rightarrowΛ\barΛη^\prime$ are observed for the first time, with statistical significances of 6.7$\,σ$ and 6.4… ▽ More

    Submitted 26 August, 2025; originally announced August 2025.

  43. arXiv:2508.18241  [pdf, ps, other

    cond-mat.mtrl-sci

    Atomistic Structure of Transient Switching States in Ferroelectric AlScN

    Authors: Jiawei Huang, Jinyang Li, Xinyue Guo, Tongqi Wen, David J. Srolovitz, Zhen Chen, Zuhuang Chen, Shi Liu

    Abstract: We resolve the microscopic mechanism of polarization switching in wurtzite ferroelectric AlScN by integrating advanced thin-film fabrication, ferroelectric switching dynamics characterizations, high-resolution scanning transmission electron microscopy (STEM), and large-scale molecular dynamics simulations enabled by a deep neural network-based interatomic potential. Contrary to earlier interpretat… ▽ More

    Submitted 25 August, 2025; originally announced August 2025.

  44. arXiv:2508.18171  [pdf, ps, other

    hep-th gr-qc

    Quadratic curvature corrections to 5-dimensional Kerr-AdS black hole thermodynamics by background subtraction method

    Authors: Gerui Chen, Xiyao Guo, Xin Lan, Hongbao Zhang, Wei Zhang

    Abstract: We justify the applicability of the background subtraction method to both Einstein's gravity and its higher derivative corrections in 5-dimensional asymptotically AdS spacetimes, where the corresponding higher derivative corrections to the expression for the ADM mass and angular momentum are also worked out. Then we further apply the background subtraction method to calculate the first order corre… ▽ More

    Submitted 26 August, 2025; v1 submitted 25 August, 2025; originally announced August 2025.

    Comments: 11 pages, title sharpened, typos corrected

  45. arXiv:2508.17972  [pdf, ps, other

    cs.CV

    SAIL-Recon: Large SfM by Augmenting Scene Regression with Localization

    Authors: Junyuan Deng, Heng Li, Tao Xie, Weiqiang Ren, Qian Zhang, Ping Tan, Xiaoyang Guo

    Abstract: Scene regression methods, such as VGGT, solve the Structure-from-Motion (SfM) problem by directly regressing camera poses and 3D scene structures from input images. They demonstrate impressive performance in handling images under extreme viewpoint changes. However, these methods struggle to handle a large number of input images. To address this problem, we introduce SAIL-Recon, a feed-forward Tran… ▽ More

    Submitted 25 August, 2025; originally announced August 2025.

  46. arXiv:2508.17819  [pdf, ps, other

    hep-ex

    Search for CP violation in e+e- -> psi(3770) -> DDbar via D -> KsPi0

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. B. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (707 additional authors not shown)

    Abstract: Utilizing data sample of electron-positron collisions recorded with the BESIII detector at the center-of-mass energies of 3.773~GeV, corresponding to an integrated luminosity of 20.28~fb$^{-1}$, we report the first search for the CP forbidden process $e^+e^- \to ψ(3773) \to D^0\bar{D}^0 \to (K^0_Sπ^0)(K^0_Sπ^0)$. No significant signal is observed. We set the upper limit on the observed cross secti… ▽ More

    Submitted 26 August, 2025; v1 submitted 25 August, 2025; originally announced August 2025.

    Comments: 9 pages, 4 figures

  47. arXiv:2508.16653  [pdf, ps, other

    cs.PF

    H2EAL: Hybrid-Bonding Architecture with Hybrid Sparse Attention for Efficient Long-Context LLM Inference

    Authors: Zizhuo Fu, Xiaotian Guo, Wenxuan Zeng, Shuzhang Zhong, Yadong Zhang, Peiyu Chen, Runsheng Wang, Le Ye, Meng Li

    Abstract: Large language models (LLMs) have demonstrated remarkable proficiency in a wide range of natural language processing applications. However, the high energy and latency overhead induced by the KV cache limits the edge deployment, especially for long contexts. Emerging hybrid bonding (HB) technology has been proposed as a promising alternative to conventional near-memory processing (NMP) architectur… ▽ More

    Submitted 19 August, 2025; originally announced August 2025.

    Comments: International Conference on Computer-Aided Design (ICCAD) 2025

  48. arXiv:2508.15763  [pdf, ps, other

    cs.LG cs.CL cs.CV

    Intern-S1: A Scientific Multimodal Foundation Model

    Authors: Lei Bai, Zhongrui Cai, Yuhang Cao, Maosong Cao, Weihan Cao, Chiyu Chen, Haojiong Chen, Kai Chen, Pengcheng Chen, Ying Chen, Yongkang Chen, Yu Cheng, Pei Chu, Tao Chu, Erfei Cui, Ganqu Cui, Long Cui, Ziyun Cui, Nianchen Deng, Ning Ding, Nanqing Dong, Peijie Dong, Shihan Dou, Sinan Du, Haodong Duan , et al. (152 additional authors not shown)

    Abstract: In recent years, a plethora of open-source foundation models have emerged, achieving remarkable progress in some widely attended fields, with performance being quite close to that of closed-source models. However, in high-value but more challenging scientific professional fields, either the fields still rely on expert models, or the progress of general foundation models lags significantly compared… ▽ More

    Submitted 24 August, 2025; v1 submitted 21 August, 2025; originally announced August 2025.

  49. arXiv:2508.15376  [pdf, ps, other

    cs.CV

    DriveSplat: Decoupled Driving Scene Reconstruction with Geometry-enhanced Partitioned Neural Gaussians

    Authors: Cong Wang, Xianda Guo, Wenbo Xu, Wei Tian, Ruiqi Song, Chenming Zhang, Lingxi Li, Long Chen

    Abstract: In the realm of driving scenarios, the presence of rapidly moving vehicles, pedestrians in motion, and large-scale static backgrounds poses significant challenges for 3D scene reconstruction. Recent methods based on 3D Gaussian Splatting address the motion blur problem by decoupling dynamic and static components within the scene. However, these decoupling strategies overlook background optimizatio… ▽ More

    Submitted 21 September, 2025; v1 submitted 21 August, 2025; originally announced August 2025.

  50. arXiv:2508.14819  [pdf, ps, other

    physics.app-ph

    Synchronization driven acoustics: The nonlinear scattering of a self-oscillating meta-atom

    Authors: Alexander K. Stoychev, Xinxin Guo, Ulrich Kuhl, Nicolas Noiray

    Abstract: In this study we demonstrate a self-oscillating acoustic meta-atom functioning as an amplifying transistor, where a steady external flow serves as a control signal to switch between reflective (off-state) and transmissive (on-state) regimes. In the on-state, an acoustic limit cycle synchronizes with incident sound waves. This process governs the energy transfer across the device, with a transmissi… ▽ More

    Submitted 22 August, 2025; v1 submitted 20 August, 2025; originally announced August 2025.

    Comments: 10 pages, 9 figures