Skip to main content

Showing 201–250 of 6,538 results for author: Suen, Y

.
  1. arXiv:2505.15151  [pdf, ps, other

    cs.LG

    Time Tracker: Mixture-of-Experts-Enhanced Foundation Time Series Forecasting Model with Decoupled Training Pipelines

    Authors: Xiaohou Shi, Ke Li, Aobo Liang, Yan Sun

    Abstract: In the past few years, time series foundation models have achieved superior predicting accuracy. However, real-world time series often exhibit significant diversity in their temporal patterns across different time spans and domains, making it challenging for a single model architecture to fit all complex scenarios. In addition, time series data may have multiple variables exhibiting complex correl… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  2. Test of local realism via entangled $Λ\barΛ$ system

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (597 additional authors not shown)

    Abstract: The non-locality of quantum correlations is a fundamental feature of quantum theory. The Bell inequality serves as a benchmark for distinguishing between predictions made by quantum theory and local hidden variable theory (LHVT). Recent advancements in photon-entanglement experiments have addressed potential loopholes and have observed significant violations of variants of Bell inequality. However… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Journal ref: Nat Commun 16, 4948 (2025)

  3. arXiv:2505.14974  [pdf, ps, other

    physics.optics

    Full spectral response of grating-induced loss in photonic crystal microrings

    Authors: Daniel Pimbi, Yi Sun, Roy Zektzer, Xiyuan Lu, Kartik Srinivasan

    Abstract: Photonic crystal microrings (PhCRs) have emerged as powerful and versatile platforms for integrated nonlinear photonics, offering precise control over frequency and phase matching while maintaining high optical quality factors. Through grating-mediated mode coupling, PhCRs enable advanced dispersion engineering, which is critical for wideband nonlinear processes such as optical parametric oscillat… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  4. arXiv:2505.14916  [pdf

    eess.IV cs.CV

    Super-Resolution Optical Coherence Tomography Using Diffusion Model-Based Plug-and-Play Priors

    Authors: Yaning Wang, Jinglun Yu, Wenhan Guo, Yu Sun, Jin U. Kang

    Abstract: We propose an OCT super-resolution framework based on a plug-and-play diffusion model (PnP-DM) to reconstruct high-quality images from sparse measurements (OCT B-mode corneal images). Our method formulates reconstruction as an inverse problem, combining a diffusion prior with Markov chain Monte Carlo sampling for efficient posterior inference. We collect high-speed under-sampled B-mode corneal ima… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  5. arXiv:2505.14560  [pdf, ps, other

    eess.IV cs.CV

    Neural Inverse Scattering with Score-based Regularization

    Authors: Yuan Gao, Wenhan Guo, Yu Sun

    Abstract: Inverse scattering is a fundamental challenge in many imaging applications, ranging from microscopy to remote sensing. Solving this problem often requires jointly estimating two unknowns -- the image and the scattering field inside the object -- necessitating effective image prior to regularize the inference. In this paper, we propose a regularized neural field (NF) approach which integrates the d… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  6. arXiv:2505.14161  [pdf, other

    cs.LG

    Personalized Bayesian Federated Learning with Wasserstein Barycenter Aggregation

    Authors: Ting Wei, Biao Mei, Junliang Lyu, Renquan Zhang, Feng Zhou, Yifan Sun

    Abstract: Personalized Bayesian federated learning (PBFL) handles non-i.i.d. client data and quantifies uncertainty by combining personalization with Bayesian inference. However, existing PBFL methods face two limitations: restrictive parametric assumptions in client posterior inference and naive parameter averaging for server aggregation. To overcome these issues, we propose FedWBA, a novel PBFL method tha… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  7. arXiv:2505.14135  [pdf, other

    cs.CV

    Hunyuan-Game: Industrial-grade Intelligent Game Creation Model

    Authors: Ruihuang Li, Caijin Zhou, Shoujian Zheng, Jianxiang Lu, Jiabin Huang, Comi Chen, Junshu Tang, Guangzheng Xu, Jiale Tao, Hongmei Wang, Donghao Li, Wenqing Yu, Senbo Wang, Zhimin Li, Yetshuan Shi, Haoyu Yang, Yukun Wang, Wenxun Dai, Jiaqi Li, Linqing Wang, Qixun Wang, Zhiyong Xu, Yingfang Zhang, Jiangfeng Xiong, Weijie Kong , et al. (33 additional authors not shown)

    Abstract: Intelligent game creation represents a transformative advancement in game development, utilizing generative artificial intelligence to dynamically generate and enhance game content. Despite notable progress in generative models, the comprehensive synthesis of high-quality game assets, including both images and videos, remains a challenging frontier. To create high-fidelity game content that simult… ▽ More

    Submitted 28 May, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

  8. arXiv:2505.14057  [pdf, ps, other

    cs.IR cs.AI

    Field Matters: A lightweight LLM-enhanced Method for CTR Prediction

    Authors: Yu Cui, Feng Liu, Jiawei Chen, Xingyu Lou, Changwang Zhang, Jun Wang, Yuegang Sun, Xiaohu Yang, Can Wang

    Abstract: Click-through rate (CTR) prediction is a fundamental task in modern recommender systems. In recent years, the integration of large language models (LLMs) has been shown to effectively enhance the performance of traditional CTR methods. However, existing LLM-enhanced methods often require extensive processing of detailed textual descriptions for large-scale instances or user/item entities, leading… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  9. OGHReS: Star formation in the Outer Galaxy II ($\ell = 180^\circ$-$280^\circ$)

    Authors: J. S. Urquhart, C. Koenig, D. Colombo, A. Karska, A. Giannetti, T. J. T. Moore, A. Y. Yang, F. Wyrowski, Y. Sun, Z. Jiang, K. R. Neralwar, D. Eden, I. Grozdanova, S. Neupane, M. Figueira, E. Dann, V., S. Veena, W. -J. Kim, S. Leurini, J. Brand, M. -Y. Lee

    Abstract: The Outer Galaxy High-Resolution Survey (OGHReS) covers 100 square degrees ($180^\circ < \ell < 280^\circ$) in the (2--1) transitions of three CO-isotopologues. We use the spectra to refine the velocities and physical properties to 6706 \higal\ clumps located in the OGHReS region. In a previous paper, we analysed 3584 clumps between $\ell = 250^\circ$ and $280^\circ$. Here, we cover a further 3122… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 18 pages, 14 figues. Full versions of Tables 1 and 2 are only available in electronic form via CDS. arXiv admin note: text overlap with arXiv:2401.00808

  10. arXiv:2505.13633  [pdf, ps, other

    cs.CV

    IPENS:Interactive Unsupervised Framework for Rapid Plant Phenotyping Extraction via NeRF-SAM2 Fusion

    Authors: Wentao Song, He Huang, Youqiang Sun, Fang Qu, Jiaqi Zhang, Longhui Fang, Yuwei Hao, Chenyang Peng

    Abstract: Advanced plant phenotyping technologies play a crucial role in targeted trait improvement and accelerating intelligent breeding. Due to the species diversity of plants, existing methods heavily rely on large-scale high-precision manually annotated data. For self-occluded objects at the grain level, unsupervised methods often prove ineffective. This study proposes IPENS, an interactive unsupervised… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  11. arXiv:2505.13579  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Learning Wavelet-Sparse FDK for 3D Cone-Beam CT Reconstruction

    Authors: Yipeng Sun, Linda-Sophie Schneider, Chengze Ye, Mingxuan Gu, Siyuan Mei, Siming Bayer, Andreas Maier

    Abstract: Cone-Beam Computed Tomography (CBCT) is essential in medical imaging, and the Feldkamp-Davis-Kress (FDK) algorithm is a popular choice for reconstruction due to its efficiency. However, FDK is susceptible to noise and artifacts. While recent deep learning methods offer improved image quality, they often increase computational complexity and lack the interpretability of traditional methods. In this… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: Accepted by Fully3D 2025

  12. arXiv:2505.13222  [pdf, ps, other

    hep-ex

    Partial Wave Analysis of $e^{+}e^{-} \rightarrow π^{+}π^{-}J/ψ$ and Cross Section Measurement of $e^{+}e^{-} \rightarrow π^{\pm}Z_{c}(3900)^{\mp}$ from 4.1271 to 4.3583 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Based on 12.0 $\mathrm{fb^{-1}}$ of $e^{+}e^{-}$ collision data samples collected by the BESIII detector at center-of-mass energies from 4.1271 to 4.3583 GeV, a partial wave analysis is performed for the process $e^{+}e^{-} \rightarrow π^{+}π^{-}J/ψ$. The cross sections for the sub processes ${e^{+}e^{-}\rightarrowπ^{+}Z_{c}(3900)^{-}+c.c.\rightarrowπ^{+}π^{-}J/ψ}$,… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  13. arXiv:2505.13211  [pdf, ps, other

    cs.CV cs.AI

    MAGI-1: Autoregressive Video Generation at Scale

    Authors: Sand. ai, Hansi Teng, Hongyu Jia, Lei Sun, Lingzhi Li, Maolin Li, Mingqiu Tang, Shuai Han, Tianning Zhang, W. Q. Zhang, Weifeng Luo, Xiaoyang Kang, Yuchen Sun, Yue Cao, Yunpeng Huang, Yutong Lin, Yuxin Fang, Zewei Tao, Zheng Zhang, Zhongshu Wang, Zixun Liu, Dai Shi, Guoli Su, Hanwen Sun, Hong Pan , et al. (14 additional authors not shown)

    Abstract: We present MAGI-1, a world model that generates videos by autoregressively predicting a sequence of video chunks, defined as fixed-length segments of consecutive frames. Trained to denoise per-chunk noise that increases monotonically over time, MAGI-1 enables causal temporal modeling and naturally supports streaming generation. It achieves strong performance on image-to-video (I2V) tasks condition… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  14. arXiv:2505.12884  [pdf, ps, other

    cs.LG cs.AI cs.CV

    TinyAlign: Boosting Lightweight Vision-Language Models by Mitigating Modal Alignment Bottlenecks

    Authors: Yuanze Hu, Zhaoxin Fan, Xinyu Wang, Gen Li, Ye Qiu, Zhichao Yang, Wenjun Wu, Kejian Wu, Yifan Sun, Xiaotie Deng, Jin Dong

    Abstract: Lightweight Vision-Language Models (VLMs) are indispensable for resource-constrained applications. The prevailing approach to aligning vision and language models involves freezing both the vision encoder and the language model while training small connector modules. However, this strategy heavily depends on the intrinsic capabilities of the language model, which can be suboptimal for lightweight m… ▽ More

    Submitted 30 June, 2025; v1 submitted 19 May, 2025; originally announced May 2025.

  15. arXiv:2505.12629  [pdf, ps, other

    cs.LG cs.CL

    Enhancing Latent Computation in Transformers with Latent Tokens

    Authors: Yuchang Sun, Yanxi Chen, Yaliang Li, Bolin Ding

    Abstract: Augmenting large language models (LLMs) with auxiliary tokens has emerged as a promising strategy for enhancing model performance. In this work, we introduce a lightweight method termed latent tokens; these are dummy tokens that may be non-interpretable in natural language but steer the autoregressive decoding process of a Transformer-based LLM via the attention mechanism. The proposed latent toke… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  16. arXiv:2505.12539  [pdf, ps, other

    cs.GR

    Penetration-free Solid-Fluid Interaction on Shells and Rods

    Authors: Jinyuan Liu, Yuchen Sun, Yin Yang, Chenfanfu Jiang, Minchen Li, Bo Zhu

    Abstract: We introduce a novel approach to simulate the interaction between fluids and thin elastic solids without any penetration. Our approach is centered around an optimization system augmented with barriers, which aims to find a configuration that ensures the absence of penetration while enforcing incompressibility for the fluids and minimizing elastic potentials for the solids. Unlike previous methods… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  17. arXiv:2505.12380  [pdf, ps, other

    cs.LG cs.DB cs.PL

    Graph-Reward-SQL: Execution-Free Reinforcement Learning for Text-to-SQL via Graph Matching and Stepwise Reward

    Authors: Han Weng, Puzhen Wu, Cui Longjie, Yi Zhan, Boyi Liu, Yuanfeng Song, Dun Zeng, Yingxiang Yang, Qianru Zhang, Dong Huang, Xiaoming Yin, Yang Sun, Xing Chen

    Abstract: Reinforcement learning (RL) has been widely adopted to enhance the performance of large language models (LLMs) on Text-to-SQL tasks. However, existing methods often rely on execution-based or LLM-based Bradley-Terry reward models. The former suffers from high execution latency caused by repeated database calls, whereas the latter imposes substantial GPU memory overhead, both of which significantly… ▽ More

    Submitted 27 June, 2025; v1 submitted 18 May, 2025; originally announced May 2025.

  18. arXiv:2505.12234  [pdf, other

    hep-ex

    Observation of $χ_{cJ}(J=0,1,2)\rightarrow p\bar{p}ηη$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (678 additional authors not shown)

    Abstract: Using $(2712.4\pm14.3)\times10^6$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII storage ring, the decays $χ_{cJ}(J=0,1,2)\rightarrow p\bar{p}ηη$ are observed for the first time through the radiative transition $ψ(3686)\toγχ_{cJ}$. The statistical significances for $χ_{cJ}$ signals are all larger than 5$σ$. The branching fractions of $χ_{c0,1,2}\to p\bar{p} ηη$ are deter… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

    Comments: 17 pages, 16 figures

  19. arXiv:2505.12188  [pdf, ps, other

    cs.AR cs.AI

    LLM-DSE: Searching Accelerator Parameters with LLM Agents

    Authors: Hanyu Wang, Xinrui Wu, Zijian Ding, Su Zheng, Chengyue Wang, Tony Nowatzki, Yizhou Sun, Jason Cong

    Abstract: Even though high-level synthesis (HLS) tools mitigate the challenges of programming domain-specific accelerators (DSAs) by raising the abstraction level, optimizing hardware directive parameters remains a significant hurdle. Existing heuristic and learning-based methods struggle with adaptability and sample efficiency. We present LLM-DSE, a multi-agent framework designed specifically for optimizin… ▽ More

    Submitted 20 May, 2025; v1 submitted 17 May, 2025; originally announced May 2025.

  20. arXiv:2505.12108  [pdf, ps, other

    cs.CV cs.AI

    EarthSynth: Generating Informative Earth Observation with Diffusion Models

    Authors: Jiancheng Pan, Shiye Lei, Yuqian Fu, Jiahao Li, Yanxing Liu, Yuze Sun, Xiao He, Long Peng, Xiaomeng Huang, Bo Zhao

    Abstract: Remote sensing image (RSI) interpretation typically faces challenges due to the scarcity of labeled data, which limits the performance of RSI interpretation tasks. To tackle this challenge, we propose EarthSynth, a diffusion-based generative foundation model that enables synthesizing multi-category, cross-satellite labeled Earth observation for downstream RSI interpretation tasks. To the best of o… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

    Comments: 23 pages

  21. arXiv:2505.12086  [pdf, ps, other

    hep-ex

    Observation of an Altered $a_{0}(980)$ Line-shape in $D^{+} \rightarrow π^{+}ηη$ due to the Triangle Loop Rescattering Effect

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (705 additional authors not shown)

    Abstract: Using 20.3~${\rm fb}^{-1}$ of $e^{+}e^{-}$ collision data taken with the BESIII detector at the center-of-mass energy 3.773~GeV, we report the first amplitude analysis of the hadronic decay $D^{+} \rightarrow π^{+}ηη$. The intermediate process $D^{+} \to a_{0}(980)^{+}η, a_{0}(980)^{+} \to π^{+}η$ is observed and is found to be the only component and its branching fraction is measured to be… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  22. arXiv:2505.12044  [pdf, ps, other

    cs.LG

    FlashBias: Fast Computation of Attention with Bias

    Authors: Haixu Wu, Minghao Guo, Yuezhou Ma, Yuanxu Sun, Jianmin Wang, Wojciech Matusik, Mingsheng Long

    Abstract: Attention mechanism has emerged as a foundation module of modern deep learning models and has also empowered many milestones in various domains. Moreover, FlashAttention with IO-aware speedup resolves the efficiency issue of standard attention, further promoting its practicality. Beyond canonical attention, attention with bias also widely exists, such as relative position bias in vision and langua… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  23. arXiv:2505.12039  [pdf, ps, other

    cs.AI cs.CL physics.soc-ph

    AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research

    Authors: Renqi Chen, Haoyang Su, Shixiang Tang, Zhenfei Yin, Qi Wu, Hui Li, Ye Sun, Nanqing Dong, Wanli Ouyang, Philip Torr

    Abstract: The Science of Science (SoS) explores the mechanisms underlying scientific discovery, and offers valuable insights for enhancing scientific efficiency and fostering innovation. Traditional approaches often rely on simplistic assumptions and basic statistical tools, such as linear regression and rule-based simulations, which struggle to capture the complexity and scale of modern research ecosystems… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  24. arXiv:2505.11823  [pdf, ps, other

    cs.LG math.OC q-bio.QM

    Variational Regularized Unbalanced Optimal Transport: Single Network, Least Action

    Authors: Yuhao Sun, Zhenyi Zhang, Zihan Wang, Tiejun Li, Peijie Zhou

    Abstract: Recovering the dynamics from a few snapshots of a high-dimensional system is a challenging task in statistical physics and machine learning, with important applications in computational biology. Many algorithms have been developed to tackle this problem, based on frameworks such as optimal transport and the Schrödinger bridge. A notable recent framework is Regularized Unbalanced Optimal Transport… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  25. arXiv:2505.11197  [pdf, ps, other

    cs.LG math.OC q-bio.QM

    Modeling Cell Dynamics and Interactions with Unbalanced Mean Field Schrödinger Bridge

    Authors: Zhenyi Zhang, Zihan Wang, Yuhao Sun, Tiejun Li, Peijie Zhou

    Abstract: Modeling the dynamics from sparsely time-resolved snapshot data is crucial for understanding complex cellular processes and behavior. Existing methods leverage optimal transport, Schrödinger bridge theory, or their variants to simultaneously infer stochastic, unbalanced dynamics from snapshot data. However, these approaches remain limited in their ability to account for cell-cell interactions. Thi… ▽ More

    Submitted 1 June, 2025; v1 submitted 16 May, 2025; originally announced May 2025.

  26. arXiv:2505.11059  [pdf, other

    hep-th

    Upper bound of holographic entanglement entropy combinations

    Authors: Xin-Xiang Ju, Ya-Wen Sun, Yang Zhao

    Abstract: In this work, we develop a systematic formalism to evaluate the upper bound of a large family of holographic entanglement entropy combinations when fixing $n$ subsystems and fine-tuning one other subsystem. The upper bound configurations and values of these entropy combinations can be derived and classified. The upper bound of these entropy combinations reveals holographic $n+1$-partite entangleme… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: 52 pages, 15 figures

  27. arXiv:2505.10996  [pdf, other

    cs.CV

    Visual Anomaly Detection under Complex View-Illumination Interplay: A Large-Scale Benchmark

    Authors: Yunkang Cao, Yuqi Cheng, Xiaohao Xu, Yiheng Zhang, Yihan Sun, Yuxiang Tan, Yuxin Zhang, Xiaonan Huang, Weiming Shen

    Abstract: The practical deployment of Visual Anomaly Detection (VAD) systems is hindered by their sensitivity to real-world imaging variations, particularly the complex interplay between viewpoint and illumination which drastically alters defect visibility. Current benchmarks largely overlook this critical challenge. We introduce Multi-View Multi-Illumination Anomaly Detection (M2AD), a new large-scale benc… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: Homgepage: https://hustcyq.github.io/M2AD/. Yunkang Cao and Yuqi Cheng contribute equally to this work

  28. arXiv:2505.10311  [pdf, other

    eess.IV eess.SP stat.AP stat.ML

    Whitened Score Diffusion: A Structured Prior for Imaging Inverse Problems

    Authors: Jeffrey Alido, Tongyu Li, Yu Sun, Lei Tian

    Abstract: Conventional score-based diffusion models (DMs) may struggle with anisotropic Gaussian diffusion processes due to the required inversion of covariance matrices in the denoising score matching training objective \cite{vincent_connection_2011}. We propose Whitened Score (WS) diffusion models, a novel framework based on stochastic differential equations that learns the Whitened Score function instead… ▽ More

    Submitted 20 May, 2025; v1 submitted 15 May, 2025; originally announced May 2025.

  29. arXiv:2505.10297  [pdf, other

    cs.LG cs.AI cs.CR

    Defending the Edge: Representative-Attention for Mitigating Backdoor Attacks in Federated Learning

    Authors: Chibueze Peace Obioma, Youcheng Sun, Mustafa A. Mustafa

    Abstract: Federated learning (FL) enhances privacy and reduces communication cost for resource-constrained edge clients by supporting distributed model training at the edge. However, the heterogeneous nature of such devices produces diverse, non-independent, and identically distributed (non-IID) data, making the detection of backdoor attacks more challenging. In this paper, we propose a novel federated repr… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: Submitted to ESORICS 2025

  30. arXiv:2505.10241  [pdf, ps, other

    physics.ao-ph

    Predicting Beyond Training Data via Extrapolation versus Translocation: AI Weather Models and Dubai's Unprecedented 2024 Rainfall

    Authors: Y. Qiang Sun, Pedram Hassanzadeh, Tiffany Shaw, Hamid A. Pahlavan

    Abstract: Artificial intelligence (AI) models have transformed weather forecasting, but their skill for gray swan extreme events is unclear. Here, we analyze GraphCast and FuXi forecasts of the unprecedented 2024 Dubai storm, which had twice the training set's highest rainfall in that region. Remarkably, GraphCast accurately forecasts this event 8 days ahead. FuXi forecasts the event, but underestimates the… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  31. arXiv:2505.09965  [pdf, ps, other

    cs.CV

    MambaControl: Anatomy Graph-Enhanced Mamba ControlNet with Fourier Refinement for Diffusion-Based Disease Trajectory Prediction

    Authors: Hao Yang, Tao Tan, Shuai Tan, Weiqin Yang, Kunyan Cai, Calvin Chen, Yue Sun

    Abstract: Modelling disease progression in precision medicine requires capturing complex spatio-temporal dynamics while preserving anatomical integrity. Existing methods often struggle with longitudinal dependencies and structural consistency in progressive disorders. To address these limitations, we introduce MambaControl, a novel framework that integrates selective state-space modelling with diffusion pro… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  32. arXiv:2505.09958  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Ultrafast excitation of polar skyrons

    Authors: Huaiyu Wang, Vladimir Stoica, Cheng Dai, Marek Paściak, Sujit Das, Tiannan Yang, Mauro A. P. Gonçalves, Jiri Kulda, Margaret R. McCarter, Anudeep Mangu, Yue Cao, Hari Padma, Utkarsh Saha, Diling Zhu, Takahiro Sato, Sanghoon Song, Mathias Hoffmann, Patrick Kramer, Silke Nelson, Yanwen Sun, Quynh Nguyen, Zhan Zhang, Ramamoorthy Ramesh, Lane Martin, Aaron M. Lindenberg , et al. (5 additional authors not shown)

    Abstract: Unraveling collective modes arising from coupled degrees of freedom is crucial for understanding complex interactions in solids and developing new functionalities. Unique collective behaviors emerge when two degrees of freedom, ordered on distinct length scales, interact. Polar skyrmions, three-dimensional electric polarization textures in ferroelectric superlattices, disrupt the lattice continuit… ▽ More

    Submitted 19 June, 2025; v1 submitted 15 May, 2025; originally announced May 2025.

  33. arXiv:2505.09916  [pdf, other

    physics.optics

    Parasitic loss in microring-waveguide coupling and its impact on wideband nonlinear photonics

    Authors: Yi Sun, Daniel Pimbi, Xiyuan Lu, Jordan Stone, Junyeob Song, Zhimin Shi, Kartik Srinivasan

    Abstract: Microring resonators enable the enhancement of nonlinear frequency mixing processes, generating output fields at frequencies that widely differ from the inputs, in some cases by more than an octave. The efficiency of such devices depends on effective in- and out-coupling between access waveguides and the microrings at these widely separated frequencies. One successful approach is to separate the c… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  34. arXiv:2505.09659  [pdf, ps, other

    cs.LG cs.CL

    LAS: Loss-less ANN-SNN Conversion for Fully Spike-Driven Large Language Models

    Authors: Long Chen, Xiaotian Song, Yanan Sun

    Abstract: Spiking Large Language Models (LLMs) have emerged as an energy-efficient alternative to conventional LLMs through their event-driven computation. To effectively obtain spiking LLMs, researchers develop different ANN-to-SNN conversion methods by leveraging pre-trained ANN parameters while inheriting the energy efficiency of SNN. However, existing conversion methods struggle with extreme activation… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  35. arXiv:2505.09284  [pdf, ps, other

    cs.LG stat.ML

    Generating Full-field Evolution of Physical Dynamics from Irregular Sparse Observations

    Authors: Panqi Chen, Yifan Sun, Lei Cheng, Yang Yang, Weichang Li, Yang Liu, Weiqing Liu, Jiang Bian, Shikai Fang

    Abstract: Modeling and reconstructing multidimensional physical dynamics from sparse and off-grid observations presents a fundamental challenge in scientific research. Recently, diffusion-based generative modeling shows promising potential for physical simulation. However, current approaches typically operate on on-grid data with preset spatiotemporal resolution, but struggle with the sparsely observed and… ▽ More

    Submitted 24 May, 2025; v1 submitted 14 May, 2025; originally announced May 2025.

  36. arXiv:2505.08915  [pdf, ps, other

    cs.LG cond-mat.dis-nn cond-mat.stat-mech

    An Analytical Characterization of Sloppiness in Neural Networks: Insights from Linear Models

    Authors: Jialin Mao, Itay Griniasty, Yan Sun, Mark K. Transtrum, James P. Sethna, Pratik Chaudhari

    Abstract: Recent experiments have shown that training trajectories of multiple deep neural networks with different architectures, optimization algorithms, hyper-parameter settings, and regularization methods evolve on a remarkably low-dimensional "hyper-ribbon-like" manifold in the space of probability distributions. Inspired by the similarities in the training trajectories of deep networks and linear netwo… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  37. arXiv:2505.08838  [pdf, ps, other

    eess.IV cs.AI cs.CV

    Ultrasound Report Generation with Multimodal Large Language Models for Standardized Texts

    Authors: Peixuan Ge, Tongkun Su, Faqin Lv, Baoliang Zhao, Peng Zhang, Chi Hong Wong, Liang Yao, Yu Sun, Zenan Wang, Pak Kin Wong, Ying Hu

    Abstract: Ultrasound (US) report generation is a challenging task due to the variability of US images, operator dependence, and the need for standardized text. Unlike X-ray and CT, US imaging lacks consistent datasets, making automation difficult. In this study, we propose a unified framework for multi-organ and multilingual US report generation, integrating fragment-based multilingual training and leveragi… ▽ More

    Submitted 19 May, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

  38. arXiv:2505.08583  [pdf

    cond-mat.mtrl-sci

    Strain Induced Robust Skyrmion lattice at Room Temperature in van der Waals Ferromagnet

    Authors: Xinyi Zhou, Iftikhar Ahmed Malik, Ruihuan Duan, Hanqing Shi, Chen Liu, Yan Luo, Yue Sun, Ruixi Chen, Yilin Liu, Shian Xia, Vanessa Li Zhang, Sheng Liu, Chao Zhu, Xixiang Zhang, Yi Du, Zheng Liu, Ting Yu

    Abstract: Manipulating topological magnetic orders of two-dimensional (2D) magnets by strain, once achieved, offers enormous potential for future low-power flexible spintronic applications. In this work, by placing Fe3GaTe2 (FGaT), a room-temperature 2D ferromagnet, on flexible substrate, we demonstrate a field-free and robust formation of skyrmion lattice induced by strain. By applying a minimal strain of… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  39. arXiv:2505.08534  [pdf, ps, other

    hep-th

    Holographic geometry/real-space entanglement correspondence and metric reconstruction

    Authors: Xuanting Ji, Xin-Xiang Ju, Ya-Wen Sun, Yuan-Tai Wang, He-Lin Zhou

    Abstract: In holography, the boundary entanglement structure is believed to be encoded in the bulk geometry. In this work, we investigate the precise correspondence between the boundary real-space entanglement and the bulk geometry. By the boundary real-space entanglement, we refer to the conditional mutual information (CMI) for two infinitesimal subsystems separated by a distance $l$, and the corresponding… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: 30 pages, 12 figures

  40. arXiv:2505.08295  [pdf, ps, other

    cs.LG cs.AI

    A Practical Introduction to Deep Reinforcement Learning

    Authors: Yinghan Sun, Hongxi Wang, Hua Chen, Wei Zhang

    Abstract: Deep reinforcement learning (DRL) has emerged as a powerful framework for solving sequential decision-making problems, achieving remarkable success in a wide range of applications, including game AI, autonomous driving, biomedicine, and large language models. However, the diversity of algorithms and the complexity of theoretical foundations often pose significant challenges for beginners seeking t… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  41. arXiv:2505.08289  [pdf, ps, other

    physics.optics cond-mat.mtrl-sci math.NA

    Nonlinear optical response in kagome lattice with inversion symmetry breaking

    Authors: Xiangyang Liu, Junwen Lai, Jie Zhan, Tianye Yu, Peitao Liu, Seiji Yunoki, Xing-Qiu Chen, Yan Sun

    Abstract: The kagome lattice is a fundamental model structure in condensed matter physics and materials science featuring symmetry-protected flat bands, saddle points, and Dirac points. This structure has emerged as an ideal platform for exploring various quantum physics. By combining effective model analysis and first-principles calculations, we propose that the synergy among inversion symmetry breaking, f… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  42. Will Your Next Pair Programming Partner Be Human? An Empirical Evaluation of Generative AI as a Collaborative Teammate in a Semester-Long Classroom Setting

    Authors: Wenhan Lyu, Yimeng Wang, Yifan Sun, Yixuan Zhang

    Abstract: Generative AI (GenAI), especially Large Language Models (LLMs), is rapidly reshaping both programming workflows and computer science education. Many programmers now incorporate GenAI tools into their workflows, including for collaborative coding tasks such as pair programming. While prior research has demonstrated the benefits of traditional pair programming and begun to explore GenAI-assisted cod… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: Accepted by Learning @ Scale 2025

  43. arXiv:2505.07763  [pdf, other

    astro-ph.GA

    Gravitationally Bound Gas Determines Star Formation in the Galaxy

    Authors: Sihan Jiao, Jingwen Wu, Zhi-Yu Zhang, Neal J. Evans II, Chao-Wei Tsai, Di Li, Hauyu Baobab Liu, Yong Shi, Junzhi Wang, Qizhou Zhang, Yuxin Lin, Linjing Feng, Xing Lu, Yan Sun, Hao Ruan, Fangyuan Deng

    Abstract: Stars form from molecular gas under complex conditions influenced by multiple competing physical mechanisms, such as gravity, turbulence, and magnetic fields. However, accurately identifying the fraction of gas actively involved in star formation remains challenging. Using dust continuum observations from the Herschel Space Observatory, we derived column density maps and their associated probabili… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: 23 pages, 17 figures. Submitted to A&A

  44. arXiv:2505.07674  [pdf

    cs.LG

    Joint Graph Convolution and Sequential Modeling for Scalable Network Traffic Estimation

    Authors: Nan Jiang, Wenxuan Zhu, Xu Han, Weiqiang Huang, Yumeng Sun

    Abstract: This study focuses on the challenge of predicting network traffic within complex topological environments. It introduces a spatiotemporal modeling approach that integrates Graph Convolutional Networks (GCN) with Gated Recurrent Units (GRU). The GCN component captures spatial dependencies among network nodes, while the GRU component models the temporal evolution of traffic data. This combination al… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  45. arXiv:2505.07546  [pdf, ps, other

    cs.IR cs.AI

    GRADA: Graph-based Reranker against Adversarial Documents Attack

    Authors: Jingjie Zheng, Aryo Pradipta Gema, Giwon Hong, Xuanli He, Pasquale Minervini, Youcheng Sun, Qiongkai Xu

    Abstract: Retrieval Augmented Generation (RAG) frameworks improve the accuracy of large language models (LLMs) by integrating external knowledge from retrieved documents, thereby overcoming the limitations of models' static intrinsic knowledge. However, these systems are susceptible to adversarial attacks that manipulate the retrieval process by introducing documents that are adversarial yet semantically si… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  46. arXiv:2505.07396  [pdf, ps, other

    cs.CV cs.LG

    TUM2TWIN: Introducing the Large-Scale Multimodal Urban Digital Twin Benchmark Dataset

    Authors: Olaf Wysocki, Benedikt Schwab, Manoj Kumar Biswanath, Michael Greza, Qilin Zhang, Jingwei Zhu, Thomas Froech, Medhini Heeramaglore, Ihab Hijazi, Khaoula Kanna, Mathias Pechinger, Zhaiyu Chen, Yao Sun, Alejandro Rueda Segura, Ziyang Xu, Omar AbdelGafar, Mansour Mehranfar, Chandan Yeshwanth, Yueh-Cheng Liu, Hadi Yazdi, Jiapan Wang, Stefan Auer, Katharina Anders, Klaus Bogenberger, Andre Borrmann , et al. (9 additional authors not shown)

    Abstract: Urban Digital Twins (UDTs) have become essential for managing cities and integrating complex, heterogeneous data from diverse sources. Creating UDTs involves challenges at multiple process stages, including acquiring accurate 3D source data, reconstructing high-fidelity 3D models, maintaining models' updates, and ensuring seamless interoperability to downstream tasks. Current datasets are usually… ▽ More

    Submitted 13 May, 2025; v1 submitted 12 May, 2025; originally announced May 2025.

    Comments: Submitted to the ISPRS Journal of Photogrammetry and Remote Sensing

  47. arXiv:2505.07276  [pdf, ps, other

    stat.ME stat.AP stat.ML

    FCPCA: Fuzzy clustering of high-dimensional time series based on common principal component analysis

    Authors: Ziling Ma, Ángel López-Oriona, Hernando Ombao, Ying Sun

    Abstract: Clustering multivariate time series data is a crucial task in many domains, as it enables the identification of meaningful patterns and groups in time-evolving data. Traditional approaches, such as crisp clustering, rely on the assumption that clusters are sufficiently separated with little overlap. However, real-world data often defy this assumption, exhibiting overlapping distributions or overla… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  48. arXiv:2505.06924  [pdf

    cond-mat.mtrl-sci

    Spontaneous Enhancement of Dzyaloshinskii-Moriya Interaction via Field-Cooling-Induced Interface Engineering in 2D van der Waals Ferromagnetic ternary Tellurides

    Authors: Shian Xia, Yan Luo, Iftikhar Ahmed Malik, Xinyi Zhou, Keying Han, Yue Sun, Haoyun Lin, Hanqing Shi, Yingchun Cheng, Vanessa Li Zhang, Yi Du, Sheng Liu, Chao Zhu, Ting Yu

    Abstract: The emergence of two-dimensional (2D) van der Waals (vdW) ferromagnets has opened new avenues for exploring topological spin textures and their applications in next-generation spintronics. Among these materials, Fe3GaTe2 (FGaT) emerges as a model system due to its room-temperature skyrmion phases, which are stabilized by strong Dzyaloshinskii-Moriya interaction (DMI). However, the atomistic origin… ▽ More

    Submitted 17 May, 2025; v1 submitted 11 May, 2025; originally announced May 2025.

  49. arXiv:2505.06896  [pdf, ps, other

    cs.DC stat.CO

    RCOMPSs: A Scalable Runtime System for R Code Execution on Manycore Systems

    Authors: Xiran Zhang, Javier Conejero, Sameh Abdulah, Jorge Ejarque, Ying Sun, Rosa M. Badia, David E. Keyes, Marc G. Genton

    Abstract: R has become a cornerstone of scientific and statistical computing due to its extensive package ecosystem, expressive syntax, and strong support for reproducible analysis. However, as data sizes and computational demands grow, native R parallelism support remains limited. This paper presents RCOMPSs, a scalable runtime system that enables efficient parallel execution of R applications on multicore… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

  50. arXiv:2505.06865  [pdf, ps, other

    physics.optics

    An ultrastable hard x-ray attosecond split-delay line

    Authors: Yanwen Sun, Haoyuan Li, Yoshio Ichii, Diling Zhu

    Abstract: We present a novel split-delay line design for generating hard x-ray attosecond pulse pulse pairs. The design introduces an unconventional delay adjustment mechanism, where an x-ray mirror pair rotation was used for adjusting the path length differential between two beam paths. The exit beam pointing stability is guaranteed by the mirror-pair self-compensating geometry, therefore enabling stable c… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.