Skip to main content

Showing 1–50 of 3,180 results for author: Zhou, S

.
  1. arXiv:2507.05094  [pdf, ps, other

    hep-ex

    Observation of the decays $B^{+} \to Σ_{c}(2455)^{++} \overlineΞ_{c}^{-}$ and $B^{0} \to Σ_{c}(2455)^{0} \overlineΞ_{c}^{0}$

    Authors: Belle, Belle II Collaborations, :, M. Abumusabh, I. Adachi, L. Aggarwal, H. Ahmed, Y. Ahn, H. Aihara, N. Akopov, S. Alghamdi, M. Alhakami, A. Aloisio, N. Althubiti, K. Amos, N. Anh Ky, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, R. Ayad, V. Babu, H. Bae, N. K. Baghel, S. Bahinipati , et al. (364 additional authors not shown)

    Abstract: We report the first observation of the two-body baryonic decays $B^{+} \to Σ_{c}(2455)^{++} \overlineΞ_{c}^{-}$ and $B^{0} \to Σ_{c}(2455)^{0} \overlineΞ_{c}^{0}$ with significances of $7.3\,σ$ and $6.2\,σ$, respectively, including statistical and systematic uncertainties. The branching fractions are measured to be… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Report number: Belle II Preprint 2025-019, KEK Preprint 2025-18

  2. arXiv:2507.05050  [pdf, ps, other

    hep-ex

    Measurement of the $ D^{0}\rightarrow K^{-}π^{+}e^{+}e^{-} $ branching fraction and search for $ D^{0}\rightarrow π^{+}π^{-}e^{+}e^{-} $ and $D^{0}\rightarrow K^{+}K^{-}e^{+}e^{-} $ decays at Belle

    Authors: Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Ahmed, Y. Ahn, H. Aihara, N. Akopov, S. Alghamdi, M. Alhakami, A. Aloisio, N. Althubiti, K. Amos, M. Angelsmark, N. Anh Ky, C. Antonioli, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae , et al. (458 additional authors not shown)

    Abstract: We present a study of the rare charm meson decays $ D^{0}\rightarrow K^{+}K^{-}e^{+}e^{-} $, $ π^{+}π^{-}e^{+}e^{-} $, and $ K^{-}π^{+}e^{+}e^{-} $ using a 942 fb$^{-1}$ data set collected by the Belle detector at the KEKB asymmetric-energy $ e^{+}e^{-} $ collider. We use $ D^{0} $ candidates identified by the charge of the pion in $ D^{*} \rightarrow D^{0} π$ decays and normalize the branching fr… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Report number: Belle II Preprint 2025-020; KEK Preprint 2025-19

  3. arXiv:2507.04896  [pdf, ps, other

    hep-ex

    Cross sections of $η$ mesons in $p$$+$$p$ collisions at forward rapidity at $\sqrt{s}=500$ GeV and central rapidity at $\sqrt{s}=510$ GeV

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, C. Aidala, N. N. Ajitanand, Y. Akiba, R. Akimoto, H. Al-Ta'ani, J. Alexander, M. Alfred, D. Anderson, K. R. Andrews, A. Angerami, S. Antsupov, K. Aoki, N. Apadula, E. Appelt, Y. Aramaki, R. Armendariz, H. Asano, E. C. Aschenauer, E. T. Atomssa, T. C. Awes, B. Azmoun , et al. (476 additional authors not shown)

    Abstract: We present the first measurements of the forward and midrapidity $η$-meson cross sections from $p$$+$$p$ collisions at $\sqrt{s}=500$ and $510$~GeV, respectively. We also report the midrapidity $η/π^0$ ratio at 510 GeV. The forward cross section is measured differentially in $η$-meson transverse momentum ($p_T$) from 1.0 to 6.5~GeV/$c$ for pseudorapidity $3.0<|η|<3.8$. The midrapidity cross sectio… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 500 authors from 81 institutions, 14 pages, 7 figures, 3 tables. v1 is version submitted to Physical Review D. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

  4. arXiv:2507.04630  [pdf, ps, other

    cs.CV

    Learn 3D VQA Better with Active Selection and Reannotation

    Authors: Shengli Zhou, Yang Liu, Feng Zheng

    Abstract: 3D Visual Question Answering (3D VQA) is crucial for enabling models to perceive the physical world and perform spatial reasoning. In 3D VQA, the free-form nature of answers often leads to improper annotations that can confuse or mislead models when training on the entire dataset. While other text generation tasks can mitigate this issue by learning on large-scale datasets, the scarcity of 3D scen… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

    Comments: Accepted by ACM MM 2025

  5. arXiv:2507.04463  [pdf, ps, other

    nucl-ex

    Low-mass vector-meson production at forward rapidity in $p$$+$$p$ and Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV

    Authors: PHENIX Collaboration, N. J. Abdulameer, U. Acharya, A. Adare, C. Aidala, N. N. Ajitanand, Y. Akiba, M. Alfred, D. Anderson, V. Andrieux, S. Antsupov, N. Apadula, H. Asano, B. Azmoun, V. Babintsev, M. Bai, N. S. Bandara, B. Bannier, E. Bannikov, K. N. Barish, S. Bathe, A. Bazilevsky, M. Beaumier, S. Beckman, R. Belmont , et al. (331 additional authors not shown)

    Abstract: The PHENIX experiment at the Relativistic Heavy Ion Collider has measured low-mass vector-meson ($ω+ρ$ and $φ$) production through the dimuon decay channel at forward rapidity $(1.2<|\mbox{y}|<2.2)$ in $p$$+$$p$ and Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. The low-mass vector-meson yield and nuclear-modification factor were measured as a function of the average number of participating nuc… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

    Comments: 356 authors from 71 institutions, 14 pages, 14 figures, 1 table. v1 is version submitted to Physical Review C. HEPdata tables for the points plotted in figures for this and previous PHENIX publications are (or will be) publicly available at http://www.phenix.bnl.gov/papers.html

  6. arXiv:2507.04289  [pdf, ps, other

    cs.CV cs.AI

    M$^3$-Med: A Benchmark for Multi-lingual, Multi-modal, and Multi-hop Reasoning in Medical Instructional Video Understanding

    Authors: Shenxi Liu, Kan Li, Mingyang Zhao, Yuhang Tian, Bin Li, Shoujun Zhou, Hongliang Li, Fuxia Yang

    Abstract: With the rapid progress of artificial intelligence (AI) in multi-modal understanding, there is increasing potential for video comprehension technologies to support professional domains such as medical education. However, existing benchmarks suffer from two primary limitations: (1) Linguistic Singularity: they are largely confined to English, neglecting the need for multilingual resources; and (2)… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

    Comments: 19 pages, 8 figures, 7 tables

  7. arXiv:2507.03604  [pdf, ps, other

    quant-ph

    Entanglement Purification by Integrated Silicon Photonics

    Authors: Yonghe Yu, Siyan Zhou, Mujtaba Zahidy, Caterina Vigliar, Karsten Rottwitt, Leif K. Oxenlowe, Yunhong Ding

    Abstract: We demonstrate the first on-chip deterministic entanglement purification based on silicon photonics. To evaluate the purification performance, we simulate the bit-flip and phase-flip errors by reconfigurable circuits on chip. The state fidelity improves from 0.71 to 0.82 under a 20% bit-flip error. rate

    Submitted 4 July, 2025; originally announced July 2025.

  8. arXiv:2507.03304  [pdf, ps, other

    cs.CV

    Bridging Domain Generalization to Multimodal Domain Generalization via Unified Representations

    Authors: Hai Huang, Yan Xia, Sashuai Zhou, Hanting Wang, Shulei Wang, Zhou Zhao

    Abstract: Domain Generalization (DG) aims to enhance model robustness in unseen or distributionally shifted target domains through training exclusively on source domains. Although existing DG techniques, such as data manipulation, learning strategies, and representation learning, have shown significant progress, they predominantly address single-modal data. With the emergence of numerous multi-modal dataset… ▽ More

    Submitted 4 July, 2025; originally announced July 2025.

    Comments: Accepted by ICCV 2025

  9. arXiv:2507.02665  [pdf, ps, other

    cs.SE

    Do Research Software Engineers and Software Engineering Researchers Speak the Same Language?

    Authors: Timo Kehrer, Robert Haines, Guido Juckeland, Shurui Zhou, David E. Bernholdt

    Abstract: Anecdotal evidence suggests that Research Software Engineers (RSEs) and Software Engineering Researchers (SERs) often use different terminologies for similar concepts, creating communication challenges. To better understand these divergences, we have started investigating how SE fundamentals from the SER community are interpreted within the RSE community, identifying aligned concepts, knowledge ga… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

    Comments: Early access journal version: T. Kehrer, R. Haines, G. Juckeland, S. Zhou and D. E. Bernholdt, "Do Research Software Engineers and Software Engineering Researchers Speak the Same Language?," in Computing in Science & Engineering, doi: 10.1109/MCSE.2025.3557236

  10. arXiv:2507.01800  [pdf, ps, other

    cs.CV cs.MM

    HCNQA: Enhancing 3D VQA with Hierarchical Concentration Narrowing Supervision

    Authors: Shengli Zhou, Jianuo Zhu, Qilin Huang, Fangjing Wang, Yanfu Zhang, Feng Zheng

    Abstract: 3D Visual Question-Answering (3D VQA) is pivotal for models to perceive the physical world and perform spatial reasoning. Answer-centric supervision is a commonly used training method for 3D VQA models. Many models that utilize this strategy have achieved promising results in 3D VQA tasks. However, the answer-centric approach only supervises the final output of models and allows models to develop… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: ICANN 2025

  11. arXiv:2507.01392  [pdf

    cond-mat.supr-con

    Two-Dimensional Superconductivity at the CaZrO3/KTaO3 (001) Heterointerfaces

    Authors: Lu Chen, Siyi Zhou, Daming Tian, Yinan Xiao, Qixuan Gao, Yongchao Wang, Yuansha Chen, Fengxia Hu, Baogen Shen, Jirong Sun, Weisheng Zhao, Jinsong Zhang, Hui Zhang

    Abstract: We investigated the superconducting transport properties of two-dimensional electron gases (2DEGs) at (001)-oriented CaZrO3/KTaO3 (CZO/KTO) heterointerfaces. Our results unambiguously demonstrate the emergence of two-dimensional superconductivity, with a superconducting transition TC up to ~0.25 K. The two-dimensional nature of the superconducting state is corroborated by the Berezinskii-Kosterlit… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: 7 Pages,4 figures

  12. arXiv:2507.01249  [pdf, ps, other

    hep-ex

    Search for an Axion-Like Particle in $B\rightarrow K^{(*)} a (\rightarrowγγ)$ Decays at Belle

    Authors: Belle, Belle II Collaborations, :, I. Adachi, L. Aggarwal, H. Ahmed, Y. Ahn, H. Aihara, N. Akopov, S. Alghamdi, M. Alhakami, A. Aloisio, N. Althubiti, K. Amos, M. Angelsmark, N. Anh Ky, C. Antonioli, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae , et al. (400 additional authors not shown)

    Abstract: We report a search for an axion-like particle $a$ in $B\rightarrow K^{(*)} a (\rightarrowγγ)$ decays using data collected with the Belle detector at the KEKB asymmetric energy electron-positron collider. The search is based on a $711 \mathrm{fb^{-1}}$ data sample collected at the $Υ4S$ resonance energy, corresponding to a sample of $772\times10^6$ $Υ4S$ events. In this study, we search for the dec… ▽ More

    Submitted 3 July, 2025; v1 submitted 1 July, 2025; originally announced July 2025.

    Comments: 26 pages, 15 Figures

    Report number: Belle II Preprint: 2025-017 KEK Preprint: 2025-16

  13. arXiv:2507.00837  [pdf, ps, other

    cond-mat.str-el

    Spontaneous emergence of altermagnetism in the single-orbital extended Hubbard model

    Authors: Jin-Wei Dong, Yu-Han Lin, Ruiqing Fu, Xianxin Wu, Gang Su, Ziqiang Wang, Sen Zhou

    Abstract: Altermagnetism (AM), the recently discovered third class of collinear magnetic order, is characterized by non-relativistic momentum-dependent spin-split electronic structure with compensated zero net magnetization. It can arise from the conventional antiferromagnetism by introducing local anisotropy on the two opposite-spin sublattices, either through structural changes in local crystallographic s… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: 7 pages, 3 figures

  14. arXiv:2506.23301  [pdf, ps, other

    cs.IT eess.SP

    Parallax QAMA: Novel Downlink Multiple Access for MISO Systems with Simple Receivers

    Authors: Jie Huang, Ming Zhao, Shengli Zhou, Ling Qiu, Jinkang Zhu

    Abstract: In this paper, we propose a novel downlink multiple access system with a multi-antenna transmitter and two single-antenna receivers, inspired by the underlying principles of hierarchical quadrature amplitude modulation (H-QAM) based multiple access (QAMA) and space-division multiple access (SDMA). In the proposed scheme, coded bits from two users are split and assigned to one shared symbol and two… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

  15. arXiv:2506.23132  [pdf, ps, other

    cs.CV

    Dare to Plagiarize? Plagiarized Painting Recognition and Retrieval

    Authors: Sophie Zhou, Shu Kong

    Abstract: Art plagiarism detection plays a crucial role in protecting artists' copyrights and intellectual property, yet it remains a challenging problem in forensic analysis. In this paper, we address the task of recognizing plagiarized paintings and explaining the detected plagarisms by retrieving visually similar authentic artworks. To support this study, we construct a dataset by collecting painting pho… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

    Comments: to appear at AVSS'25

  16. arXiv:2506.22608  [pdf, ps, other

    cs.DS

    On Fine-Grained Distinct Element Estimation

    Authors: Ilias Diakonikolas, Daniel M. Kane, Jasper C. H. Lee, Thanasis Pittas, David P. Woodruff, Samson Zhou

    Abstract: We study the problem of distributed distinct element estimation, where $α$ servers each receive a subset of a universe $[n]$ and aim to compute a $(1+\varepsilon)$-approximation to the number of distinct elements using minimal communication. While prior work establishes a worst-case bound of $Θ\left(α\log n+\fracα{\varepsilon^2}\right)$ bits, these results rely on assumptions that may not hold in… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: ICML 2025

  17. arXiv:2506.22546  [pdf, ps, other

    hep-th

    Primal S-matrix bootstrap with dispersion relations

    Authors: Claudia de Rham, Andrew J. Tolley, Zhuo-Hui Wang, Shuang-Yong Zhou

    Abstract: We propose a new method for constructing the consistent space of scattering amplitudes by parameterizing the imaginary parts of partial waves and utilizing dispersion relations, crossing symmetry, and full unitarity. Using this framework, we explicitly compute bounds on the leading couplings and examine the Regge behaviors of the constructed amplitudes. The method also readily accommodates spinnin… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: 41 pages, 14 figures

    Report number: Imperial/TP/2025/cdr/3, USTC-ICTS/PCFT-25-24

  18. arXiv:2506.21967  [pdf, ps, other

    cs.CL cs.LG

    More Vulnerable than You Think: On the Stability of Tool-Integrated LLM Agents

    Authors: Weimin Xiong, Ke Wang, Yifan Song, Hanchao Liu, Sai Zhou, Wei Peng, Sujian Li

    Abstract: Current evaluations of tool-integrated LLM agents typically focus on end-to-end tool-usage evaluation while neglecting their stability. This limits their real-world applicability, as various internal or external factors can cause agents to crash or behave abnormally. Our research addresses this by investigating whether agents are vulnerable to errors throughout the entire tool invocation process,… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

  19. arXiv:2506.21915  [pdf

    cs.NE math.OC

    An Effective Two-Phase Genetic Algorithm for Solving the Resource Constrained Project Scheduling Problem (RCPSP)

    Authors: D. Sun, S. Zhou

    Abstract: This note presents a simple and effective variation of genetic algorithm (GA) for solving RCPSP, denoted as 2-Phase Genetic Algorithm (2PGA). The 2PGA implements GA parent selection in two phases: Phase-1 includes the best current solutions in the parent pool, and Phase-2 excludes the best current solutions from the parent pool. The 2PGA carries out the GA evolution by alternating the two phases i… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: 12 pages

    MSC Class: 90-08

  20. arXiv:2506.21619  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    IndexTTS2: A Breakthrough in Emotionally Expressive and Duration-Controlled Auto-Regressive Zero-Shot Text-to-Speech

    Authors: Siyi Zhou, Yiquan Zhou, Yi He, Xun Zhou, Jinchao Wang, Wei Deng, Jingchen Shu

    Abstract: Large-scale text-to-speech (TTS) models are typically categorized into autoregressive and non-autoregressive systems. Although autoregressive systems exhibit certain advantages in speech naturalness, their token-by-token generation mechanism makes it difficult to precisely control the duration of synthesized speech. This is a key limitation in applications such as video dubbing that require strict… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  21. arXiv:2506.21409  [pdf

    cond-mat.mes-hall

    Observation of Cavity-Mediated Nonlinear Landau Fan and Modified Landau Level Degeneracy in Graphene Quantum Transport

    Authors: Hongxia Xue, Hsun-Chi Chan, Zuzhang Lin, Dalin Boriçi, Shaobo Zhou, Yanan Wang, Kenji Watanabe, Takashi Taniguchi, Cristiano Ciuti, Wang Yao, Dong-Keun Ki, Shuang Zhang

    Abstract: Recent studies on cavity-coupled two-dimensional electron gas demonstrate that vacuum-field engineering can tailor electronic transport properties of materials. By achieving ultra-strong coupling between a terahertz resonator and mesoscopic graphene, we demonstrate that cavity vacuum fields can alter the effective degeneracies of Landau levels, resulting in a nonlinear Landau fan diagram for massl… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: 11 pages, 4 figures

  22. arXiv:2506.21001  [pdf, ps, other

    cs.CV

    Style-Aligned Image Composition for Robust Detection of Abnormal Cells in Cytopathology

    Authors: Qiuyi Qi, Xin Li, Ming Kong, Zikang Xu, Bingdi Chen, Qiang Zhu, S Kevin Zhou

    Abstract: Challenges such as the lack of high-quality annotations, long-tailed data distributions, and inconsistent staining styles pose significant obstacles to training neural networks to detect abnormal cells in cytopathology robustly. This paper proposes a style-aligned image composition (SAIC) method that composes high-fidelity and style-preserved pathological images to enhance the effectiveness and ro… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: MIDL 2025 Oral

  23. arXiv:2506.20273  [pdf, ps, other

    math.CO

    Adjacency spectral radius and H-factors in 1-binding graphs

    Authors: Sizhong Zhou, Tao Zhang, Zhiren Sun

    Abstract: Let $G$ be a graph, and let $H:V(G)\longrightarrow\{\{1\},\{0,2\}\}$ be a set-valued function. Hence, $H(v)$ equals $\{1\}$ or $\{0,2\}$ for any $v\in V(G)$. We let $$ H^{-1}(1)=\{v: v\in V(G) \ \mbox{and} \ H(v)=1\}. $$ An $H$-factor of $G$ is a spanning subgraph $F$ of $G$ such that $d_F(v)\in H(v)$ for each $v\in V(G)$. Lu and Kano showed a characterization for the existence of an $H$-factor in… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 9 pages

    MSC Class: 05C50; 05C70

  24. arXiv:2506.19742  [pdf, ps, other

    eess.IV cs.AI cs.CV

    NeRF-based CBCT Reconstruction needs Normalization and Initialization

    Authors: Zhuowei Xu, Han Li, Dai Sun, Zhicheng Li, Yujia Li, Qingpeng Kong, Zhiwei Cheng, Nassir Navab, S. Kevin Zhou

    Abstract: Cone Beam Computed Tomography (CBCT) is widely used in medical imaging. However, the limited number and intensity of X-ray projections make reconstruction an ill-posed problem with severe artifacts. NeRF-based methods have achieved great success in this task. However, they suffer from a local-global training mismatch between their two key components: the hash encoder and the neural network. Specif… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  25. arXiv:2506.19651  [pdf, ps, other

    cs.CV cs.LG cs.PF

    PEVLM: Parallel Encoding for Vision-Language Models

    Authors: Letian Kang, Shixian Luo, Yiqiang Li, Xiaoyang Yu, Shenxuan Zhou, Yong Wu

    Abstract: Vision-Language Models (VLMs) have demonstrated strong capabilities in multimodal understanding and generation tasks. However, their application to long video understanding remains hindered by the quadratic complexity of standard attention mechanisms. In this work, we introduce \textbf{PEVLM}, a fine-tuning-free parallel encoding method designed to enhance the prefilling efficiency of VLMs in long… ▽ More

    Submitted 7 July, 2025; v1 submitted 24 June, 2025; originally announced June 2025.

  26. arXiv:2506.19476  [pdf, ps, other

    eess.SP

    Neural Collapse based Deep Supervised Federated Learning for Signal Detection in OFDM Systems

    Authors: Kaidi Xu, Shenglong Zhou, Geoffrey Ye Li

    Abstract: Future wireless networks are expected to be AI-empowered, making their performance highly dependent on the quality of training datasets. However, physical-layer entities often observe only partial wireless environments characterized by different power delay profiles. Federated learning is capable of addressing this limited observability, but often struggles with data heterogeneity. To tackle this… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

  27. arXiv:2506.19180  [pdf, ps, other

    hep-ex hep-ph

    Precise Measurement of the $Λ$ Electric Dipole Moment through the Entangled Strange Baryon-Antibaryon System

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (696 additional authors not shown)

    Abstract: The dominance of matter over antimatter in the universe has consistently driven the pursuit of new physics beyond the Standard Model that violates charge-parity symmetry. Unlike the well-constrained electrons and neutrons, strange baryons (hyperons) remain a largely unexplored territory, in which interactions between hyperons and particles from new physics could induce a non-trivial electric dipol… ▽ More

    Submitted 28 June, 2025; v1 submitted 23 June, 2025; originally announced June 2025.

  28. arXiv:2506.18997  [pdf, ps, other

    astro-ph.GA

    From simulations to observations. Methodology and data release of mock TNG50 galaxies at 0.3 < z < 0.7 for WEAVE-StePS

    Authors: A. Ikhsanova, L. Costantin, A. Pizzella, E. M. Corsini, L. Morelli, F. R. Ditrani, A. Ferré-Mateu, L. Gabarra, M. Gullieuszik, C. P. Haines, A. Iovino, M. Longhetti, A. Mercurio, R. Ragusa, P. Sánchez-Blázquez, C. Tortora, B. Vulcani, S. Zhou, E. Gafton, F. Pistis

    Abstract: The new generation of optical spectrographs (i.e., WEAVE, 4MOST, DESI, and WST) offer unprecedented opportunities for statistically studying the star formation histories of galaxies. However, these observations are not easily comparable to predictions from cosmological simulations. Our goal is to build a reference framework for comparing spectroscopic observations with simulations and test tools f… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  29. arXiv:2506.18930  [pdf, ps, other

    cs.CV cs.AI cs.LG

    Reinforcement Learning-Based Dynamic Grouping for Tubular Structure Tracking

    Authors: Chong Di, Shuwang Zhou, Da Chen, Jean-Marie Mirebeau, Minglei Shu, Laurent D. Cohen

    Abstract: The computation of minimal paths for the applications in tracking tubular structures such as blood vessels and roads is challenged by complex morphologies and environmental variations. Existing approaches can be roughly categorized into two research lines: the point-wise based models and the segment-wise based models. Although segment-wise approaches have obtained promising results in many scenari… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  30. arXiv:2506.18897  [pdf, ps, other

    cs.RO cs.AI

    MinD: Unified Visual Imagination and Control via Hierarchical World Models

    Authors: Xiaowei Chi, Kuangzhi Ge, Jiaming Liu, Siyuan Zhou, Peidong Jia, Zichen He, Yuzhen Liu, Tingguang Li, Lei Han, Sirui Han, Shanghang Zhang, Yike Guo

    Abstract: Video generation models (VGMs) offer a promising pathway for unified world modeling in robotics by integrating simulation, prediction, and manipulation. However, their practical application remains limited due to (1) slowgeneration speed, which limits real-time interaction, and (2) poor consistency between imagined videos and executable actions. To address these challenges, we propose Manipulate i… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  31. arXiv:2506.18851  [pdf, ps, other

    cs.CV

    Phantom-Data : Towards a General Subject-Consistent Video Generation Dataset

    Authors: Zhuowei Chen, Bingchuan Li, Tianxiang Ma, Lijie Liu, Mingcong Liu, Yi Zhang, Gen Li, Xinghui Li, Siyu Zhou, Qian He, Xinglong Wu

    Abstract: Subject-to-video generation has witnessed substantial progress in recent years. However, existing models still face significant challenges in faithfully following textual instructions. This limitation, commonly known as the copy-paste problem, arises from the widely used in-pair training paradigm. This approach inherently entangles subject identity with background and contextual attributes by samp… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: Project page:https://phantom-video.github.io/Phantom-Data/

  32. arXiv:2506.18034  [pdf, ps, other

    cs.CV cs.AI cs.MM

    Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation Booster

    Authors: Fenghe Tang, Wenxin Ma, Zhiyang He, Xiaodong Tao, Zihang Jiang, S. Kevin Zhou

    Abstract: With the advancement of Large Language Model (LLM) for natural language processing, this paper presents an intriguing finding: a frozen pre-trained LLM layer can process visual tokens for medical image segmentation tasks. Specifically, we propose a simple hybrid structure that integrates a pre-trained, frozen LLM layer within the CNN encoder-decoder segmentation framework (LLM4Seg). Surprisingly,… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

    Comments: Accepted by MICCAI 2025. Code: https://github.com/FengheTan9/LLM4Seg

  33. arXiv:2506.18019  [pdf, ps, other

    cs.AI

    Graphs Meet AI Agents: Taxonomy, Progress, and Future Opportunities

    Authors: Yuanchen Bei, Weizhi Zhang, Siwen Wang, Weizhi Chen, Sheng Zhou, Hao Chen, Yong Li, Jiajun Bu, Shirui Pan, Yizhou Yu, Irwin King, Fakhri Karray, Philip S. Yu

    Abstract: AI agents have experienced a paradigm shift, from early dominance by reinforcement learning (RL) to the rise of agents powered by large language models (LLMs), and now further advancing towards a synergistic fusion of RL and LLM capabilities. This progression has endowed AI agents with increasingly strong abilities. Despite these advances, to accomplish complex real-world tasks, agents are require… ▽ More

    Submitted 4 July, 2025; v1 submitted 22 June, 2025; originally announced June 2025.

    Comments: 20 pages, 7 figures

  34. arXiv:2506.17784  [pdf, ps, other

    cs.AI

    AnyMAC: Cascading Flexible Multi-Agent Collaboration via Next-Agent Prediction

    Authors: Song Wang, Zhen Tan, Zihan Chen, Shuang Zhou, Tianlong Chen, Jundong Li

    Abstract: Recent progress in large language model (LLM)-based multi-agent collaboration highlights the power of structured communication in enabling collective intelligence. However, existing methods largely rely on static or graph-based inter-agent topologies, lacking the potential adaptability and flexibility in communication. In this work, we propose a new framework that rethinks multi-agent coordination… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  35. arXiv:2506.16716  [pdf, ps, other

    cs.HC

    V-CASS: Vision-context-aware Expressive Speech Synthesis for Enhancing User Understanding of Videos

    Authors: Qixin Wang, Songtao Zhou, Zeyu Jin, Chenglin Guo, Shikun Sun, Xiaoyu Qin

    Abstract: Automatic video commentary systems are widely used on multimedia social media platforms to extract factual information about video content. However, current systems may overlook essential para-linguistic cues, including emotion and attitude, which are critical for fully conveying the meaning of visual content. The absence of these cues can limit user understanding or, in some cases, distort the vi… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

    Comments: Accepted by IJCNN 2025

  36. arXiv:2506.16661  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Private Training & Data Generation by Clustering Embeddings

    Authors: Felix Zhou, Samson Zhou, Vahab Mirrokni, Alessandro Epasto, Vincent Cohen-Addad

    Abstract: Deep neural networks often use large, high-quality datasets to achieve high performance on many machine learning tasks. When training involves potentially sensitive data, this process can raise privacy concerns, as large models have been shown to unintentionally memorize and reveal sensitive information, including reconstructing entire training samples. Differential privacy (DP) provides a robust… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  37. arXiv:2506.16201  [pdf, ps, other

    cs.RO cs.CV

    FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation

    Authors: Sen Wang, Le Wang, Sanping Zhou, Jingyi Tian, Jiayi Li, Haowen Sun, Wei Tang

    Abstract: Robotic manipulation in high-precision tasks is essential for numerous industrial and real-world applications where accuracy and speed are required. Yet current diffusion-based policy learning methods generally suffer from low computational efficiency due to the iterative denoising process during inference. Moreover, these methods do not fully explore the potential of generative models for enhanci… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  38. arXiv:2506.16114  [pdf, ps, other

    cs.IR cs.AI

    GFlowGR: Fine-tuning Generative Recommendation Frameworks with Generative Flow Networks

    Authors: Yejing Wang, Shengyu Zhou, Jinyu Lu, Qidong Liu, Xinhang Li, Wenlin Zhang, Feng Li, Pengjie Wang, Jian Xu, Bo Zheng, Xiangyu Zhao

    Abstract: Generative recommendations (GR), which usually include item tokenizers and generative Large Language Models (LLMs), have demonstrated remarkable success across a wide range of scenarios. The majority of existing research efforts primarily concentrate on developing powerful item tokenizers or advancing LLM decoding strategies to attain superior performance. However, the critical fine-tuning step in… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  39. arXiv:2506.16082  [pdf, ps, other

    cs.CV

    PR-DETR: Injecting Position and Relation Prior for Dense Video Captioning

    Authors: Yizhe Li, Sanping Zhou, Zheng Qin, Le Wang

    Abstract: Dense video captioning is a challenging task that aims to localize and caption multiple events in an untrimmed video. Recent studies mainly follow the transformer-based architecture to jointly perform the two sub-tasks, i.e., event localization and caption generation, in an end-to-end manner. Based on the general philosophy of detection transformer, these methods implicitly learn the event locatio… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  40. arXiv:2506.15712  [pdf, ps, other

    cs.LG cs.AI

    BatteryBERT for Realistic Battery Fault Detection Using Point-Masked Signal Modeling

    Authors: Songqi Zhou, Ruixue Liu, Yixing Wang, Jia Lu, Benben Jiang

    Abstract: Accurate fault detection in lithium-ion batteries is essential for the safe and reliable operation of electric vehicles and energy storage systems. However, existing methods often struggle to capture complex temporal dependencies and cannot fully leverage abundant unlabeled data. Although large language models (LLMs) exhibit strong representation capabilities, their architectures are not directly… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

  41. arXiv:2506.15672  [pdf, ps, other

    cs.AI cs.MA

    SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence

    Authors: Yao Zhang, Chenyang Lin, Shijie Tang, Haokun Chen, Shijie Zhou, Yunpu Ma, Volker Tresp

    Abstract: The rapid progress of Large Language Models has advanced agentic systems in decision-making, coordination, and task execution. Yet, existing agentic system generation frameworks lack full autonomy, missing from-scratch agent generation, self-optimizing agent functionality, and collaboration, limiting adaptability and scalability. We propose SwarmAgentic, a framework for fully automated agentic sys… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 41 pages

  42. arXiv:2506.15533  [pdf, ps, other

    hep-ex

    Measurements of the absolute branching fractions of the doubly Cabibbo-suppressed decays $D^+\to K^+π^0$, $D^+\to K^+η$ and $D^+\to K^+η^{\prime}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (697 additional authors not shown)

    Abstract: Using $20.3\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of 3.773\,GeV with the BESIII detector, we present improved measurements of the absolute branching fractions of the doubly Cabibbo-suppressed decays $D^+\to K^+π^0$, $D^+\to K^+η$ and $ D^+ \to K^+ η^{\prime}$ with the double-tag method. The statistical significance of each signal decay exceeds $10σ$. The bra… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 20 pages, 4 figures

  43. arXiv:2506.15256  [pdf, ps, other

    hep-ex

    Determination of $|V_{cb}|$ using $B\to D\ellν_\ell$ Decays at Belle II

    Authors: Belle II Collaboration, I. Adachi, K. Adamczyk, L. Aggarwal, H. Ahmed, Y. Ahn, H. Aihara, N. Akopov, S. Alghamdi, M. Alhakami, A. Aloisio, K. Amos, M. Angelsmark, N. Anh Ky, C. Antonioli, D. M. Asner, H. Atmacan, T. Aushev, V. Aushev, M. Aversano, R. Ayad, V. Babu, H. Bae, N. K. Baghel, S. Bahinipati , et al. (385 additional authors not shown)

    Abstract: We present a determination of the Cabibbo-Kobayashi-Maskawa matrix element $|V_{cb}|$ from the decay $B\to D\ellν_\ell$ using a $365~\mathrm{fb}^{-1}$ $e^+e^-\toΥ(4S)\to B\bar B$ data sample recorded by the Belle II experiment at the SuperKEKB collider. The semileptonic decay of one $B$ meson is reconstructed in the modes $B^0\to D^-(\to K^+π^-π^-)\ell^+ν_\ell$ and… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Report number: Belle II Preprint 2025-004, KEK Preprint 2025-1

  44. arXiv:2506.15120  [pdf, ps, other

    cs.IR cs.AI cs.LG

    Advancing Loss Functions in Recommender Systems: A Comparative Study with a Rényi Divergence-Based Solution

    Authors: Shengjia Zhang, Jiawei Chen, Changdong Li, Sheng Zhou, Qihao Shi, Yan Feng, Chun Chen, Can Wang

    Abstract: Loss functions play a pivotal role in optimizing recommendation models. Among various loss functions, Softmax Loss (SL) and Cosine Contrastive Loss (CCL) are particularly effective. Their theoretical connections and differences warrant in-depth exploration. This work conducts comprehensive analyses of these losses, yielding significant insights: 1) Common strengths -- both can be viewed as augment… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: AAAI 2025

  45. arXiv:2506.14785  [pdf, other

    math.NA math-ph physics.flu-dyn

    Moment-enhanced shallow water equations for non-slip boundary conditions

    Authors: Shiping Zhou, Juntao Huang, Andrew J. Christlieb

    Abstract: The shallow water equations often assume a constant velocity profile along the vertical axis. However, this assumption does not hold in many practical applications. To better approximate the vertical velocity distribution, models such as the shallow water moment expansion models have been proposed. Nevertheless, under non-slip bottom boundary conditions, both the standard shallow water equation an… ▽ More

    Submitted 26 May, 2025; originally announced June 2025.

    MSC Class: 35L65; 65M22; 65L04

  46. arXiv:2506.14675  [pdf, ps, other

    hep-ph

    Analysis of three-body charmed $B$ meson decays $B \to {D}(V^* \to){V P}$

    Authors: Jing Ou-Yang, Run-Hui Li, Si-Hong Zhou

    Abstract: We systematically analyze the decays $B_{(s)} \to D_{(s)} (V^* \to)\, V\, P$, where $V^*$ represents a vector resonance ($ρ, \, ω$ or $K^*$), and $V P$ denotes the final-state meson pairs $ ω\, π$, $ ρ\, π$ and $ ρ\, K$. The intermediate subprocesses $B_{(s)} \to D_{(s)} V^*$ are calculated in the factorization-assisted topological-amplitude approach, while the intermediate resonant states $V^*$ a… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: 25 pages, 2 figures

  47. arXiv:2506.14477  [pdf, ps, other

    cs.AI

    GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in Real-World Anomalies

    Authors: Jingqi Yang, Zhilong Song, Jiawei Chen, Mingli Song, Sheng Zhou, linjun sun, Xiaogang Ouyang, Chun Chen, Can Wang

    Abstract: The development of high-quality datasets is crucial for benchmarking and advancing research in Graphical User Interface (GUI) agents. Despite their importance, existing datasets are often constructed under idealized conditions, overlooking the diverse anomalies frequently encountered in real-world deployments. To address this limitation, we introduce GUI-Robust, a novel dataset designed for compre… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: 10 pages, 4 figures, submitted to NIPS 2025

  48. arXiv:2506.13301  [pdf, ps, other

    cs.CV

    AttentionDrag: Exploiting Latent Correlation Knowledge in Pre-trained Diffusion Models for Image Editing

    Authors: Biao Yang, Muqi Huang, Yuhui Zhang, Yun Xiong, Kun Zhou, Xi Chen, Shiyang Zhou, Huishuai Bao, Chuan Li, Feng Shi, Hualei Liu

    Abstract: Traditional point-based image editing methods rely on iterative latent optimization or geometric transformations, which are either inefficient in their processing or fail to capture the semantic relationships within the image. These methods often overlook the powerful yet underutilized image editing capabilities inherent in pre-trained diffusion models. In this work, we propose a novel one-step po… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  49. arXiv:2506.13137  [pdf, ps, other

    cs.IT eess.SP

    On secure UAV-aided ISCC systems

    Authors: Hongjiang Lei, Congke Jiang, Ki-Hong Park, Mohamed A. Aboulhassan, Sen Zhou, Gaofeng Pan

    Abstract: Integrated communication and sensing, which can make full use of the limited spectrum resources to perform communication and sensing tasks simultaneously, is an up-and-coming technology in wireless communication networks. In this work, we investigate the secrecy performance of an uncrewed aerial vehicle (UAV)-assisted secure integrated communication, sensing, and computing system, where the UAV se… ▽ More

    Submitted 27 June, 2025; v1 submitted 16 June, 2025; originally announced June 2025.

    Comments: 11 pages, 7 figures, submitted to IEEE Journal for review

  50. arXiv:2506.12808  [pdf, ps, other

    cs.CV

    Leveraging MIMIC Datasets for Better Digital Health: A Review on Open Problems, Progress Highlights, and Future Promises

    Authors: Afifa Khaled, Mohammed Sabir, Rizwan Qureshi, Camillo Maria Caruso, Valerio Guarrasi, Suncheng Xiang, S Kevin Zhou

    Abstract: The Medical Information Mart for Intensive Care (MIMIC) datasets have become the Kernel of Digital Health Research by providing freely accessible, deidentified records from tens of thousands of critical care admissions, enabling a broad spectrum of applications in clinical decision support, outcome prediction, and healthcare analytics. Although numerous studies and surveys have explored the predic… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.