Skip to main content

Showing 51–100 of 557 results for author: Zhuang, Y

.
  1. arXiv:2505.08330  [pdf, ps, other

    cs.LG cs.SI

    Structural-Temporal Coupling Anomaly Detection with Dynamic Graph Transformer

    Authors: Chang Zong, Yueting Zhuang, Jian Shao, Weiming Lu

    Abstract: Detecting anomalous edges in dynamic graphs is an important task in many applications over evolving triple-based data, such as social networks, transaction management, and epidemiology. A major challenge with this task is the absence of structural-temporal coupling information, which decreases the ability of the representation to distinguish anomalies from normal instances. Existing methods focus… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: 20 pages, 6 figures

    MSC Class: 68T07; 68T09

  2. arXiv:2505.07782  [pdf, ps, other

    cs.LG

    MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering

    Authors: Rushi Qiang, Yuchen Zhuang, Yinghao Li, Dingu Sagar V K, Rongzhi Zhang, Changhao Li, Ian Shu-Hei Wong, Sherry Yang, Percy Liang, Chao Zhang, Bo Dai

    Abstract: We introduce MLE-Dojo, a Gym-style framework for systematically reinforcement learning, evaluating, and improving autonomous large language model (LLM) agents in iterative machine learning engineering (MLE) workflows. Unlike existing benchmarks that primarily rely on static datasets or single-attempt evaluations, MLE-Dojo provides an interactive environment enabling agents to iteratively experimen… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  3. arXiv:2505.07110  [pdf

    cs.HC cs.CV

    DeepSORT-Driven Visual Tracking Approach for Gesture Recognition in Interactive Systems

    Authors: Tong Zhang, Fenghua Shao, Runsheng Zhang, Yifan Zhuang, Liuqingqing Yang

    Abstract: Based on the DeepSORT algorithm, this study explores the application of visual tracking technology in intelligent human-computer interaction, especially in the field of gesture recognition and tracking. With the rapid development of artificial intelligence and deep learning technology, visual-based interaction has gradually replaced traditional input devices and become an important way for intelli… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

  4. arXiv:2505.04548  [pdf, other

    eess.AS cs.HC cs.RO cs.SD

    Accelerating Audio Research with Robotic Dummy Heads

    Authors: Austin Lu, Kanad Sarkar, Yongjie Zhuang, Leo Lin, Ryan M Corey, Andrew C Singer

    Abstract: This work introduces a robotic dummy head that fuses the acoustic realism of conventional audiological mannequins with the mobility of robots. The proposed device is capable of moving, talking, and listening as people do, and can be used to automate spatially-stationary audio experiments, thus accelerating the pace of audio research. Critically, the device may also be used as a moving sound source… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: WASPAA 2025

  5. arXiv:2505.03475  [pdf, other

    cs.AI cs.LG

    am-ELO: A Stable Framework for Arena-based LLM Evaluation

    Authors: Zirui Liu, Jiatong Li, Yan Zhuang, Qi Liu, Shuanghong Shen, Jie Ouyang, Mingyue Cheng, Shijin Wang

    Abstract: Arena-based evaluation is a fundamental yet significant evaluation paradigm for modern AI models, especially large language models (LLMs). Existing framework based on ELO rating system suffers from the inevitable instability problem due to ranking inconsistency and the lack of attention to the varying abilities of annotators. In this paper, we introduce a novel stable arena framework to address th… ▽ More

    Submitted 29 May, 2025; v1 submitted 6 May, 2025; originally announced May 2025.

    Comments: ICML2025 Accepted

  6. arXiv:2504.21539  [pdf, other

    hep-ex

    Search for the lepton number violation decay $ω\to π^+ π^+ e^-e^- +c.c.$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (698 additional authors not shown)

    Abstract: The lepton number violation decay $ω\to π^+ π^+ e^-e^- +c.c.$ is searched for via $J/ψ\to ωη$ using a data sample of $(1.0087 \pm 0.0044) \times 10^{10}$ $J/ψ$ events collected by the BESIII detector at the BEPCII collider. No significant signal is observed, and the upper limit on the branching fraction of $ω\to π^+ π^+ e^-e^- +c.c.$ at the 90\% confidence level is determined for the first time to… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

    Comments: 9 pages, 3 figures

  7. arXiv:2504.19213  [pdf, other

    hep-ex

    Measurements of branching fractions of $D^0\to K^- 3π^+2π^-$, $D^0\to K^- 2π^+π^-2π^0$ and $D^+\to K^- 3π^+π^-π^0$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (693 additional authors not shown)

    Abstract: Utilizing $7.9\,\rm fb^{-1}$ of $e^+e^-$ collision data taken with the BESIII detector at the center-of-mass energy of 3.773 GeV, we report the measurements of absolute branching fractions of the hadronic decays $D^0\to K^- 3π^+2π^-$, $D^0\to K^- 2π^+π^-2π^0$ and $D^+\to K^- 3π^+π^-π^0$. The $D^0\to K^- 3π^+2π^-$ decay is measured with improved precision, while the latter two decays are observed w… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

    Comments: 12pages, 6 figures, 4 tables

    Report number: BAM-00843

  8. arXiv:2504.19087  [pdf, ps, other

    hep-ex

    Search for $η_{1}(1855)$ in $χ_{cJ}\toηηη^{\prime}$ decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (697 additional authors not shown)

    Abstract: Based on a sample of $2.7\times10^{9}$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, an analysis of the decay $ψ(3686)\toγχ_{cJ}, χ_{cJ}\toηηη^{\prime}$ is performed. The decay modes $χ_{c1}$ and $χ_{c2}\toηηη^{\prime}$ are observed for the first time, and their corresponding branching fractions are determined to be… ▽ More

    Submitted 3 June, 2025; v1 submitted 26 April, 2025; originally announced April 2025.

  9. arXiv:2504.13865  [pdf, ps, other

    cs.HC cs.AI cs.CL cs.CV

    A Survey on (M)LLM-Based GUI Agents

    Authors: Fei Tang, Haolei Xu, Hang Zhang, Siqi Chen, Xingyu Wu, Yongliang Shen, Wenqi Zhang, Guiyang Hou, Zeqi Tan, Yuchen Yan, Kaitao Song, Jian Shao, Weiming Lu, Jun Xiao, Yueting Zhuang

    Abstract: Graphical User Interface (GUI) Agents have emerged as a transformative paradigm in human-computer interaction, evolving from rule-based automation scripts to sophisticated AI-driven systems capable of understanding and executing complex interface operations. This survey provides a comprehensive examination of the rapidly advancing field of LLM-based GUI Agents, systematically analyzing their archi… ▽ More

    Submitted 4 June, 2025; v1 submitted 27 March, 2025; originally announced April 2025.

  10. arXiv:2504.13771  [pdf, other

    hep-ex

    Search for $J/ψ\rightarrow K^{0}_{S}K^{0}_{S}$ and $ψ(3686)\rightarrow K^{0}_{S}K^{0}_{S}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

    Abstract: Using data samples of $(10087\pm 44)\times10^{6}$ $J/ψ$ events and $(2712.4\pm 14.3)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, we search for the CP violating decays $J/ψ\rightarrow K^{0}_{S}K^{0}_{S}$ and $ψ(3686)\rightarrow K^{0}_{S}K^{0}_{S}$. No significant signals are observed over the expected background yields. The upper limits on their branchin… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  11. arXiv:2504.13650  [pdf, other

    cs.CV

    EyecareGPT: Boosting Comprehensive Ophthalmology Understanding with Tailored Dataset, Benchmark and Model

    Authors: Sijing Li, Tianwei Lin, Lingshuai Lin, Wenqiao Zhang, Jiang Liu, Xiaoda Yang, Juncheng Li, Yucheng He, Xiaohui Song, Jun Xiao, Yueting Zhuang, Beng Chin Ooi

    Abstract: Medical Large Vision-Language Models (Med-LVLMs) demonstrate significant potential in healthcare, but their reliance on general medical data and coarse-grained global visual understanding limits them in intelligent ophthalmic diagnosis. Currently, intelligent ophthalmic diagnosis faces three major challenges: (i) Data. The lack of deeply annotated, high-quality, multi-modal ophthalmic visual instr… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  12. arXiv:2504.13539  [pdf, other

    hep-ex

    Search for $1^{-+}$ charmonium-like hybrid via $e^{+}e^{-}\rightarrow γη^{(\prime)} η_{c}$ at center-of-mass energies between 4.258 and 4.681 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (696 additional authors not shown)

    Abstract: Using $e^{+}e^{-}$ collision data corresponding to an integrated luminosity of 10.6 fb$^{-1}$ collected at center-of-mass energies between 4.258 and 4.681 GeV with the BESIII detector at the BEPCII collider, we search for the $1^{- +}$ charmonium-like hybrid via $e^{+}e^{-}\rightarrowγηη_{c}$ and $e^{+}e^{-}\rightarrowγη^{\prime}η_{c}$ decays for the first time. No significant signal is observed a… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  13. arXiv:2504.12795  [pdf, other

    cs.CV

    EarthGPT-X: Enabling MLLMs to Flexibly and Comprehensively Understand Multi-Source Remote Sensing Imagery

    Authors: Wei Zhang, Miaoxin Cai, Yaqian Ning, Tong Zhang, Yin Zhuang, He Chen, Jun Li, Xuerui Mao

    Abstract: Recent advances in the visual-language area have developed natural multi-modal large language models (MLLMs) for spatial reasoning through visual prompting. However, due to remote sensing (RS) imagery containing abundant geospatial information that differs from natural images, it is challenging to effectively adapt natural spatial models to the RS domain. Moreover, current RS MLLMs are limited in… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  14. arXiv:2504.12100  [pdf, other

    cs.CV

    Generalized Visual Relation Detection with Diffusion Models

    Authors: Kaifeng Gao, Siqi Chen, Hanwang Zhang, Jun Xiao, Yueting Zhuang, Qianru Sun

    Abstract: Visual relation detection (VRD) aims to identify relationships (or interactions) between object pairs in an image. Although recent VRD models have achieved impressive performance, they are all restricted to pre-defined relation categories, while failing to consider the semantic ambiguity characteristic of visual relations. Unlike objects, the appearance of visual relations is always subtle and can… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

    Comments: Under review at IEEE TCSVT. The Appendix is provided additionally

  15. arXiv:2504.11301  [pdf, other

    cs.AI

    Learning to Be A Doctor: Searching for Effective Medical Agent Architectures

    Authors: Yangyang Zhuang, Wenjia Jiang, Jiayu Zhang, Ze Yang, Joey Tianyi Zhou, Chi Zhang

    Abstract: Large Language Model (LLM)-based agents have demonstrated strong capabilities across a wide range of tasks, and their application in the medical domain holds particular promise due to the demand for high generalizability and reliance on interdisciplinary knowledge. However, existing medical agent systems often rely on static, manually crafted workflows that lack the flexibility to accommodate dive… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  16. arXiv:2504.10867  [pdf, other

    hep-ex

    Precise measurement of the form factors in $D^0\rightarrow K^*(892)^-μ^+ν_μ$ and test of lepton universality with $D^0\rightarrow K^*(892)^-\ell^+ν_{\ell}$ decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (696 additional authors not shown)

    Abstract: We report a study of the semileptonic decay $D^0 \rightarrow \bar{K}^0π^-μ^+ν_μ$ based on a sample of $7.9~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773~GeV with the BESIII detector at the BEPCII collider. The branching fraction of the decay is measured for the first time to be… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: 9 pages, 4 figures

  17. arXiv:2504.08703  [pdf, other

    cs.SE

    SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents

    Authors: Muhammad Shihab Rashid, Christian Bock, Yuan Zhuang, Alexander Buchholz, Tim Esler, Simon Valentin, Luca Franceschi, Martin Wistuba, Prabhu Teja Sivaprasad, Woo Jung Kim, Anoop Deoras, Giovanni Zappella, Laurent Callot

    Abstract: Coding agents powered by large language models have shown impressive capabilities in software engineering tasks, but evaluating their performance across diverse programming languages and real-world scenarios remains challenging. We introduce SWE-PolyBench, a new multi-language benchmark for repository-level, execution-based evaluation of coding agents. SWE-PolyBench contains 2110 instances from 21… ▽ More

    Submitted 23 April, 2025; v1 submitted 11 April, 2025; originally announced April 2025.

    Comments: 20 pages, 6 figures, corrected author name spelling

  18. arXiv:2504.07817  [pdf, other

    hep-ex

    Search for the baryon and lepton number violating decay $J/ψ\to pe^-$ + c.c

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (664 additional authors not shown)

    Abstract: Based on $(2712.4\pm 14.3) \times 10^{6} $ ${ψ(3686)}$ events collected by the BESIII detector operating at the BEPCII storage ring, we perform a search for the baryon- and lepton-number violating decay $J/ψ\to pe^{-}+c.c.$ via $ψ(3686) \to π^{+}π^{-}J/ψ$. No significant signal is found. An upper limit on the branching fraction of $\mathcal{B}(J/ψ\to p e^{-}+ c.c.) < 3.1 \times 10^{-8}$ at 90\% co… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: 8 pages, 1 figure

  19. arXiv:2504.07729  [pdf, other

    cs.CV cs.AI

    Benchmarking Multi-Organ Segmentation Tools for Multi-Parametric T1-weighted Abdominal MRI

    Authors: Nicole Tran, Anisa Prasad, Yan Zhuang, Tejas Sudharshan Mathai, Boah Kim, Sydney Lewis, Pritam Mukherjee, Jianfei Liu, Ronald M. Summers

    Abstract: The segmentation of multiple organs in multi-parametric MRI studies is critical for many applications in radiology, such as correlating imaging biomarkers with disease status (e.g., cirrhosis, diabetes). Recently, three publicly available tools, such as MRSegmentator (MRSeg), TotalSegmentator MRI (TS), and TotalVibeSegmentator (VIBE), have been proposed for multi-organ segmentation in MRI. However… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: Published at SPIE Medical Imaging 2025

  20. arXiv:2504.06606  [pdf, other

    cs.CV

    Benchmarking Multimodal CoT Reward Model Stepwise by Visual Program

    Authors: Minghe Gao, Xuqi Liu, Zhongqi Yue, Yang Wu, Shuang Chen, Juncheng Li, Siliang Tang, Fei Wu, Tat-Seng Chua, Yueting Zhuang

    Abstract: Recent advancements in reward signal usage for Large Language Models (LLMs) are remarkable. However, significant challenges exist when transitioning reward signal to the multimodal domain, including labor-intensive annotations, over-reliance on one-step rewards, and inadequate evaluation. To address these issues, we propose SVIP, a novel approach to train a step-level multi-dimensional Chain-of-Th… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

  21. arXiv:2504.05584  [pdf, other

    hep-ex

    Observation of Transverse Polarization and Determination of Electromagnetic Form Factor of $Λ$ Hyperon at $\sqrt{s}= 3.773$ GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (697 additional authors not shown)

    Abstract: Using a 20.3 fb$^{-1}$ of $e^{+}e^{-}$ collision data sample collected by the BESIII detector at the BEPCII collider, we present an observation of transverse polarization and a complete determination of the electromagnetic form factor of the $Λ$ hyperon in $e^{+}e^{-}\toΛ\barΛ$ decay with the entangled $Λ-\barΛ$ pair at $\sqrt{s}=3.773$ GeV. The relative phase between the electric and magnetic for… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: 9 pages, 1 table, 5 figures

  22. arXiv:2504.04915  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration

    Authors: Ran Xu, Wenqi Shi, Yuchen Zhuang, Yue Yu, Joyce C. Ho, Haoyu Wang, Carl Yang

    Abstract: Retrieval-Augmented Generation (RAG) systems often struggle to handle multi-hop question-answering tasks accurately due to irrelevant context retrieval and limited complex reasoning capabilities. We introduce Collab-RAG, a collaborative training framework that leverages mutual enhancement between a white-box small language model (SLM) and a blackbox large language model (LLM) for RAG. Specifically… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: Work in progress. Code: https://github.com/ritaranx/Collab-RAG/

  23. arXiv:2504.04420  [pdf, ps, other

    hep-ex

    Observation of $ψ(3686) \to Ξ^- K^0_S \barΩ^+ $+c.c

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

    Abstract: Using a sample of $(2.712\pm0.014) \times 10^{9}$ $ψ(3686)$ events collected with the BESIII detector at the electron positron collider BEPCII, the decay $ψ(3686) \to Ξ^- K^0_S \barΩ^+ +c.c.$ is observed for the first time, which has a significance of 5.9 standard deviations. The branching fraction of this decay is measured to be $(2.91\pm0.47\pm0.33)\times 10^{-6}$, where the first and second unc… ▽ More

    Submitted 13 June, 2025; v1 submitted 6 April, 2025; originally announced April 2025.

  24. arXiv:2504.04096  [pdf, ps, other

    hep-ex

    Observation of a Three-Resonance Structure in the Cross Section of $e^+e^-\toπ^+π^- h_c$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (697 additional authors not shown)

    Abstract: Using $e^+e^-$ collision data collected with the BESIII detector operating at the Beijing Electron Positron Collider, the cross section of $e^+e^-\to π^+π^- h_c$ is measured at 59 points with center-of-mass energy $\sqrt{s}$ ranging from $4.009$ to $4.950~\mathrm{GeV}$ with a total integrated luminosity of $22.2~\mathrm{fb}^{-1}$. The cross section between $4.3$ and $4.45~\mathrm{GeV}$ exhibits a… ▽ More

    Submitted 5 April, 2025; originally announced April 2025.

  25. arXiv:2504.01823  [pdf, other

    hep-ex

    Evidence of doubly OZI-suppressed decay $η_{c} \to ωφ$ in the radiative decay $J/ψ\to γη_{c}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

    Abstract: Using a sample of $(10087\pm44) \times 10^{6}$ $J/ψ$ events collected with the BESIII detector at the BEPCII collider, the first evidence for the doubly OZI-suppressed decay $η_{c} \to ωφ$ is reported with a significance of 4.0$σ$. The branching fraction of $η_{c} \to ωφ$ is measured to be $\mathcal{B}(η_{c} \to ωφ) = (3.86 \pm 0.92 \pm 0.62) \times 10^{-5}$, where the first uncertainty is statist… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  26. arXiv:2503.23260  [pdf, other

    cs.SD cs.LG eess.AS eess.SP

    Mismatch-Robust Underwater Acoustic Localization Using A Differentiable Modular Forward Model

    Authors: Dariush Kari, Yongjie Zhuang, Andrew C. Singer

    Abstract: In this paper, we study the underwater acoustic localization in the presence of environmental mismatch. Especially, we exploit a pre-trained neural network for the acoustic wave propagation in a gradient-based optimization framework to estimate the source location. To alleviate the effect of mismatch between the training data and the test data, we simultaneously optimize over the network weights a… ▽ More

    Submitted 29 March, 2025; originally announced March 2025.

  27. arXiv:2503.22126  [pdf, other

    hep-ex

    Updated model-independent measurement of the strong-phase differences between $D^0$ and $\bar{D}^0 \to K^{0}_{S/L}π^+π^-$ decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (696 additional authors not shown)

    Abstract: The strong-phase differences between $D^0\to K_{S/L}^0π^+π^-$ and $\bar{D}^0\to K_{S/L}^0π^+π^-$ decays are one of the most important inputs in measuring the $C\!P$ violating angle $γ$ via $B^- \to D K^-$ decays. They also play a key role in studies of charm mixing and indirect $C\!P$ violation. In this paper, the strong-phase differences are determined in a model-independent way with quantum-corr… ▽ More

    Submitted 18 April, 2025; v1 submitted 27 March, 2025; originally announced March 2025.

  28. arXiv:2503.21696  [pdf, other

    cs.CL cs.CV

    Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks

    Authors: Wenqi Zhang, Mengna Wang, Gangao Liu, Xu Huixin, Yiwei Jiang, Yongliang Shen, Guiyang Hou, Zhe Zheng, Hang Zhang, Xin Li, Weiming Lu, Peng Li, Yueting Zhuang

    Abstract: Recent advances in deep thinking models have demonstrated remarkable reasoning capabilities on mathematical and coding tasks. However, their effectiveness in embodied domains which require continuous interaction with environments through image action interleaved trajectories remains largely -unexplored. We present Embodied Reasoner, a model that extends o1 style reasoning to interactive embodied s… ▽ More

    Submitted 14 May, 2025; v1 submitted 27 March, 2025; originally announced March 2025.

    Comments: Code: https://github.com/zwq2018/embodied_reasoner Dataset: https://huggingface.co/datasets/zwq2018/embodied_reasoner

  29. arXiv:2503.19823  [pdf, other

    q-bio.NC cs.AI cs.CV

    GyralNet Subnetwork Partitioning via Differentiable Spectral Modularity Optimization

    Authors: Yan Zhuang, Minheng Chen, Chao Cao, Tong Chen, Jing Zhang, Xiaowei Yu, Yanjun Lyu, Lu Zhang, Tianming Liu, Dajiang Zhu

    Abstract: Understanding the structural and functional organization of the human brain requires a detailed examination of cortical folding patterns, among which the three-hinge gyrus (3HG) has been identified as a key structural landmark. GyralNet, a network representation of cortical folding, models 3HGs as nodes and gyral crests as edges, highlighting their role as critical hubs in cortico-cortical connect… ▽ More

    Submitted 31 March, 2025; v1 submitted 25 March, 2025; originally announced March 2025.

    Comments: 10 pages, 3 figures

  30. Observation of the decay $ψ(3686)\rightarrow Σ^{0}\barΣ^{0}ω$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (695 additional authors not shown)

    Abstract: Using a dataset of $(27.12\pm 0.14)\times 10^{8}$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we report the first observation of the decay $ψ(3686)\toΣ^{0}\barΣ^{0}ω$ with a statistical significance of 8.9$σ$. The measured branching fraction is $(1.24 \pm 0.16_{\textrm{stat}} \pm 0.11_{\textrm{sys}}) \times 10^{-5}$, where the first uncertainty i… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  31. arXiv:2503.17165  [pdf, other

    hep-ex

    Stringent test of $CP$ symmetry in $Σ^+$ hyperon decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

    Abstract: The non-leptonic two-body weak decays $Σ^{+} \to p π^{0}$ and $\barΣ^{-} \to \bar{p} π^{0}$ are investigated, utilizing $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events and $(2.7124\pm0.0143)\times10^{9}$ $ψ(3686)$ events collected by BESIII experiment. The precision of the weak-decay parameters for the decays $Σ^{+} \to p π^{0}$ ($α_{0}$) and $\barΣ^{-} \to \bar{p} π^{0}$ ($\barα_{0}$) is improved b… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  32. arXiv:2503.16070  [pdf, other

    hep-ex hep-ph

    Search for the radiative leptonic decay $D^+\toγe^+ν_e$ with Deep Learning

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

    Abstract: Using 20.3$~\rm fb^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773$~\rm GeV$ with the BESIII detector, we report an improved search for the radiative leptonic decay $D^+\toγe^+ν_e$. An upper limit on its partial branching fraction for photon energies $E_γ>10~\rm MeV$ is determined to be $1.2\times10^{-5}$ at 90\% confidence level, which excludes most current theor… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: 15 pages, 6 figures

  33. arXiv:2503.14655  [pdf, other

    q-bio.NC cs.AI cs.CV eess.IV

    Core-Periphery Principle Guided State Space Model for Functional Connectome Classification

    Authors: Minheng Chen, Xiaowei Yu, Jing Zhang, Tong Chen, Chao Cao, Yan Zhuang, Yanjun Lyu, Lu Zhang, Tianming Liu, Dajiang Zhu

    Abstract: Understanding the organization of human brain networks has become a central focus in neuroscience, particularly in the study of functional connectivity, which plays a crucial role in diagnosing neurological disorders. Advances in functional magnetic resonance imaging and machine learning techniques have significantly improved brain network analysis. However, traditional machine learning approaches… ▽ More

    Submitted 18 March, 2025; originally announced March 2025.

  34. arXiv:2503.13330  [pdf, ps, other

    eess.IV cs.AI cs.CV

    LEAVS: An LLM-based Labeler for Abdominal CT Supervision

    Authors: Ricardo Bigolin Lanfredi, Yan Zhuang, Mark Finkelstein, Praveen Thoppey Srinivasan Balamuralikrishna, Luke Krembs, Brandon Khoury, Arthi Reddy, Pritam Mukherjee, Neil M. Rofsky, Ronald M. Summers

    Abstract: Extracting structured labels from radiology reports has been employed to create vision models to simultaneously detect several types of abnormalities. However, existing works focus mainly on the chest region. Few works have been investigated on abdominal radiology reports due to more complex anatomy and a wider range of pathologies in the abdomen. We propose LEAVS (Large language model Extractor f… ▽ More

    Submitted 28 May, 2025; v1 submitted 17 March, 2025; originally announced March 2025.

    Comments: Early acceptance (top 9% of submissions) for MICCAI 2025

  35. arXiv:2503.11383  [pdf, other

    hep-ex

    Study of $φ\to K\bar{K}$ and $K_{S}^{0}-K_{L}^{0}$ asymmetry in the amplitude analysis of $D_{s}^{+} \to K_{S}^{0}K_{L}^{0}π^{+}$ decay

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (701 additional authors not shown)

    Abstract: Using $e^+e^-$ annihilation data corresponding to a total integrated luminosity of 7.33 $\rm fb^{-1}$ collected at center-of-mass energies between 4.128 and 4.226~GeV with the BESIII detector, we provide the first amplitude analysis and absolute branching fraction measurement of the hadronic decay $D_{s}^{+} \to K_{S}^{0}K_{L}^{0}π^{+}$. The branching fraction of… ▽ More

    Submitted 23 March, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

    Comments: 11 pages, 4 figures

  36. arXiv:2503.11001  [pdf, other

    eess.SY

    A Weighted Predict-and-Optimize Framework for Power System Operation Considering Varying Impacts of Uncertainty

    Authors: Yingrui Zhuang, Lin Cheng, Can Wan, Rui Xie, Ning Qi, Yue Chen

    Abstract: Integrating prediction and optimization enhances decision-making quality by yielding near optimal solutions. Given that prediction errors associated with multiple uncertainties have varying impacts on downstream decision-making, improving the prediction accuracy of critical uncertainties with significant impacts on decision-making quality yields better optimization results. Inspired by this observ… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: This is a paper submitted to IEEE TRANSACTIONS ON Power Systems

  37. arXiv:2503.09640  [pdf, other

    cs.GR cs.CV

    Physics-Aware Human-Object Rendering from Sparse Views via 3D Gaussian Splatting

    Authors: Weiquan Wang, Jun Xiao, Yueting Zhuang, Long Chen

    Abstract: Rendering realistic human-object interactions (HOIs) from sparse-view inputs is challenging due to occlusions and incomplete observations, yet crucial for various real-world applications. Existing methods always struggle with either low rendering qualities (\eg, visual fidelity and physically plausible HOIs) or high computational costs. To address these limitations, we propose HOGS (Human-Object R… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  38. arXiv:2503.07640  [pdf

    cs.LG cs.AI q-bio.NC

    BrainNet-MoE: Brain-Inspired Mixture-of-Experts Learning for Neurological Disease Identification

    Authors: Jing Zhang, Xiaowei Yu, Tong Chen, Chao Cao, Mingheng Chen, Yan Zhuang, Yanjun Lyu, Lu Zhang, Li Su, Tianming Liu, Dajiang Zhu

    Abstract: The Lewy body dementia (LBD) is the second most common neurodegenerative dementia after Alzheimer's disease (AD). Early differentiation between AD and LBD is crucial because they require different treatment approaches, but this is challenging due to significant clinical overlap, heterogeneity, complex pathogenesis, and the rarity of LBD. While recent advances in artificial intelligence (AI) demons… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

  39. arXiv:2503.06998  [pdf, other

    cs.CV

    SOYO: A Tuning-Free Approach for Video Style Morphing via Style-Adaptive Interpolation in Diffusion Models

    Authors: Haoyu Zheng, Qifan Yu, Binghe Yu, Yang Dai, Wenqiao Zhang, Juncheng Li, Siliang Tang, Yueting Zhuang

    Abstract: Diffusion models have achieved remarkable progress in image and video stylization. However, most existing methods focus on single-style transfer, while video stylization involving multiple styles necessitates seamless transitions between them. We refer to this smooth style transition between video frames as video style morphing. Current approaches often generate stylized video frames with disconti… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  40. arXiv:2503.06692  [pdf, other

    cs.CL cs.AI

    InftyThink: Breaking the Length Limits of Long-Context Reasoning in Large Language Models

    Authors: Yuchen Yan, Yongliang Shen, Yang Liu, Jin Jiang, Mengdi Zhang, Jian Shao, Yueting Zhuang

    Abstract: Advanced reasoning in large language models has achieved remarkable performance on challenging tasks, but the prevailing long-context reasoning paradigm faces critical limitations: quadratic computational scaling with sequence length, reasoning constrained by maximum context boundaries, and performance degradation beyond pre-training context windows. Existing approaches primarily compress reasonin… ▽ More

    Submitted 24 May, 2025; v1 submitted 9 March, 2025; originally announced March 2025.

    Comments: Project Page: https://zju-real.github.io/InftyThink Code: https://github.com/ZJU-REAL/InftyThink

  41. arXiv:2503.06470  [pdf, other

    cs.AI cs.CL cs.CV

    Think Twice, Click Once: Enhancing GUI Grounding via Fast and Slow Systems

    Authors: Fei Tang, Yongliang Shen, Hang Zhang, Siqi Chen, Guiyang Hou, Wenqi Zhang, Wenqiao Zhang, Kaitao Song, Weiming Lu, Yueting Zhuang

    Abstract: Humans can flexibly switch between different modes of thinking based on task complexity: from rapid intuitive judgments to in-depth analytical understanding. However, current Graphical User Interface (GUI) grounding systems which locate interface elements based on natural language instructions rely solely on immediate prediction without reasoning, struggling to understand complex interface layouts… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

  42. arXiv:2503.05666  [pdf, other

    cs.RO eess.SY

    Kinodynamic Model Predictive Control for Energy Efficient Locomotion of Legged Robots with Parallel Elasticity

    Authors: Yulun Zhuang, Yichen Wang, Yanran Ding

    Abstract: In this paper, we introduce a kinodynamic model predictive control (MPC) framework that exploits unidirectional parallel springs (UPS) to improve the energy efficiency of dynamic legged robots. The proposed method employs a hierarchical control structure, where the solution of MPC with simplified dynamic models is used to warm-start the kinodynamic MPC, which accounts for nonlinear centroidal dyna… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: 7 pages, 6 figures. Accepted for publication at ICRA 2025

  43. arXiv:2503.05382  [pdf, other

    hep-ex

    Measurement of the branching fractions of $D^+ \to K^+K^-π^+π^+π^-$, $φπ^+π^+π^-$, $K^0_SK^+π^+π^-π^0$, $K^0_SK^+η$, and $K^0_SK^+ω$ decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (693 additional authors not shown)

    Abstract: Using $20.3~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of 3.773 GeV with the BESIII detector operating at the BEPCII collider, the branching fractions of three hadronic charm meson decays, $D^+\to φπ^+π^+π^-$, $D^+\to K^0_SK^+π^+π^-π^0$, and $D^+\to K^0_SK^+ω$, are measured for the first time to be $(0.54\pm0.19\pm0.02)\times 10^{-4}$,… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: 11 pages, 3 figures

    Report number: BAM-00841

  44. arXiv:2503.04095  [pdf, other

    cs.CL cs.AI

    Chart-HQA: A Benchmark for Hypothetical Question Answering in Charts

    Authors: Xiangnan Chen, Yuancheng Fang, Qian Xiao, Juncheng Li, Jun Lin, Siliang Tang, Yi Yang, Yueting Zhuang

    Abstract: Multimodal Large Language Models (MLLMs) have garnered significant attention for their strong visual-semantic understanding. Most existing chart benchmarks evaluate MLLMs' ability to parse information from charts to answer questions. However, they overlook the inherent output biases of MLLMs, where models rely on their parametric memory to answer questions rather than genuinely understanding the c… ▽ More

    Submitted 7 March, 2025; v1 submitted 6 March, 2025; originally announced March 2025.

    Comments: Under review

  45. arXiv:2503.02268  [pdf, other

    cs.AI

    AppAgentX: Evolving GUI Agents as Proficient Smartphone Users

    Authors: Wenjia Jiang, Yangyang Zhuang, Chenxi Song, Xu Yang, Joey Tianyi Zhou, Chi Zhang

    Abstract: Recent advancements in Large Language Models (LLMs) have led to the development of intelligent LLM-based agents capable of interacting with graphical user interfaces (GUIs). These agents demonstrate strong reasoning and adaptability, enabling them to perform complex tasks that traditionally required predefined rules. However, the reliance on step-by-step reasoning in LLM-based agents often results… ▽ More

    Submitted 14 April, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

  46. arXiv:2503.02242  [pdf, other

    cs.CV eess.IV

    $\mathbfΦ$-GAN: Physics-Inspired GAN for Generating SAR Images Under Limited Data

    Authors: Xidan Zhang, Yihan Zhuang, Qian Guo, Haodong Yang, Xuelin Qian, Gong Cheng, Junwei Han, Zhongling Huang

    Abstract: Approaches for improving generative adversarial networks (GANs) training under a few samples have been explored for natural images. However, these methods have limited effectiveness for synthetic aperture radar (SAR) images, as they do not account for the unique electromagnetic scattering properties of SAR. To remedy this, we propose a physics-inspired regularization method dubbed $Φ$-GAN, which i… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  47. arXiv:2503.02196  [pdf, ps, other

    hep-ex

    First Measurement of the Decay Dynamics in the Semileptonic Transition of the $D^{+(0)}$ into the Axial-vector Meson $\bar K_1(1270)$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (680 additional authors not shown)

    Abstract: Using $e^+e^-$ collision data taken at the center-of-mass energy of 3.773 GeV with the BESIII detector, corresponding to an integrated luminosity of 20.3 fb$^{-1}$, we report the first amplitude and angular analyses of the semileptonic decays $D^{+(0)}\to K^-π^+π^{0(-)} e^+ν_e$. From the amplitude analysis, we determine for the first time the hadronic form factors of the semileptonic $D$ decays in… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: 15 pages, 6 figures, submitted to PRL

  48. arXiv:2502.21239  [pdf, other

    cs.CL

    Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs

    Authors: Xiaomin Li, Zhou Yu, Ziji Zhang, Yingying Zhuang, Swair Shah, Narayanan Sadagopan, Anurag Beniwal

    Abstract: Large language models (LLMs) have demonstrated remarkable performance across diverse tasks by encoding vast amounts of factual knowledge. However, they are still prone to hallucinations, generating incorrect or misleading information, often accompanied by high uncertainty. Existing methods for hallucination detection primarily focus on quantifying internal uncertainty, which arises from missing or… ▽ More

    Submitted 5 May, 2025; v1 submitted 28 February, 2025; originally announced February 2025.

  49. Improved measurement of absolute branching fraction of the inclusive decay $Λ_{c}^{+} \to K_{S}^{0} X$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (679 additional authors not shown)

    Abstract: By analyzing $4.5$ fb$^{-1}$ of $e^{+}e^{-}$ collision data accumulated with the BESIII detector at center-of-mass energies ranging from $4599.53$ MeV to $4698.82$ MeV, we report the measurement of the absolute branching fraction (BF) of the inclusive decay $Λ_{c}^{+} \to K_{S}^{0} X$ using the double-tag technique. The result is $\mathcal{B}(Λ_{c}^{+} \to K_{S}^{0} X)=(10.9\pm0.2\pm0.1)\%$, where… ▽ More

    Submitted 21 June, 2025; v1 submitted 28 February, 2025; originally announced February 2025.

    Journal ref: J. High Energ. Phys. 2025, 194 (2025)

  50. arXiv:2502.20742  [pdf, other

    cs.CV cs.AI cs.CL

    Structured Preference Optimization for Vision-Language Long-Horizon Task Planning

    Authors: Xiwen Liang, Min Lin, Weiqi Ruan, Rongtao Xu, Yuecheng Liu, Jiaqi Chen, Bingqian Lin, Yuzheng Zhuang, Xiaodan Liang

    Abstract: Existing methods for vision-language task planning excel in short-horizon tasks but often fall short in complex, long-horizon planning within dynamic environments. These challenges primarily arise from the difficulty of effectively training models to produce high-quality reasoning processes for long-horizon tasks. To address this, we propose Structured Preference Optimization (SPO), which aims to… ▽ More

    Submitted 15 May, 2025; v1 submitted 28 February, 2025; originally announced February 2025.

    Comments: 18 pages