-
Observation of the decays $B^{+} \to Σ_{c}(2455)^{++} \overlineΞ_{c}^{-}$ and $B^{0} \to Σ_{c}(2455)^{0} \overlineΞ_{c}^{0}$
Authors:
Belle,
Belle II Collaborations,
:,
M. Abumusabh,
I. Adachi,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
N. Anh Ky,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
R. Ayad,
V. Babu,
H. Bae,
N. K. Baghel,
S. Bahinipati
, et al. (364 additional authors not shown)
Abstract:
We report the first observation of the two-body baryonic decays $B^{+} \to Σ_{c}(2455)^{++} \overlineΞ_{c}^{-}$ and $B^{0} \to Σ_{c}(2455)^{0} \overlineΞ_{c}^{0}$ with significances of $7.3\,σ$ and $6.2\,σ$, respectively, including statistical and systematic uncertainties. The branching fractions are measured to be…
▽ More
We report the first observation of the two-body baryonic decays $B^{+} \to Σ_{c}(2455)^{++} \overlineΞ_{c}^{-}$ and $B^{0} \to Σ_{c}(2455)^{0} \overlineΞ_{c}^{0}$ with significances of $7.3\,σ$ and $6.2\,σ$, respectively, including statistical and systematic uncertainties. The branching fractions are measured to be $\mathcal{B}(B^{+} \to Σ_{c}(2455)^{++} \overlineΞ_{c}^{-}) = (5.74 \pm 1.11 \pm 0.42_{-1.53}^{+2.47}) \times 10^{-4}$ and $\mathcal{B}(B^{0} \to Σ_{c}(2455)^{0} \overlineΞ_{c}^{0}) = (4.83 \pm 1.12 \pm 0.37_{-0.60}^{+0.72}) \times 10^{-4}$. The first and second uncertainties are statistical and systematic, respectively, while the third ones arise from the absolute branching fractions of $\overlineΞ_{c}^{-}$ or $\overlineΞ_{c}^{0}$ decays. The data samples used for this analysis have integrated luminosities of 711~$\mathrm{fb}^{-1}$ and 365~$\mathrm{fb}^{-1}$, and were collected at the $Υ(4S)$ resonance by the Belle and Belle~II detectors operating at the KEKB and SuperKEKB asymmetric-energy $e^{+}e^{-}$ colliders, respectively.
△ Less
Submitted 7 July, 2025;
originally announced July 2025.
-
Strongly polarised radio pulses from a new white-dwarf-hosting long-period transient
Authors:
Sanne Bloot,
Harish K. Vedantham,
Cees G. Bassa,
Joseph R. Callingham,
William M. J. Best,
Michael C. Liu,
Eugene A. Magnier,
Timothy W. Shimwell,
Trent J. Dupuy
Abstract:
Long-period transients (LPTs) are a new and enigmatic class of objects that produce bright pulsations in the radio, with periods far exceeding those seen in rotationally powered pulsars. The proposed progenitors for LPTs are contested, with white dwarfs or magnetars being likely candidates. Here, we present the discovery of ILT\,J163430+445010, a new LPT detected in a blind search for Stokes\,V tr…
▽ More
Long-period transients (LPTs) are a new and enigmatic class of objects that produce bright pulsations in the radio, with periods far exceeding those seen in rotationally powered pulsars. The proposed progenitors for LPTs are contested, with white dwarfs or magnetars being likely candidates. Here, we present the discovery of ILT\,J163430+445010, a new LPT detected in a blind search for Stokes\,V transients in the LOFAR Two-Metre Sky Survey. Unusual for LPTs, J1634+44 shows pulses that are 100\% circularly polarised, as well as pulses that are 100\% linearly polarised, with the polarisation state changing from pulse to pulse. We detect 19 pulses in total, each with a total polarisation fraction of $\sim100\%$ and a pulse duration of at most 10\,s. The pulses show a periodicity at $841.24808\pm0.00015$\,s, implying a low duty cycle of $0.012$. J1634+44 has a marginally detected counterpart in the ultraviolet GALEX MIS survey and the ultraviolet/optical UNIONS survey, suggesting that it contains a white dwarf with an effective temperature between 15000\,K and 33000\,K. We do not detect J1634+44 with a deep $J$-band exposure with UKIRT at a $3σ$ AB magnitude limit of 24.7, ruling out a main-sequence star or ultracool dwarf with a spectral type earlier than M7. The pulses from J1634+44 follow a particular pattern, with two pulses being produced every five periods after a waiting time of two or three periods. This pattern could be a result of spin-orbit coupling in a binary system with a 5:2 or 5:3 resonance, where a companion induces beamed radio emission on the white dwarf. The companion is most likely an ultracool dwarf or another white dwarf, making J1634+44 unique among the currently known sample of LPTs.
△ Less
Submitted 7 July, 2025;
originally announced July 2025.
-
Measurement of the $ D^{0}\rightarrow K^{-}π^{+}e^{+}e^{-} $ branching fraction and search for $ D^{0}\rightarrow π^{+}π^{-}e^{+}e^{-} $ and $D^{0}\rightarrow K^{+}K^{-}e^{+}e^{-} $ decays at Belle
Authors:
Belle,
Belle II Collaborations,
:,
I. Adachi,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
M. Angelsmark,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae
, et al. (458 additional authors not shown)
Abstract:
We present a study of the rare charm meson decays $ D^{0}\rightarrow K^{+}K^{-}e^{+}e^{-} $, $ π^{+}π^{-}e^{+}e^{-} $, and $ K^{-}π^{+}e^{+}e^{-} $ using a 942 fb$^{-1}$ data set collected by the Belle detector at the KEKB asymmetric-energy $ e^{+}e^{-} $ collider. We use $ D^{0} $ candidates identified by the charge of the pion in $ D^{*} \rightarrow D^{0} π$ decays and normalize the branching fr…
▽ More
We present a study of the rare charm meson decays $ D^{0}\rightarrow K^{+}K^{-}e^{+}e^{-} $, $ π^{+}π^{-}e^{+}e^{-} $, and $ K^{-}π^{+}e^{+}e^{-} $ using a 942 fb$^{-1}$ data set collected by the Belle detector at the KEKB asymmetric-energy $ e^{+}e^{-} $ collider. We use $ D^{0} $ candidates identified by the charge of the pion in $ D^{*} \rightarrow D^{0} π$ decays and normalize the branching fractions to $ D^{0} \rightarrow K^{-}π^{+}π^{-}π^{+} $ decays. The branching fraction for decay $ D^{0} \rightarrow K^{-}π^{+}e^{+}e^{-} $ is measured to be (39.6 $\pm$ 4.5 (stat) $\pm$ 2.9 (syst)) $\times$ $10^{-7}$, with the dielectron mass in the $ ρ/ω$ mass region $ 675 < m_{ee} < 875 $ MeV$/c^{2}$. We also search for $ D^{0}\rightarrow h^{-} h^{(\prime)+}e^{+}e^{-} $ ($ h^{(\prime)}=K,\,π$) decays with the dielectron mass near the $η$ and $φ$ resonances, and away from these resonances for the $ K^{+}K^{-}e^{+}e^{-} $ and $ π^{+}π^{-}e^{+}e^{-} $ modes. For these modes, we find no significant signals and set 90$\%$ confidence level upper limits on their branching fractions at the $\mathcal{O}$(10$^{-7}$) level.
△ Less
Submitted 7 July, 2025;
originally announced July 2025.
-
Cross sections of $η$ mesons in $p$$+$$p$ collisions at forward rapidity at $\sqrt{s}=500$ GeV and central rapidity at $\sqrt{s}=510$ GeV
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
R. Akimoto,
H. Al-Ta'ani,
J. Alexander,
M. Alfred,
D. Anderson,
K. R. Andrews,
A. Angerami,
S. Antsupov,
K. Aoki,
N. Apadula,
E. Appelt,
Y. Aramaki,
R. Armendariz,
H. Asano,
E. C. Aschenauer,
E. T. Atomssa,
T. C. Awes,
B. Azmoun
, et al. (476 additional authors not shown)
Abstract:
We present the first measurements of the forward and midrapidity $η$-meson cross sections from $p$$+$$p$ collisions at $\sqrt{s}=500$ and $510$~GeV, respectively. We also report the midrapidity $η/π^0$ ratio at 510 GeV. The forward cross section is measured differentially in $η$-meson transverse momentum ($p_T$) from 1.0 to 6.5~GeV/$c$ for pseudorapidity $3.0<|η|<3.8$. The midrapidity cross sectio…
▽ More
We present the first measurements of the forward and midrapidity $η$-meson cross sections from $p$$+$$p$ collisions at $\sqrt{s}=500$ and $510$~GeV, respectively. We also report the midrapidity $η/π^0$ ratio at 510 GeV. The forward cross section is measured differentially in $η$-meson transverse momentum ($p_T$) from 1.0 to 6.5~GeV/$c$ for pseudorapidity $3.0<|η|<3.8$. The midrapidity cross section is measured from 3.5 to 44 GeV/$c$ for pseudorapidity $|η|<0.35$. Both cross sections serve as critical inputs to an updated global analysis of the $η$-meson fragmentation functions.
△ Less
Submitted 7 July, 2025;
originally announced July 2025.
-
MurreNet: Modeling Holistic Multimodal Interactions Between Histopathology and Genomic Profiles for Survival Prediction
Authors:
Mingxin Liu,
Chengfei Cai,
Jun Li,
Pengbo Xu,
Jinze Li,
Jiquan Ma,
Jun Xu
Abstract:
Cancer survival prediction requires integrating pathological Whole Slide Images (WSIs) and genomic profiles, a challenging task due to the inherent heterogeneity and the complexity of modeling both inter- and intra-modality interactions. Current methods often employ straightforward fusion strategies for multimodal feature integration, failing to comprehensively capture modality-specific and modali…
▽ More
Cancer survival prediction requires integrating pathological Whole Slide Images (WSIs) and genomic profiles, a challenging task due to the inherent heterogeneity and the complexity of modeling both inter- and intra-modality interactions. Current methods often employ straightforward fusion strategies for multimodal feature integration, failing to comprehensively capture modality-specific and modality-common interactions, resulting in a limited understanding of multimodal correlations and suboptimal predictive performance. To mitigate these limitations, this paper presents a Multimodal Representation Decoupling Network (MurreNet) to advance cancer survival analysis. Specifically, we first propose a Multimodal Representation Decomposition (MRD) module to explicitly decompose paired input data into modality-specific and modality-shared representations, thereby reducing redundancy between modalities. Furthermore, the disentangled representations are further refined then updated through a novel training regularization strategy that imposes constraints on distributional similarity, difference, and representativeness of modality features. Finally, the augmented multimodal features are integrated into a joint representation via proposed Deep Holistic Orthogonal Fusion (DHOF) strategy. Extensive experiments conducted on six TCGA cancer cohorts demonstrate that our MurreNet achieves state-of-the-art (SOTA) performance in survival prediction.
△ Less
Submitted 7 July, 2025;
originally announced July 2025.
-
MambaVideo for Discrete Video Tokenization with Channel-Split Quantization
Authors:
Dawit Mureja Argaw,
Xian Liu,
Joon Son Chung,
Ming-Yu Liu,
Fitsum Reda
Abstract:
Discrete video tokenization is essential for efficient autoregressive generative modeling due to the high dimensionality of video data. This work introduces a state-of-the-art discrete video tokenizer with two key contributions. First, we propose a novel Mamba-based encoder-decoder architecture that overcomes the limitations of previous sequencebased tokenizers. Second, we introduce a new quantiza…
▽ More
Discrete video tokenization is essential for efficient autoregressive generative modeling due to the high dimensionality of video data. This work introduces a state-of-the-art discrete video tokenizer with two key contributions. First, we propose a novel Mamba-based encoder-decoder architecture that overcomes the limitations of previous sequencebased tokenizers. Second, we introduce a new quantization scheme, channel-split quantization, which significantly enhances the representational power of quantized latents while preserving the token count. Our model sets a new state-of-the-art, outperforming both causal 3D convolutionbased and Transformer-based approaches across multiple datasets. Experimental results further demonstrate its robustness as a tokenizer for autoregressive video generation.
△ Less
Submitted 6 July, 2025;
originally announced July 2025.
-
Low-mass vector-meson production at forward rapidity in $p$$+$$p$ and Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV
Authors:
PHENIX Collaboration,
N. J. Abdulameer,
U. Acharya,
A. Adare,
C. Aidala,
N. N. Ajitanand,
Y. Akiba,
M. Alfred,
D. Anderson,
V. Andrieux,
S. Antsupov,
N. Apadula,
H. Asano,
B. Azmoun,
V. Babintsev,
M. Bai,
N. S. Bandara,
B. Bannier,
E. Bannikov,
K. N. Barish,
S. Bathe,
A. Bazilevsky,
M. Beaumier,
S. Beckman,
R. Belmont
, et al. (331 additional authors not shown)
Abstract:
The PHENIX experiment at the Relativistic Heavy Ion Collider has measured low-mass vector-meson ($ω+ρ$ and $φ$) production through the dimuon decay channel at forward rapidity $(1.2<|\mbox{y}|<2.2)$ in $p$$+$$p$ and Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. The low-mass vector-meson yield and nuclear-modification factor were measured as a function of the average number of participating nuc…
▽ More
The PHENIX experiment at the Relativistic Heavy Ion Collider has measured low-mass vector-meson ($ω+ρ$ and $φ$) production through the dimuon decay channel at forward rapidity $(1.2<|\mbox{y}|<2.2)$ in $p$$+$$p$ and Au$+$Au collisions at $\sqrt{s_{_{NN}}}=200$~GeV. The low-mass vector-meson yield and nuclear-modification factor were measured as a function of the average number of participating nucleons, $\langle N_{\rm part}\rangle$, and the transverse momentum $p_T$. These results were compared with those obtained via the kaon decay channel in a similar $p_T$ range at midrapidity. The nuclear-modification factors in both rapidity regions are consistent within the uncertainties. A comparison of the $ω+ρ$ and $J/ψ$ mesons reveals that the light and heavy flavors are consistently suppressed across both $p_T$ and ${\langle}N_{\rm part}\rangle$. In contrast, the $φ$ meson displays a nuclear-modification factor consistent with unity, suggesting strangeness enhancement in the medium formed.
△ Less
Submitted 6 July, 2025;
originally announced July 2025.
-
iBreath: Usage Of Breathing Gestures as Means of Interactions
Authors:
Mengxi Liu,
Daniel Geißler,
Deepika Gurung,
Hymalai Bello,
Bo Zhou,
Sizhen Bian,
Paul Lukowicz,
Passant Elagroudy
Abstract:
Breathing is a spontaneous but controllable body function that can be used for hands-free interaction. Our work introduces "iBreath", a novel system to detect breathing gestures similar to clicks using bio-impedance. We evaluated iBreath's accuracy and user experience using two lab studies (n=34). Our results show high detection accuracy (F1-scores > 95.2%). Furthermore, the users found the gestur…
▽ More
Breathing is a spontaneous but controllable body function that can be used for hands-free interaction. Our work introduces "iBreath", a novel system to detect breathing gestures similar to clicks using bio-impedance. We evaluated iBreath's accuracy and user experience using two lab studies (n=34). Our results show high detection accuracy (F1-scores > 95.2%). Furthermore, the users found the gestures easy to use and comfortable. Thus, we developed eight practical guidelines for the future development of breathing gestures. For example, designers can train users on new gestures within just 50 seconds (five trials), and achieve robust performance with both user-dependent and user-independent models trained on data from 21 participants, each yielding accuracies above 90%. Users preferred single clicks and disliked triple clicks. The median gesture duration is 3.5-5.3 seconds. Our work provides solid ground for researchers to experiment with creating breathing gestures and interactions.
△ Less
Submitted 5 July, 2025;
originally announced July 2025.
-
Hyperbolic Kernel Graph Neural Networks for Neurocognitive Decline Analysis from Multimodal Brain Imaging
Authors:
Meimei Yang,
Yongheng Sun,
Qianqian Wang,
Andrea Bozoki,
Maureen Kohi,
Mingxia Liu
Abstract:
Multimodal neuroimages, such as diffusion tensor imaging (DTI) and resting-state functional MRI (fMRI), offer complementary perspectives on brain activities by capturing structural or functional interactions among brain regions. While existing studies suggest that fusing these multimodal data helps detect abnormal brain activity caused by neurocognitive decline, they are generally implemented in E…
▽ More
Multimodal neuroimages, such as diffusion tensor imaging (DTI) and resting-state functional MRI (fMRI), offer complementary perspectives on brain activities by capturing structural or functional interactions among brain regions. While existing studies suggest that fusing these multimodal data helps detect abnormal brain activity caused by neurocognitive decline, they are generally implemented in Euclidean space and can't effectively capture intrinsic hierarchical organization of structural/functional brain networks. This paper presents a hyperbolic kernel graph fusion (HKGF) framework for neurocognitive decline analysis with multimodal neuroimages. It consists of a multimodal graph construction module, a graph representation learning module that encodes brain graphs in hyperbolic space through a family of hyperbolic kernel graph neural networks (HKGNNs), a cross-modality coupling module that enables effective multimodal data fusion, and a hyperbolic neural network for downstream predictions. Notably, HKGNNs represent graphs in hyperbolic space to capture both local and global dependencies among brain regions while preserving the hierarchical structure of brain networks. Extensive experiments involving over 4,000 subjects with DTI and/or fMRI data suggest the superiority of HKGF over state-of-the-art methods in two neurocognitive decline prediction tasks. HKGF is a general framework for multimodal data analysis, facilitating objective quantification of structural/functional brain connectivity changes associated with neurocognitive decline.
△ Less
Submitted 24 June, 2025;
originally announced July 2025.
-
Partial Weakly-Supervised Oriented Object Detection
Authors:
Mingxin Liu,
Peiyuan Zhang,
Yuan Liu,
Wei Zhang,
Yue Zhou,
Ning Liao,
Ziyang Gong,
Junwei Luo,
Zhirui Wang,
Yi Yu,
Xue Yang
Abstract:
The growing demand for oriented object detection (OOD) across various domains has driven significant research in this area. However, the high cost of dataset annotation remains a major concern. Current mainstream OOD algorithms can be mainly categorized into three types: (1) fully supervised methods using complete oriented bounding box (OBB) annotations, (2) semi-supervised methods using partial O…
▽ More
The growing demand for oriented object detection (OOD) across various domains has driven significant research in this area. However, the high cost of dataset annotation remains a major concern. Current mainstream OOD algorithms can be mainly categorized into three types: (1) fully supervised methods using complete oriented bounding box (OBB) annotations, (2) semi-supervised methods using partial OBB annotations, and (3) weakly supervised methods using weak annotations such as horizontal boxes or points. However, these algorithms inevitably increase the cost of models in terms of annotation speed or annotation cost. To address this issue, we propose:(1) the first Partial Weakly-Supervised Oriented Object Detection (PWOOD) framework based on partially weak annotations (horizontal boxes or single points), which can efficiently leverage large amounts of unlabeled data, significantly outperforming weakly supervised algorithms trained with partially weak annotations, also offers a lower cost solution; (2) Orientation-and-Scale-aware Student (OS-Student) model capable of learning orientation and scale information with only a small amount of orientation-agnostic or scale-agnostic weak annotations; and (3) Class-Agnostic Pseudo-Label Filtering strategy (CPF) to reduce the model's sensitivity to static filtering thresholds. Comprehensive experiments on DOTA-v1.0/v1.5/v2.0 and DIOR datasets demonstrate that our PWOOD framework performs comparably to, or even surpasses, traditional semi-supervised algorithms.
△ Less
Submitted 3 July, 2025;
originally announced July 2025.
-
Fair Deepfake Detectors Can Generalize
Authors:
Harry Cheng,
Ming-Hui Liu,
Yangyang Guo,
Tianyi Wang,
Liqiang Nie,
Mohan Kankanhalli
Abstract:
Deepfake detection models face two critical challenges: generalization to unseen manipulations and demographic fairness among population groups. However, existing approaches often demonstrate that these two objectives are inherently conflicting, revealing a trade-off between them. In this paper, we, for the first time, uncover and formally define a causal relationship between fairness and generali…
▽ More
Deepfake detection models face two critical challenges: generalization to unseen manipulations and demographic fairness among population groups. However, existing approaches often demonstrate that these two objectives are inherently conflicting, revealing a trade-off between them. In this paper, we, for the first time, uncover and formally define a causal relationship between fairness and generalization. Building on the back-door adjustment, we show that controlling for confounders (data distribution and model capacity) enables improved generalization via fairness interventions. Motivated by this insight, we propose Demographic Attribute-insensitive Intervention Detection (DAID), a plug-and-play framework composed of: i) Demographic-aware data rebalancing, which employs inverse-propensity weighting and subgroup-wise feature normalization to neutralize distributional biases; and ii) Demographic-agnostic feature aggregation, which uses a novel alignment loss to suppress sensitive-attribute signals. Across three cross-domain benchmarks, DAID consistently achieves superior performance in both fairness and generalization compared to several state-of-the-art detectors, validating both its theoretical foundation and practical effectiveness.
△ Less
Submitted 3 July, 2025;
originally announced July 2025.
-
Midveins regulate the shape formation of drying leaves
Authors:
Kexin Guo,
Yafei Zhang,
Massimo Paradiso,
Yuchen Long,
K. Jimmy Hsia,
Mingchao Liu
Abstract:
Dried leaves in nature often exhibit curled and crumpled morphologies, typically attributed to internal strain gradients that produce dome-like shapes. However, the origin of these strain gradients remains poorly understood. Although leaf veins--particularly the midvein--have been suggested to influence shape formation, their mechanical role has not been systematically investigated. Here, we demon…
▽ More
Dried leaves in nature often exhibit curled and crumpled morphologies, typically attributed to internal strain gradients that produce dome-like shapes. However, the origin of these strain gradients remains poorly understood. Although leaf veins--particularly the midvein--have been suggested to influence shape formation, their mechanical role has not been systematically investigated. Here, we demonstrate that mechanical constraints imposed by the midvein play a crucial role in generating the diverse morphologies that emerge during leaf drying. Combining numerical simulations and theoretical analysis, we show that a uniformly shrinking leaf lamina constrained by a non-shrinking midvein gives rise to two distinct types of configurations: curling-dominated and folding-dominated morphologies. In the curling-dominated regime, both S-curled and C-curled shapes emerge, with C-curled configurations more commonly observed due to their lower elastic energy. In contrast, the folding-dominated regime features folding accompanied by edge waviness. Theoretical modeling reveals a linear relationship between midvein curvature and mismatch strain, consistent with simulation results. Moreover, we find that the morphological outcome is governed by the ratio of bending stiffnesses between the lamina and the midvein. We construct a comprehensive phase diagram for the transitions between different configurations. These findings provide a mechanical framework for understanding shape formation in drying leaves, offering new insights into natural morphing processes and informing the design of bio-inspired morphable structures.
△ Less
Submitted 2 July, 2025;
originally announced July 2025.
-
Some remarks on the uncolored versions of the original CFI-graphs
Authors:
Yijia Chen,
Jörg Flum,
Mingjun Liu
Abstract:
The CFI-graphs, named after Cai, Fürer, and Immerman, are central to the study of the graph isomorphism testing and of first-order logic with counting. They are colored graphs, and the coloring plays a role in many of their applications. As usual, it is not hard to remove the coloring by some extra graph gadgets, but at the cost of blowing up the size of the graphs and changing some parameters of…
▽ More
The CFI-graphs, named after Cai, Fürer, and Immerman, are central to the study of the graph isomorphism testing and of first-order logic with counting. They are colored graphs, and the coloring plays a role in many of their applications. As usual, it is not hard to remove the coloring by some extra graph gadgets, but at the cost of blowing up the size of the graphs and changing some parameters of them as well. This might lead to suboptimal combinatorial bounds important to their applications. Since then for some uncolored variants of the CFI-graphs it has been shown that they serve the same purposes. We show that this already applies to the graphs obtained from the original CFI-graphs by forgetting the colors. Moreover, we will see that there is a first-order formula $\varphi(x,y)$ expressing in almost all uncolored CFI-graphs that $x$ and $y$ have the same color in the corresponding colored graphs.
△ Less
Submitted 2 July, 2025;
originally announced July 2025.
-
Search for an Axion-Like Particle in $B\rightarrow K^{(*)} a (\rightarrowγγ)$ Decays at Belle
Authors:
Belle,
Belle II Collaborations,
:,
I. Adachi,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
N. Althubiti,
K. Amos,
M. Angelsmark,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae
, et al. (400 additional authors not shown)
Abstract:
We report a search for an axion-like particle $a$ in $B\rightarrow K^{(*)} a (\rightarrowγγ)$ decays using data collected with the Belle detector at the KEKB asymmetric energy electron-positron collider. The search is based on a $711 \mathrm{fb^{-1}}$ data sample collected at the $Υ4S$ resonance energy, corresponding to a sample of $772\times10^6$ $Υ4S$ events. In this study, we search for the dec…
▽ More
We report a search for an axion-like particle $a$ in $B\rightarrow K^{(*)} a (\rightarrowγγ)$ decays using data collected with the Belle detector at the KEKB asymmetric energy electron-positron collider. The search is based on a $711 \mathrm{fb^{-1}}$ data sample collected at the $Υ4S$ resonance energy, corresponding to a sample of $772\times10^6$ $Υ4S$ events. In this study, we search for the decay of the axion-like particle into a pair of photons, $a \rightarrow γγ$. We scan the two-photon invariant mass in the range $0.16\ \mathrm{GeV/}c^2-4.50\ \mathrm{GeV}/c^2$ for the $K$ modes and $0.16\ \mathrm{GeV/}c^2-4.20\ \mathrm{GeV}/c^2$ for the $K^{*}$ modes. No significant signal is observed in any of the modes, and 90\% confidence level upper limits are established on the coupling to the $W$ boson, $g_aW$, as a function of $a$ mass. The limits range from $3 \times 10^{-6} \mathrm{GeV}^{-1}$ to $3 \times 10^{-5} \mathrm{GeV}^{-1}$, improving the current constraints on $g_aW$ by a factor of two over the most stringent previous experimental results.
△ Less
Submitted 3 July, 2025; v1 submitted 1 July, 2025;
originally announced July 2025.
-
VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers
Authors:
Yating Wang,
Haoyi Zhu,
Mingyu Liu,
Jiange Yang,
Hao-Shu Fang,
Tong He
Abstract:
In this paper, we introduce an innovative vector quantization based action tokenizer built upon the largest-scale action trajectory dataset to date, leveraging over 100 times more data than previous approaches. This extensive dataset enables our tokenizer to capture rich spatiotemporal dynamics, resulting in a model that not only accelerates inference but also generates smoother and more coherent…
▽ More
In this paper, we introduce an innovative vector quantization based action tokenizer built upon the largest-scale action trajectory dataset to date, leveraging over 100 times more data than previous approaches. This extensive dataset enables our tokenizer to capture rich spatiotemporal dynamics, resulting in a model that not only accelerates inference but also generates smoother and more coherent action outputs. Once trained, the tokenizer can be seamlessly adapted to a wide range of downstream tasks in a zero-shot manner, from short-horizon reactive behaviors to long-horizon planning. A key finding of our work is that the domain gap between synthetic and real action trajectories is marginal, allowing us to effectively utilize a vast amount of synthetic data during training without compromising real-world performance. To validate our approach, we conducted extensive experiments in both simulated environments and on real robotic platforms. The results demonstrate that as the volume of synthetic trajectory data increases, the performance of our tokenizer on downstream tasks improves significantly-most notably, achieving up to a 30% higher success rate on two real-world tasks in long-horizon scenarios. These findings highlight the potential of our action tokenizer as a robust and scalable solution for real-time embodied intelligence systems, paving the way for more efficient and reliable robotic control in diverse application domains.Project website: https://xiaoxiao0406.github.io/vqvla.github.io
△ Less
Submitted 1 July, 2025;
originally announced July 2025.
-
A Hierarchical and Evolvable Benchmark for Fine-Grained Code Instruction Following with Multi-Turn Feedback
Authors:
Guoliang Duan,
Mingwei Liu,
Yanlin Wang,
Chong Wang,
Xin Peng,
Zibin Zheng
Abstract:
Large language models (LLMs) have advanced significantly in code generation, yet their ability to follow complex programming instructions with layered and diverse constraints remains underexplored. Existing benchmarks often prioritize functional correctness, overlooking the nuanced requirements found in real-world development. We introduce MultiCodeIF, a comprehensive benchmark designed to evaluat…
▽ More
Large language models (LLMs) have advanced significantly in code generation, yet their ability to follow complex programming instructions with layered and diverse constraints remains underexplored. Existing benchmarks often prioritize functional correctness, overlooking the nuanced requirements found in real-world development. We introduce MultiCodeIF, a comprehensive benchmark designed to evaluate instruction-following in code generation across multiple dimensions: constraint type, hierarchical levels, and iterative refinement. Built upon a structured taxonomy of 9 categories and 27 constraint types, MultiCodeIF enables granular assessment of both functional and non-functional instruction adherence. Using an automated pipeline, ConstraGen, we synthesize and evolve 2,021 code tasks sourced from 14 programming languages, supporting multi-turn evaluation through feedback-driven task variants. Empirical evaluation of six state-of-the-art LLMs uncovers substantial performance disparities. The top-performing model, Claude-3-7-Sonnet, achieves 63.0% average constraint satisfaction, while smaller models like Qwen3-1.7B fall to 44.8%. Models perform well on explicit constraints, but struggle with implicit or abstract constraints. Tasks with multiple hierarchical constraints significantly reduce model success rates, from 54.5% in single-level to just 18.8% in multi-level scenarios. However, structured feedback enables progressive improvement: average constraint satisfaction rises from 63.0% to 83.4% over four iterative refinement rounds. MultiCodeIF provides a scalable, constraint-aware, and feedback-sensitive framework to benchmark LLMs under realistic code generation scenarios, bridging the gap between synthetic evaluations and real-world instruction complexity. The full benchmark dataset, evaluation pipeline, and source code are available at https://github.com/SYSUSELab/MultiCodeIF.
△ Less
Submitted 1 July, 2025;
originally announced July 2025.
-
WebANNS: Fast and Efficient Approximate Nearest Neighbor Search in Web Browsers
Authors:
Mugeng Liu,
Siqi Zhong,
Qi Yang,
Yudong Han,
Xuanzhe Liu,
Yun Ma
Abstract:
Approximate nearest neighbor search (ANNS) has become vital to modern AI infrastructure, particularly in retrieval-augmented generation (RAG) applications. Numerous in-browser ANNS engines have emerged to seamlessly integrate with popular LLM-based web applications, while addressing privacy protection and challenges of heterogeneous device deployments. However, web browsers present unique challeng…
▽ More
Approximate nearest neighbor search (ANNS) has become vital to modern AI infrastructure, particularly in retrieval-augmented generation (RAG) applications. Numerous in-browser ANNS engines have emerged to seamlessly integrate with popular LLM-based web applications, while addressing privacy protection and challenges of heterogeneous device deployments. However, web browsers present unique challenges for ANNS, including computational limitations, external storage access issues, and memory utilization constraints, which state-of-the-art (SOTA) solutions fail to address comprehensively. We propose WebANNS, a novel ANNS engine specifically designed for web browsers. WebANNS leverages WebAssembly to overcome computational bottlenecks, designs a lazy loading strategy to optimize data retrieval from external storage, and applies a heuristic approach to reduce memory usage. Experiments show that WebANNS is fast and memory efficient, achieving up to $743.8\times$ improvement in 99th percentile query latency over the SOTA engine, while reducing memory usage by up to 39\%. Note that WebANNS decreases query time from 10 seconds to the 10-millisecond range in browsers, making in-browser ANNS practical with user-acceptable latency.
△ Less
Submitted 1 July, 2025; v1 submitted 1 July, 2025;
originally announced July 2025.
-
On Mitigating Data Sparsity in Conversational Recommender Systems
Authors:
Sixiao Zhang,
Mingrui Liu,
Cheng Long,
Wei Yuan,
Hongxu Chen,
Xiangyu Zhao,
Hongzhi Yin
Abstract:
Conversational recommender systems (CRSs) capture user preference through textual information in dialogues. However, they suffer from data sparsity on two fronts: the dialogue space is vast and linguistically diverse, while the item space exhibits long-tail and sparse distributions. Existing methods struggle with (1) generalizing to varied dialogue expressions due to underutilization of rich textu…
▽ More
Conversational recommender systems (CRSs) capture user preference through textual information in dialogues. However, they suffer from data sparsity on two fronts: the dialogue space is vast and linguistically diverse, while the item space exhibits long-tail and sparse distributions. Existing methods struggle with (1) generalizing to varied dialogue expressions due to underutilization of rich textual cues, and (2) learning informative item representations under severe sparsity. To address these problems, we propose a CRS model named DACRS. It consists of three modules, namely Dialogue Augmentation, Knowledge-Guided Entity Modeling, and Dialogue-Entity Matching. In the Dialogue Augmentation module, we apply a two-stage augmentation pipeline to augment the dialogue context to enrich the data and improve generalizability. In the Knowledge-Guided Entity Modeling, we propose a knowledge graph (KG) based entity substitution and an entity similarity constraint to enhance the expressiveness of entity embeddings. In the Dialogue-Entity Matching module, we fuse the dialogue embedding with the mentioned entity embeddings through a dialogue-guided attention aggregation to acquire user embeddings that contain both the explicit and implicit user preferences. Extensive experiments on two public datasets demonstrate the state-of-the-art performance of DACRS.
△ Less
Submitted 1 July, 2025;
originally announced July 2025.
-
Causal Prompting for Implicit Sentiment Analysis with Large Language Models
Authors:
Jing Ren,
Wenhao Zhou,
Bowen Li,
Mujie Liu,
Nguyen Linh Dan Le,
Jiade Cen,
Liping Chen,
Ziqi Xu,
Xiwei Xu,
Xiaodong Li
Abstract:
Implicit Sentiment Analysis (ISA) aims to infer sentiment that is implied rather than explicitly stated, requiring models to perform deeper reasoning over subtle contextual cues. While recent prompting-based methods using Large Language Models (LLMs) have shown promise in ISA, they often rely on majority voting over chain-of-thought (CoT) reasoning paths without evaluating their causal validity, m…
▽ More
Implicit Sentiment Analysis (ISA) aims to infer sentiment that is implied rather than explicitly stated, requiring models to perform deeper reasoning over subtle contextual cues. While recent prompting-based methods using Large Language Models (LLMs) have shown promise in ISA, they often rely on majority voting over chain-of-thought (CoT) reasoning paths without evaluating their causal validity, making them susceptible to internal biases and spurious correlations. To address this challenge, we propose CAPITAL, a causal prompting framework that incorporates front-door adjustment into CoT reasoning. CAPITAL decomposes the overall causal effect into two components: the influence of the input prompt on the reasoning chains, and the impact of those chains on the final output. These components are estimated using encoder-based clustering and the NWGM approximation, with a contrastive learning objective used to better align the encoder's representation with the LLM's reasoning space. Experiments on benchmark ISA datasets with three LLMs demonstrate that CAPITAL consistently outperforms strong prompting baselines in both accuracy and robustness, particularly under adversarial conditions. This work offers a principled approach to integrating causal inference into LLM prompting and highlights its benefits for bias-aware sentiment reasoning. The source code and case study are available at: https://github.com/whZ62/CAPITAL.
△ Less
Submitted 30 June, 2025;
originally announced July 2025.
-
Customizable ROI-Based Deep Image Compression
Authors:
Jian Jin,
Fanxin Xia,
Feng Ding,
Xinfeng Zhang,
Meiqin Liu,
Yao Zhao,
Weisi Lin,
Lili Meng
Abstract:
Region of Interest (ROI)-based image compression optimizes bit allocation by prioritizing ROI for higher-quality reconstruction. However, as the users (including human clients and downstream machine tasks) become more diverse, ROI-based image compression needs to be customizable to support various preferences. For example, different users may define distinct ROI or require different quality trade-…
▽ More
Region of Interest (ROI)-based image compression optimizes bit allocation by prioritizing ROI for higher-quality reconstruction. However, as the users (including human clients and downstream machine tasks) become more diverse, ROI-based image compression needs to be customizable to support various preferences. For example, different users may define distinct ROI or require different quality trade-offs between ROI and non-ROI. Existing ROI-based image compression schemes predefine the ROI, making it unchangeable, and lack effective mechanisms to balance reconstruction quality between ROI and non-ROI. This work proposes a paradigm for customizable ROI-based deep image compression. First, we develop a Text-controlled Mask Acquisition (TMA) module, which allows users to easily customize their ROI for compression by just inputting the corresponding semantic \emph{text}. It makes the encoder controlled by text. Second, we design a Customizable Value Assign (CVA) mechanism, which masks the non-ROI with a changeable extent decided by users instead of a constant one to manage the reconstruction quality trade-off between ROI and non-ROI. Finally, we present a Latent Mask Attention (LMA) module, where the latent spatial prior of the mask and the latent Rate-Distortion Optimization (RDO) prior of the image are extracted and fused in the latent space, and further used to optimize the latent representation of the source image. Experimental results demonstrate that our proposed customizable ROI-based deep image compression paradigm effectively addresses the needs of customization for ROI definition and mask acquisition as well as the reconstruction quality trade-off management between the ROI and non-ROI.
△ Less
Submitted 2 July, 2025; v1 submitted 30 June, 2025;
originally announced July 2025.
-
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
Authors:
Bo Liu,
Leon Guertler,
Simon Yu,
Zichen Liu,
Penghui Qi,
Daniel Balcells,
Mickel Liu,
Cheston Tan,
Weiyan Shi,
Min Lin,
Wee Sun Lee,
Natasha Jaques
Abstract:
Recent advances in reinforcement learning have shown that language models can develop sophisticated reasoning through training on tasks with verifiable rewards, but these approaches depend on human-curated problem-answer pairs and domain-specific reward engineering. We introduce SPIRAL, a self-play framework where models learn by playing multi-turn, zero-sum games against continuously improving ve…
▽ More
Recent advances in reinforcement learning have shown that language models can develop sophisticated reasoning through training on tasks with verifiable rewards, but these approaches depend on human-curated problem-answer pairs and domain-specific reward engineering. We introduce SPIRAL, a self-play framework where models learn by playing multi-turn, zero-sum games against continuously improving versions of themselves, eliminating the need for human supervision. Through self-play, SPIRAL generates an infinite curriculum of progressively challenging problems as models must constantly adapt to stronger opponents. To enable this self-play training at scale, We implement a fully online, multi-turn, multi-agent reinforcement learning system for LLMs and propose role-conditioned advantage estimation (RAE) to stabilize multi-agent training. Using SPIRAL, self-play on zero-sum games produces reasoning capabilities that transfer broadly. Training Qwen3-4B-Base on Kuhn Poker alone achieves 8.6% improvement on math and 8.4% on general reasoning, outperforming SFT on 25,000 expert game trajectories. Analysis reveals that this transfer occurs through three cognitive patterns: systematic decomposition, expected value calculation, and case-by-case analysis. Multi-game training (TicTacToe, Kuhn Poker, Simple Negotiation) further enhances performance as each game develops distinct reasoning strengths. Applying SPIRAL to a strong reasoning model (DeepSeek-R1-Distill-Qwen-7B) can still lead to 2.0% average improvement. These results demonstrate that zero-sum games naturally develop transferable reasoning capabilities, highlighting a promising direction for autonomous reasoning development.
△ Less
Submitted 30 June, 2025; v1 submitted 30 June, 2025;
originally announced June 2025.
-
Puzzles: Unbounded Video-Depth Augmentation for Scalable End-to-End 3D Reconstruction
Authors:
Jiahao Ma,
Lei Wang,
Miaomiao liu,
David Ahmedt-Aristizabal,
Chuong Nguyen
Abstract:
Multi-view 3D reconstruction remains a core challenge in computer vision. Recent methods, such as DUST3R and its successors, directly regress pointmaps from image pairs without relying on known scene geometry or camera parameters. However, the performance of these models is constrained by the diversity and scale of available training data. In this work, we introduce Puzzles, a data augmentation st…
▽ More
Multi-view 3D reconstruction remains a core challenge in computer vision. Recent methods, such as DUST3R and its successors, directly regress pointmaps from image pairs without relying on known scene geometry or camera parameters. However, the performance of these models is constrained by the diversity and scale of available training data. In this work, we introduce Puzzles, a data augmentation strategy that synthesizes an unbounded volume of high-quality posed video-depth data from a single image or video clip. By simulating diverse camera trajectories and realistic scene geometry through targeted image transformations, Puzzles significantly enhances data variety. Extensive experiments show that integrating Puzzles into existing video-based 3D reconstruction pipelines consistently boosts performance without modifying the underlying network architecture. Notably, models trained on only ten percent of the original data augmented with Puzzles still achieve accuracy comparable to those trained on the full dataset. Code is available at https://jiahao-ma.github.io/puzzles/.
△ Less
Submitted 30 June, 2025;
originally announced June 2025.
-
AutoEvoEval: An Automated Framework for Evolving Close-Ended LLM Evaluation Data
Authors:
JiaRu Wu,
Mingwei Liu
Abstract:
Large language models (LLMs) have shown remarkable performance on various tasks, but existing evaluation benchmarks are often static and insufficient to fully assess their robustness and generalization in realistic scenarios. Prior work using evolutionary or adversarial data augmentation has improved evaluation diversity but lacks systematic control over perturbation types and multi-step complexit…
▽ More
Large language models (LLMs) have shown remarkable performance on various tasks, but existing evaluation benchmarks are often static and insufficient to fully assess their robustness and generalization in realistic scenarios. Prior work using evolutionary or adversarial data augmentation has improved evaluation diversity but lacks systematic control over perturbation types and multi-step complexity, limiting comprehensive robustness analysis. To address these gaps, we propose AutoEvoEval, an evolution-based evaluation framework for close-ended tasks such as multi-choice question answering. AutoEvoEval introduces 22 interpretable atomic evolution operations and supports multi-round compositions, enabling controlled generation of diverse, challenging, and realistic test samples. We conduct extensive experiments addressing four research questions on a broad set of open- and closed-source LLMs. Our results show that atomic operations cause an average accuracy drop of 7.283\%, with structure-disrupting or misleading semantic edits causing the largest declines. Model sensitivities vary significantly for the same perturbation, and combining multiple evolution steps amplifies adversarial effects by up to 52.932\%. These findings suggest current benchmarks may overestimate true model generalization and emphasize the need for evolution-aware robustness evaluation. Code and resources are available at: https://github.com/SYSUSELab/AutoEvoEval.
△ Less
Submitted 30 June, 2025;
originally announced June 2025.
-
Probing the structure of exotic hadrons through correlation functions
Authors:
Yi-bo Shen,
Zhi-Wei Liu,
Jun-Xu Lu,
Ming-Zhu Liu,
Li-Sheng Geng
Abstract:
Over the past 20 years, many new hadron states have been discovered, but understanding their nature remains a key experimental and theoretical challenge. Recent studies have established that hadron-hadron interactions primarily govern the generation of new hadronic states, with their spectroscopy serving as a powerful tool for probing these interactions and determining the corresponding compositen…
▽ More
Over the past 20 years, many new hadron states have been discovered, but understanding their nature remains a key experimental and theoretical challenge. Recent studies have established that hadron-hadron interactions primarily govern the generation of new hadronic states, with their spectroscopy serving as a powerful tool for probing these interactions and determining the corresponding compositeness. In this work, we study four scenarios to determine the $DK$ interaction by reproducing the mass of the $D_{s0}^*(2317)$, i.e., assuming the $D_{s0}^*(2317)$ as a $DK$ molecule, a mixture of a $DK$ molecule and a bare state, a $DK-D_sη$ molecule, and a mixture of a $DK-D_sη$ molecule and a bare state. Using the $D^{0}K^{+}$ interactions derived from these scenarios, we predict the $D^{0}K^{+}$ correlation functions. Our results demonstrate that the inclusion of a bare state significantly alters the lineshape of the $D^{0}K^{+}$ correlation functions. These variations provide key physical observables to probe the internal structure of the $D_{s0}^*(2317)$. Using the shallow bound state candidate $X(3872)$ as input, we study the impact of including a bare state on the lineshape of the $D^0\bar{D}^{*0}$ correlation functions, revealing more pronounced variations. Our analysis provides critical insights into the internal structure of exotic hadronic states via momentum correlation measurements.
△ Less
Submitted 29 June, 2025;
originally announced June 2025.
-
The effectively optically thin accretion flow and its implication in supermassive black holes
Authors:
Mingjun Liu,
B. F. Liu,
Yilong Wang,
Huaqing Cheng,
Weimin Yuan
Abstract:
Based on a unified description of various accretion flows, we find a long-ignored solution - the effectively optically thin accretion flow, occurring at accretion rates around Eddington value. As a consequence of radiation-pressure dominance, the density in a standard thin disc (SSD) decreases with the increase of accretion rates, making the innermost region effectively optically thin. Further inc…
▽ More
Based on a unified description of various accretion flows, we find a long-ignored solution - the effectively optically thin accretion flow, occurring at accretion rates around Eddington value. As a consequence of radiation-pressure dominance, the density in a standard thin disc (SSD) decreases with the increase of accretion rates, making the innermost region effectively optically thin. Further increase in accretion rate leads to a rise of the temperature so that the Compton cooling is able to balance the accretion released energy. We demonstrate that the effectively optically thin flow is characterized by moderate temperature and large scattering optical depth, producing a multi-color Wien spectrum. For an appropriate accretion rate, the accretion flow transforms from an outer SSD to an inner effectively optically thin flow. Thus, the spectra of the whole accretion flow exhibit two components, i.e., a multi-color Wien spectrum at higher frequency and a multi-color blackbody, the former could provide an alternative origin of soft X-ray excess or formation of warm corona in active galactic nuclei (AGNs). Our stability analysis proves it is thermally stable and viscously unstable, indicating its existence in accreting systems. We show that effectively optically thin accretion flow exists in supermassive black holes for accretion rates around 0.1 to 10 times Eddington value, bridging the SSD at low accretion rates and slim disc at high rates. By comparing the predictions and average spectra of AGN, we constrain the viscosity parameter to be $α\sim 0.03$, in good agreement with that derived from observed variability.
△ Less
Submitted 28 June, 2025;
originally announced June 2025.
-
QuKAN: A Quantum Circuit Born Machine approach to Quantum Kolmogorov Arnold Networks
Authors:
Yannick Werner,
Akash Malemath,
Mengxi Liu,
Vitor Fortes Rey,
Nikolaos Palaiodimopoulos,
Paul Lukowicz,
Maximilian Kiefer-Emmanouilidis
Abstract:
Kolmogorov Arnold Networks (KANs), built upon the Kolmogorov Arnold representation theorem (KAR), have demonstrated promising capabilities in expressing complex functions with fewer neurons. This is achieved by implementing learnable parameters on the edges instead of on the nodes, unlike traditional networks such as Multi-Layer Perceptrons (MLPs). However, KANs potential in quantum machine learni…
▽ More
Kolmogorov Arnold Networks (KANs), built upon the Kolmogorov Arnold representation theorem (KAR), have demonstrated promising capabilities in expressing complex functions with fewer neurons. This is achieved by implementing learnable parameters on the edges instead of on the nodes, unlike traditional networks such as Multi-Layer Perceptrons (MLPs). However, KANs potential in quantum machine learning has not yet been well explored. In this work, we present an implementation of these KAN architectures in both hybrid and fully quantum forms using a Quantum Circuit Born Machine (QCBM). We adapt the KAN transfer using pre-trained residual functions, thereby exploiting the representational power of parametrized quantum circuits. In the hybrid model we combine classical KAN components with quantum subroutines, while the fully quantum version the entire architecture of the residual function is translated to a quantum model. We demonstrate the feasibility, interpretability and performance of the proposed Quantum KAN (QuKAN) architecture.
△ Less
Submitted 27 June, 2025;
originally announced June 2025.
-
RAUM-Net: Regional Attention and Uncertainty-aware Mamba Network
Authors:
Mingquan Liu
Abstract:
Fine Grained Visual Categorization (FGVC) remains a challenging task in computer vision due to subtle inter class differences and fragile feature representations. Existing methods struggle in fine grained scenarios, especially when labeled data is scarce. We propose a semi supervised method combining Mamba based feature modeling, region attention, and Bayesian uncertainty. Our approach enhances lo…
▽ More
Fine Grained Visual Categorization (FGVC) remains a challenging task in computer vision due to subtle inter class differences and fragile feature representations. Existing methods struggle in fine grained scenarios, especially when labeled data is scarce. We propose a semi supervised method combining Mamba based feature modeling, region attention, and Bayesian uncertainty. Our approach enhances local to global feature modeling while focusing on key areas during learning. Bayesian inference selects high quality pseudo labels for stability. Experiments show strong performance on FGVC benchmarks with occlusions, demonstrating robustness when labeled data is limited. Code is available at https://github.com/wxqnl/RAUM Net.
△ Less
Submitted 27 June, 2025;
originally announced June 2025.
-
Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation
Authors:
Qiyue Gao,
Xinyu Pi,
Kevin Liu,
Junrong Chen,
Ruolan Yang,
Xinqi Huang,
Xinyu Fang,
Lu Sun,
Gautham Kishore,
Bo Ai,
Stone Tao,
Mengyang Liu,
Jiaxi Yang,
Chao-Jung Lai,
Chuanyang Jin,
Jiannan Xiang,
Benhao Huang,
Zeming Chen,
David Danks,
Hao Su,
Tianmin Shu,
Ziqiao Ma,
Lianhui Qin,
Zhiting Hu
Abstract:
Internal world models (WMs) enable agents to understand the world's state and predict transitions, serving as the basis for advanced deliberative reasoning. Recent large Vision-Language Models (VLMs), such as OpenAI o3, GPT-4o and Gemini, exhibit potential as general-purpose WMs. While the latest studies have evaluated and shown limitations in specific capabilities such as visual understanding, a…
▽ More
Internal world models (WMs) enable agents to understand the world's state and predict transitions, serving as the basis for advanced deliberative reasoning. Recent large Vision-Language Models (VLMs), such as OpenAI o3, GPT-4o and Gemini, exhibit potential as general-purpose WMs. While the latest studies have evaluated and shown limitations in specific capabilities such as visual understanding, a systematic evaluation of VLMs' fundamental WM abilities remains absent. Drawing on comparative psychology and cognitive science, we propose a two-stage framework that assesses Perception (visual, spatial, temporal, quantitative, and motion) and Prediction (mechanistic simulation, transitive inference, compositional inference) to provide an atomic evaluation of VLMs as WMs. Guided by this framework, we introduce WM-ABench, a large-scale benchmark comprising 23 fine-grained evaluation dimensions across 6 diverse simulated environments with controlled counterfactual simulations. Through 660 experiments on 15 latest commercial and open-source VLMs, we find that these models exhibit striking limitations in basic world modeling abilities. For instance, almost all models perform at near-random accuracy when distinguishing motion trajectories. Additionally, they lack disentangled understanding -- e.g., some models tend to believe blue objects move faster than green ones. More rich results and analyses reveal significant gaps between VLMs and human-level world modeling.
△ Less
Submitted 26 June, 2025;
originally announced June 2025.
-
Debunk and Infer: Multimodal Fake News Detection via Diffusion-Generated Evidence and LLM Reasoning
Authors:
Kaiying Yan,
Moyang Liu,
Yukun Liu,
Ruibo Fu,
Zhengqi Wen,
Jianhua Tao,
Xuefei Liu
Abstract:
The rapid spread of fake news across multimedia platforms presents serious challenges to information credibility. In this paper, we propose a Debunk-and-Infer framework for Fake News Detection(DIFND) that leverages debunking knowledge to enhance both the performance and interpretability of fake news detection. DIFND integrates the generative strength of conditional diffusion models with the collab…
▽ More
The rapid spread of fake news across multimedia platforms presents serious challenges to information credibility. In this paper, we propose a Debunk-and-Infer framework for Fake News Detection(DIFND) that leverages debunking knowledge to enhance both the performance and interpretability of fake news detection. DIFND integrates the generative strength of conditional diffusion models with the collaborative reasoning capabilities of multimodal large language models (MLLMs). Specifically, debunk diffusion is employed to generate refuting or authenticating evidence based on the multimodal content of news videos, enriching the evaluation process with diverse yet semantically aligned synthetic samples. To improve inference, we propose a chain-of-debunk strategy where a multi-agent MLLM system produces logic-grounded, multimodal-aware reasoning content and final veracity judgment. By jointly modeling multimodal features, generative debunking cues, and reasoning-rich verification within a unified architecture, DIFND achieves notable improvements in detection accuracy. Extensive experiments on the FakeSV and FVC datasets show that DIFND not only outperforms existing approaches but also delivers trustworthy decisions.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
Diverse polymorphs and phase transitions in van der Waals In$_2$Se$_3$
Authors:
Mingfeng Liu,
Jiantao Wang,
Peitao Liu,
Qiang Wang,
Zhibo Liu,
Yan Sun,
Xing-Qiu Chen
Abstract:
Van der Waals In$_2$Se$_3$ has garnered significant attention due to its unique properties and wide applications associated with its rich polymorphs and polymorphic phase transitions. Despite extensive studies, the vast complex polymorphic phase space remains largely unexplored, and the underlying microscopic mechanism for their phase transformations remains elusive. Here, we develop a highly accu…
▽ More
Van der Waals In$_2$Se$_3$ has garnered significant attention due to its unique properties and wide applications associated with its rich polymorphs and polymorphic phase transitions. Despite extensive studies, the vast complex polymorphic phase space remains largely unexplored, and the underlying microscopic mechanism for their phase transformations remains elusive. Here, we develop a highly accurate, efficient, and reliable machine-learning potential (MLP), which not only facilitates accurate exploration of the intricate potential energy surface (PES), but also enables us to conduct large-scale molecular dynamics (MD) simulations with first-principles accuracy. We identify the accurate structure of the $β''$ polymorph and uncover several previously unreported $β'$ polymorph variants exhibiting dynamic stability and competing energies, which are elucidated by characteristic flat imaginary phonon bands and the distinctive Mexican-hat-like PES in the $β$ polymorph. Through the MLP-accelerated MD simulations, we directly observe the polymorphic phase transformations among the $α$, $β$, $β'$, and $β''$ polymorphs under varying temperature and pressure conditions, and build for the first time an ab initio temperature-pressure phase diagram, showing good agreement with experiments. Furthermore, our MD simulations reveal a novel strain-induced reversible phase transition between the $β'$ and $β''$ polymorphs. This work not only unveils diverse polymorphs in van der Waals In$_2$Se$_3$, but also provides crucial atomic insights into their phase transitions, opening new avenues for the design of novel functional electronic devices.
△ Less
Submitted 26 June, 2025;
originally announced June 2025.
-
Transformer-Based Spatial-Temporal Counterfactual Outcomes Estimation
Authors:
He Li,
Haoang Chi,
Mingyu Liu,
Wanrong Huang,
Liyang Xu,
Wenjing Yang
Abstract:
The real world naturally has dimensions of time and space. Therefore, estimating the counterfactual outcomes with spatial-temporal attributes is a crucial problem. However, previous methods are based on classical statistical models, which still have limitations in performance and generalization. This paper proposes a novel framework for estimating counterfactual outcomes with spatial-temporal attr…
▽ More
The real world naturally has dimensions of time and space. Therefore, estimating the counterfactual outcomes with spatial-temporal attributes is a crucial problem. However, previous methods are based on classical statistical models, which still have limitations in performance and generalization. This paper proposes a novel framework for estimating counterfactual outcomes with spatial-temporal attributes using the Transformer, exhibiting stronger estimation ability. Under mild assumptions, the proposed estimator within this framework is consistent and asymptotically normal. To validate the effectiveness of our approach, we conduct simulation experiments and real data experiments. Simulation experiments show that our estimator has a stronger estimation capability than baseline methods. Real data experiments provide a valuable conclusion to the causal effect of conflicts on forest loss in Colombia. The source code is available at https://github.com/lihe-maxsize/DeppSTCI_Release_Version-master.
△ Less
Submitted 26 June, 2025;
originally announced June 2025.
-
Balmer Decrement and IRX Break in Tracing Dust Attenuation at Scales of Individual Star-forming Regions in NGC 628
Authors:
Man Qiao,
Mingfeng Liu,
Zongfei Lyu,
Shuang Liu,
Chao Yang,
Dong Dong Shi,
Fangxia An,
Zhizheng Pan,
Wenhao Liu,
Binyang Liu,
Run Wen,
Yu Heng Zhang,
Xian Zhong Zheng
Abstract:
We investigate the relationships between infrared excess (IRX=$L_{\rm IR}/L_{\rm UV}$) and Balmer decrement (${\rm H}α/{\rm H}β$) as indicators of dust attenuation for 609 ${\rm {H\,{\small II}}}$ regions at scales of $\sim 50-200$ pc in NGC 628, utilizing data from AstroSat, James Webb Space Telescope (JWST) and Multi Unit Spectroscopic Explorer (MUSE). Our findings indicate that about three fift…
▽ More
We investigate the relationships between infrared excess (IRX=$L_{\rm IR}/L_{\rm UV}$) and Balmer decrement (${\rm H}α/{\rm H}β$) as indicators of dust attenuation for 609 ${\rm {H\,{\small II}}}$ regions at scales of $\sim 50-200$ pc in NGC 628, utilizing data from AstroSat, James Webb Space Telescope (JWST) and Multi Unit Spectroscopic Explorer (MUSE). Our findings indicate that about three fifths of the sample ${\rm {H\,{\small II}}}$ regions reside within the regime occupied by local star-forming galaxies (SFGs) along the dust attenuation correlation described by their corresponding color excess parameters $E(B-V)_{\rm IRX} = 0.51\,E(B-V)_{{\rm H}α/{\rm H}β}$. Nearly 27$\%$ of the sample exhibits $E(B-V)_{\rm IRX}> E(B-V)_{{\rm H}α/{\rm H}β}$, while a small fraction ($\sim 13\%$) displays significantly lower $E(B-V)_{\rm IRX}$ compared to $E(B-V)_{{\rm H}α/{\rm H}β}$. These results suggest that the correlation between the two dust attenuation indicators no longer holds for spatially resolved ${\rm {H\,{\small II}}}$ regions. Furthermore, the ratio of $E(B-V)_{\rm IRX}$ to $E(B-V)_{{\rm H}α/{\rm H}β}$ remains unaffected by various physical parameters of the ${\rm {H\,{\small II}}}$ regions, including star formation rate (SFR), SFR surface density, infrared luminosity ($L_{\rm IR}$), $L_{\rm IR}$ surface density, stellar mass, gas-phase metallicity, circularized radius, and the distance to galactic center. We argue that the ratio is primarily influenced by the evolution of surrounding interstellar medium (ISM) of the star-forming regions, transitioning from an early dense and thick phase to the late blown-away stage.
△ Less
Submitted 26 June, 2025;
originally announced June 2025.
-
SuperSONIC: Cloud-Native Infrastructure for ML Inferencing
Authors:
Dmitry Kondratyev,
Benedikt Riedel,
Yuan-Tang Chou,
Miles Cochran-Branson,
Noah Paladino,
David Schultz,
Mia Liu,
Javier Duarte,
Philip Harris,
Shih-Chieh Hsu
Abstract:
The increasing computational demand from growing data rates and complex machine learning (ML) algorithms in large-scale scientific experiments has driven the adoption of the Services for Optimized Network Inference on Coprocessors (SONIC) approach. SONIC accelerates ML inference by offloading it to local or remote coprocessors to optimize resource utilization. Leveraging its portability to differe…
▽ More
The increasing computational demand from growing data rates and complex machine learning (ML) algorithms in large-scale scientific experiments has driven the adoption of the Services for Optimized Network Inference on Coprocessors (SONIC) approach. SONIC accelerates ML inference by offloading it to local or remote coprocessors to optimize resource utilization. Leveraging its portability to different types of coprocessors, SONIC enhances data processing and model deployment efficiency for cutting-edge research in high energy physics (HEP) and multi-messenger astrophysics (MMA). We developed the SuperSONIC project, a scalable server infrastructure for SONIC, enabling the deployment of computationally intensive tasks to Kubernetes clusters equipped with graphics processing units (GPUs). Using NVIDIA Triton Inference Server, SuperSONIC decouples client workflows from server infrastructure, standardizing communication, optimizing throughput, load balancing, and monitoring. SuperSONIC has been successfully deployed for the CMS and ATLAS experiments at the CERN Large Hadron Collider (LHC), the IceCube Neutrino Observatory (IceCube), and the Laser Interferometer Gravitational-Wave Observatory (LIGO) and tested on Kubernetes clusters at Purdue University, the National Research Platform (NRP), and the University of Chicago. SuperSONIC addresses the challenges of the Cloud-native era by providing a reusable, configurable framework that enhances the efficiency of accelerator-based inference deployment across diverse scientific domains and industries.
△ Less
Submitted 25 June, 2025;
originally announced June 2025.
-
VoxelOpt: Voxel-Adaptive Message Passing for Discrete Optimization in Deformable Abdominal CT Registration
Authors:
Hang Zhang,
Yuxi Zhang,
Jiazheng Wang,
Xiang Chen,
Renjiu Hu,
Xin Tian,
Gaolei Li,
Min Liu
Abstract:
Recent developments in neural networks have improved deformable image registration (DIR) by amortizing iterative optimization, enabling fast and accurate DIR results. However, learning-based methods often face challenges with limited training data, large deformations, and tend to underperform compared to iterative approaches when label supervision is unavailable. While iterative methods can achiev…
▽ More
Recent developments in neural networks have improved deformable image registration (DIR) by amortizing iterative optimization, enabling fast and accurate DIR results. However, learning-based methods often face challenges with limited training data, large deformations, and tend to underperform compared to iterative approaches when label supervision is unavailable. While iterative methods can achieve higher accuracy in such scenarios, they are considerably slower than learning-based methods. To address these limitations, we propose VoxelOpt, a discrete optimization-based DIR framework that combines the strengths of learning-based and iterative methods to achieve a better balance between registration accuracy and runtime. VoxelOpt uses displacement entropy from local cost volumes to measure displacement signal strength at each voxel, which differs from earlier approaches in three key aspects. First, it introduces voxel-wise adaptive message passing, where voxels with lower entropy receives less influence from their neighbors. Second, it employs a multi-level image pyramid with 27-neighbor cost volumes at each level, avoiding exponential complexity growth. Third, it replaces hand-crafted features or contrastive learning with a pretrained foundational segmentation model for feature extraction. In abdominal CT registration, these changes allow VoxelOpt to outperform leading iterative in both efficiency and accuracy, while matching state-of-the-art learning-based methods trained with label supervision. The source code will be available at https://github.com/tinymilky/VoxelOpt
△ Less
Submitted 24 June, 2025;
originally announced June 2025.
-
Follow-Up Exploration of the TWA 7 Planet-Disk System with JWST NIRCam
Authors:
Katie A. Crotts,
Aarynn L. Carter,
Kellen Lawson,
James Mang,
Beth Biller,
Mark Booth,
Rodrigo Ferrer-Chavez,
Julien H. Girard,
Anne-Marie Lagrange,
Michael C. Liu,
Sebastian Marino,
Maxwell A. Millar-Blanchaer,
Andy Skemer,
Giovanni M. Strampelli,
Jason Wang,
Olivier Absil,
William O. Balmer,
Raphaël Bendahan-West,
Ellis Bogat,
Rachel Bowens-Rubin,
Gaël Chauvin,
Clémence Fontanive,
Kyle Franson,
Jens Kammerer,
Jarron Leisenring
, et al. (17 additional authors not shown)
Abstract:
The young M-star TWA 7 hosts a bright and near face-on debris disk, which has been imaged from the optical to the submillimeter. The disk displays multiple complex substructures such as three disk components, a large dust clump, and spiral arms, suggesting the presence of planets to actively sculpt these features. The evidence for planets in this disk was further strengthened with the recent detec…
▽ More
The young M-star TWA 7 hosts a bright and near face-on debris disk, which has been imaged from the optical to the submillimeter. The disk displays multiple complex substructures such as three disk components, a large dust clump, and spiral arms, suggesting the presence of planets to actively sculpt these features. The evidence for planets in this disk was further strengthened with the recent detection of a point-source compatible with a Saturn-mass planet companion using JWST/MIRI at 11 $μ$m, at the location a planet was predicted to reside based on the disk morphology. In this paper, we present new observations of the TWA 7 system with JWST/NIRCam in the F200W and F444W filters. The disk is detected at both wavelengths and presents many of the same substructures as previously imaged, although we do not robustly detect the southern spiral arm. Furthermore, we detect two faint potential companions in the F444W filter at the 2-3$σ$ level. While one of these companions needs further followup to determine its nature, the other one coincides with the location of the planet candidate imaged with MIRI, providing further evidence that this source is a sub-Jupiter mass planet companion rather than a background galaxy. Such discoveries make TWA 7 only the second system, after $β$ Pictoris, in which a planet predicted by the debris disk morphology has been detected.
△ Less
Submitted 24 June, 2025;
originally announced June 2025.
-
What Matters in LLM-generated Data: Diversity and Its Effect on Model Fine-Tuning
Authors:
Yuchang Zhu,
Huazhen Zhong,
Qunshu Lin,
Haotong Wei,
Xiaolong Sun,
Zixuan Yu,
Minghao Liu,
Zibin Zheng,
Liang Chen
Abstract:
With the remarkable generative capabilities of large language models (LLMs), using LLM-generated data to train downstream models has emerged as a promising approach to mitigate data scarcity in specific domains and reduce time-consuming annotations. However, recent studies have highlighted a critical issue: iterative training on self-generated data results in model collapse, where model performanc…
▽ More
With the remarkable generative capabilities of large language models (LLMs), using LLM-generated data to train downstream models has emerged as a promising approach to mitigate data scarcity in specific domains and reduce time-consuming annotations. However, recent studies have highlighted a critical issue: iterative training on self-generated data results in model collapse, where model performance degrades over time. Despite extensive research on the implications of LLM-generated data, these works often neglect the importance of data diversity, a key factor in data quality. In this work, we aim to understand the implications of the diversity of LLM-generated data on downstream model performance. Specifically, we explore how varying levels of diversity in LLM-generated data affect downstream model performance. Additionally, we investigate the performance of models trained on data that mixes different proportions of LLM-generated data, which we refer to as synthetic data. Our experimental results show that, with minimal distribution shift, moderately diverse LLM-generated data can enhance model performance in scenarios with insufficient labeled data, whereas highly diverse generated data has a negative impact. We hope our empirical findings will offer valuable guidance for future studies on LLMs as data generators.
△ Less
Submitted 24 June, 2025; v1 submitted 23 June, 2025;
originally announced June 2025.
-
Precise Measurement of the $Λ$ Electric Dipole Moment through the Entangled Strange Baryon-Antibaryon System
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (696 additional authors not shown)
Abstract:
The dominance of matter over antimatter in the universe has consistently driven the pursuit of new physics beyond the Standard Model that violates charge-parity symmetry. Unlike the well-constrained electrons and neutrons, strange baryons (hyperons) remain a largely unexplored territory, in which interactions between hyperons and particles from new physics could induce a non-trivial electric dipol…
▽ More
The dominance of matter over antimatter in the universe has consistently driven the pursuit of new physics beyond the Standard Model that violates charge-parity symmetry. Unlike the well-constrained electrons and neutrons, strange baryons (hyperons) remain a largely unexplored territory, in which interactions between hyperons and particles from new physics could induce a non-trivial electric dipole moment (EDM). However, direct measurements of hyperon EDMs through spin precession are highly challenging due to their short lifetimes. In this paper, we present a novel method to extract the EDM of the lightest hyperon, $Λ$, using the entangled $Λ$$\overlineΛ$ system. Our result is consistent with zero, achieving a three-order-of-magnitude improvement over the previous upper limit established in the 1980s with comparable statistics, providing stringent constraints on potential new physics.
△ Less
Submitted 28 June, 2025; v1 submitted 23 June, 2025;
originally announced June 2025.
-
Phantom-Data : Towards a General Subject-Consistent Video Generation Dataset
Authors:
Zhuowei Chen,
Bingchuan Li,
Tianxiang Ma,
Lijie Liu,
Mingcong Liu,
Yi Zhang,
Gen Li,
Xinghui Li,
Siyu Zhou,
Qian He,
Xinglong Wu
Abstract:
Subject-to-video generation has witnessed substantial progress in recent years. However, existing models still face significant challenges in faithfully following textual instructions. This limitation, commonly known as the copy-paste problem, arises from the widely used in-pair training paradigm. This approach inherently entangles subject identity with background and contextual attributes by samp…
▽ More
Subject-to-video generation has witnessed substantial progress in recent years. However, existing models still face significant challenges in faithfully following textual instructions. This limitation, commonly known as the copy-paste problem, arises from the widely used in-pair training paradigm. This approach inherently entangles subject identity with background and contextual attributes by sampling reference images from the same scene as the target video. To address this issue, we introduce \textbf{Phantom-Data, the first general-purpose cross-pair subject-to-video consistency dataset}, containing approximately one million identity-consistent pairs across diverse categories. Our dataset is constructed via a three-stage pipeline: (1) a general and input-aligned subject detection module, (2) large-scale cross-context subject retrieval from more than 53 million videos and 3 billion images, and (3) prior-guided identity verification to ensure visual consistency under contextual variation. Comprehensive experiments show that training with Phantom-Data significantly improves prompt alignment and visual quality while preserving identity consistency on par with in-pair baselines.
△ Less
Submitted 23 June, 2025;
originally announced June 2025.
-
Benchmarking the Pedagogical Knowledge of Large Language Models
Authors:
Maxime Lelièvre,
Amy Waldock,
Meng Liu,
Natalia Valdés Aspillaga,
Alasdair Mackintosh,
María José Ogando Portela,
Jared Lee,
Paul Atherton,
Robin A. A. Ince,
Oliver G. B. Garrod
Abstract:
Benchmarks like Massive Multitask Language Understanding (MMLU) have played a pivotal role in evaluating AI's knowledge and abilities across diverse domains. However, existing benchmarks predominantly focus on content knowledge, leaving a critical gap in assessing models' understanding of pedagogy - the method and practice of teaching. This paper introduces The Pedagogy Benchmark, a novel dataset…
▽ More
Benchmarks like Massive Multitask Language Understanding (MMLU) have played a pivotal role in evaluating AI's knowledge and abilities across diverse domains. However, existing benchmarks predominantly focus on content knowledge, leaving a critical gap in assessing models' understanding of pedagogy - the method and practice of teaching. This paper introduces The Pedagogy Benchmark, a novel dataset designed to evaluate large language models on their Cross-Domain Pedagogical Knowledge (CDPK) and Special Education Needs and Disability (SEND) pedagogical knowledge. These benchmarks are built on a carefully curated set of questions sourced from professional development exams for teachers, which cover a range of pedagogical subdomains such as teaching strategies and assessment methods. Here we outline the methodology and development of these benchmarks. We report results for 97 models, with accuracies spanning a range from 28% to 89% on the pedagogical knowledge questions. We consider the relationship between cost and accuracy and chart the progression of the Pareto value frontier over time. We provide online leaderboards at https://rebrand.ly/pedagogy which are updated with new models and allow interactive exploration and filtering based on various model properties, such as cost per token and open-vs-closed weights, as well as looking at performance in different subjects. LLMs and generative AI have tremendous potential to influence education and help to address the global learning crisis. Education-focused benchmarks are crucial to measure models' capacities to understand pedagogical concepts, respond appropriately to learners' needs, and support effective teaching practices across diverse contexts. They are needed for informing the responsible and evidence-based deployment of LLMs and LLM-based tools in educational settings, and for guiding both development and policy decisions.
△ Less
Submitted 1 July, 2025; v1 submitted 23 June, 2025;
originally announced June 2025.
-
PERSCEN: Learning Personalized Interaction Pattern and Scenario Preference for Multi-Scenario Matching
Authors:
Haotong Du,
Yaqing Wang,
Fei Xiong,
Lei Shao,
Ming Liu,
Hao Gu,
Quanming Yao,
Zhen Wang
Abstract:
With the expansion of business scales and scopes on online platforms, multi-scenario matching has become a mainstream solution to reduce maintenance costs and alleviate data sparsity. The key to effective multi-scenario recommendation lies in capturing both user preferences shared across all scenarios and scenario-aware preferences specific to each scenario. However, existing methods often overloo…
▽ More
With the expansion of business scales and scopes on online platforms, multi-scenario matching has become a mainstream solution to reduce maintenance costs and alleviate data sparsity. The key to effective multi-scenario recommendation lies in capturing both user preferences shared across all scenarios and scenario-aware preferences specific to each scenario. However, existing methods often overlook user-specific modeling, limiting the generation of personalized user representations. To address this, we propose PERSCEN, an innovative approach that incorporates user-specific modeling into multi-scenario matching. PERSCEN constructs a user-specific feature graph based on user characteristics and employs a lightweight graph neural network to capture higher-order interaction patterns, enabling personalized extraction of preferences shared across scenarios. Additionally, we leverage vector quantization techniques to distil scenario-aware preferences from users' behavior sequence within individual scenarios, facilitating user-specific and scenario-aware preference modeling. To enhance efficient and flexible information transfer, we introduce a progressive scenario-aware gated linear unit that allows fine-grained, low-latency fusion. Extensive experiments demonstrate that PERSCEN outperforms existing methods. Further efficiency analysis confirms that PERSCEN effectively balances performance with computational cost, ensuring its practicality for real-world industrial systems.
△ Less
Submitted 23 June, 2025;
originally announced June 2025.
-
CEGA: A Cost-Effective Approach for Graph-Based Model Extraction and Acquisition
Authors:
Zebin Wang,
Menghan Lin,
Bolin Shen,
Ken Anderson,
Molei Liu,
Tianxi Cai,
Yushun Dong
Abstract:
Graph Neural Networks (GNNs) have demonstrated remarkable utility across diverse applications, and their growing complexity has made Machine Learning as a Service (MLaaS) a viable platform for scalable deployment. However, this accessibility also exposes GNN to serious security threats, most notably model extraction attacks (MEAs), in which adversaries strategically query a deployed model to const…
▽ More
Graph Neural Networks (GNNs) have demonstrated remarkable utility across diverse applications, and their growing complexity has made Machine Learning as a Service (MLaaS) a viable platform for scalable deployment. However, this accessibility also exposes GNN to serious security threats, most notably model extraction attacks (MEAs), in which adversaries strategically query a deployed model to construct a high-fidelity replica. In this work, we evaluate the vulnerability of GNNs to MEAs and explore their potential for cost-effective model acquisition in non-adversarial research settings. Importantly, adaptive node querying strategies can also serve a critical role in research, particularly when labeling data is expensive or time-consuming. By selectively sampling informative nodes, researchers can train high-performing GNNs with minimal supervision, which is particularly valuable in domains such as biomedicine, where annotations often require expert input. To address this, we propose a node querying strategy tailored to a highly practical yet underexplored scenario, where bulk queries are prohibited, and only a limited set of initial nodes is available. Our approach iteratively refines the node selection mechanism over multiple learning cycles, leveraging historical feedback to improve extraction efficiency. Extensive experiments on benchmark graph datasets demonstrate our superiority over comparable baselines on accuracy, fidelity, and F1 score under strict query-size constraints. These results highlight both the susceptibility of deployed GNNs to extraction attacks and the promise of ethical, efficient GNN acquisition methods to support low-resource research environments.
△ Less
Submitted 21 June, 2025;
originally announced June 2025.
-
Nuclear Cold QCD: Review and Future Strategy
Authors:
F. Arleo,
P. Caucal,
A. Deshpande,
J. M. Durham,
G. M. Innocenti,
J. Jalilian-Marian,
A. Kusina,
M. X. Liu,
Y. Mehtar-Tani,
C. -J. Naïm,
H. Paukkunen,
S. Platchkov,
F. Salazar,
I. Vitev,
R. Vogt
Abstract:
This review examines data from hadron-nucleus collisions, primarily focusing on hard processes like Drell-Yan, heavy flavor and quarkonium production. It highlights observed modifications of particle yields as functions of momentum and rapidity, aiming to clarify the underlying QCD effects in cold nuclear matter (CNM). The paper outlines strategies for future experiments, including the Electron-Io…
▽ More
This review examines data from hadron-nucleus collisions, primarily focusing on hard processes like Drell-Yan, heavy flavor and quarkonium production. It highlights observed modifications of particle yields as functions of momentum and rapidity, aiming to clarify the underlying QCD effects in cold nuclear matter (CNM). The paper outlines strategies for future experiments, including the Electron-Ion Collider (EIC), to distinguish between these effects. Key questions address the universality of suppression mechanisms and the role of non-perturbative physics, providing a road map for upcoming nuclear data.
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Enhancing Step-by-Step and Verifiable Medical Reasoning in MLLMs
Authors:
Haoran Sun,
Yankai Jiang,
Wenjie Lou,
Yujie Zhang,
Wenjie Li,
Lilong Wang,
Mianxin Liu,
Lei Liu,
Xiaosong Wang
Abstract:
Multimodal large language models (MLLMs) have begun to demonstrate robust reasoning capabilities on general tasks, yet their application in the medical domain remains in its early stages. Constructing chain-of-thought (CoT) training data is essential for bolstering the reasoning abilities of medical MLLMs. However, existing approaches exhibit a deficiency in offering a comprehensive framework for…
▽ More
Multimodal large language models (MLLMs) have begun to demonstrate robust reasoning capabilities on general tasks, yet their application in the medical domain remains in its early stages. Constructing chain-of-thought (CoT) training data is essential for bolstering the reasoning abilities of medical MLLMs. However, existing approaches exhibit a deficiency in offering a comprehensive framework for searching and evaluating effective reasoning paths towards critical diagnosis. To address this challenge, we propose Mentor-Intern Collaborative Search (MICS), a novel reasoning-path searching scheme to generate rigorous and effective medical CoT data. MICS first leverages mentor models to initialize the reasoning, one step at a time, then prompts each intern model to continue the thinking along those initiated paths, and finally selects the optimal reasoning path according to the overall reasoning performance of multiple intern models. The reasoning performance is determined by an MICS-Score, which assesses the quality of generated reasoning paths. Eventually, we construct MMRP, a multi-task medical reasoning dataset with ranked difficulty, and Chiron-o1, a new medical MLLM devised via a curriculum learning strategy, with robust visual question-answering and generalizable reasoning capabilities. Extensive experiments demonstrate that Chiron-o1, trained on our CoT dataset constructed using MICS, achieves state-of-the-art performance across a list of medical visual question answering and reasoning benchmarks. Codes are available at GitHub - manglu097/Chiron-o1: Enhancing Step-by-Step and Verifiable Medical Reasoning in MLLMs
△ Less
Submitted 20 June, 2025;
originally announced June 2025.
-
Identifying Ring Galaxies in DESI Legacy Imaging Surveys Using Machine Learning Methods
Authors:
Aina Zhang,
Xiaoming Kong,
Bowen Liu,
Nan Li,
Yude Bu,
Zhenping Yi,
Meng Liu
Abstract:
The formation and evolution of ring structures in galaxies are crucial for understanding the nature and distribution of dark matter, galactic interactions, and the internal secular evolution of galaxies. However, the limited number of existing ring galaxy catalogs has constrained deeper exploration in this field. To address this gap, we introduce a two-stage binary classification model based on th…
▽ More
The formation and evolution of ring structures in galaxies are crucial for understanding the nature and distribution of dark matter, galactic interactions, and the internal secular evolution of galaxies. However, the limited number of existing ring galaxy catalogs has constrained deeper exploration in this field. To address this gap, we introduce a two-stage binary classification model based on the Swin Transformer architecture to identify ring galaxies from the DESI Legacy Imaging Surveys. This model first selects potential candidates and then refines them in a second stage to improve classification accuracy. During model training, we investigated the impact of imbalanced datasets on the performance of the two-stage model. We experimented with various model combinations applied to the datasets of the DESI Legacy Imaging Surveys DR9, processing a total of 573,668 images with redshifts ranging from z_spec = 0.01-0.20 and magr <17.5. After applying the two-stage filtering and conducting visual inspections, the overall Precision of the models exceeded 64.87%, successfully identifying a total of 8052 newly discovered ring galaxies. With our catalog, the forthcoming spectroscopic data from DESI will facilitate a more comprehensive investigation into the formation and evolution of ring galaxies.
△ Less
Submitted 19 June, 2025;
originally announced June 2025.
-
DeepJ: Graph Convolutional Transformers with Differentiable Pooling for Patient Trajectory Modeling
Authors:
Deyi Li,
Zijun Yao,
Muxuan Liang,
Mei Liu
Abstract:
In recent years, graph learning has gained significant interest for modeling complex interactions among medical events in structured Electronic Health Record (EHR) data. However, existing graph-based approaches often work in a static manner, either restricting interactions within individual encounters or collapsing all historical encounters into a single snapshot. As a result, when it is necessary…
▽ More
In recent years, graph learning has gained significant interest for modeling complex interactions among medical events in structured Electronic Health Record (EHR) data. However, existing graph-based approaches often work in a static manner, either restricting interactions within individual encounters or collapsing all historical encounters into a single snapshot. As a result, when it is necessary to identify meaningful groups of medical events spanning longitudinal encounters, existing methods are inadequate in modeling interactions cross encounters while accounting for temporal dependencies. To address this limitation, we introduce Deep Patient Journey (DeepJ), a novel graph convolutional transformer model with differentiable graph pooling to effectively capture intra-encounter and inter-encounter medical event interactions. DeepJ can identify groups of temporally and functionally related medical events, offering valuable insights into key event clusters pertinent to patient outcome prediction. DeepJ significantly outperformed five state-of-the-art baseline models while enhancing interpretability, demonstrating its potential for improved patient risk stratification.
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
OAgents: An Empirical Study of Building Effective Agents
Authors:
He Zhu,
Tianrui Qin,
King Zhu,
Heyuan Huang,
Yeyi Guan,
Jinxiang Xia,
Yi Yao,
Hanhao Li,
Ningning Wang,
Pai Liu,
Tianhao Peng,
Xin Gui,
Xiaowan Li,
Yuhui Liu,
Yuchen Eleanor Jiang,
Jun Wang,
Changwang Zhang,
Xiangru Tang,
Ge Zhang,
Jian Yang,
Minghao Liu,
Xitong Gao,
Jiaheng Liu,
Wangchunshu Zhou
Abstract:
Recently, Agentic AI has become an increasingly popular research field. However, we argue that current agent research practices lack standardization and scientific rigor, making it hard to conduct fair comparisons among methods. As a result, it is still unclear how different design choices in agent frameworks affect effectiveness, and measuring their progress remains challenging. In this work, we…
▽ More
Recently, Agentic AI has become an increasingly popular research field. However, we argue that current agent research practices lack standardization and scientific rigor, making it hard to conduct fair comparisons among methods. As a result, it is still unclear how different design choices in agent frameworks affect effectiveness, and measuring their progress remains challenging. In this work, we conduct a systematic empirical study on GAIA benchmark and BrowseComp to examine the impact of popular design choices in key agent components in a fair and rigorous manner. We find that the lack of a standard evaluation protocol makes previous works, even open-sourced ones, non-reproducible, with significant variance between random runs. Therefore, we introduce a more robust evaluation protocol to stabilize comparisons. Our study reveals which components and designs are crucial for effective agents, while others are redundant, despite seeming logical. Based on our findings, we build and open-source OAgents, a new foundation agent framework that achieves state-of-the-art performance among open-source projects. OAgents offers a modular design for various agent components, promoting future research in Agentic AI.
△ Less
Submitted 23 June, 2025; v1 submitted 17 June, 2025;
originally announced June 2025.
-
Measurements of the absolute branching fractions of the doubly Cabibbo-suppressed decays $D^+\to K^+π^0$, $D^+\to K^+η$ and $D^+\to K^+η^{\prime}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (697 additional authors not shown)
Abstract:
Using $20.3\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of 3.773\,GeV with the BESIII detector, we present improved measurements of the absolute branching fractions of the doubly Cabibbo-suppressed decays $D^+\to K^+π^0$, $D^+\to K^+η$ and $ D^+ \to K^+ η^{\prime}$ with the double-tag method. The statistical significance of each signal decay exceeds $10σ$. The bra…
▽ More
Using $20.3\,\rm fb^{-1}$ of $e^+e^-$ collision data collected at a center-of-mass energy of 3.773\,GeV with the BESIII detector, we present improved measurements of the absolute branching fractions of the doubly Cabibbo-suppressed decays $D^+\to K^+π^0$, $D^+\to K^+η$ and $ D^+ \to K^+ η^{\prime}$ with the double-tag method. The statistical significance of each signal decay exceeds $10σ$. The branching fractions are determined to be ${\mathcal B}(D^+\to K^+ π^0) = (1.45 \pm 0.06 \pm 0.06)\times 10^{-4}$, ${\mathcal B}(D^+\to K^+ η) = (1.17 \pm 0.10 \pm 0.03)\times 10^{-4}$ and ${\mathcal B}(D^+\to K^+ η^{\prime}) = (1.88 \pm 0.15 \pm 0.06)\times 10^{-4}$, where the first uncertainties are statistical and the second systematic. These results are consistent with the world average values but with significantly improved precision.
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
Determination of $|V_{cb}|$ using $B\to D\ellν_\ell$ Decays at Belle II
Authors:
Belle II Collaboration,
I. Adachi,
K. Adamczyk,
L. Aggarwal,
H. Ahmed,
Y. Ahn,
H. Aihara,
N. Akopov,
S. Alghamdi,
M. Alhakami,
A. Aloisio,
K. Amos,
M. Angelsmark,
N. Anh Ky,
C. Antonioli,
D. M. Asner,
H. Atmacan,
T. Aushev,
V. Aushev,
M. Aversano,
R. Ayad,
V. Babu,
H. Bae,
N. K. Baghel,
S. Bahinipati
, et al. (385 additional authors not shown)
Abstract:
We present a determination of the Cabibbo-Kobayashi-Maskawa matrix element $|V_{cb}|$ from the decay $B\to D\ellν_\ell$ using a $365~\mathrm{fb}^{-1}$ $e^+e^-\toΥ(4S)\to B\bar B$ data sample recorded by the Belle II experiment at the SuperKEKB collider. The semileptonic decay of one $B$ meson is reconstructed in the modes $B^0\to D^-(\to K^+π^-π^-)\ell^+ν_\ell$ and…
▽ More
We present a determination of the Cabibbo-Kobayashi-Maskawa matrix element $|V_{cb}|$ from the decay $B\to D\ellν_\ell$ using a $365~\mathrm{fb}^{-1}$ $e^+e^-\toΥ(4S)\to B\bar B$ data sample recorded by the Belle II experiment at the SuperKEKB collider. The semileptonic decay of one $B$ meson is reconstructed in the modes $B^0\to D^-(\to K^+π^-π^-)\ell^+ν_\ell$ and $B^+\to \bar D^0(\to K^+π^-)\ell^+ν_\ell$, where $\ell$ denotes either an electron or a muon. Charge conjugation is implied. The second $B$ meson in the $Υ(4S)$ event is not reconstructed explicitly. Using an inclusive reconstruction of the unobserved neutrino momentum, we determine the recoil variable $w=v_B\cdot v_D$, where $v_B$ and $v_D$ are the 4-velocities of the $B$ and $D$ mesons. We measure the total decay branching fractions to be $\mathcal{B}(B^0\to D^-\ell^+ν_\ell)=(2.06 \pm 0.05\,(\mathrm{stat.}) \pm 0.10\,(\mathrm{sys.}))\%$ and $\mathcal{B}(B^+\to\bar D^0\ell^+ν_\ell)=(2.31 \pm 0.04\,(\mathrm{stat.}) \pm 0.09\,(\mathrm{sys.}))\%$. We probe lepton flavor universality by measuring $\mathcal{B}(B\to Deν_e)/\mathcal{B}(B\to Dμν_μ)=1.020 \pm 0.020\,(\mathrm{stat.})\pm 0.022\,(\mathrm{sys.})$. Fitting the partial decay branching fraction as a function of $w$ and using the average of lattice QCD calculations of the $B\to D$ form factor, we obtain $ |V_{cb}|=(39.2\pm 0.4\,(\mathrm{stat.}) \pm 0.6\,(\mathrm{sys.}) \pm 0.5\,(\mathrm{th.})$.
△ Less
Submitted 18 June, 2025;
originally announced June 2025.
-
PFMBench: Protein Foundation Model Benchmark
Authors:
Zhangyang Gao,
Hao Wang,
Cheng Tan,
Chenrui Xu,
Mengdi Liu,
Bozhen Hu,
Linlin Chao,
Xiaoming Zhang,
Stan Z. Li
Abstract:
This study investigates the current landscape and future directions of protein foundation model research. While recent advancements have transformed protein science and engineering, the field lacks a comprehensive benchmark for fair evaluation and in-depth understanding. Since ESM-1B, numerous protein foundation models have emerged, each with unique datasets and methodologies. However, evaluations…
▽ More
This study investigates the current landscape and future directions of protein foundation model research. While recent advancements have transformed protein science and engineering, the field lacks a comprehensive benchmark for fair evaluation and in-depth understanding. Since ESM-1B, numerous protein foundation models have emerged, each with unique datasets and methodologies. However, evaluations often focus on limited tasks tailored to specific models, hindering insights into broader generalization and limitations. Specifically, researchers struggle to understand the relationships between tasks, assess how well current models perform across them, and determine the criteria in developing new foundation models. To fill this gap, we present PFMBench, a comprehensive benchmark evaluating protein foundation models across 38 tasks spanning 8 key areas of protein science. Through hundreds of experiments on 17 state-of-the-art models across 38 tasks, PFMBench reveals the inherent correlations between tasks, identifies top-performing models, and provides a streamlined evaluation protocol. Code is available at \href{https://github.com/biomap-research/PFMBench}{\textcolor{blue}{GitHub}}.
△ Less
Submitted 1 June, 2025;
originally announced June 2025.
-
Search for neutron decay into an antineutrino and a neutral kaon in 0.401 megaton-years exposure of Super-Kamiokande
Authors:
Super-Kamiokande Collaboration,
:,
K. Yamauchi,
K. Abe,
S. Abe,
Y. Asaoka,
M. Harada,
Y. Hayato,
K. Hiraide,
K. Hosokawa,
K. Ieki,
M. Ikeda,
J. Kameda,
Y. Kanemura,
Y. Kataoka,
S. Miki,
S. Mine,
M. Miura,
S. Moriyama,
M. Nakahata,
S. Nakayama,
Y. Noguchi,
G. Pronost,
K. Sato,
H. Sekiya
, et al. (240 additional authors not shown)
Abstract:
We searched for bound neutron decay via $n\to\barν+K^0$ predicted by the Grand Unified Theories in 0.401 Mton$\cdot$years exposure of all pure water phases in the Super-Kamiokande detector. About 4.4 times more data than in the previous search have been analyzed by a new method including a spectrum fit to kaon invariant mass distributions. No significant data excess has been observed in the signal…
▽ More
We searched for bound neutron decay via $n\to\barν+K^0$ predicted by the Grand Unified Theories in 0.401 Mton$\cdot$years exposure of all pure water phases in the Super-Kamiokande detector. About 4.4 times more data than in the previous search have been analyzed by a new method including a spectrum fit to kaon invariant mass distributions. No significant data excess has been observed in the signal regions. As a result of this analysis, we set a lower limit of $7.8\times10^{32}$ years on the neutron lifetime at a 90% confidence level.
△ Less
Submitted 17 June, 2025;
originally announced June 2025.