-
COSMIC: Clique-Oriented Semantic Multi-space Integration for Robust CLIP Test-Time Adaptation
Authors:
Fanding Huang,
Jingyan Jiang,
Qinting Jiang,
Hebei Li,
Faisal Nadeem Khan,
Zhi Wang
Abstract:
Recent vision-language models (VLMs) face significant challenges in test-time adaptation to novel domains. While cache-based methods show promise by leveraging historical information, they struggle with both caching unreliable feature-label pairs and indiscriminately using single-class information during querying, significantly compromising adaptation accuracy. To address these limitations, we pro…
▽ More
Recent vision-language models (VLMs) face significant challenges in test-time adaptation to novel domains. While cache-based methods show promise by leveraging historical information, they struggle with both caching unreliable feature-label pairs and indiscriminately using single-class information during querying, significantly compromising adaptation accuracy. To address these limitations, we propose COSMIC (Clique-Oriented Semantic Multi-space Integration for CLIP), a robust test-time adaptation framework that enhances adaptability through multi-granular, cross-modal semantic caching and graph-based querying mechanisms. Our framework introduces two key innovations: Dual Semantics Graph (DSG) and Clique Guided Hyper-class (CGH). The Dual Semantics Graph constructs complementary semantic spaces by incorporating textual features, coarse-grained CLIP features, and fine-grained DINOv2 features to capture rich semantic relationships. Building upon these dual graphs, the Clique Guided Hyper-class component leverages structured class relationships to enhance prediction robustness through correlated class selection. Extensive experiments demonstrate COSMIC's superior performance across multiple benchmarks, achieving significant improvements over state-of-the-art methods: 15.81% gain on out-of-distribution tasks and 5.33% on cross-domain generation with CLIP RN-50. Code is available at github.com/hf618/COSMIC.
△ Less
Submitted 30 March, 2025;
originally announced March 2025.
-
ControlFusion: A Controllable Image Fusion Framework with Language-Vision Degradation Prompts
Authors:
Linfeng Tang,
Yeda Wang,
Zhanchuan Cai,
Junjun Jiang,
Jiayi Ma
Abstract:
Current image fusion methods struggle to address the composite degradations encountered in real-world imaging scenarios and lack the flexibility to accommodate user-specific requirements. In response to these challenges, we propose a controllable image fusion framework with language-vision prompts, termed ControlFusion, which adaptively neutralizes composite degradations. On the one hand, we devel…
▽ More
Current image fusion methods struggle to address the composite degradations encountered in real-world imaging scenarios and lack the flexibility to accommodate user-specific requirements. In response to these challenges, we propose a controllable image fusion framework with language-vision prompts, termed ControlFusion, which adaptively neutralizes composite degradations. On the one hand, we develop a degraded imaging model that integrates physical imaging mechanisms, including the Retinex theory and atmospheric scattering principle, to simulate composite degradations, thereby providing potential for addressing real-world complex degradations from the data level. On the other hand, we devise a prompt-modulated restoration and fusion network that dynamically enhances features with degradation prompts, enabling our method to accommodate composite degradation of varying levels. Specifically, considering individual variations in quality perception of users, we incorporate a text encoder to embed user-specified degradation types and severity levels as degradation prompts. We also design a spatial-frequency collaborative visual adapter that autonomously perceives degradations in source images, thus eliminating the complete dependence on user instructions. Extensive experiments demonstrate that ControlFusion outperforms SOTA fusion methods in fusion quality and degradation handling, particularly in countering real-world and compound degradations with various levels.
△ Less
Submitted 30 March, 2025;
originally announced March 2025.
-
The DUNE Phase II Detectors
Authors:
DUNE Collaboration,
A. Abed Abud,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
F. Akbar,
F. Alemanno,
N. S. Alex,
K. Allison,
M. Alrashed,
A. Alton,
R. Alvarez,
T. Alves,
A. Aman,
H. Amar,
P. Amedo,
J. Anderson,
D. A. Andrade
, et al. (1322 additional authors not shown)
Abstract:
The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy for the implementation of this leading-edge, large-scale science project. The 2023 report of the US Particle Physics Project Prioritization Panel (P5) reaffirmed this vision and strongly endorsed DUNE Phase I and…
▽ More
The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy for the implementation of this leading-edge, large-scale science project. The 2023 report of the US Particle Physics Project Prioritization Panel (P5) reaffirmed this vision and strongly endorsed DUNE Phase I and Phase II, as did the previous European Strategy for Particle Physics. The construction of DUNE Phase I is well underway. DUNE Phase II consists of a third and fourth far detector module, an upgraded near detector complex, and an enhanced > 2 MW beam. The fourth FD module is conceived as a 'Module of Opportunity', aimed at supporting the core DUNE science program while also expanding the physics opportunities with more advanced technologies. The DUNE collaboration is submitting four main contributions to the 2026 Update of the European Strategy for Particle Physics process. This submission to the 'Detector instrumentation' stream focuses on technologies and R&D for the DUNE Phase II detectors. Additional inputs related to the DUNE science program, DUNE software and computing, and European contributions to Fermilab accelerator upgrades and facilities for the DUNE experiment, are also being submitted to other streams.
△ Less
Submitted 29 March, 2025;
originally announced March 2025.
-
The DUNE Science Program
Authors:
DUNE Collaboration,
A. Abed Abud,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
F. Akbar,
F. Alemanno,
N. S. Alex,
K. Allison,
M. Alrashed,
A. Alton,
R. Alvarez,
T. Alves,
A. Aman,
H. Amar,
P. Amedo,
J. Anderson,
D. A. Andrade
, et al. (1322 additional authors not shown)
Abstract:
The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy for the implementation of this leading-edge, large-scale science project. The 2023 report of the US Particle Physics Project Prioritization Panel (P5) reaffirmed this vision and strongly endorsed DUNE Phase I and…
▽ More
The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy for the implementation of this leading-edge, large-scale science project. The 2023 report of the US Particle Physics Project Prioritization Panel (P5) reaffirmed this vision and strongly endorsed DUNE Phase I and Phase II, as did the previous European Strategy for Particle Physics. The construction of DUNE Phase I is well underway. DUNE Phase II consists of a third and fourth far detector module, an upgraded near detector complex, and an enhanced > 2 MW beam. The fourth FD module is conceived as a 'Module of Opportunity', aimed at supporting the core DUNE science program while also expanding the physics opportunities with more advanced technologies. The DUNE collaboration is submitting four main contributions to the 2026 Update of the European Strategy for Particle Physics process. This submission to the 'Neutrinos and cosmic messengers', 'BSM physics' and 'Dark matter and dark sector' streams focuses on the physics program of DUNE. Additional inputs related to DUNE detector technologies and R&D, DUNE software and computing, and European contributions to Fermilab accelerator upgrades and facilities for the DUNE experiment, are also being submitted to other streams.
△ Less
Submitted 29 March, 2025;
originally announced March 2025.
-
Updated model-independent measurement of the strong-phase differences between $D^0$ and $\bar{D}^0 \to K^{0}_{S/L}π^+π^-$ decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (696 additional authors not shown)
Abstract:
The strong-phase differences between $D^0\to K_{S/L}^0π^+π^-$ and $\bar{D}^0\to K_{S/L}^0π^+π^-$ decays are one of the most important inputs in measuring the $C\!P$ violating angle $γ$ via $B^- \to D K^-$ decays. They also play a key role in studies of charm mixing and indirect $C\!P$ violation. In this paper, the strong-phase differences are determined in a model-independent way with quantum-corr…
▽ More
The strong-phase differences between $D^0\to K_{S/L}^0π^+π^-$ and $\bar{D}^0\to K_{S/L}^0π^+π^-$ decays are one of the most important inputs in measuring the $C\!P$ violating angle $γ$ via $B^- \to D K^-$ decays. They also play a key role in studies of charm mixing and indirect $C\!P$ violation. In this paper, the strong-phase differences are determined in a model-independent way with quantum-correlated $D^0$-$\bar{D}^0$ decays from 7.93 fb$^{-1}$ of $e^+e^-$ annihilation data at $\sqrt{s}$=3.773 GeV by the BESIII experiment. These results are the most precise to date and are expected to significantly reduce associated uncertainties in determining the $C\!P$ violating angle $γ$ and related charm mixing parameters.
△ Less
Submitted 18 April, 2025; v1 submitted 27 March, 2025;
originally announced March 2025.
-
OAEI-LLM-T: A TBox Benchmark Dataset for Understanding Large Language Model Hallucinations in Ontology Matching
Authors:
Zhangcheng Qiang,
Kerry Taylor,
Weiqing Wang,
Jing Jiang
Abstract:
Hallucinations are often inevitable in downstream tasks using large language models (LLMs). To tackle the substantial challenge of addressing hallucinations for LLM-based ontology matching (OM) systems, we introduce a new benchmark dataset OAEI-LLM-T. The dataset evolves from seven TBox datasets in the Ontology Alignment Evaluation Initiative (OAEI), capturing hallucinations of ten different LLMs…
▽ More
Hallucinations are often inevitable in downstream tasks using large language models (LLMs). To tackle the substantial challenge of addressing hallucinations for LLM-based ontology matching (OM) systems, we introduce a new benchmark dataset OAEI-LLM-T. The dataset evolves from seven TBox datasets in the Ontology Alignment Evaluation Initiative (OAEI), capturing hallucinations of ten different LLMs performing OM tasks. These OM-specific hallucinations are organised into two primary categories and six sub-categories. We showcase the usefulness of the dataset in constructing an LLM leaderboard for OM tasks and for fine-tuning LLMs used in OM tasks.
△ Less
Submitted 14 May, 2025; v1 submitted 25 March, 2025;
originally announced March 2025.
-
Machine Learning Assisted Modeling of Amorphous TiO$_2$-Doped GeO$_2$ for Advanced LIGO Mirror Coatings
Authors:
Jun Jiang,
Rui Zhang,
Kiran Prasai,
Riccardo Bassiri,
James N. Fry,
Martin M. Fejer,
Hai-Ping Cheng
Abstract:
The mechanical loss angle of amorphous TiO$_2$-doped GeO$_2$ can be lower than 10$^{-4}$, making it a candidate for Laser Interferometer Gravitational-wave Observatory (LIGO) mirror coatings. Amorphous oxides have complex atomic structures that are influenced by various factors, including doping concentration, preparation, and thermal history, resulting in different mass densities and physical pro…
▽ More
The mechanical loss angle of amorphous TiO$_2$-doped GeO$_2$ can be lower than 10$^{-4}$, making it a candidate for Laser Interferometer Gravitational-wave Observatory (LIGO) mirror coatings. Amorphous oxides have complex atomic structures that are influenced by various factors, including doping concentration, preparation, and thermal history, resulting in different mass densities and physical properties. Modeling at atomistic level enables capturing these effects by generating atomic structure models according to experimental conditions. In order to obtain reliable and physical amorphous models at an affordable cost, we develop classical and machine-learning potentials (MLP) to speed up simulations. First-principles calculations are used to train and validate MLP as well as validating structure models. To better reproduce properties such as elastic modulus, radial distribution function (RDF) and the variations in mass density of doped amorphous oxides, density functional theory (DFT) calculations are used to optimize the final models. We find that the mass densities of amorphous systems are correlated with the total void volume. The experimental mass density matches the models with the most symmetric potential energy wells under volume change. The elastic response of the metal-oxygen network is also studied. The 27\% TiO$_2$ doped GeO$_2$ system shows the least number of large atom-atom distance changes, while for 44\% TiO$_2$ doped GeO$_2$, a majority of Ti-O distances are significantly changed. In response to strains, the metal-oxygen network at low mass densities prefers to adjust bond angles, while at high mass densities, the adjustment is mainly done by changing atom-atom distance.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
First observation of $Λ_{c}(2595)^{+} \to Λ^{+}_{c}π^0π^0$ and $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^0π^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (657 additional authors not shown)
Abstract:
By analysing $e^+e^-$ annihilation data corresponding to an integrated luminosity of 368.48~pb$^{-1}$ collected at the centre-of-mass energies of $\sqrt{s} = 4.918$ and $4.951$~GeV with the BESIII detector, we report the first observation of $Λ_{c}(2595)^{+}$ and $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^0π^0$ with statistical significances of 7.9$σ$ and 11.8$σ$, respectively. The branching fractions of…
▽ More
By analysing $e^+e^-$ annihilation data corresponding to an integrated luminosity of 368.48~pb$^{-1}$ collected at the centre-of-mass energies of $\sqrt{s} = 4.918$ and $4.951$~GeV with the BESIII detector, we report the first observation of $Λ_{c}(2595)^{+}$ and $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^0π^0$ with statistical significances of 7.9$σ$ and 11.8$σ$, respectively. The branching fractions of $Λ_{c}(2595)^{+}$ and $Λ_{c}(2625)^{+}\to Λ^{+}_{c}π^0π^0$ are measured to be $(59.5 \pm 11.1_{\rm stat.} \pm 7.9_{\rm syst.}) \%$ and $(41.0 \pm 5.2_{\rm stat.} \pm 3.3_{\rm syst.}) \%$, respectively. The absolute branching fraction of $Λ_{c}(2595)^{+}$ is consistent with the expectation of the mechanism referred to as the threshold effect, proposed for the strong decays of $Λ_{c}(2595)^{+}$ within uncertainty.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
Numerical Study of Wheeler-Dewitt Equation beyond Slow-roll approximation
Authors:
Jie Jiang,
Deog Ki Hong,
Dong-han Yeom
Abstract:
The Wheeler-DeWitt (WDW) equation is analyzed using two boundary proposals: the Hartle-Hawking no-boundary condition and tunneling condition. By compactifying the scale factor $a$ into $ x = a/(1+a) $, we reformulate the WDW equation to find stable numerical solutions with clearer boundary conditions. The no-boundary wave function peaks at the horizon scale, indicating quantum nucleation of classi…
▽ More
The Wheeler-DeWitt (WDW) equation is analyzed using two boundary proposals: the Hartle-Hawking no-boundary condition and tunneling condition. By compactifying the scale factor $a$ into $ x = a/(1+a) $, we reformulate the WDW equation to find stable numerical solutions with clearer boundary conditions. The no-boundary wave function peaks at the horizon scale, indicating quantum nucleation of classical spacetime, while the tunneling solution shows exponential decay, reflecting vacuum decay from a classically forbidden state. These dynamics are explored under slow-roll and non-slow-roll regimes of a periodic potential, separately, with non-slow-roll scenarios amplifying quantum effects that delay the classical behavior. The results emphasize the role of boundary conditions in quantum cosmology, offering insights into the universe's origin and the interplay between quantum gravity and observable cosmology.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
Percolation of both signs in a triangular-type 3D Ising model above $T_c$
Authors:
Jianping Jiang,
Sike Lang
Abstract:
Let $\mathbb{T}$ be the two-dimensional triangular lattice, and $\mathbb{Z}$ the one-dimensional integer lattice. Let $\mathbb{T}\times \mathbb{Z}$ denote the Cartesian product graph. Consider the Ising model defined on this graph with inverse temperature $β$ and external field $h$, and let $β_c$ be the critical inverse temperature when $h=0$. We prove that for each $β\in[0,β_c)$, there exists…
▽ More
Let $\mathbb{T}$ be the two-dimensional triangular lattice, and $\mathbb{Z}$ the one-dimensional integer lattice. Let $\mathbb{T}\times \mathbb{Z}$ denote the Cartesian product graph. Consider the Ising model defined on this graph with inverse temperature $β$ and external field $h$, and let $β_c$ be the critical inverse temperature when $h=0$. We prove that for each $β\in[0,β_c)$, there exists $h_c(β)>0$ such that both a unique infinite $+$cluster and a unique infinite $-$cluster coexist whenever $|h|<h_c(β)$. The same coexistence result also holds for the three-dimensional triangular lattice.
△ Less
Submitted 27 March, 2025;
originally announced March 2025.
-
The Land$\acute{e}$ $g$ factors for the $6S_{1/2}$ , $5D_{3/2,5/2}$ states of Ba$^{+}$ ions
Authors:
Bing-Bing Li,
Jun Jiang,
Lei Wu,
Deng-Hong Zhang,
Chen-Zhong Dong
Abstract:
The Land$\acute{e}$ $g$ factors of Ba$^+$ are very important in high-precision measurement physics. The wave functions, energy levels, and Land$\acute{e}$ $g$ factors for the $6s$ $^{2}S_{1/2}$ and $5d$ $^{2}D_{3/2,5/2}$ states of Ba$^{+}$ ions were calculated using the multi-configuration Dirac-Hartree-Fock (MCDHF) method and the Model-QED method. The contributions of the electron correlation eff…
▽ More
The Land$\acute{e}$ $g$ factors of Ba$^+$ are very important in high-precision measurement physics. The wave functions, energy levels, and Land$\acute{e}$ $g$ factors for the $6s$ $^{2}S_{1/2}$ and $5d$ $^{2}D_{3/2,5/2}$ states of Ba$^{+}$ ions were calculated using the multi-configuration Dirac-Hartree-Fock (MCDHF) method and the Model-QED method. The contributions of the electron correlation effects and quantum electrodynamics (QED) effects were discussed in detail. The transition energies are in excellent agreement with the experimental results, with differences of approximately 5 cm$^{-1}$. The presently calculated $g$ factor of 2.0024905(16) for the $6S_{1/2}$ agrees very well with the available experimental and theoretical results, with a difference at a level of 10$^{-6}$. For the $5D_{3/2, 5/2}$ states, the present results of 0.7993961(126) and 1.2003942(190) agree with the experimental results of 0.7993278(3) [\textcolor{blue}{Phys. Rev. A 54, 1199(1996)}] and 1.20036739(14) [\textcolor{blue}{Phys. Rev. Lett. 124, 193001 (2020)}] very well, with differences at the level of 10$^{-5}$.
△ Less
Submitted 26 March, 2025;
originally announced March 2025.
-
CTS-CBS: A New Approach for Multi-Agent Collaborative Task Sequencing and Path Finding
Authors:
Junkai Jiang,
Ruochen Li,
Yibin Yang,
Yihe Chen,
Yuning Wang,
Shaobing Xu,
Jianqiang Wang
Abstract:
This paper addresses a generalization problem of Multi-Agent Pathfinding (MAPF), called Collaborative Task Sequencing - Multi-Agent Pathfinding (CTS-MAPF), where agents must plan collision-free paths and visit a series of intermediate task locations in a specific order before reaching their final destinations. To address this problem, we propose a new approach, Collaborative Task Sequencing - Conf…
▽ More
This paper addresses a generalization problem of Multi-Agent Pathfinding (MAPF), called Collaborative Task Sequencing - Multi-Agent Pathfinding (CTS-MAPF), where agents must plan collision-free paths and visit a series of intermediate task locations in a specific order before reaching their final destinations. To address this problem, we propose a new approach, Collaborative Task Sequencing - Conflict-Based Search (CTS-CBS), which conducts a two-level search. In the high level, it generates a search forest, where each tree corresponds to a joint task sequence derived from the jTSP solution. In the low level, CTS-CBS performs constrained single-agent path planning to generate paths for each agent while adhering to high-level constraints. We also provide heoretical guarantees of its completeness and optimality (or sub-optimality with a bounded parameter). To evaluate the performance of CTS-CBS, we create two datasets, CTS-MAPF and MG-MAPF, and conduct comprehensive experiments. The results show that CTS-CBS adaptations for MG-MAPF outperform baseline algorithms in terms of success rate (up to 20 times larger) and runtime (up to 100 times faster), with less than a 10% sacrifice in solution quality. Furthermore, CTS-CBS offers flexibility by allowing users to adjust the sub-optimality bound omega to balance between solution quality and efficiency. Finally, practical robot tests demonstrate the algorithm's applicability in real-world scenarios.
△ Less
Submitted 26 March, 2025;
originally announced March 2025.
-
Search for the $B^+_c\rightarrow χ_{c1}(3872)π^+$ decay
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
M. Alexander,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis
, et al. (1124 additional authors not shown)
Abstract:
A search for the decay $B^+_c\rightarrow χ_{c1}(3872)π^+$ is reported using proton-proton collision data collected with the LHCb detector between 2011 and 2018 at centre-of-mass energies of 7, 8, and 13 $TeV$, corresponding to an integrated luminosity of $9fb^{-1}$. No significant signal is observed. Using the decay $B^+_c\rightarrow ψ(2S)π^+$ as a normalisation channel, an upper limit for the rat…
▽ More
A search for the decay $B^+_c\rightarrow χ_{c1}(3872)π^+$ is reported using proton-proton collision data collected with the LHCb detector between 2011 and 2018 at centre-of-mass energies of 7, 8, and 13 $TeV$, corresponding to an integrated luminosity of $9fb^{-1}$. No significant signal is observed. Using the decay $B^+_c\rightarrow ψ(2S)π^+$ as a normalisation channel, an upper limit for the ratio of branching fractions $$ \mathcal{R}^{χ_{c1}(3872)}_{ψ(2S)}
= \frac{\mathcal{B}_{B^+_c\rightarrow χ_{c1}(3872)π^+}}
{\mathcal{B}_{B^+_c\rightarrow ψ(2S)π^+}} \times \frac{\mathcal{B}_{χ_{c1}(3872)\rightarrow J/ψπ^+π^-}}
{\mathcal{B}_{ψ(2S)\rightarrow J/ψπ^+π^-}} < 0.05\,(0.06)\,, $$ is set at the 90\,(95)\% confidence level.
△ Less
Submitted 25 March, 2025;
originally announced March 2025.
-
FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust Fusion
Authors:
Pihai Sun,
Junjun Jiang,
Yuanqi Yao,
Youyu Chen,
Wenbo Zhao,
Kui Jiang,
Xianming Liu
Abstract:
Image-event joint depth estimation methods leverage complementary modalities for robust perception, yet face challenges in generalizability stemming from two factors: 1) limited annotated image-event-depth datasets causing insufficient cross-modal supervision, and 2) inherent frequency mismatches between static images and dynamic event streams with distinct spatiotemporal patterns, leading to inef…
▽ More
Image-event joint depth estimation methods leverage complementary modalities for robust perception, yet face challenges in generalizability stemming from two factors: 1) limited annotated image-event-depth datasets causing insufficient cross-modal supervision, and 2) inherent frequency mismatches between static images and dynamic event streams with distinct spatiotemporal patterns, leading to ineffective feature fusion. To address this dual challenge, we propose Frequency-decoupled Unified Self-supervised Encoder (FUSE) with two synergistic components: The Parameter-efficient Self-supervised Transfer (PST) establishes cross-modal knowledge transfer through latent space alignment with image foundation models, effectively mitigating data scarcity by enabling joint encoding without depth ground truth. Complementing this, we propose the Frequency-Decoupled Fusion module (FreDFuse) to explicitly decouple high-frequency edge features from low-frequency structural components, resolving modality-specific frequency mismatches through physics-aware fusion. This combined approach enables FUSE to construct a universal image-event encoder that only requires lightweight decoder adaptation for target datasets. Extensive experiments demonstrate state-of-the-art performance with 14% and 24.9% improvements in Abs.Rel on MVSEC and DENSE datasets. The framework exhibits remarkable zero-shot adaptability to challenging scenarios including extreme lighting and motion blur, significantly advancing real-world deployment capabilities. The source code for our method is publicly available at: https://github.com/sunpihai-up/FUSE
△ Less
Submitted 26 March, 2025; v1 submitted 25 March, 2025;
originally announced March 2025.
-
Measurement of the branching fractions of doubly Cabibbo-suppressed $D$ decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (648 additional authors not shown)
Abstract:
By analyzing $e^+e^-$ collision data collected at the center-of-mass energy of 3.773~GeV with the BESIII detector, corresponding to an integrated luminosity of 20.3~fb$^{-1}$, we measure the branching fractions of the doubly Cabibbo-suppressed (DCS) decays $D^0\to K^+π^-$, $D^0\to K^+π^-π^-π^+$, $D^0\to K^+π^-π^0$, $D^0\to K^+π^-π^0π^0$, $D^+\to K^+π^+π^-$, and $D^+\to K^+K^+K^-$. We also perform…
▽ More
By analyzing $e^+e^-$ collision data collected at the center-of-mass energy of 3.773~GeV with the BESIII detector, corresponding to an integrated luminosity of 20.3~fb$^{-1}$, we measure the branching fractions of the doubly Cabibbo-suppressed (DCS) decays $D^0\to K^+π^-$, $D^0\to K^+π^-π^-π^+$, $D^0\to K^+π^-π^0$, $D^0\to K^+π^-π^0π^0$, $D^+\to K^+π^+π^-$, and $D^+\to K^+K^+K^-$. We also perform the first searches for $D^0\to K^+π^-η$, $D^0\to K^+π^-π^0η$, $D^+\to K^+π^+π^-η$, $D^{+} \to K^{+} \left(π^{+} π^{-} η\right)_{{\rm non}-η^{\prime}}$, and $D^+\to K^+ηη$ and report the first observations and evidence for some of these final states. Combining the measurements with the world averages of the corresponding Cabibbo-favored (CF) decays, the ratios of the DCS/CF branching fractions are obtained. For the $D^{+} \to K^{+} \left(π^{+} π^{-} η\right)_{{\rm non}-η^{\prime}}$ decay, the ratio is significantly larger than the corresponding ratios of the other DCS decays.
△ Less
Submitted 25 March, 2025;
originally announced March 2025.
-
COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian Splitting
Authors:
Jiaxin Zhang,
Junjun Jiang,
Youyu Chen,
Kui Jiang,
Xianming Liu
Abstract:
Accurate object segmentation is crucial for high-quality scene understanding in the 3D vision domain. However, 3D segmentation based on 3D Gaussian Splatting (3DGS) struggles with accurately delineating object boundaries, as Gaussian primitives often span across object edges due to their inherent volume and the lack of semantic guidance during training. In order to tackle these challenges, we intr…
▽ More
Accurate object segmentation is crucial for high-quality scene understanding in the 3D vision domain. However, 3D segmentation based on 3D Gaussian Splatting (3DGS) struggles with accurately delineating object boundaries, as Gaussian primitives often span across object edges due to their inherent volume and the lack of semantic guidance during training. In order to tackle these challenges, we introduce Clear Object Boundaries for 3DGS Segmentation (COB-GS), which aims to improve segmentation accuracy by clearly delineating blurry boundaries of interwoven Gaussian primitives within the scene. Unlike existing approaches that remove ambiguous Gaussians and sacrifice visual quality, COB-GS, as a 3DGS refinement method, jointly optimizes semantic and visual information, allowing the two different levels to cooperate with each other effectively. Specifically, for the semantic guidance, we introduce a boundary-adaptive Gaussian splitting technique that leverages semantic gradient statistics to identify and split ambiguous Gaussians, aligning them closely with object boundaries. For the visual optimization, we rectify the degraded suboptimal texture of the 3DGS scene, particularly along the refined boundary structures. Experimental results show that COB-GS substantially improves segmentation accuracy and robustness against inaccurate masks from pre-trained model, yielding clear boundaries while preserving high visual quality. Code is available at https://github.com/ZestfulJX/COB-GS.
△ Less
Submitted 26 March, 2025; v1 submitted 25 March, 2025;
originally announced March 2025.
-
RGL: A Graph-Centric, Modular Framework for Efficient Retrieval-Augmented Generation on Graphs
Authors:
Yuan Li,
Jun Hu,
Jiaxin Jiang,
Zemin Liu,
Bryan Hooi,
Bingsheng He
Abstract:
Recent advances in graph learning have paved the way for innovative retrieval-augmented generation (RAG) systems that leverage the inherent relational structures in graph data. However, many existing approaches suffer from rigid, fixed settings and significant engineering overhead, limiting their adaptability and scalability. Additionally, the RAG community has largely overlooked the decades of re…
▽ More
Recent advances in graph learning have paved the way for innovative retrieval-augmented generation (RAG) systems that leverage the inherent relational structures in graph data. However, many existing approaches suffer from rigid, fixed settings and significant engineering overhead, limiting their adaptability and scalability. Additionally, the RAG community has largely overlooked the decades of research in the graph database community regarding the efficient retrieval of interesting substructures on large-scale graphs. In this work, we introduce the RAG-on-Graphs Library (RGL), a modular framework that seamlessly integrates the complete RAG pipeline-from efficient graph indexing and dynamic node retrieval to subgraph construction, tokenization, and final generation-into a unified system. RGL addresses key challenges by supporting a variety of graph formats and integrating optimized implementations for essential components, achieving speedups of up to 143x compared to conventional methods. Moreover, its flexible utilities, such as dynamic node filtering, allow for rapid extraction of pertinent subgraphs while reducing token consumption. Our extensive evaluations demonstrate that RGL not only accelerates the prototyping process but also enhances the performance and applicability of graph-based RAG systems across a range of tasks.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
BIMII-Net: Brain-Inspired Multi-Iterative Interactive Network for RGB-T Road Scene Semantic Segmentation
Authors:
Hanshuo Qiu,
Jie Jiang,
Ruoli Yang,
Lixin Zhan,
Jizhao Liu
Abstract:
RGB-T road scene semantic segmentation enhances visual scene understanding in complex environments characterized by inadequate illumination or occlusion by fusing information from RGB and thermal images. Nevertheless, existing RGB-T semantic segmentation models typically depend on simple addition or concatenation strategies or ignore the differences between information at different levels. To addr…
▽ More
RGB-T road scene semantic segmentation enhances visual scene understanding in complex environments characterized by inadequate illumination or occlusion by fusing information from RGB and thermal images. Nevertheless, existing RGB-T semantic segmentation models typically depend on simple addition or concatenation strategies or ignore the differences between information at different levels. To address these issues, we proposed a novel RGB-T road scene semantic segmentation network called Brain-Inspired Multi-Iteration Interaction Network (BIMII-Net). First, to meet the requirements of accurate texture and local information extraction in road scenarios like autonomous driving, we proposed a deep continuous-coupled neural network (DCCNN) architecture based on a brain-inspired model. Second, to enhance the interaction and expression capabilities among multi-modal information, we designed a cross explicit attention-enhanced fusion module (CEAEF-Module) in the feature fusion stage of BIMII-Net to effectively integrate features at different levels. Finally, we constructed a complementary interactive multi-layer decoder structure, incorporating the shallow-level feature iteration module (SFI-Module), the deep-level feature iteration module (DFI-Module), and the multi-feature enhancement module (MFE-Module) to collaboratively extract texture details and global skeleton information, with multi-module joint supervision further optimizing the segmentation results. Experimental results demonstrate that BIMII-Net achieves state-of-the-art (SOTA) performance in the brain-inspired computing domain and outperforms most existing RGB-T semantic segmentation methods. It also exhibits strong generalization capabilities on multiple RGB-T datasets, proving the effectiveness of brain-inspired computer models in multi-modal image segmentation tasks.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
RomanTex: Decoupling 3D-aware Rotary Positional Embedded Multi-Attention Network for Texture Synthesis
Authors:
Yifei Feng,
Mingxin Yang,
Shuhui Yang,
Sheng Zhang,
Jiaao Yu,
Zibo Zhao,
Yuhong Liu,
Jie Jiang,
Chunchao Guo
Abstract:
Painting textures for existing geometries is a critical yet labor-intensive process in 3D asset generation. Recent advancements in text-to-image (T2I) models have led to significant progress in texture generation. Most existing research approaches this task by first generating images in 2D spaces using image diffusion models, followed by a texture baking process to achieve UV texture. However, the…
▽ More
Painting textures for existing geometries is a critical yet labor-intensive process in 3D asset generation. Recent advancements in text-to-image (T2I) models have led to significant progress in texture generation. Most existing research approaches this task by first generating images in 2D spaces using image diffusion models, followed by a texture baking process to achieve UV texture. However, these methods often struggle to produce high-quality textures due to inconsistencies among the generated multi-view images, resulting in seams and ghosting artifacts. In contrast, 3D-based texture synthesis methods aim to address these inconsistencies, but they often neglect 2D diffusion model priors, making them challenging to apply to real-world objects To overcome these limitations, we propose RomanTex, a multiview-based texture generation framework that integrates a multi-attention network with an underlying 3D representation, facilitated by our novel 3D-aware Rotary Positional Embedding. Additionally, we incorporate a decoupling characteristic in the multi-attention block to enhance the model's robustness in image-to-texture task, enabling semantically-correct back-view synthesis. Furthermore, we introduce a geometry-related Classifier-Free Guidance (CFG) mechanism to further improve the alignment with both geometries and images. Quantitative and qualitative evaluations, along with comprehensive user studies, demonstrate that our method achieves state-of-the-art results in texture quality and consistency.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
SplitFrozen: Split Learning with Device-side Model Frozen for Fine-Tuning LLM on Heterogeneous Resource-Constrained Devices
Authors:
Jian Ma,
Xinchen Lyu,
Jun Jiang,
Qimei Cui,
Haipeng Yao,
Xiaofeng Tao
Abstract:
Fine-tuning large language models (LLMs) on private, on-device data can empower tailored personalized AI agents. However, fine-tuning LLMs on resource-constrained edge devices faces significant challenges, including excessive computation overhead, device heterogeneity, and data imbalance. This paper proposes SplitFrozen, a split learning framework that enables efficient LLM fine-tuning by strategi…
▽ More
Fine-tuning large language models (LLMs) on private, on-device data can empower tailored personalized AI agents. However, fine-tuning LLMs on resource-constrained edge devices faces significant challenges, including excessive computation overhead, device heterogeneity, and data imbalance. This paper proposes SplitFrozen, a split learning framework that enables efficient LLM fine-tuning by strategically freezing device-side model layers while centralizing parameter-efficient fine-tuning on the server. Our framework partitions LLMs into device-side frozen layers and server-side fine-tuning layers, where heterogeneous resource-constrained devices execute only forward propagation. To minimize server-side training costs, we integrate Low-Rank Adaptation (LoRA) into the server-side layers. A pipeline parallelism strategy further optimizes training efficiency by decoupling device-server computations and leveraging decomposed backward propagation. Experiments on GPT-2 with the MRPC, MNLI-matched, and SST-2 datasets demonstrate that SplitFrozen outperforms FedLoRA and SplitLoRA by 69.4\% model accuracy under extremely imbalanced data, while reducing up to 86.8\% device-side computations and 50.2\% total training time. Experiments also validate the scalability of SplitFrozen on content generation task using Llama-3.2 model on GSM8K dataset.
△ Less
Submitted 23 March, 2025;
originally announced March 2025.
-
Observation of the decay $ψ(3686)\rightarrow Σ^{0}\barΣ^{0}ω$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (695 additional authors not shown)
Abstract:
Using a dataset of $(27.12\pm 0.14)\times 10^{8}$ $ψ(3686)$
events collected by the BESIII detector operating at
the BEPCII collider, we report the first observation of the decay
$ψ(3686)\toΣ^{0}\barΣ^{0}ω$ with a statistical
significance of 8.9$σ$. The measured branching fraction is $(1.24 \pm 0.16_{\textrm{stat}} \pm
0.11_{\textrm{sys}}) \times 10^{-5}$, where the first
uncertainty i…
▽ More
Using a dataset of $(27.12\pm 0.14)\times 10^{8}$ $ψ(3686)$
events collected by the BESIII detector operating at
the BEPCII collider, we report the first observation of the decay
$ψ(3686)\toΣ^{0}\barΣ^{0}ω$ with a statistical
significance of 8.9$σ$. The measured branching fraction is $(1.24 \pm 0.16_{\textrm{stat}} \pm
0.11_{\textrm{sys}}) \times 10^{-5}$, where the first
uncertainty is statistical and the second is
systematic. Additionally, we investigate potential
intermediate states in the invariant mass distributions of $Σ^{0}ω$, $\barΣ^{0}ω$ and $Σ^{0}\barΣ^{0}$. A hint of a resonance is observed in the invariant mass distribution of $M_{Σ^{0}(\barΣ^{0})ω}$, located around 2.06 GeV/$c^2$, with a significance of 2.5$σ$.
△ Less
Submitted 24 March, 2025;
originally announced March 2025.
-
DashGaussian: Optimizing 3D Gaussian Splatting in 200 Seconds
Authors:
Youyu Chen,
Junjun Jiang,
Kui Jiang,
Xiao Tang,
Zhihao Li,
Xianming Liu,
Yinyu Nie
Abstract:
3D Gaussian Splatting (3DGS) renders pixels by rasterizing Gaussian primitives, where the rendering resolution and the primitive number, concluded as the optimization complexity, dominate the time cost in primitive optimization. In this paper, we propose DashGaussian, a scheduling scheme over the optimization complexity of 3DGS that strips redundant complexity to accelerate 3DGS optimization. Spec…
▽ More
3D Gaussian Splatting (3DGS) renders pixels by rasterizing Gaussian primitives, where the rendering resolution and the primitive number, concluded as the optimization complexity, dominate the time cost in primitive optimization. In this paper, we propose DashGaussian, a scheduling scheme over the optimization complexity of 3DGS that strips redundant complexity to accelerate 3DGS optimization. Specifically, we formulate 3DGS optimization as progressively fitting 3DGS to higher levels of frequency components in the training views, and propose a dynamic rendering resolution scheme that largely reduces the optimization complexity based on this formulation. Besides, we argue that a specific rendering resolution should cooperate with a proper primitive number for a better balance between computing redundancy and fitting quality, where we schedule the growth of the primitives to synchronize with the rendering resolution. Extensive experiments show that our method accelerates the optimization of various 3DGS backbones by 45.7% on average while preserving the rendering quality.
△ Less
Submitted 26 March, 2025; v1 submitted 24 March, 2025;
originally announced March 2025.
-
Cat-AIR: Content and Task-Aware All-in-One Image Restoration
Authors:
Jiachen Jiang,
Tianyu Ding,
Ke Zhang,
Jinxin Zhou,
Tianyi Chen,
Ilya Zharkov,
Zhihui Zhu,
Luming Liang
Abstract:
All-in-one image restoration seeks to recover high-quality images from various types of degradation using a single model, without prior knowledge of the corruption source. However, existing methods often struggle to effectively and efficiently handle multiple degradation types. We present Cat-AIR, a novel \textbf{C}ontent \textbf{A}nd \textbf{T}ask-aware framework for \textbf{A}ll-in-one \textbf{I…
▽ More
All-in-one image restoration seeks to recover high-quality images from various types of degradation using a single model, without prior knowledge of the corruption source. However, existing methods often struggle to effectively and efficiently handle multiple degradation types. We present Cat-AIR, a novel \textbf{C}ontent \textbf{A}nd \textbf{T}ask-aware framework for \textbf{A}ll-in-one \textbf{I}mage \textbf{R}estoration. Cat-AIR incorporates an alternating spatial-channel attention mechanism that adaptively balances the local and global information for different tasks. Specifically, we introduce cross-layer channel attentions and cross-feature spatial attentions that allocate computations based on content and task complexity. Furthermore, we propose a smooth learning strategy that allows for seamless adaptation to new restoration tasks while maintaining performance on existing ones. Extensive experiments demonstrate that Cat-AIR achieves state-of-the-art results across a wide range of restoration tasks, requiring fewer FLOPs than previous methods, establishing new benchmarks for efficient all-in-one image restoration.
△ Less
Submitted 22 March, 2025;
originally announced March 2025.
-
TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning
Authors:
Sheng Wang,
Pengan Chen,
Jingqi Zhou,
Qintong Li,
Jingwei Dong,
Jiahui Gao,
Boyang Xue,
Jiyue Jiang,
Lingpeng Kong,
Chuan Wu
Abstract:
Model customization requires high-quality and diverse datasets, but acquiring such data remains challenging and costly. Although large language models (LLMs) can synthesize training data, current approaches are constrained by limited seed data, model bias and insufficient control over the generation process, resulting in limited diversity and biased distribution with the increase of data scales. T…
▽ More
Model customization requires high-quality and diverse datasets, but acquiring such data remains challenging and costly. Although large language models (LLMs) can synthesize training data, current approaches are constrained by limited seed data, model bias and insufficient control over the generation process, resulting in limited diversity and biased distribution with the increase of data scales. To tackle this challenge, we present TreeSynth, a tree-guided subspace-based data synthesis framework that recursively partitions the entire data space into hierar-chical subspaces, enabling comprehensive and diverse scaling of data synthesis. Briefly, given a task-specific description, we construct a data space partitioning tree by iteratively executing criteria determination and subspace coverage steps. This hierarchically divides the whole space (i.e., root node) into mutually exclusive and complementary atomic subspaces (i.e., leaf nodes). By collecting synthesized data according to the attributes of each leaf node, we obtain a diverse dataset that fully covers the data space. Empirically, our extensive experiments demonstrate that TreeSynth surpasses both human-designed datasets and the state-of-the-art data synthesis baselines, achieving maximum improvements of 45.2% in data diversity and 17.6% in downstream task performance across various models and tasks. Hopefully, TreeSynth provides a scalable solution to synthesize diverse and comprehensive datasets from scratch without human intervention.
△ Less
Submitted 21 March, 2025;
originally announced March 2025.
-
Stringent test of $CP$ symmetry in $Σ^+$ hyperon decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (680 additional authors not shown)
Abstract:
The non-leptonic two-body weak decays $Σ^{+} \to p π^{0}$ and $\barΣ^{-} \to \bar{p} π^{0}$ are investigated, utilizing $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events and $(2.7124\pm0.0143)\times10^{9}$ $ψ(3686)$ events collected by BESIII experiment. The precision of the weak-decay parameters for the decays $Σ^{+} \to p π^{0}$ ($α_{0}$) and $\barΣ^{-} \to \bar{p} π^{0}$ ($\barα_{0}$) is improved b…
▽ More
The non-leptonic two-body weak decays $Σ^{+} \to p π^{0}$ and $\barΣ^{-} \to \bar{p} π^{0}$ are investigated, utilizing $(1.0087\pm0.0044)\times10^{10}$ $J/ψ$ events and $(2.7124\pm0.0143)\times10^{9}$ $ψ(3686)$ events collected by BESIII experiment. The precision of the weak-decay parameters for the decays $Σ^{+} \to p π^{0}$ ($α_{0}$) and $\barΣ^{-} \to \bar{p} π^{0}$ ($\barα_{0}$) is improved by a factor of three compared to the previous world average. Furthermore, the quantum-entangled $Σ^{+}\barΣ^{-}$ system enables the most precise test of $CP$ symmetry for the decay $Σ^+\to pπ^0$, through the asymmetry observable $A_{CP}=(α_{0}+\barα_{0})/(α_{0}-\barα_{0})$ that is measured to be $-0.0118\pm0.0083_{\rm stat}\pm0.0028_{\rm syst}$. Assuming $CP$ conservation, the average decay parameter is determined to be ${\left< α_{\rm 0}\right>} = (α_0-\barα_0)/2=-0.9869\pm0.0011_{\rm stat}\pm0.0016_{\rm syst}$, which is the most precise measurement of the asymmetry decay parameters in baryon sectors. The angular dependence of the ratio of the polarization of the $Σ^+$ in both $J/ψ$ and $ψ(3686)$ decays is studied for the first time.
△ Less
Submitted 21 March, 2025;
originally announced March 2025.
-
Observation of charge-parity symmetry breaking in baryon decays
Authors:
LHCb collaboration,
R. Aaij,
A. S. W. Abdelmotteleb,
C. Abellan Beteta,
F. Abudinén,
T. Ackernley,
A. A. Adefisoye,
B. Adeva,
M. Adinolfi,
P. Adlarson,
C. Agapopoulou,
C. A. Aidala,
Z. Ajaltouni,
S. Akar,
K. Akiba,
P. Albicocco,
J. Albrecht,
F. Alessio,
Z. Aliouche,
P. Alvarez Cartelle,
R. Amalric,
S. Amato,
J. L. Amey,
Y. Amhis,
L. An
, et al. (1132 additional authors not shown)
Abstract:
The Standard Model of particle physics, the theory of particles and interactions at the smallest scale, predicts that matter and antimatter interact differently due to violation of the combined symmetry of charge conjugation ($C$) and parity ($P$). Charge conjugation transforms particles into their antimatter particles, while the parity transformation inverts spatial coordinates. This prediction a…
▽ More
The Standard Model of particle physics, the theory of particles and interactions at the smallest scale, predicts that matter and antimatter interact differently due to violation of the combined symmetry of charge conjugation ($C$) and parity ($P$). Charge conjugation transforms particles into their antimatter particles, while the parity transformation inverts spatial coordinates. This prediction applies to both mesons, which consist of a quark and an antiquark, and baryons, which are composed of three quarks. However, despite having been discovered in various meson decays, $CP$ violation has yet to be observed in baryons, the type of matter that makes up the observable Universe. This article reports a study of the decay of the beauty baryon $Λ^{0}_{b}$ to the $p K^{-} π^{+}π^{-}$ final state and its $CP$-conjugated process, using data collected by the LHCb (Large Hadron Collider beauty) experiment at CERN. The results reveal significant asymmetries between the decay rates of the $Λ^{0}_{b}$ baryon and its $CP$-conjugated antibaryon, marking the first observation of $CP$ violation in baryon decays, thus demonstrating the different behaviour of baryons and antibaryons. In the Standard Model, $CP$ violation arises from the Cabibbo-Kobayashi-Maskawa mechanism, while new forces or particles beyond the Standard Model could provide additional contributions. This discovery opens a new path to search for physics beyond the Standard Model.
△ Less
Submitted 21 March, 2025;
originally announced March 2025.
-
Unleashing Vecset Diffusion Model for Fast Shape Generation
Authors:
Zeqiang Lai,
Yunfei Zhao,
Zibo Zhao,
Haolin Liu,
Fuyun Wang,
Huiwen Shi,
Xianghui Yang,
Qingxiang Lin,
Jingwei Huang,
Yuhong Liu,
Jie Jiang,
Chunchao Guo,
Xiangyu Yue
Abstract:
3D shape generation has greatly flourished through the development of so-called "native" 3D diffusion, particularly through the Vecset Diffusion Model (VDM). While recent advancements have shown promising results in generating high-resolution 3D shapes, VDM still struggles with high-speed generation. Challenges exist because of difficulties not only in accelerating diffusion sampling but also VAE…
▽ More
3D shape generation has greatly flourished through the development of so-called "native" 3D diffusion, particularly through the Vecset Diffusion Model (VDM). While recent advancements have shown promising results in generating high-resolution 3D shapes, VDM still struggles with high-speed generation. Challenges exist because of difficulties not only in accelerating diffusion sampling but also VAE decoding in VDM, areas under-explored in previous works. To address these challenges, we present FlashVDM, a systematic framework for accelerating both VAE and DiT in VDM. For DiT, FlashVDM enables flexible diffusion sampling with as few as 5 inference steps and comparable quality, which is made possible by stabilizing consistency distillation with our newly introduced Progressive Flow Distillation. For VAE, we introduce a lightning vecset decoder equipped with Adaptive KV Selection, Hierarchical Volume Decoding, and Efficient Network Design. By exploiting the locality of the vecset and the sparsity of shape surface in the volume, our decoder drastically lowers FLOPs, minimizing the overall decoding overhead. We apply FlashVDM to Hunyuan3D-2 to obtain Hunyuan3D-2 Turbo. Through systematic evaluation, we show that our model significantly outperforms existing fast 3D generation methods, achieving comparable performance to the state-of-the-art while reducing inference time by over 45x for reconstruction and 32x for generation. Code and models are available at https://github.com/Tencent/FlashVDM.
△ Less
Submitted 26 March, 2025; v1 submitted 20 March, 2025;
originally announced March 2025.
-
Search for the radiative leptonic decay $D^+\toγe^+ν_e$ with Deep Learning
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (680 additional authors not shown)
Abstract:
Using 20.3$~\rm fb^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773$~\rm GeV$ with the BESIII detector, we report an improved search for the radiative leptonic decay $D^+\toγe^+ν_e$. An upper limit on its partial branching fraction for photon energies $E_γ>10~\rm MeV$ is determined to be $1.2\times10^{-5}$ at 90\% confidence level, which excludes most current theor…
▽ More
Using 20.3$~\rm fb^{-1}$ of $e^+e^-$ annihilation data collected at a center-of-mass energy of 3.773$~\rm GeV$ with the BESIII detector, we report an improved search for the radiative leptonic decay $D^+\toγe^+ν_e$. An upper limit on its partial branching fraction for photon energies $E_γ>10~\rm MeV$ is determined to be $1.2\times10^{-5}$ at 90\% confidence level, which excludes most current theoretical predictions. A sophisticated deep learning approach with thorough validation, based on the Transformer architecture, is implemented to efficiently distinguish the signal from massive backgrounds.
△ Less
Submitted 20 March, 2025;
originally announced March 2025.
-
Multispectral radiation temperature inversion based on Transformer-LSTM-SVM
Authors:
Ying Cui,
Kongxin Qiu,
Shan Gao,
Hailong Liu,
Rongyan Gao,
Liwei Chen,
Zezhan Zhang,
Jing Jiang,
Yi Niu,
Chao Wang
Abstract:
The key challenge in multispectral radiation thermometry is accurately measuring emissivity. Traditional constrained optimization methods often fail to meet practical requirements in terms of precision, efficiency, and noise resistance. However, the continuous advancement of neural networks in data processing offers a potential solution to this issue. This paper presents a multispectral radiation…
▽ More
The key challenge in multispectral radiation thermometry is accurately measuring emissivity. Traditional constrained optimization methods often fail to meet practical requirements in terms of precision, efficiency, and noise resistance. However, the continuous advancement of neural networks in data processing offers a potential solution to this issue. This paper presents a multispectral radiation thermometry algorithm that combines Transformer, LSTM (Long Short-Term Memory), and SVM (Support Vector Machine) to mitigate the impact of emissivity, thereby enhancing accuracy and noise resistance. In simulations, compared to the BP neural network algorithm, GIM-LSTM, and Transformer-LSTM algorithms, the Transformer-LSTM-SVM algorithm demonstrates an improvement in accuracy of 1.23%, 0.46% and 0.13%, respectively, without noise. When 5% random noise is added, the accuracy increases by 1.39%, 0.51%, and 0.38%, respectively. Finally, experiments confirmed that the maximum temperature error using this method is less than 1%, indicating that the algorithm offers high accuracy, fast processing speed, and robust noise resistance. These characteristics make it well-suited for real-time high-temperature measurements with multi-wavelength thermometry equipment.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
A constraint on superheavy elements of the GRB-kilonova AT 2023vfi
Authors:
Zhengyan Liu,
Ji-an Jiang,
Wen Zhao
Abstract:
The discovery of the kilonova (KN) AT 2017gfo, accompanying the gravitational wave event GW170817, provides crucial insight into the synthesis of heavy elements during binary neutron star (BNS) mergers. Following this landmark event, another KN was detected in association with the second-brightest gamma-ray burst (GRB) observed to date, GRB 230307A, and subsequently confirmed by observations of th…
▽ More
The discovery of the kilonova (KN) AT 2017gfo, accompanying the gravitational wave event GW170817, provides crucial insight into the synthesis of heavy elements during binary neutron star (BNS) mergers. Following this landmark event, another KN was detected in association with the second-brightest gamma-ray burst (GRB) observed to date, GRB 230307A, and subsequently confirmed by observations of the James Webb Space Telescope (JWST). In this work, we conduct an end-to-end simulation to analyze the temporal evolution of the KN AT 2023vfi associated with GRB 230307A, and constrain the abundances of superheavy elements produced. We find that the temporal evolution of AT 2023vfi is similar to AT 2017gfo in the first week post-burst. Additionally, the \textit{r}-process nuclide abundances of lanthanide-rich ejecta, derived from numerical relativity simulations of BNS mergers, can also successfully interpret the temporal evolution of the KN with the lanthanide-rich ejecta mass of $0.02 M_\odot$, which is consistent with the mass range of dynamical ejecta from numerical simulations in literature. Both findings strongly suggest the hypothesis that GRB 230307A originated from a BNS merger, similar to AT 2017gfo. Based on the first time observation of the KN for JWST, we are able to constrain the superheavy elements of another KN following AT 2017gfo. The pre-radioactive-decay abundances of the superheavy nuclides: $^{222}$Rn, $^{223}$Ra, $^{224}$Ra and $^{225}$Ac, are estimated to be at least on the order of $1 \times 10^{-5}$. These abundance estimates provide valuable insight into the synthesis of superheavy elements in BNS mergers, contributing to our understanding of astrophysical \textit{r}-process nucleosynthesis.
△ Less
Submitted 21 March, 2025; v1 submitted 19 March, 2025;
originally announced March 2025.
-
Machine learning predictions from unpredictable chaos
Authors:
Jian Jiang,
Long Chen,
Lu ke,
Bozheng Dou,
Yueying Zhu,
Yazhou Shi,
Huahai Qiu,
Bengong Zhang,
Tianshou Zhou,
Guo-Wei Wei
Abstract:
Chaos is omnipresent in nature, and its understanding provides enormous social and economic benefits. However, the unpredictability of chaotic systems is a textbook concept due to their sensitivity to initial conditions, aperiodic behavior, fractal dimensions, nonlinearity, and strange attractors. In this work, we introduce, for the first time, chaotic learning, a novel multiscale topological para…
▽ More
Chaos is omnipresent in nature, and its understanding provides enormous social and economic benefits. However, the unpredictability of chaotic systems is a textbook concept due to their sensitivity to initial conditions, aperiodic behavior, fractal dimensions, nonlinearity, and strange attractors. In this work, we introduce, for the first time, chaotic learning, a novel multiscale topological paradigm that enables accurate predictions from chaotic systems. We show that seemingly random and unpredictable chaotic dynamics counterintuitively offer unprecedented quantitative predictions. Specifically, we devise multiscale topological Laplacians to embed real-world data into a family of interactive chaotic dynamical systems, modulate their dynamical behaviors, and enable the accurate prediction of the input data. As a proof of concept, we consider 28 datasets from four categories of realistic problems: 10 brain waves, four benchmark protein datasets, 13 single-cell RNA sequencing datasets, and an image dataset, as well as two distinct chaotic dynamical systems, namely the Lorenz and Rossler attractors. We demonstrate chaotic learning predictions of the physical properties from chaos. Our new chaotic learning paradigm profoundly changes the textbook perception of chaos and bridges topology, chaos, and learning for the first time.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
Towards More Economical Context-Augmented LLM Generation by Reusing Stored KV Cache
Authors:
Hanchen Li,
Yuhan Liu,
Yihua Cheng,
Kuntai Du,
Junchen Jiang
Abstract:
Across large language model (LLM) applications, we observe an emerging trend for reusing KV caches to save the prefill delays of processing repeated input texts in different LLM inputs. This has led to a broad design space, including colocating stored KV caches with (or close to) GPUs to various KV cache compression. However, a key question remains unanswered: can these delay reductions also be ec…
▽ More
Across large language model (LLM) applications, we observe an emerging trend for reusing KV caches to save the prefill delays of processing repeated input texts in different LLM inputs. This has led to a broad design space, including colocating stored KV caches with (or close to) GPUs to various KV cache compression. However, a key question remains unanswered: can these delay reductions also be economically favorable? Specifically, we ask whether a developer can use public cloud services to store precomputed KV caches and reuse them to save delay without incurring more costs in terms of compute, storage, and network. To answer this question, we propose an validated analytical model for the cloud cost (in compute, storage, and network) of storing and reusing KV caches based on various workload parameters, such as reuse frequency, generated text lengths, model sizes, etc. Preliminary results show that KV cache reusing is able to save both delay and cloud cost across a range of workloads with long context. And we call more efforts on building more economical context augmented LLM by KV cache reusing.
△ Less
Submitted 18 March, 2025;
originally announced March 2025.
-
A Comprehensive Survey on Cross-Domain Recommendation: Taxonomy, Progress, and Prospects
Authors:
Hao Zhang,
Mingyue Cheng,
Qi Liu,
Junzhe Jiang,
Xianquan Wang,
Rujiao Zhang,
Chenyi Lei,
Enhong Chen
Abstract:
Recommender systems (RS) have become crucial tools for information filtering in various real world scenarios. And cross domain recommendation (CDR) has been widely explored in recent years in order to provide better recommendation results in the target domain with the help of other domains. The CDR technology has developed rapidly, yet there is a lack of a comprehensive survey summarizing recent w…
▽ More
Recommender systems (RS) have become crucial tools for information filtering in various real world scenarios. And cross domain recommendation (CDR) has been widely explored in recent years in order to provide better recommendation results in the target domain with the help of other domains. The CDR technology has developed rapidly, yet there is a lack of a comprehensive survey summarizing recent works. Therefore, in this paper, we will summarize the progress and prospects based on the main procedure of CDR, including Cross Domain Relevance, Cross Domain Interaction, Cross Domain Representation Enhancement and Model Optimization. To help researchers better understand and engage in this field, we also organize the applications and resources, and highlight several current important challenges and future directions of CDR. More details of the survey articles are available at https://github.com/USTCAGI/Awesome-Cross-Domain Recommendation-Papers-and-Resources.
△ Less
Submitted 18 March, 2025;
originally announced March 2025.
-
DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models
Authors:
Haoyang Li,
Liang Wang,
Chao Wang,
Jing Jiang,
Yan Peng,
Guodong Long
Abstract:
The Base-New Trade-off (BNT) problem universally exists during the optimization of CLIP-based prompt tuning, where continuous fine-tuning on base (target) classes leads to a simultaneous decrease of generalization ability on new (unseen) classes. Existing approaches attempt to regulate the prompt tuning process to balance BNT by appending constraints. However, imposed on the same target prompt, th…
▽ More
The Base-New Trade-off (BNT) problem universally exists during the optimization of CLIP-based prompt tuning, where continuous fine-tuning on base (target) classes leads to a simultaneous decrease of generalization ability on new (unseen) classes. Existing approaches attempt to regulate the prompt tuning process to balance BNT by appending constraints. However, imposed on the same target prompt, these constraints fail to fully avert the mutual exclusivity between the optimization directions for base and new. As a novel solution to this challenge, we propose the plug-and-play Dual-Prompt Collaboration (DPC) framework, the first that decoupling the optimization processes of base and new tasks at the prompt level. Specifically, we clone a learnable parallel prompt based on the backbone prompt, and introduce a variable Weighting-Decoupling framework to independently control the optimization directions of dual prompts specific to base or new tasks, thus avoiding the conflict in generalization. Meanwhile, we propose a Dynamic Hard Negative Optimizer, utilizing dual prompts to construct a more challenging optimization task on base classes for enhancement. For interpretability, we prove the feature channel invariance of the prompt vector during the optimization process, providing theoretical support for the Weighting-Decoupling of DPC. Extensive experiments on multiple backbones demonstrate that DPC can significantly improve base performance without introducing any external knowledge beyond the base classes, while maintaining generalization to new classes. Code is available at: https://github.com/JREion/DPC.
△ Less
Submitted 17 March, 2025;
originally announced March 2025.
-
Digital Beamforming Enhanced Radar Odometry
Authors:
Jingqi Jiang,
Shida Xu,
Kaicheng Zhang,
Jiyuan Wei,
Jingyang Wang,
Sen Wang
Abstract:
Radar has become an essential sensor for autonomous navigation, especially in challenging environments where camera and LiDAR sensors fail. 4D single-chip millimeter-wave radar systems, in particular, have drawn increasing attention thanks to their ability to provide spatial and Doppler information with low hardware cost and power consumption. However, most single-chip radar systems using traditio…
▽ More
Radar has become an essential sensor for autonomous navigation, especially in challenging environments where camera and LiDAR sensors fail. 4D single-chip millimeter-wave radar systems, in particular, have drawn increasing attention thanks to their ability to provide spatial and Doppler information with low hardware cost and power consumption. However, most single-chip radar systems using traditional signal processing, such as Fast Fourier Transform, suffer from limited spatial resolution in radar detection, significantly limiting the performance of radar-based odometry and Simultaneous Localization and Mapping (SLAM) systems. In this paper, we develop a novel radar signal processing pipeline that integrates spatial domain beamforming techniques, and extend it to 3D Direction of Arrival estimation. Experiments using public datasets are conducted to evaluate and compare the performance of our proposed signal processing pipeline against traditional methodologies. These tests specifically focus on assessing structural precision across diverse scenes and measuring odometry accuracy in different radar odometry systems. This research demonstrates the feasibility of achieving more accurate radar odometry by simply replacing the standard FFT-based processing with the proposed pipeline. The codes are available at GitHub*.
△ Less
Submitted 17 March, 2025;
originally announced March 2025.
-
Understanding Driver Cognition and Decision-Making Behaviors in High-Risk Scenarios: A Drift Diffusion Perspective
Authors:
Heye Huang,
Zheng Li,
Hao Cheng,
Haoran Wang,
Junkai Jiang,
Xiaopeng Li,
Arkady Zgonnikov
Abstract:
Ensuring safe interactions between autonomous vehicles (AVs) and human drivers in mixed traffic systems remains a major challenge, particularly in complex, high-risk scenarios. This paper presents a cognition-decision framework that integrates individual variability and commonalities in driver behavior to quantify risk cognition and model dynamic decision-making. First, a risk sensitivity model ba…
▽ More
Ensuring safe interactions between autonomous vehicles (AVs) and human drivers in mixed traffic systems remains a major challenge, particularly in complex, high-risk scenarios. This paper presents a cognition-decision framework that integrates individual variability and commonalities in driver behavior to quantify risk cognition and model dynamic decision-making. First, a risk sensitivity model based on a multivariate Gaussian distribution is developed to characterize individual differences in risk cognition. Then, a cognitive decision-making model based on the drift diffusion model (DDM) is introduced to capture common decision-making mechanisms in high-risk environments. The DDM dynamically adjusts decision thresholds by integrating initial bias, drift rate, and boundary parameters, adapting to variations in speed, relative distance, and risk sensitivity to reflect diverse driving styles and risk preferences. By simulating high-risk scenarios with lateral, longitudinal, and multidimensional risk sources in a driving simulator, the proposed model accurately predicts cognitive responses and decision behaviors during emergency maneuvers. Specifically, by incorporating driver-specific risk sensitivity, the model enables dynamic adjustments of key DDM parameters, allowing for personalized decision-making representations in diverse scenarios. Comparative analysis with IDM, Gipps, and MOBIL demonstrates that DDM more precisely captures human cognitive processes and adaptive decision-making in high-risk scenarios. These findings provide a theoretical basis for modeling human driving behavior and offer critical insights for enhancing AV-human interaction in real-world traffic environments.
△ Less
Submitted 16 March, 2025;
originally announced March 2025.
-
FedGAI: Federated Style Learning with Cloud-Edge Collaboration for Generative AI in Fashion Design
Authors:
Mingzhu Wu,
Jianan Jiang,
Xinglin Li,
Hanhui Deng,
Di Wu
Abstract:
Collaboration can amalgamate diverse ideas, styles, and visual elements, fostering creativity and innovation among different designers. In collaborative design, sketches play a pivotal role as a means of expressing design creativity. However, designers often tend to not openly share these meticulously crafted sketches. This phenomenon of data island in the design area hinders its digital transform…
▽ More
Collaboration can amalgamate diverse ideas, styles, and visual elements, fostering creativity and innovation among different designers. In collaborative design, sketches play a pivotal role as a means of expressing design creativity. However, designers often tend to not openly share these meticulously crafted sketches. This phenomenon of data island in the design area hinders its digital transformation under the third wave of AI. In this paper, we introduce a Federated Generative Artificial Intelligence Clothing system, namely FedGAI, employing federated learning to aid in sketch design. FedGAI is committed to establishing an ecosystem wherein designers can exchange sketch styles among themselves. Through FedGAI, designers can generate sketches that incorporate various designers' styles from their peers, drawing inspiration from collaboration without the need for data disclosure or upload. Extensive performance evaluations indicate that our FedGAI system can produce multi-styled sketches of comparable quality to human-designed ones while significantly enhancing efficiency compared to hand-drawn sketches.
△ Less
Submitted 16 March, 2025;
originally announced March 2025.
-
PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing
Authors:
Cheng Deng,
Luoyang Sun,
Jiwen Jiang,
Yongcheng Zeng,
Xinjian Wu,
Wenxin Zhao,
Qingfa Xiao,
Jiachuan Wang,
Haoyang Li,
Lei Chen,
Lionel M. Ni,
Haifeng Zhang,
Jun Wang
Abstract:
While scaling laws have been continuously validated in large language models (LLMs) with increasing model parameters, the inherent tension between the inference demands of LLMs and the limited resources of edge devices poses a critical challenge to the development of edge intelligence. Recently, numerous small language models have emerged, aiming to distill the capabilities of LLMs into smaller fo…
▽ More
While scaling laws have been continuously validated in large language models (LLMs) with increasing model parameters, the inherent tension between the inference demands of LLMs and the limited resources of edge devices poses a critical challenge to the development of edge intelligence. Recently, numerous small language models have emerged, aiming to distill the capabilities of LLMs into smaller footprints. However, these models often retain the fundamental architectural principles of their larger counterparts, still imposing considerable strain on the storage and bandwidth capacities of edge devices. In this paper, we introduce the PLM, a Peripheral Language Model, developed through a co-design process that jointly optimizes model architecture and edge system constraints. The PLM utilizes a Multi-head Latent Attention mechanism and employs the squared ReLU activation function to encourage sparsity, thereby reducing peak memory footprint during inference. During training, we collect and reorganize open-source datasets, implement a multi-phase training strategy, and empirically investigate the Warmup-Stable-Decay-Constant (WSDC) learning rate scheduler. Additionally, we incorporate Reinforcement Learning from Human Feedback (RLHF) by adopting the ARIES preference learning approach. Following a two-phase SFT process, this method yields performance gains of 2% in general tasks, 9% in the GSM8K task, and 11% in coding tasks. In addition to its novel architecture, evaluation results demonstrate that PLM outperforms existing small language models trained on publicly available data while maintaining the lowest number of activated parameters. Furthermore, deployment across various edge devices, including consumer-grade GPUs, mobile phones, and Raspberry Pis, validates PLM's suitability for peripheral applications. The PLM series models are publicly available at https://github.com/plm-team/PLM.
△ Less
Submitted 19 March, 2025; v1 submitted 15 March, 2025;
originally announced March 2025.
-
Study of $φ\to K\bar{K}$ and $K_{S}^{0}-K_{L}^{0}$ asymmetry in the amplitude analysis of $D_{s}^{+} \to K_{S}^{0}K_{L}^{0}π^{+}$ decay
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann,
H. Cai
, et al. (701 additional authors not shown)
Abstract:
Using $e^+e^-$ annihilation data corresponding to a total integrated luminosity of 7.33 $\rm fb^{-1}$ collected at center-of-mass energies between 4.128 and 4.226~GeV with the BESIII detector, we provide the first amplitude analysis and absolute branching fraction measurement of the hadronic decay $D_{s}^{+} \to K_{S}^{0}K_{L}^{0}π^{+}$. The branching fraction of…
▽ More
Using $e^+e^-$ annihilation data corresponding to a total integrated luminosity of 7.33 $\rm fb^{-1}$ collected at center-of-mass energies between 4.128 and 4.226~GeV with the BESIII detector, we provide the first amplitude analysis and absolute branching fraction measurement of the hadronic decay $D_{s}^{+} \to K_{S}^{0}K_{L}^{0}π^{+}$. The branching fraction of $D_{s}^{+} \to K_{S}^{0}K_{L}^{0}π^{+}$ is determined to be $(1.86\pm0.06_{\rm stat}\pm0.03_{\rm syst})\%$.
Combining the $\mathcal{B}(D_{s}^{+} \to φ(\to K_{S}^0K_{L}^0) π^+)$ obtained in this work and the world average of $\mathcal{B}(D_{s}^{+} \to φ(\to K^+K^-) π^+)$, we measure the relative branching fraction $\mathcal{B}(φ\to K_S^0K_L^0)/\mathcal{B}(φ\to K^+K^-)$=($0.597 \pm 0.023_{\rm stat} \pm 0.018_{\rm syst} \pm 0.016_{\rm PDG}$), which deviates from the PDG value by more than 3$σ$. Furthermore, the asymmetry of the branching fractions of $D^+_s\to K_{S}^0K^{*}(892)^{+}$ and $D^+_s\to K_{L}^0K^{*}(892)^{+}$, $\frac{\mathcal{B}(D_{s}^{+} \to K_{S}^0K^{*}(892)^{+})-\mathcal{B}(D_{s}^{+} \to K_{L}^0K^{*}(892)^{+})}{\mathcal{B}(D_{s}^{+} \to K_{S}^0K^{*}(892)^{+})+\mathcal{B}(D_{s}^{+} \to K_{L}^0K^{*}(892)^{+})}$, is determined to be $(-13.4\pm5.0_{\rm stat}\pm3.4_{\rm syst})\%$.
△ Less
Submitted 23 March, 2025; v1 submitted 14 March, 2025;
originally announced March 2025.
-
Advancing Electronics Manufacturing Using Dynamically Programmable Micro-Transfer Printing System
Authors:
Qinhua Guo,
Lizhou Yang,
Yawen Gan,
Jingyang Zhang,
Jiajun Zhang,
Jiahao Jiang,
Weihan Lin,
Kaiqi Chen,
Chenchen Zhang,
Yunda Wang
Abstract:
Micro-transfer printing is an assembly technology that enables large-scale integration of diverse materials and components from micro- to nano-scale, crucial for developing advanced electronic and photonic systems. However, traditional micro-transfer printing technologies lack dynamic selectivity, limiting capabilities in sorting and repairing materials and components for effective yield managemen…
▽ More
Micro-transfer printing is an assembly technology that enables large-scale integration of diverse materials and components from micro- to nano-scale, crucial for developing advanced electronic and photonic systems. However, traditional micro-transfer printing technologies lack dynamic selectivity, limiting capabilities in sorting and repairing materials and components for effective yield management during large-scale manufacturing and integration processes. In this work, we introduce a dynamically programmable micro-transfer printing system utilizing a sharp phase-changing polymer and an independently addressable microheater array to modulate adhesion through localized heating. The system demonstrates dynamically programmable capabilities for selective transfer of various materials including semiconductors, polymers and metals, handling geometries from micro-scale chiplets to nanometer-thick films and micro-spheres. It also exhibits exceptional capabilities in 3D stacking and heterogeneous materials integration, significantly advancing the manufacturability of complex electronics. As a demonstration, we successfully perform dynamically programmable transfer of microLED chips to create arbitrarily specified patterns, offering a promising solution to the challenges of mass transfer and pixel repair in microLED display manufacturing.
△ Less
Submitted 14 March, 2025;
originally announced March 2025.
-
Beyond A Single AI Cluster: A Survey of Decentralized LLM Training
Authors:
Haotian Dong,
Jingyan Jiang,
Rongwei Lu,
Jiajun Luo,
Jiajun Song,
Bowen Li,
Ying Shen,
Zhi Wang
Abstract:
The emergence of large language models (LLMs) has revolutionized AI development, yet their training demands computational resources beyond a single cluster or even datacenter, limiting accessibility to large organizations. Decentralized training has emerged as a promising paradigm to leverage dispersed resources across clusters, datacenters, and global regions, democratizing LLM development for br…
▽ More
The emergence of large language models (LLMs) has revolutionized AI development, yet their training demands computational resources beyond a single cluster or even datacenter, limiting accessibility to large organizations. Decentralized training has emerged as a promising paradigm to leverage dispersed resources across clusters, datacenters, and global regions, democratizing LLM development for broader communities. As the first comprehensive exploration of this emerging field, we present decentralized LLM training as a resource-driven paradigm and categorize it into community-driven and organizational approaches. Furthermore, our in-depth analysis clarifies decentralized LLM training, including: (1) position with related domain concepts comparison, (2) decentralized resource development trends, and (3) recent advances with discussion under a novel taxonomy. We also provide up-to-date case studies and explore future directions, contributing to the evolution of decentralized LLM training research.
△ Less
Submitted 13 March, 2025;
originally announced March 2025.
-
Search for a $1^{-+}$ molecular state via $e^{+}e^{-} \to γD^{+}_{s} D_{s1}^{-}(2536) +c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (649 additional authors not shown)
Abstract:
We search, for the first time, for an exotic molecular state with quantum numbers $J^{PC}=1^{-+}$, called $X$, via the process $e^{+}e^{-} \to γD^{+}_{s} D_{s1}^{-}(2536) +c.c.$ using data samples corresponding to a luminosity of $5.8~\mathrm{fb^{-1}}$ across center-of-mass energies from 4.612 to 4.951~GeV, collected with the BESIII detector operating at the BEPCII collider. No statistically signi…
▽ More
We search, for the first time, for an exotic molecular state with quantum numbers $J^{PC}=1^{-+}$, called $X$, via the process $e^{+}e^{-} \to γD^{+}_{s} D_{s1}^{-}(2536) +c.c.$ using data samples corresponding to a luminosity of $5.8~\mathrm{fb^{-1}}$ across center-of-mass energies from 4.612 to 4.951~GeV, collected with the BESIII detector operating at the BEPCII collider. No statistically significant signal is observed. The upper limits on the product of cross-section and branching fraction $σ({e^{+}e^{-} \to γX}) \times \mathcal{B}(X \to D^{+}_{s} D_{s1}^{-}(2536) +c.c.)$ at 90\% confidence level are reported for each energy point, assuming the $X$ mass to be 4.503~GeV/$c^{2}$ and the width 25, 50, 75, and 100~MeV, respectively.
△ Less
Submitted 13 March, 2025;
originally announced March 2025.
-
MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion
Authors:
Zebin He,
Mingxin Yang,
Shuhui Yang,
Yixuan Tang,
Tao Wang,
Kaihao Zhang,
Guanying Chen,
Yuhong Liu,
Jie Jiang,
Chunchao Guo,
Wenhan Luo
Abstract:
Physically-based rendering (PBR) has become a cornerstone in modern computer graphics, enabling realistic material representation and lighting interactions in 3D scenes. In this paper, we present MaterialMVP, a novel end-to-end model for generating PBR textures from 3D meshes and image prompts, addressing key challenges in multi-view material synthesis. Our approach leverages Reference Attention t…
▽ More
Physically-based rendering (PBR) has become a cornerstone in modern computer graphics, enabling realistic material representation and lighting interactions in 3D scenes. In this paper, we present MaterialMVP, a novel end-to-end model for generating PBR textures from 3D meshes and image prompts, addressing key challenges in multi-view material synthesis. Our approach leverages Reference Attention to extract and encode informative latent from the input reference images, enabling intuitive and controllable texture generation. We also introduce a Consistency-Regularized Training strategy to enforce stability across varying viewpoints and illumination conditions, ensuring illumination-invariant and geometrically consistent results. Additionally, we propose Dual-Channel Material Generation, which separately optimizes albedo and metallic-roughness (MR) textures while maintaining precise spatial alignment with the input images through Multi-Channel Aligned Attention. Learnable material embeddings are further integrated to capture the distinct properties of albedo and MR. Experimental results demonstrate that our model generates PBR textures with realistic behavior across diverse lighting scenarios, outperforming existing methods in both consistency and quality for scalable 3D asset creation.
△ Less
Submitted 13 March, 2025;
originally announced March 2025.
-
Modeling Thousands of Human Annotators for Generalizable Text-to-Image Person Re-identification
Authors:
Jiayu Jiang,
Changxing Ding,
Wentao Tan,
Junhong Wang,
Jin Tao,
Xiangmin Xu
Abstract:
Text-to-image person re-identification (ReID) aims to retrieve the images of an interested person based on textual descriptions. One main challenge for this task is the high cost in manually annotating large-scale databases, which affects the generalization ability of ReID models. Recent works handle this problem by leveraging Multi-modal Large Language Models (MLLMs) to describe pedestrian images…
▽ More
Text-to-image person re-identification (ReID) aims to retrieve the images of an interested person based on textual descriptions. One main challenge for this task is the high cost in manually annotating large-scale databases, which affects the generalization ability of ReID models. Recent works handle this problem by leveraging Multi-modal Large Language Models (MLLMs) to describe pedestrian images automatically. However, the captions produced by MLLMs lack diversity in description styles. To address this issue, we propose a Human Annotator Modeling (HAM) approach to enable MLLMs to mimic the description styles of thousands of human annotators. Specifically, we first extract style features from human textual descriptions and perform clustering on them. This allows us to group textual descriptions with similar styles into the same cluster. Then, we employ a prompt to represent each of these clusters and apply prompt learning to mimic the description styles of different human annotators. Furthermore, we define a style feature space and perform uniform sampling in this space to obtain more diverse clustering prototypes, which further enriches the diversity of the MLLM-generated captions. Finally, we adopt HAM to automatically annotate a massive-scale database for text-to-image ReID. Extensive experiments on this database demonstrate that it significantly improves the generalization ability of ReID models.
△ Less
Submitted 12 March, 2025;
originally announced March 2025.
-
Unveiling Hidden Pivotal Players with GoalNet: A GNN-Based Soccer Player Evaluation System
Authors:
Jacky Hao Jiang,
Jerry Cai,
Anastasios Kyrillidis
Abstract:
Soccer analysis tools emphasize metrics such as expected goals, leading to an overrepresentation of attacking players' contributions and overlooking players who facilitate ball control and link attacks. Examples include Rodri from Manchester City and Palhinha who just transferred to Bayern Munich. To address this bias, we aim to identify players with pivotal roles in a soccer team, incorporating b…
▽ More
Soccer analysis tools emphasize metrics such as expected goals, leading to an overrepresentation of attacking players' contributions and overlooking players who facilitate ball control and link attacks. Examples include Rodri from Manchester City and Palhinha who just transferred to Bayern Munich. To address this bias, we aim to identify players with pivotal roles in a soccer team, incorporating both spatial and temporal features.
In this work, we introduce a GNN-based framework that assigns individual credit for changes in expected threat (xT), thus capturing overlooked yet vital contributions in soccer. Our pipeline encodes both spatial and temporal features in event-centric graphs, enabling fair attribution of non-scoring actions such as defensive or transitional plays. We incorporate centrality measures into the learned player embeddings, ensuring that ball-retaining defenders and defensive midfielders receive due recognition for their overall impact. Furthermore, we explore diverse GNN variants-including Graph Attention Networks and Transformer-based models-to handle long-range dependencies and evolving match contexts, discussing their relative performance and computational complexity. Experiments on real match data confirm the robustness of our approach in highlighting pivotal roles that traditional attacking metrics typically miss, underscoring the model's utility for more comprehensive soccer analytics.
△ Less
Submitted 12 March, 2025;
originally announced March 2025.
-
Evaluating the Generalizability of LLMs in Automated Program Repair
Authors:
Fengjie Li,
Jiajun Jiang,
Jiajun Sun,
Hongyu Zhang
Abstract:
LLM-based automated program repair methods have attracted significant attention for their state-of-the-art performance. However, they were primarily evaluated on a few well known datasets like Defects4J, raising questions about their effectiveness on new datasets. In this study, we evaluate 11 top-performing LLMs on DEFECTS4J-TRANS, a new dataset derived from transforming Defects4J while maintaini…
▽ More
LLM-based automated program repair methods have attracted significant attention for their state-of-the-art performance. However, they were primarily evaluated on a few well known datasets like Defects4J, raising questions about their effectiveness on new datasets. In this study, we evaluate 11 top-performing LLMs on DEFECTS4J-TRANS, a new dataset derived from transforming Defects4J while maintaining the original semantics. Results from experiments on both Defects4J and DEFECTS4J-TRANS show that all studied LLMs have limited generalizability in APR tasks, with the average number of correct and plausible patches decreasing by 49.48% and 42.90%, respectively, on DEFECTS4J-TRANS. Further investigation into incorporating additional repair-relevant information in repair prompts reveals that, although this information significantly enhances the LLMs' capabilities (increasing the number of correct and plausible patches by up to 136.67% and 121.82%, respectively), performance still falls short of their original results. This indicates that prompt engineering alone is insufficient to substantially enhance LLMs' repair capabilities. Based on our study, we also offer several recommendations for future research.
△ Less
Submitted 12 March, 2025;
originally announced March 2025.
-
Rethinking Bimanual Robotic Manipulation: Learning with Decoupled Interaction Framework
Authors:
Jian-Jian Jiang,
Xiao-Ming Wu,
Yi-Xiang He,
Ling-An Zeng,
Yi-Lin Wei,
Dandan Zhang,
Wei-Shi Zheng
Abstract:
Bimanual robotic manipulation is an emerging and critical topic in the robotics community. Previous works primarily rely on integrated control models that take the perceptions and states of both arms as inputs to directly predict their actions. However, we think bimanual manipulation involves not only coordinated tasks but also various uncoordinated tasks that do not require explicit cooperation d…
▽ More
Bimanual robotic manipulation is an emerging and critical topic in the robotics community. Previous works primarily rely on integrated control models that take the perceptions and states of both arms as inputs to directly predict their actions. However, we think bimanual manipulation involves not only coordinated tasks but also various uncoordinated tasks that do not require explicit cooperation during execution, such as grasping objects with the closest hand, which integrated control frameworks ignore to consider due to their enforced cooperation in the early inputs. In this paper, we propose a novel decoupled interaction framework that considers the characteristics of different tasks in bimanual manipulation. The key insight of our framework is to assign an independent model to each arm to enhance the learning of uncoordinated tasks, while introducing a selective interaction module that adaptively learns weights from its own arm to improve the learning of coordinated tasks. Extensive experiments on seven tasks in the RoboTwin dataset demonstrate that: (1) Our framework achieves outstanding performance, with a 23.5% boost over the SOTA method. (2) Our framework is flexible and can be seamlessly integrated into existing methods. (3) Our framework can be effectively extended to multi-agent manipulation tasks, achieving a 28% boost over the integrated control SOTA. (4) The performance boost stems from the decoupled design itself, surpassing the SOTA by 16.5% in success rate with only 1/6 of the model size.
△ Less
Submitted 12 March, 2025;
originally announced March 2025.
-
AffordDexGrasp: Open-set Language-guided Dexterous Grasp with Generalizable-Instructive Affordance
Authors:
Yi-Lin Wei,
Mu Lin,
Yuhao Lin,
Jian-Jian Jiang,
Xiao-Ming Wu,
Ling-An Zeng,
Wei-Shi Zheng
Abstract:
Language-guided robot dexterous generation enables robots to grasp and manipulate objects based on human commands. However, previous data-driven methods are hard to understand intention and execute grasping with unseen categories in the open set. In this work, we explore a new task, Open-set Language-guided Dexterous Grasp, and find that the main challenge is the huge gap between high-level human…
▽ More
Language-guided robot dexterous generation enables robots to grasp and manipulate objects based on human commands. However, previous data-driven methods are hard to understand intention and execute grasping with unseen categories in the open set. In this work, we explore a new task, Open-set Language-guided Dexterous Grasp, and find that the main challenge is the huge gap between high-level human language semantics and low-level robot actions. To solve this problem, we propose an Affordance Dexterous Grasp (AffordDexGrasp) framework, with the insight of bridging the gap with a new generalizable-instructive affordance representation. This affordance can generalize to unseen categories by leveraging the object's local structure and category-agnostic semantic attributes, thereby effectively guiding dexterous grasp generation. Built upon the affordance, our framework introduces Affordacne Flow Matching (AFM) for affordance generation with language as input, and Grasp Flow Matching (GFM) for generating dexterous grasp with affordance as input. To evaluate our framework, we build an open-set table-top language-guided dexterous grasp dataset. Extensive experiments in the simulation and real worlds show that our framework surpasses all previous methods in open-set generalization.
△ Less
Submitted 10 March, 2025;
originally announced March 2025.
-
Anomalous Meets Topological Hall Effect in Cr2Ge2Te6 Heterostructures
Authors:
Xiaofan Cai,
Yaqing Han,
Jiawei Jiang,
Renjun Du,
Di Zhang,
Jiabei Huang,
Siqi Jiang,
Jingkuan Xiao,
Zihao Wang,
Qian Guo,
Wanting Xu,
Fuzhuo Lian,
Siqing Wang,
Bingxian Ou,
Yongqiang Yang,
Kenji Watanabe,
Takashi Taniguchi,
Alexander S. Mayorov,
Konstantin S. Novoselov,
Baigeng Wang,
Kai Chang,
Hongxin Yang,
Lei Wang,
Geliang Yu
Abstract:
Introducing topologically protected skyrmions in graphene holds significant importance for developing high-speed, low-energy spintronic devices. Here, we present a centrosymmetric ferromagnetic graphene/trilayer Cr2Ge2Te6/graphene heterostructure, demonstrating the anomalous and topological Hall effect due to the magnetic proximity effect. Through gate voltage control, we effectively tune the emer…
▽ More
Introducing topologically protected skyrmions in graphene holds significant importance for developing high-speed, low-energy spintronic devices. Here, we present a centrosymmetric ferromagnetic graphene/trilayer Cr2Ge2Te6/graphene heterostructure, demonstrating the anomalous and topological Hall effect due to the magnetic proximity effect. Through gate voltage control, we effectively tune the emergence and size of skyrmions. Micromagnetic simulations reveal the formation of skyrmions and antiskyrmions, which respond differently to external magnetic fields, leading to oscillations in the topological Hall signal. Our findings provide a novel pathway for the formation and manipulation of skyrmions in centrosymmetric two-dimensional magnetic systems, offering significant insights for developing topological spintronics.
△ Less
Submitted 11 March, 2025; v1 submitted 10 March, 2025;
originally announced March 2025.
-
First differential measurement of the single $\mathbfπ^+$ production cross section in neutrino neutral-current scattering
Authors:
K. Abe,
S. Abe,
R. Akutsu,
H. Alarakia-Charles,
Y. I. Alj Hakim,
S. Alonso Monsalve,
L. Anthony,
S. Aoki,
K. A. Apte,
T. Arai,
T. Arihara,
S. Arimoto,
N. Babu,
V. Baranov,
G. J. Barker,
G. Barr,
D. Barrow,
P. Bates,
L. Bathe-Peters,
M. Batkiewicz-Kwasniak,
N. Baudis,
V. Berardi,
L. Berns,
S. Bhattacharjee,
A. Blanchet
, et al. (338 additional authors not shown)
Abstract:
Since its first observation in the 1970s, neutrino-induced neutral-current single positive pion production (NC1$π^+$) has remained an elusive and poorly understood interaction channel. This process is a significant background in neutrino oscillation experiments and studying it further is critical for the physics program of next-generation accelerator-based neutrino oscillation experiments. In this…
▽ More
Since its first observation in the 1970s, neutrino-induced neutral-current single positive pion production (NC1$π^+$) has remained an elusive and poorly understood interaction channel. This process is a significant background in neutrino oscillation experiments and studying it further is critical for the physics program of next-generation accelerator-based neutrino oscillation experiments. In this Letter we present the first double-differential cross-section measurement of NC1$π^+$ interactions using data from the ND280 detector of the T2K experiment collected in $ν$-beam mode. We compare the results on a hydrocarbon target to the predictions of several neutrino interaction generators and final-state interaction models. While model predictions agree with the differential results, the data shows a weak preference for a cross-section normalization approximately 30\% higher than predicted by most models studied in this Letter.
△ Less
Submitted 11 March, 2025; v1 submitted 9 March, 2025;
originally announced March 2025.