-
Measurement of charged-current muon neutrino-argon interactions without pions in the final state using the MicroBooNE detector
Authors:
MicroBooNE collaboration,
P. Abratenko,
D. Andrade Aldana,
L. Arellano,
J. Asaadi,
A. Ashkenazi,
S. Balasubramanian,
B. Baller,
A. Barnard,
G. Barr,
D. Barrow,
J. Barrow,
V. Basque,
J. Bateman,
O. Benevides Rodrigues,
S. Berkman,
A. Bhat,
M. Bhattacharya,
M. Bishai,
A. Blake,
B. Bogart,
T. Bolton,
M. B. Brunetti,
L. Camilleri,
D. Caratelli
, et al. (152 additional authors not shown)
Abstract:
We report a new measurement of flux-integrated differential cross sections for charged-current (CC) muon neutrino interactions with argon nuclei that produce no final state pions $(ν_μ\mathrm{CC}0π)$. These interactions are of particular importance as a topologically defined signal dominated by quasielastic-like interactions. This measurement was performed with the MicroBooNE liquid argon time pro…
▽ More
We report a new measurement of flux-integrated differential cross sections for charged-current (CC) muon neutrino interactions with argon nuclei that produce no final state pions $(ν_μ\mathrm{CC}0π)$. These interactions are of particular importance as a topologically defined signal dominated by quasielastic-like interactions. This measurement was performed with the MicroBooNE liquid argon time projection chamber detector located at the Fermilab Booster Neutrino Beam (BNB), and uses an exposure of $1.3\times10^{21}$ protons on target collected between 2015 and 2020. The results are presented in terms of single and double-differential cross sections as a function of the final state muon momentum and angle. The data are compared with widely-used neutrino event generators. We find good agreement with the single-differential measurements, while only a subset of generators are also able to adequately describe the data in double-differential distributions. This work facilitates comparison with Cherenkov detector measurements, including those located at the BNB.
△ Less
Submitted 1 July, 2025;
originally announced July 2025.
-
NaviAgent: Bilevel Planning on Tool Dependency Graphs for Function Calling
Authors:
Yan Jiang,
Hao Zhou,
LiZhong GU,
Ai Han,
TianLong Li
Abstract:
LLMs' reliance on static knowledge and fragile tool invocation severely hinders the orchestration of complex, heterogeneous toolchains, particularly at large scales. Existing methods typically use rigid single-path execution, resulting in poor error recovery and exponentially growing search spaces. We introduce NaviAgent, a graph-navigated bilevel planning architecture for robust function calling,…
▽ More
LLMs' reliance on static knowledge and fragile tool invocation severely hinders the orchestration of complex, heterogeneous toolchains, particularly at large scales. Existing methods typically use rigid single-path execution, resulting in poor error recovery and exponentially growing search spaces. We introduce NaviAgent, a graph-navigated bilevel planning architecture for robust function calling, comprising a Multi-Path Decider and Graph-Encoded Navigator. As an LLM-powered agent, the Multi-Path Decider defines a four-dimensional decision space and continuously perceives environmental states, dynamically selecting the optimal action to fully cover all tool invocation scenarios. The Graph-Encoded Navigator constructs a Tool Dependency Heterogeneous Graph (TDHG), where node embeddings explicitly fuse API schema structure with historical invocation behavior. It also integrates a novel heuristic search strategy that guides the Decider toward efficient and highly successful toolchains, even for unseen tool combinations. Experiments show that NaviAgent consistently achieves the highest task success rate (TSR) across all foundation models and task complexities, outperforming the average baselines (ReAct, ToolLLM, α-UMI) by 13.5%, 16.4%, and 19.0% on Qwen2.5-14B, Qwen2.5-32B, and Deepseek-V3, respectively. Its execution steps are typically within one step of the most efficient baseline, ensuring a strong balance between quality and efficiency. Notably, a fine-tuned Qwen2.5-14B model achieves a TSR of 49.5%, surpassing the much larger 32B model (44.9%) under our architecture. Incorporating the Graph-Encoded Navigator further boosts TSR by an average of 2.4 points, with gains up over 9 points on complex tasks for larger models (Deepseek-V3 and GPT-4o), highlighting its essential role in toolchain orchestration.
△ Less
Submitted 24 June, 2025;
originally announced June 2025.
-
InternSpatial: A Comprehensive Dataset for Spatial Reasoning in Vision-Language Models
Authors:
Nianchen Deng,
Lixin Gu,
Shenglong Ye,
Yinan He,
Zhe Chen,
Songze Li,
Haomin Wang,
Xingguang Wei,
Tianshuo Yang,
Min Dou,
Tong He,
Wenqi Shao,
Kaipeng Zhang,
Yi Wang,
Botian Shi,
Yanting Zhang,
Jifeng Dai,
Yu Qiao,
Hongjie Zhang,
Wenhai Wang
Abstract:
Recent benchmarks and datasets have been proposed to improve spatial reasoning in vision-language models (VLMs), yet existing open resources remain limited in scale, visual diversity, and instruction expressiveness. In this work, we introduce InternSpatial, the largest open-source dataset for spatial reasoning in VLMs, along with InternSpatial-Bench, a corresponding evaluation benchmark designed t…
▽ More
Recent benchmarks and datasets have been proposed to improve spatial reasoning in vision-language models (VLMs), yet existing open resources remain limited in scale, visual diversity, and instruction expressiveness. In this work, we introduce InternSpatial, the largest open-source dataset for spatial reasoning in VLMs, along with InternSpatial-Bench, a corresponding evaluation benchmark designed to assess spatial understanding under diverse instruction formats. InternSpatial comprises 12 million QA pairs spanning both single-view and multi-view settings, drawn from diverse visual environments and supporting 19 instruction formats that reflect varied query styles. For evaluation, we propose InternSpatial-Bench for single-view tasks and expand multi-view reasoning by introducing a novel rotation angle prediction task that has not been explored in prior work. Experimental results show that models trained on InternSpatial achieve 12.1% improvement on InternSpatial-Bench and 10.7% on VSI-Bench, while maintaining strong performance on general-purpose benchmarks. We hope these resources will support the development of spatially capable VLMs in practical applications such as robotics and embodied AI.
△ Less
Submitted 23 June, 2025;
originally announced June 2025.
-
Learning to Adapt Frozen CLIP for Few-Shot Test-Time Domain Adaptation
Authors:
Zhixiang Chi,
Li Gu,
Huan Liu,
Ziqiang Wang,
Yanan Wu,
Yang Wang,
Konstantinos N Plataniotis
Abstract:
Few-shot Test-Time Domain Adaptation focuses on adapting a model at test time to a specific domain using only a few unlabeled examples, addressing domain shift. Prior methods leverage CLIP's strong out-of-distribution (OOD) abilities by generating domain-specific prompts to guide its generalized, frozen features. However, since downstream datasets are not explicitly seen by CLIP, solely depending…
▽ More
Few-shot Test-Time Domain Adaptation focuses on adapting a model at test time to a specific domain using only a few unlabeled examples, addressing domain shift. Prior methods leverage CLIP's strong out-of-distribution (OOD) abilities by generating domain-specific prompts to guide its generalized, frozen features. However, since downstream datasets are not explicitly seen by CLIP, solely depending on the feature space knowledge is constrained by CLIP's prior knowledge. Notably, when using a less robust backbone like ViT-B/16, performance significantly drops on challenging real-world benchmarks. Departing from the state-of-the-art of inheriting the intrinsic OOD capability of CLIP, this work introduces learning directly on the input space to complement the dataset-specific knowledge for frozen CLIP. Specifically, an independent side branch is attached in parallel with CLIP and enforced to learn exclusive knowledge via revert attention. To better capture the dataset-specific label semantics for downstream adaptation, we propose to enhance the inter-dispersion among text features via greedy text ensemble and refinement. The text and visual features are then progressively fused in a domain-aware manner by a generated domain prompt to adapt toward a specific domain. Extensive experiments show our method's superiority on 5 large-scale benchmarks (WILDS and DomainNet), notably improving over smaller networks like ViT-B/16 with gains of \textbf{+5.1} in F1 for iWildCam and \textbf{+3.1\%} in WC Acc for FMoW.
△ Less
Submitted 17 June, 2025;
originally announced June 2025.
-
Full-Gap Superconductivity in BaAs/Ferropnictide Heterostructures
Authors:
Ming-Qiang Ren,
Qiang-Jun Cheng,
Hui-Hui He,
Ze-Xian Deng,
Fang-Jun Cheng,
Yong-Wei Wang,
Cong-Cong Lou,
Qinghua Zhang,
Lin Gu,
Kai Liu,
Xu-Cun Ma,
Qi-Kun Xue,
Can-Li Song
Abstract:
Interfacial interactions often promote the emergence of unusual phenomena in two-dimensional systems, including high-temperature superconductivity. Here, we report the observation of full-gap superconductivity with a maximal spectroscopic temperature up to 26 K in a BaAs monolayer grown on ferropnictide Ba(Fe$_{1-x}$Co$_x$)$_2$As$_2$ (abbreviated as BFCA) epitaxial films. The superconducting gap r…
▽ More
Interfacial interactions often promote the emergence of unusual phenomena in two-dimensional systems, including high-temperature superconductivity. Here, we report the observation of full-gap superconductivity with a maximal spectroscopic temperature up to 26 K in a BaAs monolayer grown on ferropnictide Ba(Fe$_{1-x}$Co$_x$)$_2$As$_2$ (abbreviated as BFCA) epitaxial films. The superconducting gap remains robust even when the thickness of underlying BFCA is reduced to the monolayer limit, in contrast to the rapid suppression of $T_\textrm{c}$ in standalone BFCA thin films. We reveal that the exceptional crystallinity of the BaAs/BFCA heterostructures, featured by their remarkable electronic and geometric uniformities, is crucial for the emergent full-gap superconductivity with mean-field temperature dependence and pronounced bound states within magnetic vortices. Our findings open up new avenues to unravel the mysteries of unconventional superconductivity in ferropnictides and advance the development of FeAs-based heterostructures.
△ Less
Submitted 19 June, 2025;
originally announced June 2025.
-
I Speak and You Find: Robust 3D Visual Grounding with Noisy and Ambiguous Speech Inputs
Authors:
Yu Qi,
Lipeng Gu,
Honghua Chen,
Liangliang Nan,
Mingqiang Wei
Abstract:
Existing 3D visual grounding methods rely on precise text prompts to locate objects within 3D scenes. Speech, as a natural and intuitive modality, offers a promising alternative. Real-world speech inputs, however, often suffer from transcription errors due to accents, background noise, and varying speech rates, limiting the applicability of existing 3DVG methods. To address these challenges, we pr…
▽ More
Existing 3D visual grounding methods rely on precise text prompts to locate objects within 3D scenes. Speech, as a natural and intuitive modality, offers a promising alternative. Real-world speech inputs, however, often suffer from transcription errors due to accents, background noise, and varying speech rates, limiting the applicability of existing 3DVG methods. To address these challenges, we propose \textbf{SpeechRefer}, a novel 3DVG framework designed to enhance performance in the presence of noisy and ambiguous speech-to-text transcriptions. SpeechRefer integrates seamlessly with xisting 3DVG models and introduces two key innovations. First, the Speech Complementary Module captures acoustic similarities between phonetically related words and highlights subtle distinctions, generating complementary proposal scores from the speech signal. This reduces dependence on potentially erroneous transcriptions. Second, the Contrastive Complementary Module employs contrastive learning to align erroneous text features with corresponding speech features, ensuring robust performance even when transcription errors dominate. Extensive experiments on the SpeechRefer and peechNr3D datasets demonstrate that SpeechRefer improves the performance of existing 3DVG methods by a large margin, which highlights SpeechRefer's potential to bridge the gap between noisy speech inputs and reliable 3DVG, enabling more intuitive and practical multimodal systems.
△ Less
Submitted 17 June, 2025;
originally announced June 2025.
-
Unified Representation Space for 3D Visual Grounding
Authors:
Yinuo Zheng,
Lipeng Gu,
Honghua Chen,
Liangliang Nan,
Mingqiang Wei
Abstract:
3D visual grounding (3DVG) is a critical task in scene understanding that aims to identify objects in 3D scenes based on text descriptions. However, existing methods rely on separately pre-trained vision and text encoders, resulting in a significant gap between the two modalities in terms of spatial geometry and semantic categories. This discrepancy often causes errors in object positioning and cl…
▽ More
3D visual grounding (3DVG) is a critical task in scene understanding that aims to identify objects in 3D scenes based on text descriptions. However, existing methods rely on separately pre-trained vision and text encoders, resulting in a significant gap between the two modalities in terms of spatial geometry and semantic categories. This discrepancy often causes errors in object positioning and classification. The paper proposes UniSpace-3D, which innovatively introduces a unified representation space for 3DVG, effectively bridging the gap between visual and textual features. Specifically, UniSpace-3D incorporates three innovative designs: i) a unified representation encoder that leverages the pre-trained CLIP model to map visual and textual features into a unified representation space, effectively bridging the gap between the two modalities; ii) a multi-modal contrastive learning module that further reduces the modality gap; iii) a language-guided query selection module that utilizes the positional and semantic information to identify object candidate points aligned with textual descriptions. Extensive experiments demonstrate that UniSpace-3D outperforms baseline models by at least 2.24% on the ScanRefer and Nr3D/Sr3D datasets. The code will be made available upon acceptance of the paper.
△ Less
Submitted 17 June, 2025;
originally announced June 2025.
-
Delving into the depths of NGC 3783 with XRISM. I. Kinematic and ionization structure of the highly ionized outflows
Authors:
Missagh Mehdipour,
Jelle S. Kaastra,
Megan E. Eckart,
Liyi Gu,
Ralf Ballhausen,
Ehud Behar,
Camille M. Diez,
Keigo Fukumura,
Matteo Guainazzi,
Kouichi Hagino,
Timothy R. Kallman,
Erin Kara,
Chen Li,
Jon M. Miller,
Misaki Mizumoto,
Hirofumi Noda,
Shoji Ogawa,
Christos Panagiotou,
Atsushi Tanimoto,
Keqin Zhao
Abstract:
We present our study of the XRISM observation of the Seyfert-1 galaxy NGC 3783. For the first time, XRISM's Resolve microcalorimeter enables a detailed characterization of the highly ionized outflows in this active galactic nucleus. Our analysis constrains their outflow and turbulent velocities, along with their ionization parameter $ξ$ and column density $N_{\rm H}$. The high-resolution Resolve s…
▽ More
We present our study of the XRISM observation of the Seyfert-1 galaxy NGC 3783. For the first time, XRISM's Resolve microcalorimeter enables a detailed characterization of the highly ionized outflows in this active galactic nucleus. Our analysis constrains their outflow and turbulent velocities, along with their ionization parameter $ξ$ and column density $N_{\rm H}$. The high-resolution Resolve spectrum reveals a distinct series of Fe absorption lines between 6.4 and 7.8 keV, ranging from Fe XVIII to Fe XXVI. At lower energies, absorption features from Si, S, and Ar are also detected. Our spectroscopy and photoionization modeling of the time-averaged Resolve spectrum uncover six outflow components, five of which exhibit relatively narrow absorption lines, with outflow velocities ranging from 560 to 1170 km/s. In addition, a broad absorption feature is detected, which is consistent with Fe XXVI outflowing at 14,300 km/s (0.05 $c$). This component carries a kinetic luminosity of 0.8-3% of the bolometric luminosity. Our analysis of the Resolve spectrum shows that more highly ionized absorption lines are intrinsically broader than those of lower ionization species, indicating that the turbulent velocity of the six outflow components (ranging from 0 to 3500 km/s) increases with $ξ$. Furthermore, we find that the $N_{\rm H}$ of the outflows generally declines with $ξ$ up to $\log ξ= 3.2$, but rises beyond this point, suggesting a complex ionization structure. Examination of the absorption profile of the Fe XXV resonance line reveals intriguing similarities to UV absorption lines (Ly$α$ and C IV) observed by the HST, from which we infer that the outflows are clumpy in nature. Our XRISM/Resolve results support a 'hybrid wind' scenario, in which the outflows have multiple origins and driving mechanisms. We explore various interpretations of our findings within AGN wind models.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Authors:
Ailin Huang,
Bingxin Li,
Bruce Wang,
Boyong Wu,
Chao Yan,
Chengli Feng,
Heng Wang,
Hongyu Zhou,
Hongyuan Wang,
Jingbei Li,
Jianjian Sun,
Joanna Wang,
Mingrui Chen,
Peng Liu,
Ruihang Miao,
Shilei Jiang,
Tian Fei,
Wang You,
Xi Chen,
Xuerui Yang,
Yechang Huang,
Yuxiang Zhang,
Zheng Ge,
Zheng Gong,
Zhewei Huang
, et al. (51 additional authors not shown)
Abstract:
Large Audio-Language Models (LALMs) have significantly advanced intelligent human-computer interaction, yet their reliance on text-based outputs limits their ability to generate natural speech responses directly, hindering seamless audio interactions. To address this, we introduce Step-Audio-AQAA, a fully end-to-end LALM designed for Audio Query-Audio Answer (AQAA) tasks. The model integrates a du…
▽ More
Large Audio-Language Models (LALMs) have significantly advanced intelligent human-computer interaction, yet their reliance on text-based outputs limits their ability to generate natural speech responses directly, hindering seamless audio interactions. To address this, we introduce Step-Audio-AQAA, a fully end-to-end LALM designed for Audio Query-Audio Answer (AQAA) tasks. The model integrates a dual-codebook audio tokenizer for linguistic and semantic feature extraction, a 130-billion-parameter backbone LLM and a neural vocoder for high-fidelity speech synthesis. Our post-training approach employs interleaved token-output of text and audio to enhance semantic coherence and combines Direct Preference Optimization (DPO) with model merge to improve performance. Evaluations on the StepEval-Audio-360 benchmark demonstrate that Step-Audio-AQAA excels especially in speech control, outperforming the state-of-art LALMs in key areas. This work contributes a promising solution for end-to-end LALMs and highlights the critical role of token-based vocoder in enhancing overall performance for AQAA tasks.
△ Less
Submitted 13 June, 2025; v1 submitted 10 June, 2025;
originally announced June 2025.
-
XRISM insights for interstellar Sulfur
Authors:
Lia Corrales,
Elisa Costantini,
Sascha Zeeger,
Liyi Gu,
Hiromitsu Takahashi,
David Moutard,
Megumi Shidatsu,
Jon M. Miller,
Misaki Mizumoto,
Randall K. Smith,
Ralf Ballhausen,
Priyanka Chakraborty,
Marua Diaz Trigo,
Renee Ludlam,
Takao Nakagawa,
Ioanna Psaradaki,
Shinya Yamada,
Caroline A. Kilbourne
Abstract:
The X-ray Imaging Spectroscopy Mission (XRISM) provides the best spectral resolution with which to study Sulfur (S) K-shell photoabsorption features from the interstellar medium (ISM). For the first time, we demonstrate the high-signal detection of interstellar atomic SII K-beta absorption in the spectrum of X-ray binaries (XRBs) 4U 1630-472 and GX 340+0. The persistence of this feature across mul…
▽ More
The X-ray Imaging Spectroscopy Mission (XRISM) provides the best spectral resolution with which to study Sulfur (S) K-shell photoabsorption features from the interstellar medium (ISM). For the first time, we demonstrate the high-signal detection of interstellar atomic SII K-beta absorption in the spectrum of X-ray binaries (XRBs) 4U 1630-472 and GX 340+0. The persistence of this feature across multiple instruments, targets, and flux states implies that it is interstellar in nature. We measure the SII Kbeta line centroid at 2470.8 +/- 1.1 eV after including systematic uncertainties. We also find that the most recently published high resolution SII absorption template requires a systematic energy scale shift of +7-8 eV, which is comparable to the level of disagreement among various atomic modeling procedures. The XRISM 300 ks observation of GX 340+0 provides unprecedented signal-to-noise in the S K region, and we find evidence of residual absorption from solid S in the spectra of GX 340+0. Absorption templates from three Fe-S compounds, troilite (FeS), pyrrhotite (Fe_7S_8) and pyrite (FeS_2), provide equally good fits to the residuals. Even though we are not able to distinguish among these three compounds, they provide equal estimates for the abundance of S locked in dust grains. Having accounted for both the gaseous and solid S in the GX 340+0 sightline provides us with a direct measurement of S depletion, which is 40% +/- 15%. Our depletion measurement provides an upper limit to the fraction of interstellar Fe bound in Fe-S compounds of < 25%, which is consistent with prior studies of Fe-S compounds via Fe L-shell absorption. Both XRBs in this study are at a distance of approximately 11 kpc and on the opposite side of the Galactic disk, suggesting that this value could represent the average S depletion of the Milky Way when integrated across all phases of the ISM.
△ Less
Submitted 10 June, 2025;
originally announced June 2025.
-
FLEX: A Large-Scale Multi-Modal Multi-Action Dataset for Fitness Action Quality Assessment
Authors:
Hao Yin,
Lijun Gu,
Paritosh Parmar,
Lin Xu,
Tianxiao Guo,
Weiwei Fu,
Yang Zhang,
Tianyou Zheng
Abstract:
With the increasing awareness of health and the growing desire for aesthetic physique, fitness has become a prevailing trend. However, the potential risks associated with fitness training, especially with weight-loaded fitness actions, cannot be overlooked. Action Quality Assessment (AQA), a technology that quantifies the quality of human action and provides feedback, holds the potential to assist…
▽ More
With the increasing awareness of health and the growing desire for aesthetic physique, fitness has become a prevailing trend. However, the potential risks associated with fitness training, especially with weight-loaded fitness actions, cannot be overlooked. Action Quality Assessment (AQA), a technology that quantifies the quality of human action and provides feedback, holds the potential to assist fitness enthusiasts of varying skill levels in achieving better training outcomes. Nevertheless, current AQA methodologies and datasets are limited to single-view competitive sports scenarios and RGB modality and lack professional assessment and guidance of fitness actions. To address this gap, we propose the FLEX dataset, the first multi-modal, multi-action, large-scale dataset that incorporates surface electromyography (sEMG) signals into AQA. FLEX utilizes high-precision MoCap to collect 20 different weight-loaded actions performed by 38 subjects across 3 different skill levels for 10 repetitions each, containing 5 different views of the RGB video, 3D pose, sEMG, and physiological information. Additionally, FLEX incorporates knowledge graphs into AQA, constructing annotation rules in the form of penalty functions that map weight-loaded actions, action keysteps, error types, and feedback. We conducted various baseline methodologies on FLEX, demonstrating that multimodal data, multiview data, and fine-grained annotations significantly enhance model performance. FLEX not only advances AQA methodologies and datasets towards multi-modal and multi-action scenarios but also fosters the integration of artificial intelligence within the fitness domain. Dataset and code are available at https://haoyin116.github.io/FLEX_Dataset.
△ Less
Submitted 1 June, 2025;
originally announced June 2025.
-
A new shock in the pre-merging cluster pair 1E2215-2216
Authors:
Yanling Chen,
Liyi Gu,
Aurora Simionescu,
Chunyang Jiang,
Rui Huang,
Wei Cui
Abstract:
The galaxy cluster pair 1E2216.0-0401 and 1E2215.7-0404 represents a major cluster merger in its early stages, a phase that has been scarcely explored in previous studies. Within this system, both axial and equatorial merger shocks have been identified. Recent XMM-Newton observations of the southern region of the cluster pair have increased the total exposure time to approximately 300 ks, enhancin…
▽ More
The galaxy cluster pair 1E2216.0-0401 and 1E2215.7-0404 represents a major cluster merger in its early stages, a phase that has been scarcely explored in previous studies. Within this system, both axial and equatorial merger shocks have been identified. Recent XMM-Newton observations of the southern region of the cluster pair have increased the total exposure time to approximately 300 ks, enhancing the sensitivity to detect faint shock features in the cluster outskirts. Through a combined analysis of XMM-Newton and Chandra data, including both imaging and spectral techniques, a new shock front has been identified at approximately 2'.3 south of the X-ray brightness peak of 1E2215. This shock front exhibits a surface brightness ratio of $1.33 \pm 0.07$ and a temperature ratio of $1.22^{+0.13}_{-0.14}$ in XMM-Newton, consistent with Chandra results. The Mach number, independently calculated from both the temperature and surface brightness discontinuities, yields consistent values of $\mathcal{M} \approx 1.2$ . The age, velocity, and spatial distribution of this shock suggest that it shares a common physical origin with the previously identified equatorial shock.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
Point or Line? Using Line-based Representation for Panoptic Symbol Spotting in CAD Drawings
Authors:
Xingguang Wei,
Haomin Wang,
Shenglong Ye,
Ruifeng Luo,
Yanting Zhang,
Lixin Gu,
Jifeng Dai,
Yu Qiao,
Wenhai Wang,
Hongjie Zhang
Abstract:
We study the task of panoptic symbol spotting, which involves identifying both individual instances of countable things and the semantic regions of uncountable stuff in computer-aided design (CAD) drawings composed of vector graphical primitives. Existing methods typically rely on image rasterization, graph construction, or point-based representation, but these approaches often suffer from high co…
▽ More
We study the task of panoptic symbol spotting, which involves identifying both individual instances of countable things and the semantic regions of uncountable stuff in computer-aided design (CAD) drawings composed of vector graphical primitives. Existing methods typically rely on image rasterization, graph construction, or point-based representation, but these approaches often suffer from high computational costs, limited generality, and loss of geometric structural information. In this paper, we propose VecFormer, a novel method that addresses these challenges through line-based representation of primitives. This design preserves the geometric continuity of the original primitive, enabling more accurate shape representation while maintaining a computation-friendly structure, making it well-suited for vector graphic understanding tasks. To further enhance prediction reliability, we introduce a Branch Fusion Refinement module that effectively integrates instance and semantic predictions, resolving their inconsistencies for more coherent panoptic outputs. Extensive experiments demonstrate that our method establishes a new state-of-the-art, achieving 91.1 PQ, with Stuff-PQ improved by 9.6 and 21.2 points over the second-best results under settings with and without prior information, respectively, highlighting the strong potential of line-based representation as a foundation for vector graphic understanding.
△ Less
Submitted 29 May, 2025;
originally announced May 2025.
-
MetaWriter: Personalized Handwritten Text Recognition Using Meta-Learned Prompt Tuning
Authors:
Wenhao Gu,
Li Gu,
Ching Yee Suen,
Yang Wang
Abstract:
Recent advancements in handwritten text recognition (HTR) have enabled the effective conversion of handwritten text to digital formats. However, achieving robust recognition across diverse writing styles remains challenging. Traditional HTR methods lack writer-specific personalization at test time due to limitations in model architecture and training strategies. Existing attempts to bridge this ga…
▽ More
Recent advancements in handwritten text recognition (HTR) have enabled the effective conversion of handwritten text to digital formats. However, achieving robust recognition across diverse writing styles remains challenging. Traditional HTR methods lack writer-specific personalization at test time due to limitations in model architecture and training strategies. Existing attempts to bridge this gap, through gradient-based meta-learning, still require labeled examples and suffer from parameter-inefficient fine-tuning, leading to substantial computational and memory overhead. To overcome these challenges, we propose an efficient framework that formulates personalization as prompt tuning, incorporating an auxiliary image reconstruction task with a self-supervised loss to guide prompt adaptation with unlabeled test-time examples. To ensure self-supervised loss effectively minimizes text recognition error, we leverage meta-learning to learn the optimal initialization of the prompts. As a result, our method allows the model to efficiently capture unique writing styles by updating less than 1% of its parameters and eliminating the need for time-intensive annotation processes. We validate our approach on the RIMES and IAM Handwriting Database benchmarks, where it consistently outperforms previous state-of-the-art methods while using 20x fewer parameters. We believe this represents a significant advancement in personalized handwritten text recognition, paving the way for more reliable and practical deployment in resource-constrained scenarios.
△ Less
Submitted 26 May, 2025;
originally announced May 2025.
-
JanusDNA: A Powerful Bi-directional Hybrid DNA Foundation Model
Authors:
Qihao Duan,
Bingding Huang,
Zhenqiao Song,
Irina Lehmann,
Lei Gu,
Roland Eils,
Benjamin Wild
Abstract:
Large language models (LLMs) have revolutionized natural language processing and are increasingly applied to other sequential data types, including genetic sequences. However, adapting LLMs to genomics presents significant challenges. Capturing complex genomic interactions requires modeling long-range dependencies within DNA sequences, where interactions often span over 10,000 base pairs, even wit…
▽ More
Large language models (LLMs) have revolutionized natural language processing and are increasingly applied to other sequential data types, including genetic sequences. However, adapting LLMs to genomics presents significant challenges. Capturing complex genomic interactions requires modeling long-range dependencies within DNA sequences, where interactions often span over 10,000 base pairs, even within a single gene, posing substantial computational burdens under conventional model architectures and training paradigms. Moreover, standard LLM training approaches are suboptimal for DNA: autoregressive training, while efficient, supports only unidirectional understanding. However, DNA is inherently bidirectional, e.g., bidirectional promoters regulate transcription in both directions and account for nearly 11% of human gene expression. Masked language models (MLMs) allow bidirectional understanding but are inefficient, as only masked tokens contribute to the loss per step. To address these limitations, we introduce JanusDNA, the first bidirectional DNA foundation model built upon a novel pretraining paradigm that combines the optimization efficiency of autoregressive modeling with the bidirectional comprehension of masked modeling. JanusDNA adopts a hybrid Mamba, Attention and Mixture of Experts (MoE) architecture, combining long-range modeling of Attention with efficient sequential learning of Mamba. MoE layers further scale model capacity via sparse activation while keeping computational cost low. Notably, JanusDNA processes up to 1 million base pairs at single nucleotide resolution on a single 80GB GPU. Extensive experiments and ablations show JanusDNA achieves new SOTA results on three genomic representation benchmarks, outperforming models with 250x more activated parameters. Code: https://github.com/Qihao-Duan/JanusDNA
△ Less
Submitted 2 June, 2025; v1 submitted 22 May, 2025;
originally announced May 2025.
-
Comprehensive Laboratory Benchmark of K-shell Dielectronic Satellites of Fe XXV-XXI Ions
Authors:
Chintan Shah,
Pedro Amaro,
Filipe Grilo,
Ming Feng Gu,
Liyi Gu,
José Paulo Santos,
F. Scott Porter,
Thomas Pfeifer,
Maurice A. Leutenegger,
José R. Crespo López-Urrutia
Abstract:
We report on comprehensive laboratory studies of the K-shell dielectronic recombination (DR) resonances of Fe XXV - XXI ions that prominently contribute to the hard X-ray spectrum of hot astrophysical plasmas. By scanning a monoenergetic electron beam to resonantly excite trapped Fe ions in an electron beam ion trap, and achieving a high electron-ion collision energy resolution of ~7 eV, we resolv…
▽ More
We report on comprehensive laboratory studies of the K-shell dielectronic recombination (DR) resonances of Fe XXV - XXI ions that prominently contribute to the hard X-ray spectrum of hot astrophysical plasmas. By scanning a monoenergetic electron beam to resonantly excite trapped Fe ions in an electron beam ion trap, and achieving a high electron-ion collision energy resolution of ~7 eV, we resolve their respective KL$n$ satellites up to n'=11. By normalization to known radiative recombination cross sections we also determine their excitation cross sections and that of the continuum with uncertainties below 15%, and verify our results with an independent normalization based on previous measurements. Our experimental data excellently confirm the accuracy and suitability of distorted-wave calculations obtained with the Flexible Atomic Code (FAC) for modeling astrophysical and fusion plasmas.
△ Less
Submitted 20 May, 2025;
originally announced May 2025.
-
Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model
Authors:
Wei Li,
Ming Hu,
Guoan Wang,
Lihao Liu,
Kaijin Zhou,
Junzhi Ning,
Xin Guo,
Zongyuan Ge,
Lixu Gu,
Junjun He
Abstract:
In ophthalmic surgery, developing an AI system capable of interpreting surgical videos and predicting subsequent operations requires numerous ophthalmic surgical videos with high-quality annotations, which are difficult to collect due to privacy concerns and labor consumption. Text-guided video generation (T2V) emerges as a promising solution to overcome this issue by generating ophthalmic surgica…
▽ More
In ophthalmic surgery, developing an AI system capable of interpreting surgical videos and predicting subsequent operations requires numerous ophthalmic surgical videos with high-quality annotations, which are difficult to collect due to privacy concerns and labor consumption. Text-guided video generation (T2V) emerges as a promising solution to overcome this issue by generating ophthalmic surgical videos based on surgeon instructions. In this paper, we present Ophora, a pioneering model that can generate ophthalmic surgical videos following natural language instructions. To construct Ophora, we first propose a Comprehensive Data Curation pipeline to convert narrative ophthalmic surgical videos into a large-scale, high-quality dataset comprising over 160K video-instruction pairs, Ophora-160K. Then, we propose a Progressive Video-Instruction Tuning scheme to transfer rich spatial-temporal knowledge from a T2V model pre-trained on natural video-text datasets for privacy-preserved ophthalmic surgical video generation based on Ophora-160K. Experiments on video quality evaluation via quantitative analysis and ophthalmologist feedback demonstrate that Ophora can generate realistic and reliable ophthalmic surgical videos based on surgeon instructions. We also validate the capability of Ophora for empowering downstream tasks of ophthalmic surgical workflow understanding. Code is available at https://github.com/mar-cry/Ophora.
△ Less
Submitted 26 June, 2025; v1 submitted 12 May, 2025;
originally announced May 2025.
-
Constraining gas motion and non-thermal pressure beyond the core of the Abell 2029 galaxy cluster with XRISM
Authors:
XRISM Collaboration,
Marc Audard,
Hisamitsu Awaki,
Ralf Ballhausen,
Aya Bamba,
Ehud Behar,
Rozenn Boissay-Malaquin,
Laura Brenneman,
Gregory Brown,
Lia Corrales,
Elisa Costantini,
Renata Cumbee,
Maria Diaz Trigo,
Chris Done,
Tadayasu Dotani,
Ken Ebisawa,
Megan Eckart,
Dominique Eckert,
Satoshi Eguchi,
Teruaki Enoto,
Yuichiro Ezoe,
Adam Foster,
Ryuichi Fujimoto,
Yutaka Fujita,
Yasushi Fukazawa
, et al. (115 additional authors not shown)
Abstract:
We report a detailed spectroscopic study of the gas dynamics and hydrostatic mass bias of the galaxy cluster Abell 2029, utilizing high-resolution observations from XRISM Resolve. Abell 2029, known for its cool core and relaxed X-ray morphology, provides an excellent opportunity to investigate the influence of gas motions beyond the central region. Expanding upon prior studies that revealed low tu…
▽ More
We report a detailed spectroscopic study of the gas dynamics and hydrostatic mass bias of the galaxy cluster Abell 2029, utilizing high-resolution observations from XRISM Resolve. Abell 2029, known for its cool core and relaxed X-ray morphology, provides an excellent opportunity to investigate the influence of gas motions beyond the central region. Expanding upon prior studies that revealed low turbulence and bulk motions within the core, our analysis covers regions out to the scale radius $R_{2500}$ (670~kpc) based on three radial pointings extending from the cluster center toward the northern side. We obtain accurate measurements of bulk and turbulent velocities along the line of sight. The results indicate that non-thermal pressure accounts for no more than 2% of the total pressure at all radii, with a gradual decrease outward. The observed radial trend differs from many numerical simulations, which often predict an increase in non-thermal pressure fraction at larger radii. These findings suggest that deviations from hydrostatic equilibrium are small, leading to a hydrostatic mass bias of around 2% across the observed area.
△ Less
Submitted 10 May, 2025;
originally announced May 2025.
-
Optimal time-decay for Euler-Fourier system with damping in the critical $L^2$ framework
Authors:
Jing Liu,
Lianchao Gu
Abstract:
This paper is concerned with the large time behavior of solutions to the Euler-Fourier system with damping in $\mathbb{R}^{d}~(d\geq1)$. A time-weighted energy argument has been developed within the $L^2$ framework to derive the optimal time-decay rates, which enables us to remove the smallness of low-frequencies of initial data. A great part of our analysis relies on the study of a Lyapunov funct…
▽ More
This paper is concerned with the large time behavior of solutions to the Euler-Fourier system with damping in $\mathbb{R}^{d}~(d\geq1)$. A time-weighted energy argument has been developed within the $L^2$ framework to derive the optimal time-decay rates, which enables us to remove the smallness of low-frequencies of initial data. A great part of our analysis relies on the study of a Lyapunov functional in the spirit of [13], which mainly depends on some elaborate use of non-classical Besov product estimates and interpolations. Exhibiting a damped mode with faster time decay than the whole solution also plays a key role.
△ Less
Submitted 8 May, 2025;
originally announced May 2025.
-
Mapping Cassiopeia A's silicon/sulfur Doppler velocities with XRISM-Resolve
Authors:
Jacco Vink,
Manan Agarwal,
Aya Bamba,
Liyi Gu,
Paul Plucinsky,
Ehud Behar,
Lia Corrales,
Adam Foster,
Shin-ichiro Fujimoto,
Masahiro Ichihashi,
Kazuhiro Ichikawa,
Satoru Katsuda,
Hironori Matsumoto,
Kai Matsunaga,
Tsunefumi Mizuno,
Koji Mori,
Hiroshi Murakami,
Hiroshi Nakajima,
Toshiki Sato,
Makoto Sawada,
Haruto Sonoda,
Shunsuke Suzuki,
Dai Tateishi,
Yukikatsu Terada,
Hiroyuki Uchida
Abstract:
Young supernova remnants (SNRs) provide crucial insights into explosive nucleosynthesis products and their velocity distribution soon after the explosion. However, these velocities are influenced by the dynamics of the circumstellar medium (CSM), which originates from the progenitor's late-phase mass loss. Cas A, the youngest known Galactic core-collapse SNR, was studied to analyze the spatial dis…
▽ More
Young supernova remnants (SNRs) provide crucial insights into explosive nucleosynthesis products and their velocity distribution soon after the explosion. However, these velocities are influenced by the dynamics of the circumstellar medium (CSM), which originates from the progenitor's late-phase mass loss. Cas A, the youngest known Galactic core-collapse SNR, was studied to analyze the spatial distribution of Si and S radial velocities using two high-spectral resolution observations from the XRISM-Resolve imaging spectrometer.Resolve's capabilities enabled the detailed characterization of Si XIII, Si XIV, S XV, and S XVI lines, whose line shapes can be resolved and modeled using Gaussian radial-velocity components. The radial velocities measured generally align with previous CCD-based results, confirming that they were not artifacts caused by blended lines or ionization variations. Modeling line profiles with two-component Gaussians improved fits in some regions, revealing distinct redshifted (backside) and blueshifted (frontside) components only in a few specific areas. In most regions, however, both components were either both redshifted (northwest) or both blueshifted (southeast), consistent with the patchy ejecta shell morphology seen in optically emitting fast-moving knots. The individual line components revealed a line broadening ranging from $σ_v \approx 200$ to $σ_v \approx 2000$ km/s. Components with $1000 \lesssim σ_v \lesssim 2000$km/s are consistent with previously determined reverse shock velocities, suggesting non-equilibrated or partially equilibrated ion temperatures. Narrow components with small radial velocities found near Cas A's projected center likely originate from shocked CSM plasma. But the low radial velocity and small $σ_v$ defies identifying these components with either the frontside or backside of the SNR, or both.
△ Less
Submitted 7 May, 2025;
originally announced May 2025.
-
XRISM forecast for the Coma cluster: stormy, with a steep power spectrum
Authors:
XRISM Collaboration,
Marc Audard,
Hisamitsu Awaki,
Ralf Ballhausen,
Aya Bamba,
Ehud Behar,
Rozenn Boissay-Malaquin,
Laura Brenneman,
Gregory V. Brown,
Lia Corrales,
Elisa Costantini,
Renata Cumbee,
Maria Diaz Trigo,
Chris Done,
Tadayasu Dotani,
Ken Ebisawa,
Megan E. Eckart,
Dominique Eckert,
Satoshi Eguchi,
Teruaki Enoto,
Yuichiro Ezoe,
Adam Foster,
Ryuichi Fujimoto,
Yutaka Fujita,
Yasushi Fukazawa
, et al. (120 additional authors not shown)
Abstract:
The XRISM Resolve microcalorimeter array measured the velocities of hot intracluster gas at two positions in the Coma galaxy cluster: 3'x3' squares at the center and at 6' (170 kpc) to the south. We find the line-of-sight velocity dispersions in those regions to be sigma_z=208+-12 km/s and 202+-24 km/s, respectively. The central value corresponds to a 3D Mach number of M=0.24+-0.015 and the ratio…
▽ More
The XRISM Resolve microcalorimeter array measured the velocities of hot intracluster gas at two positions in the Coma galaxy cluster: 3'x3' squares at the center and at 6' (170 kpc) to the south. We find the line-of-sight velocity dispersions in those regions to be sigma_z=208+-12 km/s and 202+-24 km/s, respectively. The central value corresponds to a 3D Mach number of M=0.24+-0.015 and the ratio of the kinetic pressure of small-scale motions to thermal pressure in the intracluster plasma of only 3.1+-0.4%, at the lower end of predictions from cosmological simulations for merging clusters like Coma, and similar to that observed in the cool core of the relaxed cluster A2029. Meanwhile, the gas in both regions exhibits high line-of-sight velocity differences from the mean velocity of the cluster galaxies, Delta v_z=450+-15 km/s and 730+-30 km/s, respectively. A small contribution from an additional gas velocity component, consistent with the cluster optical mean, is detected along a sightline near the cluster center. The combination of the observed velocity dispersions and bulk velocities is not described by a Kolmogorov velocity power spectrum of steady-state turbulence; instead, the data imply a much steeper effective slope (i.e., relatively more power at larger linear scales). This may indicate either a very large dissipation scale resulting in the suppression of small-scale motions, or a transient dynamic state of the cluster, where large-scale gas flows generated by an ongoing merger have not yet cascaded down to small scales.
△ Less
Submitted 29 April, 2025;
originally announced April 2025.
-
First study of neutrino angle reconstruction using quasielastic-like interactions in MicroBooNE
Authors:
MicroBooNE collaboration,
P. Abratenko,
D. Andrade Aldana,
L. Arellano,
J. Asaadi,
A. Ashkenazi,
S. Balasubramanian,
B. Baller,
A. Barnard,
G. Barr,
D. Barrow,
J. Barrow,
V. Basque,
J. Bateman,
O. Benevides Rodrigues,
S. Berkman,
A. Bhat,
M. Bhattacharya,
M. Bishai,
A. Blake,
B. Bogart,
T. Bolton,
M. B. Brunetti,
L. Camilleri,
D. Caratelli
, et al. (150 additional authors not shown)
Abstract:
We investigate the expected precision of the reconstructed neutrino direction using a νμ-argon quasielastic-like event topology with one muon and one proton in the final state and the reconstruction capabilities of the MicroBooNE liquid argon time projection chamber. This direction is of importance in the context of DUNE sub-GeV atmospheric oscillation studies. MicroBooNE allows for a data-driven…
▽ More
We investigate the expected precision of the reconstructed neutrino direction using a νμ-argon quasielastic-like event topology with one muon and one proton in the final state and the reconstruction capabilities of the MicroBooNE liquid argon time projection chamber. This direction is of importance in the context of DUNE sub-GeV atmospheric oscillation studies. MicroBooNE allows for a data-driven quantification of this resolution by investigating the deviation of the reconstructed muon-proton system orientation with respect to the well-known direction of neutrinos originating from the Booster Neutrino Beam with an exposure of 1.3 x 1021 protons on target. Using simulation studies, we derive the expected sub-GeV DUNE atmospheric-neutrino reconstructed simulated spectrum by developing a reweighting scheme as a function of the true neutrino energy. We further report flux-integrated single- and double-differential cross section measurements of charged-current νμ quasielastic-like scattering on argon as a function of the muon-proton system angle using the full MicroBooNE data sets. We also demonstrate the sensitivity of these results to nuclear effects and final state hadronic reinteraction modeling.
△ Less
Submitted 30 May, 2025; v1 submitted 24 April, 2025;
originally announced April 2025.
-
Fluctuated lattice-driven charge density wave far above the condensation temperature in kagome superconductor KV$_3$Sb$_5$
Authors:
Haoran Liu,
Shaofeng Duan,
Xiangqi Liu,
Zhihua Liu,
Shichong Wang,
Lingxiao Gu,
Jiongyu Huang,
Wenxuan Yang,
Jianzhe Liu,
Dong Qian,
Yanfeng Guo,
Wentao Zhang
Abstract:
The kagome material AV$_3$Sb$_5$ exhibits multiple exotic orders, including an unconventional charge density wave (CDW). Elucidating the underlying mechanism behind the CDW transition is crucial for unraveling the complex interactions among these phases. However, the driving force of the CDW remains a topic of debate due to the intertwined interactions among the system's various excitations. Here…
▽ More
The kagome material AV$_3$Sb$_5$ exhibits multiple exotic orders, including an unconventional charge density wave (CDW). Elucidating the underlying mechanism behind the CDW transition is crucial for unraveling the complex interactions among these phases. However, the driving force of the CDW remains a topic of debate due to the intertwined interactions among the system's various excitations. Here we investigated the CDW transition in KV$_3$Sb$_5$ by isolating the ultrafast electronic phase transition using time- and angleresolved photoemission spectroscopy. An ultrafast electronic phase transition was observed at a critical photoexcitation fluence, F$_c$, without reduction in CDW lattice-distortion-induced band folding. This folded band persisted up to 150 K under equilibrium heating, well above the CDW condensation temperature of T$_c$ = 78 K. Notably, the pump-induced band shifts at F$_c$ were comparable to those caused by thermal effects at T$_c$. These findings suggest that in KV$_3$Sb$_5$, a fluctuating lattice-driven in-plane CDW emerges above 150 K, with out-of-plane electronic correlations leading to the $2\times2 \times 2$ CDW near T$_c$, offering key insights into the interplay between the electronic and structural dynamics in AV$_3$Sb$_5$.
△ Less
Submitted 25 April, 2025; v1 submitted 23 April, 2025;
originally announced April 2025.
-
WT-BCP: Wavelet Transform based Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation
Authors:
Mingya Zhang,
Liang Wang,
Limei Gu,
Tingsheng Ling,
Xianping Tao
Abstract:
Semi-supervised medical image segmentation (SSMIS) shows promise in reducing reliance on scarce labeled medical data. However, SSMIS field confronts challenges such as distribution mismatches between labeled and unlabeled data, artificial perturbations causing training biases, and inadequate use of raw image information, especially low-frequency (LF) and high-frequency (HF) components.To address t…
▽ More
Semi-supervised medical image segmentation (SSMIS) shows promise in reducing reliance on scarce labeled medical data. However, SSMIS field confronts challenges such as distribution mismatches between labeled and unlabeled data, artificial perturbations causing training biases, and inadequate use of raw image information, especially low-frequency (LF) and high-frequency (HF) components.To address these challenges, we propose a Wavelet Transform based Bidirectional Copy-Paste SSMIS framework, named WT-BCP, which improves upon the Mean Teacher approach. Our method enhances unlabeled data understanding by copying random crops between labeled and unlabeled images and employs WT to extract LF and HF details.We propose a multi-input and multi-output model named XNet-Plus, to receive the fused information after WT. Moreover, consistency training among multiple outputs helps to mitigate learning biases introduced by artificial perturbations. During consistency training, the mixed images resulting from WT are fed into both models, with the student model's output being supervised by pseudo-labels and ground-truth. Extensive experiments conducted on 2D and 3D datasets confirm the effectiveness of our model.Code: https://github.com/simzhangbest/WT-BCP.
△ Less
Submitted 19 April, 2025;
originally announced April 2025.
-
Charge transfer induced insulating state at antiperovskite/perovskite heterointerfaces
Authors:
Ting Cui,
Ying Zhou,
Qianying Wang,
Dongke Rong,
Haitao Hong,
Axin Xie,
Jun-Jie Zhang,
Qinghua Zhang,
Can Wang,
Chen Ge,
Lin Gu,
Shanmin Wang,
Kuijuan Jin,
Shuai Dong,
Er-Jia Guo
Abstract:
Heterointerfaces have been pivotal in unveiling extraordinary interfacial properties and enabling multifunctional material platforms. Despite extensive research on all-oxide interfaces, heterointerfaces between different material classes, such as oxides and nitrides, remain underexplored. Here we present the fabrication of a high-quality Dirac metal antiperovskite Ni3InN, characterized by an extre…
▽ More
Heterointerfaces have been pivotal in unveiling extraordinary interfacial properties and enabling multifunctional material platforms. Despite extensive research on all-oxide interfaces, heterointerfaces between different material classes, such as oxides and nitrides, remain underexplored. Here we present the fabrication of a high-quality Dirac metal antiperovskite Ni3InN, characterized by an extremely low temperature coefficient of resistivity, approximately 1.8*10^-8 Ω*cm/K, over a broad temperature range. Atomically sharp heterointerfaces between Ni3InN and SrVO3 were constructed, revealing intriguing interfacial phenomena. Leveraging layer-resolved scanning transmission electron microscopy and electron energy loss spectroscopy, we identified pronounced charge transfer across the well-ordered interface. Remarkably, this interfacial electron transfer from Ni3InN to SrVO3 induces an insulating interfacial layer and an emergent magnetic moment within the Ni3InN layer, consistent with first-principles calculations. These findings pave the way for novel electronic and spintronic applications by enabling tunable interfacial properties in nitride/oxide systems.
△ Less
Submitted 17 April, 2025;
originally announced April 2025.
-
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models
Authors:
Jinguo Zhu,
Weiyun Wang,
Zhe Chen,
Zhaoyang Liu,
Shenglong Ye,
Lixin Gu,
Hao Tian,
Yuchen Duan,
Weijie Su,
Jie Shao,
Zhangwei Gao,
Erfei Cui,
Xuehui Wang,
Yue Cao,
Yangzhou Liu,
Xingguang Wei,
Hongjie Zhang,
Haomin Wang,
Weiye Xu,
Hao Li,
Jiahao Wang,
Nianchen Deng,
Songze Li,
Yinan He,
Tan Jiang
, et al. (26 additional authors not shown)
Abstract:
We introduce InternVL3, a significant advancement in the InternVL series featuring a native multimodal pre-training paradigm. Rather than adapting a text-only large language model (LLM) into a multimodal large language model (MLLM) that supports visual inputs, InternVL3 jointly acquires multimodal and linguistic capabilities from both diverse multimodal data and pure-text corpora during a single p…
▽ More
We introduce InternVL3, a significant advancement in the InternVL series featuring a native multimodal pre-training paradigm. Rather than adapting a text-only large language model (LLM) into a multimodal large language model (MLLM) that supports visual inputs, InternVL3 jointly acquires multimodal and linguistic capabilities from both diverse multimodal data and pure-text corpora during a single pre-training stage. This unified training paradigm effectively addresses the complexities and alignment challenges commonly encountered in conventional post-hoc training pipelines for MLLMs. To further improve performance and scalability, InternVL3 incorporates variable visual position encoding (V2PE) to support extended multimodal contexts, employs advanced post-training techniques such as supervised fine-tuning (SFT) and mixed preference optimization (MPO), and adopts test-time scaling strategies alongside an optimized training infrastructure. Extensive empirical evaluations demonstrate that InternVL3 delivers superior performance across a wide range of multi-modal tasks. In particular, InternVL3-78B achieves a score of 72.2 on the MMMU benchmark, setting a new state-of-the-art among open-source MLLMs. Its capabilities remain highly competitive with leading proprietary models, including ChatGPT-4o, Claude 3.5 Sonnet, and Gemini 2.5 Pro, while also maintaining strong pure-language proficiency. In pursuit of open-science principles, we will publicly release both the training data and model weights to foster further research and development in next-generation MLLMs.
△ Less
Submitted 18 April, 2025; v1 submitted 14 April, 2025;
originally announced April 2025.
-
Spectroscopic diagnostics of high-temperature plasma in stellar corona using Fe XXIV--XXVI K-shell lines with XRISM
Authors:
Miki Kurihara,
Masahiro Tsujimoto,
Marc Audard,
Ehud Behar,
Liyi Gu,
Kenji Hamaguchi,
Natalie Hell,
Caroline A. Kilbourne,
Yoshitomo Maeda,
Frederick S. Porter,
Haruka Sugai,
Yohko Tsuboi
Abstract:
The RS CVn type binary star GT Mus was observed during its quiescence using the Resolve X-ray microcalorimeter spectrometer onboard XRISM. The main and satellite lines of the Fe XXIV--XXVI K-shell transitions were resolved for the first time from stellar sources. We conducted line ratio analysis to investigate any deviations from collisional onization equilibrium (CIE) and Maxwell electron energy…
▽ More
The RS CVn type binary star GT Mus was observed during its quiescence using the Resolve X-ray microcalorimeter spectrometer onboard XRISM. The main and satellite lines of the Fe XXIV--XXVI K-shell transitions were resolved for the first time from stellar sources. We conducted line ratio analysis to investigate any deviations from collisional onization equilibrium (CIE) and Maxwell electron energy distribution with a single-temperature. By using five combinations of direct excitation lines and dielectronic recombination satellite lines in three line complexes (Fe He$α$, Ly$α$, and He$β$), we found that the plasma is well characterized by two-temperature thermal plasmas with temperatures of 1.7 and 4.3 keV, which is consistent with a thermal broadening of Fe XXV and the broadband fitting results in the 1.7--10 keV band. Other forms of deviation from a single-temperature plasma, such as different ionization and electron temperatures or the $κ$ distribution for the electron energy distributions, are not favored, which is reasonable for stellar coronae at quiescence. This study demonstrates the utility of the Fe K-shell line ratio diagnostics to probe plasma conditions using X-ray microcalorimeters.
△ Less
Submitted 9 April, 2025;
originally announced April 2025.
-
Investigating electron conductivity regimes in the bacterial cytochrome wire OmcS
Authors:
L. N. Mohanam,
R. Umeda,
L. Gu,
Y. Song,
D. J. Tobias,
A. I. Hochbaum,
R. Wu,
S. Sharifzadeh
Abstract:
The anaerobic bacterium \textit{Geobacter sulfurreducens} produces extracellular, electronically conductive cytochrome polymer wires that are conductive over micron length scales. Structure models from cryo-electron microscopy data show OmcS wires form a linear chain of hemes along the protein wire axis, which is proposed as the structural basis supporting their electronic properties. The geometri…
▽ More
The anaerobic bacterium \textit{Geobacter sulfurreducens} produces extracellular, electronically conductive cytochrome polymer wires that are conductive over micron length scales. Structure models from cryo-electron microscopy data show OmcS wires form a linear chain of hemes along the protein wire axis, which is proposed as the structural basis supporting their electronic properties. The geometric arrangement of heme along OmcS wires is conserved in many multiheme c-type cytochromes and other recently discovered microbial cytochrome wires. However, the mechanism by which this arrangement of heme molecules support electron transport through proteins and supramolecular heme wires is unclear. Here, we investigate the site energies, inter-heme coupling, and long-range electronic conductivity within OmcS. We introduce an approach to extract charge carrier site information directly from Kohn-Sham density functional theory, without employing projector schemes. We show that site and coupling energies are highly sensitive to changes in inter-heme geometry and the surrounding electrostatic environment, as intuitively expected. These parameters serve as inputs for a quantum charge carrier model that includes decoherence corrections with which we predict a diffusion coefficient comparable with other organic-based electronic materials. Based on these simulations, we propose that dynamic disorder, particularly due to perturbative inter-heme vibrations allow the carrier to overcome trapping due to the presence of static disorder \textit{via} small frequency-dependent fluctuations. These studies provide insights into molecular and electronic determinants of long-range electronic conductivity in microbial cytochrome wires and highlight design principles for bioinspired, heme-based conductive materials.
△ Less
Submitted 7 April, 2025;
originally announced April 2025.
-
Measuring the asymmetric expansion of the Fe ejecta of Cassiopeia A with XRISM/Resolve
Authors:
Aya Bamba,
Manan Agarwal,
Jacco Vink,
Paul Plucinsky,
Yukikatsu Terada,
Ehud Behar,
Satoru Katsuda,
Koji Mori,
Makoto Sawada,
Hironori Matsumoto,
Lia Corrales,
Adam Foster,
Shin-ichiro Fujimoto,
Liyi Gu,
Kazuhiro Ichikawa,
Kai Matsunaga,
Tsunefumi Mizuno,
Hiroshi Murakami,
Hiroshi Nakajima,
Toshiki Sato,
Haruto Sonoda,
Shunsuke Suzuki,
Dai Tateishi,
Hiroyuki Uchida,
Masahiro Ichihashi
, et al. (2 additional authors not shown)
Abstract:
The expansion structure of supernova remnants (SNRs) is important for understanding not only how heavy elements are distributed into space, but also how supernovae explode. The ejecta expansion structure of the young core-collapse SNR Cas A is investigated, with Doppler parameter mapping of the Fe-K complex by the Resolve microcalorimeter onboard the X-ray Imaging and Spectroscopy Mission, XRISM.…
▽ More
The expansion structure of supernova remnants (SNRs) is important for understanding not only how heavy elements are distributed into space, but also how supernovae explode. The ejecta expansion structure of the young core-collapse SNR Cas A is investigated, with Doppler parameter mapping of the Fe-K complex by the Resolve microcalorimeter onboard the X-ray Imaging and Spectroscopy Mission, XRISM. It is found that the Fe ejecta are blueshifted in the southeast (SE) and redshifted in the northwest (NW), indicating an incomplete shell structure, similar to the intermediate mass elements (IMEs), such as Si and S. The Fe has a velocity shift of $\sim1400$ km~s$^{-1}$ in the NW and $\sim2160$ km~s$^{-1}$ in the SE region, with the error range of a few 100s km~s$^{-1}$. These values are consistent with those for the IMEs in the NW region, whereas larger than those for the IMEs in the SE region, although the large error region prevented us from concluding which component has significantly higher velocity. The line broadening is larger in the center with values of $\sim$2000--3000~km~s$^{-1}$, and smaller near the edges of the remnant. The radial profiles of the Doppler shift and broadening of the IMEs and Fe indicate that the Fe ejecta may expand asymmetrically as IME ejacta, although the large error regions do not allow us to conclude it. Moreover, we see little bulk Doppler broadening of the Fe lines in the northeastern jet region whereas the IME lines exhibit significant broadening. No such narrow lines are detected in the NW region. These findings suggest an asymmetric expansion of the ejecta potentially driven by large-scale asymmetries originating from the supernova explosion. This interpretation aligns with the large-scale asymmetries predicted by models of neutrino-driven supernova explosions.
△ Less
Submitted 9 April, 2025; v1 submitted 4 April, 2025;
originally announced April 2025.
-
Evidence for Charge Exchange Emission in Supernova Remnant N132D from XRISM/Resolve Observations
Authors:
Liyi Gu,
Hiroya Yamaguchi,
Adam Foster,
Satoru Katsuda,
Hiroyuki Uchida,
Makoto Sawada,
Frederick Scott Porter,
Brian J. Williams,
Robert Petre,
Aya Bamba,
Yukikatsu Terada,
Manan Agarwal,
Anne Decourchelle,
Matteo Guainazzi,
Richard Kelley,
Caroline Kilbourne,
Michael Loewenstein,
Hironori Matsumoto,
Eric D. Miller,
Yuken Ohshiro,
Paul Plucinsky,
Hiromasa Suzuki,
Makoto Tashiro,
Jacco Vink,
Yuichiro Ezoe
, et al. (2 additional authors not shown)
Abstract:
XRISM has delivered one of its first light observations on N132D, the X-ray brightest supernova remnant in the Large Magellanic Cloud. Utilizing 193 ks of high-resolution X-ray spectroscopy data, we conduct a comprehensive search for charge exchange emission. By incorporating a charge exchange model into our spectral analysis, we observe an improvement in the fits of two weak features at 2.41 keV…
▽ More
XRISM has delivered one of its first light observations on N132D, the X-ray brightest supernova remnant in the Large Magellanic Cloud. Utilizing 193 ks of high-resolution X-ray spectroscopy data, we conduct a comprehensive search for charge exchange emission. By incorporating a charge exchange model into our spectral analysis, we observe an improvement in the fits of two weak features at 2.41 keV and 2.63 keV. These features, with a combined significance of 99.6%, are consistent with transitions from highly ionized silicon ions in high Rydberg states, which are unique indicators of charge exchange. Our analysis constrains the charge exchange flux to no more than 4% of the total source flux within the 1.7-3.0 keV band, and places an upper limit on the charge exchange interaction velocity at 450 km/s. This result supports ongoing shock-cloud interactions within N132D and highlights the unique capabilities of XRISM to probe the complex physical processes at play.
△ Less
Submitted 4 April, 2025;
originally announced April 2025.
-
Extended Hybridization Expansion Solver for Impurity Models with Retarded Interactions
Authors:
Lei Gu,
Jia Luo,
Ruqian Wu,
Guoping Zhao
Abstract:
We extend the continuous-time hybridization expansion solver to a general form, where the hybridization function and retarded interaction are treated on equal footing. Correlation functions can now be directly obtained via functional derivatives with respect to the bosonic propagators, similar to the measurement of Green's functions. We devise a combinatorial scheme of measuring the correlation fu…
▽ More
We extend the continuous-time hybridization expansion solver to a general form, where the hybridization function and retarded interaction are treated on equal footing. Correlation functions can now be directly obtained via functional derivatives with respect to the bosonic propagators, similar to the measurement of Green's functions. We devise a combinatorial scheme of measuring the correlation function, whose efficiency partially emulates that of the Green's function measurement. The algorithm and numerical methods are validated through application to an impurity model involving both electron-phonon coupling and exchange interactions, a case where the previous hybridization expansion algorithm is not applicable. Our improvement of the hybridization expansion solver promotes its applicability in studies of electron-phonon coupling, the extended dynamical mean field theory, and the dual boson method.
△ Less
Submitted 3 April, 2025;
originally announced April 2025.
-
Detection of Disease on Nasal Breath Sound by New Lightweight Architecture: Using COVID-19 as An Example
Authors:
Jiayuan She,
Lin Shi,
Peiqi Li,
Ziling Dong,
Renxing Li,
Shengkai Li,
Liping Gu,
Zhao Tong,
Zhuochang Yang,
Yajie Ji,
Liang Feng,
Jiangang Chen
Abstract:
Background. Infectious diseases, particularly COVID-19, continue to be a significant global health issue. Although many countries have reduced or stopped large-scale testing measures, the detection of such diseases remains a propriety. Objective. This study aims to develop a novel, lightweight deep neural network for efficient, accurate, and cost-effective detection of COVID-19 using a nasal breat…
▽ More
Background. Infectious diseases, particularly COVID-19, continue to be a significant global health issue. Although many countries have reduced or stopped large-scale testing measures, the detection of such diseases remains a propriety. Objective. This study aims to develop a novel, lightweight deep neural network for efficient, accurate, and cost-effective detection of COVID-19 using a nasal breathing audio data collected via smartphones. Methodology. Nasal breathing audio from 128 patients diagnosed with the Omicron variant was collected. Mel-Frequency Cepstral Coefficients (MFCCs), a widely used feature in speech and sound analysis, were employed for extracting important characteristics from the audio signals. Additional feature selection was performed using Random Forest (RF) and Principal Component Analysis (PCA) for dimensionality reduction. A Dense-ReLU-Dropout model was trained with K-fold cross-validation (K=3), and performance metrics like accuracy, precision, recall, and F1-score were used to evaluate the model. Results. The proposed model achieved 97% accuracy in detecting COVID-19 from nasal breathing sounds, outperforming state-of-the-art methods such as those by [23] and [13]. Our Dense-ReLU-Dropout model, using RF and PCA for feature selection, achieves high accuracy with greater computational efficiency compared to existing methods that require more complex models or larger datasets. Conclusion. The findings suggest that the proposed method holds significant potential for clinical implementation, advancing smartphone-based diagnostics in infectious diseases. The Dense-ReLU-Dropout model, combined with innovative feature processing techniques, offers a promising approach for efficient and accurate COVID-19 detection, showcasing the capabilities of mobile device-based diagnostics
△ Less
Submitted 19 April, 2025; v1 submitted 1 April, 2025;
originally announced April 2025.
-
European Contributions to Fermilab Accelerator Upgrades and Facilities for the DUNE Experiment
Authors:
DUNE Collaboration,
A. Abed Abud,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
F. Akbar,
F. Alemanno,
N. S. Alex,
K. Allison,
M. Alrashed,
A. Alton,
R. Alvarez,
T. Alves,
A. Aman,
H. Amar,
P. Amedo,
J. Anderson,
D. A. Andrade
, et al. (1322 additional authors not shown)
Abstract:
The Proton Improvement Plan (PIP-II) to the FNAL accelerator chain and the Long-Baseline Neutrino Facility (LBNF) will provide the world's most intense neutrino beam to the Deep Underground Neutrino Experiment (DUNE) enabling a wide-ranging physics program. This document outlines the significant contributions made by European national laboratories and institutes towards realizing the first phase o…
▽ More
The Proton Improvement Plan (PIP-II) to the FNAL accelerator chain and the Long-Baseline Neutrino Facility (LBNF) will provide the world's most intense neutrino beam to the Deep Underground Neutrino Experiment (DUNE) enabling a wide-ranging physics program. This document outlines the significant contributions made by European national laboratories and institutes towards realizing the first phase of the project with a 1.2 MW neutrino beam. Construction of this first phase is well underway. For DUNE Phase II, this will be closely followed by an upgrade of the beam power to > 2 MW, for which the European groups again have a key role and which will require the continued support of the European community for machine aspects of neutrino physics. Beyond the neutrino beam aspects, LBNF is also responsible for providing unique infrastructure to install and operate the DUNE neutrino detectors at FNAL and at the Sanford Underground Research Facility (SURF). The cryostats for the first two Liquid Argon Time Projection Chamber detector modules at SURF, a contribution of CERN to LBNF, are central to the success of the ongoing execution of DUNE Phase I. Likewise, successful and timely procurement of cryostats for two additional detector modules at SURF will be critical to the success of DUNE Phase II and the overall physics program. The DUNE Collaboration is submitting four main contributions to the 2026 Update of the European Strategy for Particle Physics process. This paper is being submitted to the 'Accelerator technologies' and 'Projects and Large Experiments' streams. Additional inputs related to the DUNE science program, DUNE detector technologies and R&D, and DUNE software and computing, are also being submitted to other streams.
△ Less
Submitted 31 March, 2025;
originally announced March 2025.
-
DUNE Software and Computing Research and Development
Authors:
DUNE Collaboration,
A. Abed Abud,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
F. Akbar,
F. Alemanno,
N. S. Alex,
K. Allison,
M. Alrashed,
A. Alton,
R. Alvarez,
T. Alves,
A. Aman,
H. Amar,
P. Amedo,
J. Anderson,
D. A. Andrade
, et al. (1322 additional authors not shown)
Abstract:
The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy toward the implementation of this leading-edge, large-scale science project. The ambitious physics program of Phase I and Phase II of DUNE is dependent upon deployment and utilization of significant computing res…
▽ More
The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy toward the implementation of this leading-edge, large-scale science project. The ambitious physics program of Phase I and Phase II of DUNE is dependent upon deployment and utilization of significant computing resources, and successful research and development of software (both infrastructure and algorithmic) in order to achieve these scientific goals. This submission discusses the computing resources projections, infrastructure support, and software development needed for DUNE during the coming decades as an input to the European Strategy for Particle Physics Update for 2026. The DUNE collaboration is submitting four main contributions to the 2026 Update of the European Strategy for Particle Physics process. This submission to the 'Computing' stream focuses on DUNE software and computing. Additional inputs related to the DUNE science program, DUNE detector technologies and R&D, and European contributions to Fermilab accelerator upgrades and facilities for the DUNE experiment, are also being submitted to other streams.
△ Less
Submitted 31 March, 2025;
originally announced March 2025.
-
First measurement of $ν_e$ and $\barν_e$ charged current single charged pion production differential cross sections on argon using the MicroBooNE detector
Authors:
MicroBooNE collaboration,
P. Abratenko,
D. Andrade Aldana,
L. Arellano,
J. Asaadi,
A. Ashkenazi,
S. Balasubramanian,
B. Baller,
A. Barnard,
G. Barr,
D. Barrow,
J. Barrow,
V. Basque,
J. Bateman,
O. Benevides Rodrigues,
S. Berkman,
A. Bhat,
M. Bhattacharya,
M. Bishai,
A. Blake,
B. Bogart,
T. Bolton,
M. B. Brunetti,
L. Camilleri,
D. Caratelli
, et al. (150 additional authors not shown)
Abstract:
Understanding electron neutrino interactions is crucial for measurements of neutrino oscillations and searches for new physics in neutrino experiments. We present the first measurement of the flux-averaged $ν_e$ and $\barν_e$ charged current single charged pion production cross section on argon using the MicroBooNE detector and data from the NuMI neutrino beam. The total cross section is measured…
▽ More
Understanding electron neutrino interactions is crucial for measurements of neutrino oscillations and searches for new physics in neutrino experiments. We present the first measurement of the flux-averaged $ν_e$ and $\barν_e$ charged current single charged pion production cross section on argon using the MicroBooNE detector and data from the NuMI neutrino beam. The total cross section is measured to be (0.93 $\pm$ 0.13 (stat.) $\pm$ 0.27 (syst.)) $\times 10^{-39}$ cm$^2$/nucleon at a mean $ν_e$ and $\barν_e$ energy of 730 MeV. Differential cross sections are also reported in electron energy, electron and pion angles, and electron-pion opening angle.
△ Less
Submitted 30 March, 2025;
originally announced March 2025.
-
The DUNE Phase II Detectors
Authors:
DUNE Collaboration,
A. Abed Abud,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
F. Akbar,
F. Alemanno,
N. S. Alex,
K. Allison,
M. Alrashed,
A. Alton,
R. Alvarez,
T. Alves,
A. Aman,
H. Amar,
P. Amedo,
J. Anderson,
D. A. Andrade
, et al. (1322 additional authors not shown)
Abstract:
The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy for the implementation of this leading-edge, large-scale science project. The 2023 report of the US Particle Physics Project Prioritization Panel (P5) reaffirmed this vision and strongly endorsed DUNE Phase I and…
▽ More
The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy for the implementation of this leading-edge, large-scale science project. The 2023 report of the US Particle Physics Project Prioritization Panel (P5) reaffirmed this vision and strongly endorsed DUNE Phase I and Phase II, as did the previous European Strategy for Particle Physics. The construction of DUNE Phase I is well underway. DUNE Phase II consists of a third and fourth far detector module, an upgraded near detector complex, and an enhanced > 2 MW beam. The fourth FD module is conceived as a 'Module of Opportunity', aimed at supporting the core DUNE science program while also expanding the physics opportunities with more advanced technologies. The DUNE collaboration is submitting four main contributions to the 2026 Update of the European Strategy for Particle Physics process. This submission to the 'Detector instrumentation' stream focuses on technologies and R&D for the DUNE Phase II detectors. Additional inputs related to the DUNE science program, DUNE software and computing, and European contributions to Fermilab accelerator upgrades and facilities for the DUNE experiment, are also being submitted to other streams.
△ Less
Submitted 29 March, 2025;
originally announced March 2025.
-
The DUNE Science Program
Authors:
DUNE Collaboration,
A. Abed Abud,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
F. Akbar,
F. Alemanno,
N. S. Alex,
K. Allison,
M. Alrashed,
A. Alton,
R. Alvarez,
T. Alves,
A. Aman,
H. Amar,
P. Amedo,
J. Anderson,
D. A. Andrade
, et al. (1322 additional authors not shown)
Abstract:
The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy for the implementation of this leading-edge, large-scale science project. The 2023 report of the US Particle Physics Project Prioritization Panel (P5) reaffirmed this vision and strongly endorsed DUNE Phase I and…
▽ More
The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy for the implementation of this leading-edge, large-scale science project. The 2023 report of the US Particle Physics Project Prioritization Panel (P5) reaffirmed this vision and strongly endorsed DUNE Phase I and Phase II, as did the previous European Strategy for Particle Physics. The construction of DUNE Phase I is well underway. DUNE Phase II consists of a third and fourth far detector module, an upgraded near detector complex, and an enhanced > 2 MW beam. The fourth FD module is conceived as a 'Module of Opportunity', aimed at supporting the core DUNE science program while also expanding the physics opportunities with more advanced technologies. The DUNE collaboration is submitting four main contributions to the 2026 Update of the European Strategy for Particle Physics process. This submission to the 'Neutrinos and cosmic messengers', 'BSM physics' and 'Dark matter and dark sector' streams focuses on the physics program of DUNE. Additional inputs related to DUNE detector technologies and R&D, DUNE software and computing, and European contributions to Fermilab accelerator upgrades and facilities for the DUNE experiment, are also being submitted to other streams.
△ Less
Submitted 29 March, 2025;
originally announced March 2025.
-
Reflex: Speeding Up SMPC Query Execution through Efficient and Flexible Intermediate Result Size Trimming
Authors:
Long Gu,
Shaza Zeitouni,
Carsten Binnig,
Zsolt István
Abstract:
There is growing interest in Secure Analytics, but fully oblivious query execution in Secure Multi-Party Computation (MPC) settings is often prohibitively expensive. Recent related works propose different approaches to trimming the size of intermediate results between query operators, resulting in significant speedups at the cost of some information leakage. In this work, we generalize these ideas…
▽ More
There is growing interest in Secure Analytics, but fully oblivious query execution in Secure Multi-Party Computation (MPC) settings is often prohibitively expensive. Recent related works propose different approaches to trimming the size of intermediate results between query operators, resulting in significant speedups at the cost of some information leakage. In this work, we generalize these ideas into a method of flexible and efficient trimming of operator outputs that can be added to MPC operators easily. This allows for precisely controlling the security/performance trade-off on a per-operator and per-query basis. We demonstrate that our work is practical by porting a state-of-the-art trimming approach to it, resulting in a faster runtime and increased security. Our work lays down the foundation for a future MPC query planner that can pick different performance and security targets when composing physical query plans.
△ Less
Submitted 26 March, 2025;
originally announced March 2025.
-
Frequency Dynamic Convolution for Dense Image Prediction
Authors:
Linwei Chen,
Lin Gu,
Liang Li,
Chenggang Yan,
Ying Fu
Abstract:
While Dynamic Convolution (DY-Conv) has shown promising performance by enabling adaptive weight selection through multiple parallel weights combined with an attention mechanism, the frequency response of these weights tends to exhibit high similarity, resulting in high parameter costs but limited adaptability. In this work, we introduce Frequency Dynamic Convolution (FDConv), a novel approach that…
▽ More
While Dynamic Convolution (DY-Conv) has shown promising performance by enabling adaptive weight selection through multiple parallel weights combined with an attention mechanism, the frequency response of these weights tends to exhibit high similarity, resulting in high parameter costs but limited adaptability. In this work, we introduce Frequency Dynamic Convolution (FDConv), a novel approach that mitigates these limitations by learning a fixed parameter budget in the Fourier domain. FDConv divides this budget into frequency-based groups with disjoint Fourier indices, enabling the construction of frequency-diverse weights without increasing the parameter cost. To further enhance adaptability, we propose Kernel Spatial Modulation (KSM) and Frequency Band Modulation (FBM). KSM dynamically adjusts the frequency response of each filter at the spatial level, while FBM decomposes weights into distinct frequency bands in the frequency domain and modulates them dynamically based on local content. Extensive experiments on object detection, segmentation, and classification validate the effectiveness of FDConv. We demonstrate that when applied to ResNet-50, FDConv achieves superior performance with a modest increase of +3.6M parameters, outperforming previous methods that require substantial increases in parameter budgets (e.g., CondConv +90M, KW +76.5M). Moreover, FDConv seamlessly integrates into a variety of architectures, including ConvNeXt, Swin-Transformer, offering a flexible and efficient solution for modern vision tasks. The code is made publicly available at https://github.com/Linwei-Chen/FDConv.
△ Less
Submitted 24 March, 2025; v1 submitted 24 March, 2025;
originally announced March 2025.
-
Computation-Efficient and Recognition-Friendly 3D Point Cloud Privacy Protection
Authors:
Haotian Ma,
Lin Gu,
Siyi Wu,
Yingying Zhu
Abstract:
3D point cloud has been widely used in applications such as self-driving cars, robotics, CAD models, etc. To the best of our knowledge, these applications raised the issue of privacy leakage in 3D point clouds, which has not been studied well. Different from the 2D image privacy, which is related to texture and 2D geometric structure, the 3D point cloud is texture-less and only relevant to 3D geom…
▽ More
3D point cloud has been widely used in applications such as self-driving cars, robotics, CAD models, etc. To the best of our knowledge, these applications raised the issue of privacy leakage in 3D point clouds, which has not been studied well. Different from the 2D image privacy, which is related to texture and 2D geometric structure, the 3D point cloud is texture-less and only relevant to 3D geometric structure. In this work, we defined the 3D point cloud privacy problem and proposed an efficient privacy-preserving framework named PointFlowGMM that can support downstream classification and segmentation tasks without seeing the original data. Using a flow-based generative model, the point cloud is projected into a latent Gaussian mixture distributed subspace. We further designed a novel angular similarity loss to obfuscate the original geometric structure and reduce the model size from 767MB to 120MB without a decrease in recognition performance. The projected point cloud in the latent space is orthogonally rotated randomly to further protect the original geometric structure, the class-to-class relationship is preserved after rotation, thus, the protected point cloud can support the recognition task. We evaluated our model on multiple datasets and achieved comparable recognition results on encrypted point clouds compared to the original point clouds.
△ Less
Submitted 23 March, 2025; v1 submitted 19 March, 2025;
originally announced March 2025.
-
First Measurement of Charged Current Muon Neutrino-Induced $K^+$ Production on Argon using the MicroBooNE Detector
Authors:
MicroBooNE collaboration,
P. Abratenko,
D. Andrade Aldana,
L. Arellano,
J. Asaadi,
A. Ashkenazi,
S. Balasubramanian,
B. Baller,
A. Barnard,
G. Barr,
D. Barrow,
J. Barrow,
V. Basque,
J. Bateman,
O. Benevides Rodrigues,
S. Berkman,
A. Bhat,
M. Bhattacharya,
M. Bishai,
A. Blake,
B. Bogart,
T. Bolton,
M. B. Brunetti,
L. Camilleri,
D. Caratelli
, et al. (156 additional authors not shown)
Abstract:
The MicroBooNE experiment is an 85 tonne active mass liquid argon time projection chamber neutrino detector exposed to the on-axis Booster Neutrino Beam (BNB) at Fermilab. One of MicroBooNE's physics goals is the precise measurement of neutrino interactions on argon in the 1 GeV energy regime. Building on the capabilities of the MicroBooNE detector, this analysis identifies $K^{+}$ mesons, a key s…
▽ More
The MicroBooNE experiment is an 85 tonne active mass liquid argon time projection chamber neutrino detector exposed to the on-axis Booster Neutrino Beam (BNB) at Fermilab. One of MicroBooNE's physics goals is the precise measurement of neutrino interactions on argon in the 1 GeV energy regime. Building on the capabilities of the MicroBooNE detector, this analysis identifies $K^{+}$ mesons, a key signature for the study of strange particle production in neutrino interactions. This measurement is furthermore valuable for background estimation for future nucleon decay searches and for improved reconstruction and particle identification capabilities in experiments such as the Deep Underground Neutrino Experiment (DUNE). In this letter, we present the first-ever measurement of a flux-integrated cross section for charged-current muon neutrino induced $K^{+}$ production on argon nuclei, determined to be 7.93 $\pm$ 3.27 (stat.) $\pm$ 2.92 (syst.) $\times~10^{-42}\;$ cm$^2$/nucleon based on an analysis of 6.88$\times10^{20}$ protons on target.
△ Less
Submitted 4 March, 2025; v1 submitted 28 February, 2025;
originally announced March 2025.
-
InsTaG: Learning Personalized 3D Talking Head from Few-Second Video
Authors:
Jiahe Li,
Jiawei Zhang,
Xiao Bai,
Jin Zheng,
Jun Zhou,
Lin Gu
Abstract:
Despite exhibiting impressive performance in synthesizing lifelike personalized 3D talking heads, prevailing methods based on radiance fields suffer from high demands for training data and time for each new identity. This paper introduces InsTaG, a 3D talking head synthesis framework that allows a fast learning of realistic personalized 3D talking head from few training data. Built upon a lightwei…
▽ More
Despite exhibiting impressive performance in synthesizing lifelike personalized 3D talking heads, prevailing methods based on radiance fields suffer from high demands for training data and time for each new identity. This paper introduces InsTaG, a 3D talking head synthesis framework that allows a fast learning of realistic personalized 3D talking head from few training data. Built upon a lightweight 3DGS person-specific synthesizer with universal motion priors, InsTaG achieves high-quality and fast adaptation while preserving high-level personalization and efficiency. As preparation, we first propose an Identity-Free Pre-training strategy that enables the pre-training of the person-specific model and encourages the collection of universal motion priors from long-video data corpus. To fully exploit the universal motion priors to learn an unseen new identity, we then present a Motion-Aligned Adaptation strategy to adaptively align the target head to the pre-trained field, and constrain a robust dynamic head structure under few training data. Experiments demonstrate our outstanding performance and efficiency under various data scenarios to render high-quality personalized talking heads.
△ Less
Submitted 27 February, 2025;
originally announced February 2025.
-
Separated Contrastive Learning for Matching in Cross-domain Recommendation with Curriculum Scheduling
Authors:
Heng Chang,
Liang Gu,
Cheng Hu,
Zhinan Zhang,
Hong Zhu,
Yuhui Xu,
Yuan Fang,
Zhen Chen
Abstract:
Cross-domain recommendation (CDR) is a task that aims to improve the recommendation performance in a target domain by leveraging the information from source domains. Contrastive learning methods have been widely adopted among intra-domain (intra-CL) and inter-domain (inter-CL) users/items for their representation learning and knowledge transfer during the matching stage of CDR. However, we observe…
▽ More
Cross-domain recommendation (CDR) is a task that aims to improve the recommendation performance in a target domain by leveraging the information from source domains. Contrastive learning methods have been widely adopted among intra-domain (intra-CL) and inter-domain (inter-CL) users/items for their representation learning and knowledge transfer during the matching stage of CDR. However, we observe that directly employing contrastive learning on mixed-up intra-CL and inter-CL tasks ignores the difficulty of learning from inter-domain over learning from intra-domain, and thus could cause severe training instability. Therefore, this instability deteriorates the representation learning process and hurts the quality of generated embeddings. To this end, we propose a novel framework named SCCDR built up on a separated intra-CL and inter-CL paradigm and a stop-gradient operation to handle the drawback. Specifically, SCCDR comprises two specialized curriculum stages: intra-inter separation and inter-domain curriculum scheduling. The former stage explicitly uses two distinct contrastive views for the intra-CL task in the source and target domains, respectively. Meanwhile, the latter stage deliberately tackles the inter-CL tasks with a curriculum scheduling strategy that derives effective curricula by accounting for the difficulty of negative samples anchored by overlapping users. Empirical experiments on various open-source datasets and an offline proprietary industrial dataset extracted from a real-world recommender system, and an online A/B test verify that SCCDR achieves state-of-the-art performance over multiple baselines.
△ Less
Submitted 22 February, 2025;
originally announced February 2025.
-
Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction
Authors:
Ailin Huang,
Boyong Wu,
Bruce Wang,
Chao Yan,
Chen Hu,
Chengli Feng,
Fei Tian,
Feiyu Shen,
Jingbei Li,
Mingrui Chen,
Peng Liu,
Ruihang Miao,
Wang You,
Xi Chen,
Xuerui Yang,
Yechang Huang,
Yuxiang Zhang,
Zheng Gong,
Zixin Zhang,
Hongyu Zhou,
Jianjian Sun,
Brian Li,
Chengting Feng,
Changyi Wan,
Hanpeng Hu
, et al. (120 additional authors not shown)
Abstract:
Real-time speech interaction, serving as a fundamental interface for human-machine collaboration, holds immense potential. However, current open-source models face limitations such as high costs in voice data collection, weakness in dynamic control, and limited intelligence. To address these challenges, this paper introduces Step-Audio, the first production-ready open-source solution. Key contribu…
▽ More
Real-time speech interaction, serving as a fundamental interface for human-machine collaboration, holds immense potential. However, current open-source models face limitations such as high costs in voice data collection, weakness in dynamic control, and limited intelligence. To address these challenges, this paper introduces Step-Audio, the first production-ready open-source solution. Key contributions include: 1) a 130B-parameter unified speech-text multi-modal model that achieves unified understanding and generation, with the Step-Audio-Chat version open-sourced; 2) a generative speech data engine that establishes an affordable voice cloning framework and produces the open-sourced lightweight Step-Audio-TTS-3B model through distillation; 3) an instruction-driven fine control system enabling dynamic adjustments across dialects, emotions, singing, and RAP; 4) an enhanced cognitive architecture augmented with tool calling and role-playing abilities to manage complex tasks effectively. Based on our new StepEval-Audio-360 evaluation benchmark, Step-Audio achieves state-of-the-art performance in human evaluations, especially in terms of instruction following. On open-source benchmarks like LLaMA Question, shows 9.3% average performance improvement, demonstrating our commitment to advancing the development of open-source multi-modal language technologies. Our code and models are available at https://github.com/stepfun-ai/Step-Audio.
△ Less
Submitted 18 February, 2025; v1 submitted 17 February, 2025;
originally announced February 2025.
-
First Search for Dark Sector $e^+e^-$ Explanations of the MiniBooNE Anomaly at MicroBooNE
Authors:
MicroBooNE Collaboration,
A. M. Abdullahi,
P. Abratenko,
D. Andrade Aldana,
L. Arellano,
J. Asaadi,
A. Ashkenazi,
S. Balasubramanian,
B. Baller,
A. Barnard,
G. Barr,
D. Barrow,
J. Barrow,
V. Basque,
J. Bateman,
O. Benevides Rodrigues,
S. Berkman,
A. Bhat,
M. Bhattacharya,
M. Bishai,
A. Blake,
B. Bogart,
T. Bolton,
M. B. Brunetti,
L. Camilleri
, et al. (156 additional authors not shown)
Abstract:
We present MicroBooNE's first search for dark sector $e^+e^-$ explanations of the long-standing MiniBooNE anomaly. The MiniBooNE anomaly has garnered significant attention over the past 20 years including previous MicroBooNE investigations into both anomalous electron and photon excesses, but its origin still remains unclear. In this letter, we provide the first direct test of dark sector models i…
▽ More
We present MicroBooNE's first search for dark sector $e^+e^-$ explanations of the long-standing MiniBooNE anomaly. The MiniBooNE anomaly has garnered significant attention over the past 20 years including previous MicroBooNE investigations into both anomalous electron and photon excesses, but its origin still remains unclear. In this letter, we provide the first direct test of dark sector models in which dark neutrinos, produced through neutrino-induced scattering, decay into missing energy and visible $e^+e^-$ pairs comprising the MiniBooNE anomaly. Many such models have recently gained traction as a viable solution to the anomaly while evading past bounds. Using an exposure of $6.87 \times 10^{20}$ protons-on-target in the Booster Neutrino Beam, we implement a selection targeting forward-going, coherently produced $e^+e^-$ events. After unblinding, we observe 95 events, which we compare with the constrained background-only prediction of $69.7 \pm 17.3$. This analysis sets the world's first direct limits on these dark sector models and, at the 95\% confidence level, excludes the majority of the parameter space viable as a solution to the MiniBooNE anomaly.
△ Less
Submitted 15 February, 2025;
originally announced February 2025.
-
The X-ray Integral Field Unit at the end of the Athena reformulation phase
Authors:
Philippe Peille,
Didier Barret,
Edoardo Cucchetti,
Vincent Albouys,
Luigi Piro,
Aurora Simionescu,
Massimo Cappi,
Elise Bellouard,
Céline Cénac-Morthé,
Christophe Daniel,
Alice Pradines,
Alexis Finoguenov,
Richard Kelley,
J. Miguel Mas-Hesse,
Stéphane Paltani,
Gregor Rauw,
Agata Rozanska,
Jiri Svoboda,
Joern Wilms,
Marc Audard,
Enrico Bozzo,
Elisa Costantini,
Mauro Dadina,
Thomas Dauser,
Anne Decourchelle
, et al. (257 additional authors not shown)
Abstract:
The Athena mission entered a redefinition phase in July 2022, driven by the imperative to reduce the mission cost at completion for the European Space Agency below an acceptable target, while maintaining the flagship nature of its science return. This notably called for a complete redesign of the X-ray Integral Field Unit (X-IFU) cryogenic architecture towards a simpler active cooling chain. Passi…
▽ More
The Athena mission entered a redefinition phase in July 2022, driven by the imperative to reduce the mission cost at completion for the European Space Agency below an acceptable target, while maintaining the flagship nature of its science return. This notably called for a complete redesign of the X-ray Integral Field Unit (X-IFU) cryogenic architecture towards a simpler active cooling chain. Passive cooling via successive radiative panels at spacecraft level is now used to provide a 50 K thermal environment to an X-IFU owned cryostat. 4.5 K cooling is achieved via a single remote active cryocooler unit, while a multi-stage Adiabatic Demagnetization Refrigerator ensures heat lift down to the 50 mK required by the detectors. Amidst these changes, the core concept of the readout chain remains robust, employing Transition Edge Sensor microcalorimeters and a SQUID-based Time-Division Multiplexing scheme. Noteworthy is the introduction of a slower pixel. This enables an increase in the multiplexing factor (from 34 to 48) without compromising the instrument energy resolution, hence keeping significant system margins to the new 4 eV resolution requirement. This allows reducing the number of channels by more than a factor two, and thus the resource demands on the system, while keeping a 4' field of view (compared to 5' before). In this article, we will give an overview of this new architecture, before detailing its anticipated performances. Finally, we will present the new X-IFU schedule, with its short term focus on demonstration activities towards a mission adoption in early 2027.
△ Less
Submitted 15 February, 2025;
originally announced February 2025.
-
Position reconstruction and surface background model for the PandaX-4T detector
Authors:
Zhicheng Qian,
Linhui Gu,
Chen Cheng,
Zihao Bo,
Wei Chen,
Xun Chen,
Yunhua Chen,
Zhaokan Cheng,
Xiangyi Cui,
Yingjie Fan,
Deqing Fang,
Zhixing Gao,
Lisheng Geng,
Karl Giboni,
Xunan Guo,
Xuyuan Guo,
Zichao Guo,
Chencheng Han,
Ke Han,
Changda He,
Jinrong He,
Di Huang,
Houqi Huang,
Junting Huang,
Ruquan Hou
, et al. (78 additional authors not shown)
Abstract:
We report the position reconstruction methods and surface background model for the PandaX-4T dark matter direct search experiment. This work develops two position reconstruction algorithms: template matching (TM) method and photon acceptance function (PAF) method. Both methods determine the horizontal position of events based on the light pattern of secondary scintillation collected by the light s…
▽ More
We report the position reconstruction methods and surface background model for the PandaX-4T dark matter direct search experiment. This work develops two position reconstruction algorithms: template matching (TM) method and photon acceptance function (PAF) method. Both methods determine the horizontal position of events based on the light pattern of secondary scintillation collected by the light sensors. After a comprehensive evaluation of resolution, uniformity, and robustness, the PAF method was selected for position reconstruction, while the TM method was employed for verification. The PAF method achieves a bulk event resolution of 1.0 mm and a surface event resolution of 4.4 mm for a typical $S2$ signal with a bottom charge of 1500 PE (about 14 keV). The uniformity is around 20\%. Robustness studies reveal average deviations of 5.1 mm and 8.8 mm for the commissioning run (Run0) and the first science run (Run1), respectively, due to the deactivation of certain PMTs. A data-driven surface background model is developed based on the PAF method. The surface background is estimated to be $0.09 \pm 0.06$ events for Run0 (0.54 tonne$\cdot$year) and $0.17 \pm 0.11$ events for Run1 (1.00 tonne$\cdot$year).
△ Less
Submitted 11 February, 2025;
originally announced February 2025.
-
Neutrino Interaction Vertex Reconstruction in DUNE with Pandora Deep Learning
Authors:
DUNE Collaboration,
A. Abed Abud,
R. Acciarri,
M. A. Acero,
M. R. Adames,
G. Adamov,
M. Adamowski,
D. Adams,
M. Adinolfi,
C. Adriano,
A. Aduszkiewicz,
J. Aguilar,
F. Akbar,
F. Alemanno,
N. S. Alex,
K. Allison,
M. Alrashed,
A. Alton,
R. Alvarez,
T. Alves,
A. Aman,
H. Amar,
P. Amedo,
J. Anderson,
C. Andreopoulos
, et al. (1313 additional authors not shown)
Abstract:
The Pandora Software Development Kit and algorithm libraries perform reconstruction of neutrino interactions in liquid argon time projection chamber detectors. Pandora is the primary event reconstruction software used at the Deep Underground Neutrino Experiment, which will operate four large-scale liquid argon time projection chambers at the far detector site in South Dakota, producing high-resolu…
▽ More
The Pandora Software Development Kit and algorithm libraries perform reconstruction of neutrino interactions in liquid argon time projection chamber detectors. Pandora is the primary event reconstruction software used at the Deep Underground Neutrino Experiment, which will operate four large-scale liquid argon time projection chambers at the far detector site in South Dakota, producing high-resolution images of charged particles emerging from neutrino interactions. While these high-resolution images provide excellent opportunities for physics, the complex topologies require sophisticated pattern recognition capabilities to interpret signals from the detectors as physically meaningful objects that form the inputs to physics analyses. A critical component is the identification of the neutrino interaction vertex. Subsequent reconstruction algorithms use this location to identify the individual primary particles and ensure they each result in a separate reconstructed particle. A new vertex-finding procedure described in this article integrates a U-ResNet neural network performing hit-level classification into the multi-algorithm approach used by Pandora to identify the neutrino interaction vertex. The machine learning solution is seamlessly integrated into a chain of pattern-recognition algorithms. The technique substantially outperforms the previous BDT-based solution, with a more than 20\% increase in the efficiency of sub-1\,cm vertex reconstruction across all neutrino flavours.
△ Less
Submitted 26 June, 2025; v1 submitted 10 February, 2025;
originally announced February 2025.
-
Identification of metastable lattice distortion free charge density wave at photoinduced interface via TRARPES
Authors:
Shaofeng Duan,
Binshuo Zhang,
Zihao Wang,
Shichong Wang,
Lingxiao Gu,
Haoran Liu,
Jiongyu Huang,
Jianzhe Liu,
Dong Qian,
Yanfeng Guo,
Wentao Zhang
Abstract:
The interplay between different degrees of freedom governs the emergence of correlated electronic states in quantum materials, with charge density waves (CDW) often coexisting with other exotic phases. Under thermal equilibrium, traditional CDW states are consequentially accompanied by structural phase transitions. In contrast, ultrafast photoexcitation allows access to exotic states where a singl…
▽ More
The interplay between different degrees of freedom governs the emergence of correlated electronic states in quantum materials, with charge density waves (CDW) often coexisting with other exotic phases. Under thermal equilibrium, traditional CDW states are consequentially accompanied by structural phase transitions. In contrast, ultrafast photoexcitation allows access to exotic states where a single degree of freedom dominates in the time domain, enabling the study of underlying physics without interference. Here, we report the realization of a long-lived metastable CDW state without lattice distortion at the photoinduced interfaces in GdTe3 using time- and angle-resolved photoemission spectroscopy. After optical excitation above the CDW melting threshold, we identified emerged metastable interfaces through inverting the CDW-coupled lattice distortions, with lifetimes on the order of 10 picoseconds. These photoinduced interfaces represent a novel CDW state lacking the usual amplitude mode and lattice distortions, allowing quantification of the dominant role of electronic instabilities in CDW order. This work provides a new approach to disentangling electronic instabilities from electron-phonon coupling using a nonequilibrium method.
△ Less
Submitted 10 February, 2025;
originally announced February 2025.
-
First Search for Neutral Current Coherent Single-Photon Production in MicroBooNE
Authors:
MicroBooNE Collaboration,
P. Abratenko,
D. Andrade Aldana,
L. Arellano,
J. Asaadi,
A. Ashkenazi,
S. Balasubramanian,
B. Baller,
A. Barnard,
G. Barr,
D. Barrow,
J. Barrow,
V. Basque,
J. Bateman,
O. Benevides Rodrigues,
S. Berkman,
A. Bhat,
M. Bhattacharya,
M. Bishai,
A. Blake,
B. Bogart,
T. Bolton,
M. B. Brunetti,
L. Camilleri,
D. Caratelli
, et al. (155 additional authors not shown)
Abstract:
This article presents the first search for neutrino-induced neutral current coherent single-photon production (NC coherent 1$γ$). The search makes use of data from the MicroBooNE 85-tonne active volume liquid argon time projection chamber detector, situated in the Fermilab Booster Neutrino Beam (BNB), with an average neutrino energy of $\langle E_ν\rangle \sim 0.8$ GeV. A targeted selection of can…
▽ More
This article presents the first search for neutrino-induced neutral current coherent single-photon production (NC coherent 1$γ$). The search makes use of data from the MicroBooNE 85-tonne active volume liquid argon time projection chamber detector, situated in the Fermilab Booster Neutrino Beam (BNB), with an average neutrino energy of $\langle E_ν\rangle \sim 0.8$ GeV. A targeted selection of candidate neutrino interactions with a single photon-like electromagnetic shower in the final state and no visible vertex activity was developed to search for the NC coherent 1$γ$ process, along with two auxiliary selections used to constrain the dominant background from NC$π^0$ production. With an integrated exposure of $6.87 \times 10^{20}$ protons on target delivered by the BNB, we set the world's first limit for this rare process, corresponding to an upper limit on the flux-averaged cross section of $σ<1.49 \times 10^{-41}\text{cm}^2$ at 90\% C.L.
△ Less
Submitted 11 February, 2025; v1 submitted 9 February, 2025;
originally announced February 2025.