Search | arXiv e-print repository

An Inclusive Foundation Model for Generalizable Cytogenetics in Precision Oncology

Authors: Changchun Yang, Weiqian Dai, Yilan Zhang, Siyuan Chen, Jingdong Hu, Junkai Su, Yuxuan Chen, Ao Xu, Na Li, Xin Gao, Yongguo Yu

Abstract: Chromosome analysis is vital for diagnosing genetic disorders and guiding cancer therapy decisions through the identification of somatic clonal aberrations. However, developing an AI model are hindered by the overwhelming complexity and diversity of chromosomal abnormalities, requiring extensive annotation efforts, while automated methods remain task-specific and lack generalizability due to the s… ▽ More Chromosome analysis is vital for diagnosing genetic disorders and guiding cancer therapy decisions through the identification of somatic clonal aberrations. However, developing an AI model are hindered by the overwhelming complexity and diversity of chromosomal abnormalities, requiring extensive annotation efforts, while automated methods remain task-specific and lack generalizability due to the scarcity of comprehensive datasets spanning diverse resource conditions. Here, we introduce CHROMA, a foundation model for cytogenomics, designed to overcome these challenges by learning generalizable representations of chromosomal abnormalities. Pre-trained on over 84,000 specimens (~4 million chromosomal images) via self-supervised learning, CHROMA outperforms other methods across all types of abnormalities, even when trained on fewer labelled data and more imbalanced datasets. By facilitating comprehensive mapping of instability and clonal leisons across various aberration types, CHROMA offers a scalable and generalizable solution for reliable and automated clinical analysis, reducing the annotation workload for experts and advancing precision oncology through the early detection of rare genomic abnormalities, enabling broad clinical AI applications and making advanced genomic analysis more accessible. △ Less

Submitted 21 May, 2025; originally announced May 2025.

Comments: These authors contributed equally to this work: Changchun Yang, Weiqian Dai, Yilan Zhang

arXiv:2505.15620 [pdf, ps, other]

Observation of $χ_{cJ}\to 3K_S^0K^\pmπ^\mp$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (678 additional authors not shown)

Abstract: By analyzing $(2712.4\pm14.3)\times10^6$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays $χ_{c0,1,2} \to 3K_S^0K^\pmπ^\mp$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be $\mathcal{B}(χ_{c0}\to 3K_S^0K^\pmπ^\mp )=(7.95\pm0.50\pm0.65)\times10^{-5},$… ▽ More By analyzing $(2712.4\pm14.3)\times10^6$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, the decays $χ_{c0,1,2} \to 3K_S^0K^\pmπ^\mp$ are observed for the first time with statistical significances greater than $10σ$. The branching fractions of these decays are determined to be $\mathcal{B}(χ_{c0}\to 3K_S^0K^\pmπ^\mp )=(7.95\pm0.50\pm0.65)\times10^{-5},$ $\mathcal{B}(χ_{c1}\to 3K_S^0K^\pmπ^\mp)=(2.62\pm0.08\pm0.19)\times10^{-4},$ and $\mathcal{B}(χ_{c2}\to 3K_S^0K^\pmπ^\mp)=(1.72\pm0.07\pm0.15)\times10^{-4},$ where the first uncertainties are statistical and the second systematic. △ Less

Submitted 21 May, 2025; originally announced May 2025.

Comments: 11 pages, 6 figures

arXiv:2505.15431 [pdf, ps, other]

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Authors: Tencent Hunyuan Team, Ao Liu, Botong Zhou, Can Xu, Chayse Zhou, ChenChen Zhang, Chengcheng Xu, Chenhao Wang, Decheng Wu, Dengpeng Wu, Dian Jiao, Dong Du, Dong Wang, Feng Zhang, Fengzong Lian, Guanghui Xu, Guanwei Zhang, Hai Wang, Haipeng Luo, Han Hu, Huilin Xu, Jiajia Wu, Jianchen Zhu, Jianfeng Yan, Jiaqi Zhu , et al. (230 additional authors not shown)

Abstract: As Large Language Models (LLMs) rapidly advance, we introduce Hunyuan-TurboS, a novel large hybrid Transformer-Mamba Mixture of Experts (MoE) model. It synergistically combines Mamba's long-sequence processing efficiency with Transformer's superior contextual understanding. Hunyuan-TurboS features an adaptive long-short chain-of-thought (CoT) mechanism, dynamically switching between rapid response… ▽ More As Large Language Models (LLMs) rapidly advance, we introduce Hunyuan-TurboS, a novel large hybrid Transformer-Mamba Mixture of Experts (MoE) model. It synergistically combines Mamba's long-sequence processing efficiency with Transformer's superior contextual understanding. Hunyuan-TurboS features an adaptive long-short chain-of-thought (CoT) mechanism, dynamically switching between rapid responses for simple queries and deep "thinking" modes for complex problems, optimizing computational resources. Architecturally, this 56B activated (560B total) parameter model employs 128 layers (Mamba2, Attention, FFN) with an innovative AMF/MF block pattern. Faster Mamba2 ensures linear complexity, Grouped-Query Attention minimizes KV cache, and FFNs use an MoE structure. Pre-trained on 16T high-quality tokens, it supports a 256K context length and is the first industry-deployed large-scale Mamba model. Our comprehensive post-training strategy enhances capabilities via Supervised Fine-Tuning (3M instructions), a novel Adaptive Long-short CoT Fusion method, Multi-round Deliberation Learning for iterative improvement, and a two-stage Large-scale Reinforcement Learning process targeting STEM and general instruction-following. Evaluations show strong performance: overall top 7 rank on LMSYS Chatbot Arena with a score of 1356, outperforming leading models like Gemini-2.0-Flash-001 (1352) and o4-mini-2025-04-16 (1345). TurboS also achieves an average of 77.9% across 23 automated benchmarks. Hunyuan-TurboS balances high performance and efficiency, offering substantial capabilities at lower inference costs than many reasoning models, establishing a new paradigm for efficient large-scale pre-trained models. △ Less

Submitted 4 July, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

arXiv:2505.15305 [pdf, ps, other]

Vacuum Tunneling from Conifold Transitions in IIB

Authors: Xin Gao, Qinjian Lou, Yi-Nan Wang

Abstract: We investigate the quantum tunneling process through a topology transition near a conifold singularity, in the setup of IIB CY3 orientifold compactification. We propose a novel method to do moduli stabilization in an extended moduli space, parametrized by both the geometric moduli and the light D3-brane wrapping modes arisen from the brane quantization. Assuming the absence of flux through the van… ▽ More We investigate the quantum tunneling process through a topology transition near a conifold singularity, in the setup of IIB CY3 orientifold compactification. We propose a novel method to do moduli stabilization in an extended moduli space, parametrized by both the geometric moduli and the light D3-brane wrapping modes arisen from the brane quantization. Assuming the absence of flux through the vanishing exceptional 3-cycle, we find two types of vacuum solutions, one corresponds to the resolved conifold and the other one is interpreted as a novel non-geometric phase. We compute the quantum tunneling rate between these two solutions and find that it is difficult to achieve a significantly large tunneling rate in the controllable regime. △ Less

Submitted 27 June, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

Comments: 46 pages,4 figures

arXiv:2505.14619 [pdf, ps, other]

LaMET's Asymptotic Extrapolation vs. Inverse Problem

Authors: Jiunn-Wei Chen, Xiang Gao, Jinchen He, Jun Hua, Xiangdong Ji, Andreas Schäfer, Yushan Su, Wei Wang, Yi-Bo Yang, Jian-Hui Zhang, Qi-An Zhang, Rui Zhang, Yong Zhao

Abstract: Large-Momentum Effective Theory (LaMET) is a physics-guided systematic expansion to calculate light-cone parton distributions, including collinear (PDFs) and transverse-momentum-dependent ones, at any fixed momentum fraction $x$ within a range of $[x_{\rm min}, x_{\rm max}]$. It theoretically solves the ill-posed inverse problem that afflicts other theoretical approaches to collinear PDFs, such as… ▽ More Large-Momentum Effective Theory (LaMET) is a physics-guided systematic expansion to calculate light-cone parton distributions, including collinear (PDFs) and transverse-momentum-dependent ones, at any fixed momentum fraction $x$ within a range of $[x_{\rm min}, x_{\rm max}]$. It theoretically solves the ill-posed inverse problem that afflicts other theoretical approaches to collinear PDFs, such as short-distance factorizations. Recently, arXiv:2504.17706~\cite{Dutrieux:2025jed} raised practical concerns about whether current or even future lattice data will have sufficient precision in the sub-asymptotic correlation region to support an error-controlled extrapolation -- and if not, whether it becomes an inverse problem where the relevant uncertainties cannot be properly quantified. While we agree that not all current lattice data have the desired precision to qualify for an asymptotic extrapolation, some calculations do, and more are expected in the future. We comment on the analysis and results in Ref.~\cite{Dutrieux:2025jed} and argue that a physics-based systematic extrapolation still provides the most reliable error estimates, even when the data quality is not ideal. In contrast, re-framing the long-distance asymptotic extrapolation as a data-driven-only inverse problem with {\it ad hoc} mathematical conditioning could lead to unnecessarily conservative errors. △ Less