Skip to main content

Showing 101–150 of 2,004 results for author: Yang, K

.
  1. arXiv:2502.14739  [pdf, other

    cs.CL

    SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

    Authors: M-A-P Team, Xinrun Du, Yifan Yao, Kaijing Ma, Bingli Wang, Tianyu Zheng, King Zhu, Minghao Liu, Yiming Liang, Xiaolong Jin, Zhenlin Wei, Chujie Zheng, Kaixin Deng, Shawn Gavin, Shian Jia, Sichao Jiang, Yiyan Liao, Rui Li, Qinrui Li, Sirun Li, Yizhi Li, Yunwen Li, David Ma, Yuansheng Ni, Haoran Que , et al. (72 additional authors not shown)

    Abstract: Large language models (LLMs) have demonstrated remarkable proficiency in mainstream academic disciplines such as mathematics, physics, and computer science. However, human knowledge encompasses over 200 specialized disciplines, far exceeding the scope of existing benchmarks. The capabilities of LLMs in many of these specialized fields-particularly in light industry, agriculture, and service-orient… ▽ More

    Submitted 28 March, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

  2. arXiv:2502.14383  [pdf, other

    cs.CL

    Rumor Detection by Multi-task Suffix Learning based on Time-series Dual Sentiments

    Authors: Zhiwei Liu, Kailai Yang, Eduard Hovy, Sophia Ananiadou

    Abstract: The widespread dissemination of rumors on social media has a significant impact on people's lives, potentially leading to public panic and fear. Rumors often evoke specific sentiments, resonating with readers and prompting sharing. To effectively detect and track rumors, it is essential to observe the fine-grained sentiments of both source and response message pairs as the rumor evolves over time.… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: work in progress

  3. arXiv:2502.13834  [pdf, other

    cs.AI

    Proving Olympiad Inequalities by Synergizing LLMs and Symbolic Reasoning

    Authors: Zenan Li, Zhaoyu Li, Wen Tang, Xian Zhang, Yuan Yao, Xujie Si, Fan Yang, Kaiyu Yang, Xiaoxing Ma

    Abstract: Large language models (LLMs) can prove mathematical theorems formally by generating proof steps (\textit{a.k.a.} tactics) within a proof system. However, the space of possible tactics is vast and complex, while the available training data for formal proofs is limited, posing a significant challenge to LLM-based tactic generation. To address this, we introduce a neuro-symbolic tactic generator that… ▽ More

    Submitted 26 February, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: Published as a conference paper at ICLR 2025. Code is available at https://github.com/Lizn-zn/NeqLIPS/

  4. arXiv:2502.13472  [pdf, other

    cs.CL cs.HC

    FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems

    Authors: Borui Liao, Yulong Xu, Jiao Ou, Kaiyuan Yang, Weihua Jian, Pengfei Wan, Di Zhang

    Abstract: Full-Duplex Speech Dialogue Systems (Full-Duplex SDS) have significantly enhanced the naturalness of human-machine interaction by enabling real-time bidirectional communication. However, existing approaches face challenges such as difficulties in independent module optimization and contextual noise interference due to highly coupled architectural designs and oversimplified binary state modeling. T… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

  5. arXiv:2502.12536  [pdf, other

    cs.NE cs.AI

    An Algorithm Board in Neural Decoding

    Authors: Jingyi Feng, Kai Yang

    Abstract: Understanding the mechanisms of neural encoding and decoding has always been a highly interesting research topic in fields such as neuroscience and cognitive intelligence. In prior studies, some researchers identified a symmetry in neural data decoded by unsupervised methods in motor scenarios and constructed a cognitive learning system based on this pattern (i.e., symmetry). Nevertheless, the dis… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: 16 pages, 10 figures, 2 tables

  6. arXiv:2502.12513  [pdf, other

    cs.CV

    RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm

    Authors: Tiancheng Gu, Kaicheng Yang, Chaoyi Zhang, Yin Xie, Xiang An, Ziyong Feng, Dongnan Liu, Weidong Cai, Jiankang Deng

    Abstract: After pre-training on extensive image-text pairs, Contrastive Language-Image Pre-training (CLIP) demonstrates promising performance on a wide variety of benchmarks. However, a substantial volume of multimodal interleaved documents remains underutilized for contrastive vision-language representation learning. To fully leverage these unpaired documents, we initially establish a Real-World Data Extra… ▽ More

    Submitted 16 April, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: 15 pages, 12 figures, Webpage: https://garygutc.github.io/RealSyn

  7. arXiv:2502.12479  [pdf, other

    cs.LG q-bio.BM

    MotifBench: A standardized protein design benchmark for motif-scaffolding problems

    Authors: Zhuoqi Zheng, Bo Zhang, Kieran Didi, Kevin K. Yang, Jason Yim, Joseph L. Watson, Hai-Feng Chen, Brian L. Trippe

    Abstract: The motif-scaffolding problem is a central task in computational protein design: Given the coordinates of atoms in a geometry chosen to confer a desired biochemical function (a motif), the task is to identify diverse protein structures (scaffolds) that include the motif and maintain its geometry. Significant recent progress on motif-scaffolding has been made due to computational evaluation with re… ▽ More

    Submitted 19 February, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: Associated content available at github.com/blt2114/MotifBench

  8. arXiv:2502.12252  [pdf, other

    quant-ph cond-mat.supr-con

    Roadmap to fault tolerant quantum computation using topological qubit arrays

    Authors: David Aasen, Morteza Aghaee, Zulfi Alam, Mariusz Andrzejczuk, Andrey Antipov, Mikhail Astafev, Lukas Avilovas, Amin Barzegar, Bela Bauer, Jonathan Becker, Juan M. Bello-Rivas, Umesh Bhaskar, Alex Bocharov, Srini Boddapati, David Bohn, Jouri Bommer, Parsa Bonderson, Jan Borovsky, Leo Bourdet, Samuel Boutin, Tom Brown, Gary Campbell, Lucas Casparis, Srivatsa Chakravarthi, Rui Chao , et al. (157 additional authors not shown)

    Abstract: We describe a concrete device roadmap towards a fault-tolerant quantum computing architecture based on noise-resilient, topologically protected Majorana-based qubits. Our roadmap encompasses four generations of devices: a single-qubit device that enables a measurement-based qubit benchmarking protocol; a two-qubit device that uses measurement-based braiding to perform single-qubit Clifford operati… ▽ More

    Submitted 7 April, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: v2: 12+8 pages, 9+5 figures, significant main text revisions, added appendices discussing idle coherence times and non-Clifford operations v1:11+6 pages, 8+5 figures

  9. arXiv:2502.11573  [pdf, other

    cs.CL cs.AI

    InfiR : Crafting Effective Small Language Models and Multimodal Small Language Models in Reasoning

    Authors: Congkai Xie, Shuo Cai, Wenjun Wang, Pengxiang Li, Zhijie Sang, Kejing Yang, Yiming Zhang, Zhen Li, Guanghao Zhu, Zeyu Liu, Yang Yu, Yuhang Liu, Su Lu, Baoyi He, Qi Zhou, Xiaotian Han, Jianbo Yuan, Shengyu Zhang, Fei Wu, Hongxia Yang

    Abstract: Large Language Models (LLMs) and Multimodal Large Language Models (MLLMs) have made significant advancements in reasoning capabilities. However, they still face challenges such as high computational demands and privacy concerns. This paper focuses on developing efficient Small Language Models (SLMs) and Multimodal Small Language Models (MSLMs) that retain competitive reasoning abilities. We introd… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  10. arXiv:2502.10812  [pdf, other

    eess.IV cs.IT

    ResiComp: Loss-Resilient Image Compression via Dual-Functional Masked Visual Token Modeling

    Authors: Sixian Wang, Jincheng Dai, Xiaoqi Qin, Ke Yang, Kai Niu, Ping Zhang

    Abstract: Recent advancements in neural image codecs (NICs) are of significant compression performance, but limited attention has been paid to their error resilience. These resulting NICs tend to be sensitive to packet losses, which are prevalent in real-time communications. In this paper, we investigate how to elevate the resilience ability of NICs to combat packet losses. We propose ResiComp, a pion… ▽ More

    Submitted 28 February, 2025; v1 submitted 15 February, 2025; originally announced February 2025.

    Comments: Accepted by IEEE TCSVT

  11. arXiv:2502.10291  [pdf, other

    hep-ex

    Angular analysis of $B^0\rightarrow K^{*0}e^{+}e^{-}$ decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1115 additional authors not shown)

    Abstract: An angular analysis of $B^0\rightarrow K^{*0}e^{+}e^{-}$ decays is presented using proton-proton collision data collected by the LHCb experiment at centre-of-mass energies of 7, 8 and 13 TeV, corresponding to an integrated luminosity of 9 fb$^{-1}$. The analysis is performed in the region of the dilepton invariant mass squared of 1.1-6.0 GeV$^{2}/c^{4}$. In addition, a test of lepton flavour unive… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/1628/ (LHCb public pages)

    Report number: LHCb-PAPER-2024-022, CERN-EP-2025-001

  12. arXiv:2502.08836  [pdf, ps, other

    cs.CV

    Survey on Single-Image Reflection Removal using Deep Learning Techniques

    Authors: Kangning Yang, Huiming Sun, Jie Cai, Lan Fu, Jiaming Ding, Jinlong Li, Chiu Man Ho, Zibo Meng

    Abstract: The phenomenon of reflection is quite common in digital images, posing significant challenges for various applications such as computer vision, photography, and image processing. Traditional methods for reflection removal often struggle to achieve clean results while maintaining high fidelity and robustness, particularly in real-world scenarios. Over the past few decades, numerous deep learning-ba… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

  13. arXiv:2502.08794  [pdf, other

    cs.LG

    Spectral Journey: How Transformers Predict the Shortest Path

    Authors: Andrew Cohen, Andrey Gromov, Kaiyu Yang, Yuandong Tian

    Abstract: Decoder-only transformers lead to a step-change in capability of large language models. However, opinions are mixed as to whether they are really planning or reasoning. A path to making progress in this direction is to study the model's behavior in a setting with carefully controlled data. Then interpret the learned representations and reverse-engineer the computation performed internally. We stud… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

    Comments: 12 pages

  14. arXiv:2502.07862  [pdf, other

    cs.LG cs.AI cs.CV

    ADMN: A Layer-Wise Adaptive Multimodal Network for Dynamic Input Noise and Compute Resources

    Authors: Jason Wu, Kang Yang, Lance Kaplan, Mani Srivastava

    Abstract: Multimodal deep learning systems are deployed in dynamic scenarios due to the robustness afforded by multiple sensing modalities. Nevertheless, they struggle with varying compute resource availability (due to multi-tenancy, device heterogeneity, etc.) and fluctuating quality of inputs (from sensor feed corruption, environmental noise, etc.). Current multimodal systems employ static resource provis… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  15. arXiv:2502.07640  [pdf, other

    cs.LG cs.AI

    Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving

    Authors: Yong Lin, Shange Tang, Bohan Lyu, Jiayun Wu, Hongzhou Lin, Kaiyu Yang, Jia Li, Mengzhou Xia, Danqi Chen, Sanjeev Arora, Chi Jin

    Abstract: We introduce Goedel-Prover, an open-source language model that achieves state-of-the-art (as of April 5 2025) performance in automated formal proof generation for mathematical problems. A key challenge in this field is the scarcity of formalized mathematical statements and proofs, which we address through the following approaches. First, we train LLMs to convert natural language math problems from… ▽ More

    Submitted 19 April, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

  16. Learning to Synthesize Compatible Fashion Items Using Semantic Alignment and Collocation Classification: An Outfit Generation Framework

    Authors: Dongliang Zhou, Haijun Zhang, Kai Yang, Linlin Liu, Han Yan, Xiaofei Xu, Zhao Zhang, Shuicheng Yan

    Abstract: The field of fashion compatibility learning has attracted great attention from both the academic and industrial communities in recent years. Many studies have been carried out for fashion compatibility prediction, collocated outfit recommendation, artificial intelligence (AI)-enabled compatible fashion design, and related topics. In particular, AI-enabled compatible fashion design can be used to s… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

    Comments: This paper was accepted by IEEE TNNLS

  17. arXiv:2502.06287  [pdf, other

    cs.RO

    CT-UIO: Continuous-Time UWB-Inertial-Odometer Localization Using Non-Uniform B-spline with Fewer Anchors

    Authors: Jian Sun, Wei Sun, Genwei Zhang, Kailun Yang, Song Li, Xiangqi Meng, Na Deng, Chongbin Tan

    Abstract: Ultra-wideband (UWB) based positioning with fewer anchors has attracted significant research interest in recent years, especially under energy-constrained conditions. However, most existing methods rely on discrete-time representations and smoothness priors to infer a robot's motion states, which often struggle with ensuring multi-sensor data synchronization. In this paper, we present an efficient… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: The codebase and datasets will be open-sourced at https://github.com/JasonSun623/CT-UIO

  18. arXiv:2502.05708  [pdf, other

    cs.NI cs.LG

    GWRF: A Generalizable Wireless Radiance Field for Wireless Signal Propagation Modeling

    Authors: Kang Yang, Yuning Chen, Wan Du

    Abstract: We present Generalizable Wireless Radiance Fields (GWRF), a framework for modeling wireless signal propagation at arbitrary 3D transmitter and receiver positions. Unlike previous methods that adapt vanilla Neural Radiance Fields (NeRF) from the optical to the wireless signal domain, requiring extensive per-scene training, GWRF generalizes effectively across scenes. First, a geometry-aware Transfor… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

  19. arXiv:2502.05330  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Multi-Class Segmentation of Aortic Branches and Zones in Computed Tomography Angiography: The AortaSeg24 Challenge

    Authors: Muhammad Imran, Jonathan R. Krebs, Vishal Balaji Sivaraman, Teng Zhang, Amarjeet Kumar, Walker R. Ueland, Michael J. Fassler, Jinlong Huang, Xiao Sun, Lisheng Wang, Pengcheng Shi, Maximilian Rokuss, Michael Baumgartner, Yannick Kirchhof, Klaus H. Maier-Hein, Fabian Isensee, Shuolin Liu, Bing Han, Bong Thanh Nguyen, Dong-jin Shin, Park Ji-Woo, Mathew Choi, Kwang-Hyun Uhm, Sung-Jea Ko, Chanwoong Lee , et al. (38 additional authors not shown)

    Abstract: Multi-class segmentation of the aorta in computed tomography angiography (CTA) scans is essential for diagnosing and planning complex endovascular treatments for patients with aortic dissections. However, existing methods reduce aortic segmentation to a binary problem, limiting their ability to measure diameters across different branches and zones. Furthermore, no open-source dataset is currently… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  20. arXiv:2502.04735  [pdf, other

    eess.SP

    Affine Frequency Division Multiplexing: Extending OFDM for Scenario-Flexibility and Resilience

    Authors: Haoran Yin, Yanqun Tang, Ali Bemani, Marios Kountouris, Yu Zhou, Xingyao Zhang, Yuqing Liu, Gaojie Chen, Kai Yang, Fan Liu, Christos Masouros, Shuangyang Li, Giuseppe Caire, Pei Xiao

    Abstract: Next-generation wireless networks are conceived to provide reliable and high-data-rate communication services for diverse scenarios, such as vehicle-to-vehicle, unmanned aerial vehicles, and satellite networks. The severe Doppler spreads in the underlying time-varying channels induce destructive inter-carrier interference (ICI) in the extensively adopted orthogonal frequency division multiplexing… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: Magazine paper submitted to IEEE

  21. arXiv:2502.04013  [pdf, other

    hep-ex

    Search for resonance-enhanced $CP$ and angular asymmetries in the $Λ^+_{c}\to pμ^+μ^-$ decay at LHCb

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1127 additional authors not shown)

    Abstract: The first measurement of the $CP$ asymmetry of the decay rate ($A_{CP}$) and the $CP$ average ($ΣA_{\text{FB}}$) and $CP$ asymmetry ($ΔA_{\text{FB}}$) of the forward-backward asymmetry in the muon system of $\mathitΛ^+_c\to pμ^+μ^-$ decays is reported. The measurement is performed using a data sample of proton-proton collisions, recorded by the LHCb experiment from 2016 to 2018 at a center-of-mass… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3473/ (LHCb public pages)

    Report number: LHCb-PAPER-2024-051, CERN-EP-2024-340

  22. ALMA observations of massive clouds in the central molecular zone: slim filaments tracing parsec-scale shocks

    Authors: Kai Yang, Xing Lu, Yichen Zhang, Xunchuan Liu, Adam Ginsburg, Hauyu Baobab Liu, Yu Cheng, Siyi Feng, Tie Liu, Qizhou Zhang, Elisabeth A. C. Mills, Daniel L. Walker, Shu-ichiro Inutsuka, Cara Battersby, Steven N. Longmore, Xindi Tang, Jens Kauffmann, Qilao Gu, Shanghuo Li, Qiuyi Luo, J. M. Diederik Kruijssen, Thushara Pillai, Hai-Hua Qiao, Keping Qiu, Zhiqiang Shen

    Abstract: The central molecular zone (CMZ) of our Galaxy exhibits widespread emission from SiO and various complex organic molecules (COMs), yet the exact origin of such emission is uncertain. Here we report the discovery of a unique class of long ($>$0.5 pc) and narrow ($<$0.03 pc) filaments in the emission of SiO 5$-$4 and eight additional molecular lines, including several COMs, in our ALMA 1.3 mm spectr… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

    Comments: 14 pages, 9 figures, 2 tables

    Journal ref: A&A, 694, A86 (2025)

  23. arXiv:2502.02334  [pdf, other

    cs.CV cs.RO eess.IV

    Event-aided Semantic Scene Completion

    Authors: Shangwei Guo, Hao Shi, Song Wang, Xiaoting Yin, Kailun Yang, Kaiwei Wang

    Abstract: Autonomous driving systems rely on robust 3D scene understanding. Recent advances in Semantic Scene Completion (SSC) for autonomous driving underscore the limitations of RGB-based approaches, which struggle under motion blur, poor lighting, and adverse weather. Event cameras, offering high dynamic range and low latency, address these challenges by providing asynchronous data that complements RGB i… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

    Comments: The established datasets and codebase will be made publicly at https://github.com/Pandapan01/EvSSC

  24. arXiv:2502.01826  [pdf, other

    cs.NI

    Scalable 3D Gaussian Splatting-Based RF Signal Spatial Propagation Modeling

    Authors: Kang Yang, Gaofeng Dong, Sijie Ji, Wan Du, Mani Srivastava

    Abstract: Effective network planning and sensing in wireless networks require resource-intensive site surveys for data collection. An alternative is Radio-Frequency (RF) signal spatial propagation modeling, which computes received signals given transceiver positions in a scene (e.g.s a conference room). We identify a fundamental trade-off between scalability and fidelity in the state-of-the-art method. To a… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

  25. arXiv:2502.00333  [pdf, other

    cs.CV

    BiMaCoSR: Binary One-Step Diffusion Model Leveraging Flexible Matrix Compression for Real Super-Resolution

    Authors: Kai Liu, Kaicheng Yang, Zheng Chen, Zhiteng Li, Yong Guo, Wenbo Li, Linghe Kong, Yulun Zhang

    Abstract: While super-resolution (SR) methods based on diffusion models (DM) have demonstrated inspiring performance, their deployment is impeded due to the heavy request of memory and computation. Recent researchers apply two kinds of methods to compress or fasten the DM. One is to compress the DM into 1-bit, aka binarization, alleviating the storage and computation pressure. The other distills the multi-s… ▽ More

    Submitted 3 February, 2025; v1 submitted 1 February, 2025; originally announced February 2025.

    Comments: 10 pages, 5 figures. The code and models will be available at https://github.com/Kai-Liu001/BiMaCoSR

  26. arXiv:2501.17149  [pdf, ps, other

    math.CO

    Colorful Helly via induced matchings

    Authors: Cosmin Pohoata, Kevin Yang, Shengtong Zhang

    Abstract: We establish a theorem regarding the maximum size of an {\it{induced}} matching in the bipartite complement of the incidence graph of a set system $(X,\mathcal{F})$. We show that this quantity plus one provides an upper bound on the colorful Helly number of this set system, i.e. the minimum positive integer $N$ for which the following statement holds: if finite subfamilies… ▽ More

    Submitted 29 January, 2025; v1 submitted 28 January, 2025; originally announced January 2025.

    Comments: 12 pages, 2 figures. Fix issue with a figure not displaying, and correct some typos

  27. arXiv:2501.16803  [pdf, other

    cs.RO cs.CV cs.NI eess.IV

    RG-Attn: Radian Glue Attention for Multi-modality Multi-agent Cooperative Perception

    Authors: Lantao Li, Kang Yang, Wenqi Zhang, Xiaoxue Wang, Chen Sun

    Abstract: Cooperative perception offers an optimal solution to overcome the perception limitations of single-agent systems by leveraging Vehicle-to-Everything (V2X) communication for data sharing and fusion across multiple agents. However, most existing approaches focus on single-modality data exchange, limiting the potential of both homogeneous and heterogeneous fusion across agents. This overlooks the opp… ▽ More

    Submitted 31 March, 2025; v1 submitted 28 January, 2025; originally announced January 2025.

  28. arXiv:2501.15383  [pdf, other

    cs.CL

    Qwen2.5-1M Technical Report

    Authors: An Yang, Bowen Yu, Chengyuan Li, Dayiheng Liu, Fei Huang, Haoyan Huang, Jiandong Jiang, Jianhong Tu, Jianwei Zhang, Jingren Zhou, Junyang Lin, Kai Dang, Kexin Yang, Le Yu, Mei Li, Minmin Sun, Qin Zhu, Rui Men, Tao He, Weijia Xu, Wenbiao Yin, Wenyuan Yu, Xiafei Qiu, Xingzhang Ren, Xinlong Yang , et al. (3 additional authors not shown)

    Abstract: We introduce Qwen2.5-1M, a series of models that extend the context length to 1 million tokens. Compared to the previous 128K version, the Qwen2.5-1M series have significantly enhanced long-context capabilities through long-context pre-training and post-training. Key techniques such as long data synthesis, progressive pre-training, and multi-stage supervised fine-tuning are employed to effectively… ▽ More

    Submitted 25 January, 2025; originally announced January 2025.

  29. arXiv:2501.14943  [pdf, other

    hep-ex

    Evidence for $B^-\rightarrow D^{**0}τ^-\overline{ν_τ}$ decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1127 additional authors not shown)

    Abstract: The first evidence for the decay $B^-\rightarrow D^{**0}τ^-\overline{ν_τ}$ is obtained using proton-proton collision data collected by the LHCb experiment, corresponding to an integrated luminosity of 9 fb$^{-1}$ , at centre-of-mass energies of 7, 8 and 13 Tev. Here, the $D^{**0}$ meson represents any of the three excited charm mesons $D_{1}(2420)^{0}$, $D_{2}^{*}(2460)^{0}$, and… ▽ More

    Submitted 21 March, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3300/ (LHCb public pages)

    Report number: LHCb-PAPER-2024-037, CERN-EP-2024-341

  30. arXiv:2501.13629  [pdf, other

    cs.CL

    Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

    Authors: Zhenghao Lin, Zihao Tang, Xiao Liu, Yeyun Gong, Yi Cheng, Qi Chen, Hang Li, Ying Xin, Ziyue Yang, Kailai Yang, Yu Yan, Xiao Liang, Shuai Lu, Yiming Huang, Zheheng Luo, Lei Qu, Xuan Feng, Yaoxiang Wang, Yuqing Xia, Feiyang Chen, Yuting Jiang, Yasen Hu, Hao Ni, Binyang Li, Guoshuai Zhao , et al. (9 additional authors not shown)

    Abstract: We introduce Sigma, an efficient large language model specialized for the system domain, empowered by a novel architecture including DiffQKV attention, and pre-trained on our meticulously collected system domain data. DiffQKV attention significantly enhances the inference efficiency of Sigma by optimizing the Query (Q), Key (K), and Value (V) components in the attention mechanism differentially, b… ▽ More

    Submitted 10 February, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

  31. arXiv:2501.12779  [pdf, other

    hep-ex

    Observation of the $Λ_b^0 \to J/ψΞ^- K^+$ and $Ξ_b^0 \to J/ψΞ^- π^+$ decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1126 additional authors not shown)

    Abstract: The first observation of the $Ξ_b^0 \to J/ψΞ^- π^+$ decay and the most precise measurement of the branching fraction of the $Λ_b^0 \to J/ψΞ^- K^+$ decay are reported, using proton-proton collision data from the LHCb experiment collected in 2016--2018 at a centre-of-mass energy of 13~TeV, corresponding to an integrated luminosity of 5.4~fb$^{-1}$. Using the $Λ_b^0 \to J/ψΛ$ and $Ξ_b^0 \to J/ψΞ^-$ d… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3479/ (LHCb public pages)

    Report number: CERN-EP-2024-337 LHCb-PAPER-2024-049

  32. arXiv:2501.12611  [pdf, other

    hep-ex nucl-ex

    Measurement of the multiplicity dependence of $\mitΥ$ production ratios in $pp$ collisions at $\sqrt{s}=13$ TeV

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1127 additional authors not shown)

    Abstract: The $\mitΥ(\mathrm{2}S)$ and $\mitΥ(\mathrm{3}S)$ production cross-sections are measured relative to that of the $\mitΥ(\mathrm{1}S)$ meson, as a function of charged-particle multiplicity in proton-proton collisions at a centre-of-mass energy of $13$ TeV. The measurement uses data collected by the LHCb experiment in 2018 corresponding to an integrated luminosity of 2 $\text{fb}^{-1}$. Both the… ▽ More

    Submitted 23 January, 2025; v1 submitted 21 January, 2025; originally announced January 2025.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/1782/ (LHCb public pages)

    Report number: LHCb-PAPER-2024-038, CERN-EP-2024-318

  33. arXiv:2501.11635  [pdf, other

    hep-ex

    Search for charge-parity violation in semileptonically tagged $D^{0} \to K^{+} π^{-}$ decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1127 additional authors not shown)

    Abstract: An analysis of the flavour oscillations of the charmed neutral meson is presented. The ratio of $D^{0} \to K^{+} π^{-}$ and $D^{0} \to K^{-} π^{+}$ decay rates is measured as a function of the decay time of the $D^{0}$ meson and compared with the charge-conjugated system to search for charge-parity violation. The meson flavour at production is double-tagged by the charges of the muon and pion in t… ▽ More

    Submitted 20 January, 2025; originally announced January 2025.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3260/ (LHCb public pages)

    Report number: LHCb-PAPER-2024-044, CERN-EP-2024-319

  34. Design-Agnostic Distributed Timing Fault Injection Monitor With End-to-End Design Automation

    Authors: Yan He, Yumin Su, Kaiyuan Yang

    Abstract: Fault injection attacks induce hardware failures in circuits and exploit these faults to compromise the security of the system. It has been demonstrated that FIAs can bypass system security mechanisms, cause faulty outputs, and gain access to secret information. Certain types of FIAs can be mounted with little effort by tampering with clock signals and or the chip operating conditions. To mitigate… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

    Comments: 12 pages, 26 figures

    Journal ref: IEEE Journal of Solid-State Circuits, 04 December 2024

  35. arXiv:2501.09098  [pdf, other

    astro-ph.CO astro-ph.GA

    Boosting Supermassive Black Hole Growth in the Early Universe by Fuzzy Dark Matter Solitons

    Authors: H. -H. Sandy Chiu, Hsi-Yu Schive, Hsiang-Yi Karen Yang, Hsinhao Huang, Massimo Gaspari

    Abstract: Observations of massive supermassive black holes (SMBHs) in the early universe challenge existing black hole formation models. We propose that soliton cores in fuzzy dark matter (FDM) offer a potential solution to this timing problem. Our FDM cosmological zoom-in simulations confirm that for a particle mass $m_{\rm FDM}\sim 10^{-22}~{\rm eV}$, solitons are well developed at redshift $z \sim 7$ wit… ▽ More

    Submitted 15 January, 2025; originally announced January 2025.

    Comments: 9 pages, 4 figures, 1 table. Accepted for publication in Physical Review Letters

  36. arXiv:2501.09035  [pdf, other

    cs.SI cs.CY

    DomainDemo: a dataset of domain-sharing activities among different demographic groups on Twitter

    Authors: Kai-Cheng Yang, Pranav Goel, Alexi Quintana-Mathé, Luke Horgan, Stefan D. McCabe, Nir Grinberg, Kenneth Joseph, David Lazer

    Abstract: Social media play a pivotal role in disseminating web content, particularly during elections, yet our understanding of the association between demographic factors and political discourse online remains limited. Here, we introduce a unique dataset, DomainDemo, linking domains shared on Twitter (X) with the demographic characteristics of associated users, including age, gender, race, political affil… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

    Comments: 19 pages, 1 figure

  37. arXiv:2501.08379  [pdf, ps, other

    cond-mat.str-el hep-th

    Fermion liquids as quantum Hall liquids in phase space: A unified approach for anomalies and responses

    Authors: Jaychandran Padayasi, Ken K. W. Ma, Kun Yang

    Abstract: The discovery of many strongly correlated metallic phases has inspired different routes to generalize or go beyond the celebrated Landau Fermi liquid theory. To this end, from universal consideration of symmetries and anomalies, Else, Thorngren and Senthil (ETS) have introduced a class of theories called ersatz Fermi liquids which possess a Fermi surface and satisfy a generalized Luttinger's theor… ▽ More

    Submitted 11 April, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

    Comments: Published version

    Journal ref: Phys. Rev. B 111, 125138 (2025)

  38. arXiv:2501.06700  [pdf, other

    cs.IT cs.LG cs.NI eess.SP

    Average Reward Reinforcement Learning for Wireless Radio Resource Management

    Authors: Kun Yang, Jing Yang, Cong Shen

    Abstract: In this paper, we address a crucial but often overlooked issue in applying reinforcement learning (RL) to radio resource management (RRM) in wireless communications: the mismatch between the discounted reward RL formulation and the undiscounted goal of wireless network optimization. To the best of our knowledge, we are the first to systematically investigate this discrepancy, starting with a discu… ▽ More

    Submitted 11 January, 2025; originally announced January 2025.

    Comments: Accepted by Asilomar 2024

  39. arXiv:2501.06483  [pdf, other

    hep-ex

    Study of light-meson resonances decaying to $K^0_{\rm S} K π$ in the $B \to (K^0_{\rm S} K π) K$ channels

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1127 additional authors not shown)

    Abstract: A study is presented of $B^+ \to K^0_{\rm S} K^- π^+ K^-$ and $B^+ \to K^0_{\rm S} K^+ π^- K^+$ decays based on the analysis of proton-proton collision data collected with the LHCb detector at centre-of-mass energies of 7, 8 and 13 TeV, corresponding to an integrated luminosity of $9 fb^{-1}$. The $K^0_{\rm S} K π$ invariant-mass distributions of both $B^+$ decay modes show, in the… ▽ More

    Submitted 11 January, 2025; originally announced January 2025.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2024-045.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-045,CERN-EP-2024-329

  40. arXiv:2501.03880  [pdf, other

    eess.IV cs.CV cs.LG

    SELMA3D challenge: Self-supervised learning for 3D light-sheet microscopy image segmentation

    Authors: Ying Chen, Rami Al-Maskari, Izabela Horvath, Mayar Ali, Luciano Hoher, Kaiyuan Yang, Zengming Lin, Zhiwei Zhai, Mengzhe Shen, Dejin Xun, Yi Wang, Tony Xu, Maged Goubran, Yunheng Wu, Kensaku Mori, Johannes C. Paetzold, Ali Erturk

    Abstract: Recent innovations in light sheet microscopy, paired with developments in tissue clearing techniques, enable the 3D imaging of large mammalian tissues with cellular resolution. Combined with the progress in large-scale data analysis, driven by deep learning, these innovations empower researchers to rapidly investigate the morphological and functional properties of diverse biological samples. Segme… ▽ More

    Submitted 12 January, 2025; v1 submitted 7 January, 2025; originally announced January 2025.

    Comments: 2st version

  41. Search for continuous gravitational waves from known pulsars in the first part of the fourth LIGO-Virgo-KAGRA observing run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné , et al. (1794 additional authors not shown)

    Abstract: Continuous gravitational waves (CWs) emission from neutron stars carries information about their internal structure and equation of state, and it can provide tests of General Relativity. We present a search for CWs from a set of 45 known pulsars in the first part of the fourth LIGO--Virgo--KAGRA observing run, known as O4a. We conducted a targeted search for each pulsar using three independent ana… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

    Comments: main paper: 12 pages, 6 figures, 4 tables

    Report number: LIGO-P2400315

    Journal ref: Astrophys.J. 983 (2025) 2, 99

  42. arXiv:2501.00525  [pdf, other

    cs.CV

    Is Segment Anything Model 2 All You Need for Surgery Video Segmentation? A Systematic Evaluation

    Authors: Cheng Yuan, Jian Jiang, Kunyi Yang, Lv Wu, Rui Wang, Zi Meng, Haonan Ping, Ziyu Xu, Yifan Zhou, Wanli Song, Hesheng Wang, Qi Dou, Yutong Ban

    Abstract: Surgery video segmentation is an important topic in the surgical AI field. It allows the AI model to understand the spatial information of a surgical scene. Meanwhile, due to the lack of annotated surgical data, surgery segmentation models suffer from limited performance. With the emergence of SAM2 model, a large foundation model for video segmentation trained on natural videos, zero-shot surgical… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

  43. arXiv:2501.00522  [pdf, other

    cs.CL cs.AI

    TinyHelen's First Curriculum: Training and Evaluating Tiny Language Models in a Simpler Language Environment

    Authors: Ke Yang, Volodymyr Kindratenko, ChengXiang Zhai

    Abstract: Training language models (LMs) and their application agents is increasingly costly due to large datasets and models, making test failures difficult to bear. Simplified language environments serve as primordial training and testing grounds, retaining essential commonsense and communication skills but in a more digestible form, potentially enhancing the learning efficiency of LMs, and thus reducing… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

  44. arXiv:2412.19979  [pdf, other

    cs.LG cs.CR cs.IT

    Explainable Semantic Federated Learning Enabled Industrial Edge Network for Fire Surveillance

    Authors: Li Dong, Yubo Peng, Feibo Jiang, Kezhi Wang, Kun Yang

    Abstract: In fire surveillance, Industrial Internet of Things (IIoT) devices require transmitting large monitoring data frequently, which leads to huge consumption of spectrum resources. Hence, we propose an Industrial Edge Semantic Network (IESN) to allow IIoT devices to send warnings through Semantic communication (SC). Thus, we should consider (1) Data privacy and security. (2) SC model adaptation for he… ▽ More

    Submitted 27 December, 2024; originally announced December 2024.

    Comments: 9 pages

    Journal ref: IEEE Transactions on Industrial Informatics, vol. 20, no. 12, pp. 14053-14061, Dec. 2024

  45. arXiv:2412.19123  [pdf, other

    cs.SD cs.MM eess.AS

    CoheDancers: Enhancing Interactive Group Dance Generation through Music-Driven Coherence Decomposition

    Authors: Kaixing Yang, Xulong Tang, Haoyu Wu, Qinliang Xue, Biao Qin, Hongyan Liu, Zhaoxin Fan

    Abstract: Dance generation is crucial and challenging, particularly in domains like dance performance and virtual gaming. In the current body of literature, most methodologies focus on Solo Music2Dance. While there are efforts directed towards Group Music2Dance, these often suffer from a lack of coherence, resulting in aesthetically poor dance performances. Thus, we introduce CoheDancers, a novel framework… ▽ More

    Submitted 26 December, 2024; originally announced December 2024.

  46. arXiv:2412.18342  [pdf, other

    cs.CV cs.LG eess.IV

    Mitigating Label Noise using Prompt-Based Hyperbolic Meta-Learning in Open-Set Domain Generalization

    Authors: Kunyu Peng, Di Wen, Sarfraz M. Saquib, Yufan Chen, Junwei Zheng, David Schneider, Kailun Yang, Jiamin Wu, Alina Roitberg, Rainer Stiefelhagen

    Abstract: Open-Set Domain Generalization (OSDG) is a challenging task requiring models to accurately predict familiar categories while minimizing confidence for unknown categories to effectively reject them in unseen domains. While the OSDG field has seen considerable advancements, the impact of label noise--a common issue in real-world datasets--has been largely overlooked. Label noise can mislead model op… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

    Comments: The source code of this work is released at https://github.com/KPeng9510/HyProMeta

  47. arXiv:2412.18244  [pdf, other

    physics.optics

    Observation of Thouless pumping of light in quasiperiodic photonic crystals

    Authors: Kai Yang, Qidong Fu, Henrique C. Prates, Peng Wang, Yaroslav V. Kartashov, Vladimir V. Konotop, Fangwei Ye

    Abstract: Topological transport is determined by global properties of physical media where it occurs and is characterized by quantized amounts of adiabatically transported quantities. Discovered for periodic potentials it was also explored in disordered and discrete quasi-periodic systems. Here we report on experimental observation of pumping of a light beam in a genuinely continuous incommensurate photoref… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

    Journal ref: Proceedings of the National Academy of Sciences, 121(47), e2411793121 (2024)

  48. arXiv:2412.16838  [pdf, other

    cs.CL

    Ask-Before-Detection: Identifying and Mitigating Conformity Bias in LLM-Powered Error Detector for Math Word Problem Solutions

    Authors: Hang Li, Tianlong Xu, Kaiqi Yang, Yucheng Chu, Yanling Chen, Yichi Song, Qingsong Wen, Hui Liu

    Abstract: The rise of large language models (LLMs) offers new opportunities for automatic error detection in education, particularly for math word problems (MWPs). While prior studies demonstrate the promise of LLMs as error detectors, they overlook the presence of multiple valid solutions for a single MWP. Our preliminary analysis reveals a significant performance gap between conventional and alternative s… ▽ More

    Submitted 21 December, 2024; originally announced December 2024.

    Comments: 12 pages, 4 figures

  49. arXiv:2412.16746  [pdf

    cs.CY cs.AI

    Beyond Partisan Leaning: A Comparative Analysis of Political Bias in Large Language Models

    Authors: Tai-Quan Peng, Kaiqi Yang, Sanguk Lee, Hang Li, Yucheng Chu, Yuping Lin, Hui Liu

    Abstract: As large language models (LLMs) become increasingly embedded in civic, educational, and political information environments, concerns about their potential political bias have grown. Prior research often evaluates such bias through simulated personas or predefined ideological typologies, which may introduce artificial framing effects or overlook how models behave in general use scenarios. This stud… ▽ More

    Submitted 10 May, 2025; v1 submitted 21 December, 2024; originally announced December 2024.

  50. arXiv:2412.16075  [pdf, other

    cs.AI cs.LG cs.LO

    Formal Mathematical Reasoning: A New Frontier in AI

    Authors: Kaiyu Yang, Gabriel Poesia, Jingxuan He, Wenda Li, Kristin Lauter, Swarat Chaudhuri, Dawn Song

    Abstract: AI for Mathematics (AI4Math) is not only intriguing intellectually but also crucial for AI-driven discovery in science, engineering, and beyond. Extensive efforts on AI4Math have mirrored techniques in NLP, in particular, training large language models on carefully curated math datasets in text form. As a complementary yet less explored avenue, formal mathematical reasoning is grounded in formal s… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.