Skip to main content

Showing 1–50 of 2,862 results for author: Yang, W

.
  1. arXiv:2506.08777  [pdf, ps, other

    cs.CV

    Gaussian2Scene: 3D Scene Representation Learning via Self-supervised Learning with 3D Gaussian Splatting

    Authors: Keyi Liu, Weidong Yang, Ben Fei, Ying He

    Abstract: Self-supervised learning (SSL) for point cloud pre-training has become a cornerstone for many 3D vision tasks, enabling effective learning from large-scale unannotated data. At the scene level, existing SSL methods often incorporate volume rendering into the pre-training framework, using RGB-D images as reconstruction signals to facilitate cross-modal learning. This strategy promotes alignment bet… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  2. arXiv:2506.08367  [pdf, ps, other

    astro-ph.IM astro-ph.GA astro-ph.HE astro-ph.SR

    Observatory Science with eXTP

    Authors: Ping Zhou, Jirong Mao, Liang Zhang, Alessandro Patruno, Enrico Bozzo, Yanjun Xu, Andrea Santangelo, Silvia Zane, Shuang-Nan Zhang, Hua Feng, Yuri Cavecchi, Barbara De Marco, Junhui Fan, Xian Hou, Pengfei Jiang, Patrizia Romano, Gloria Sala, Lian Tao, Alexandra Veledina, Jacco Vink, Song Wang, Junxian Wang, Yidi Wang, Shanshan Weng, Qingwen Wu , et al. (75 additional authors not shown)

    Abstract: Scheduled for launch in 2030, the enhanced X-ray Timing and Polarization (eXTP) telescope is a Chinese space-based mission aimed at studying extreme conditions and phenomena in astrophysics. eXTP will feature three main payloads: Spectroscopy Focusing Arrays (SFAs), Polarimetry Focusing Arrays (PFAs), and a Wide-field Camera (W2C). This white paper outlines observatory science, incorporating key s… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: Submitted to the SCIENCE CHINA Physics, Mechanics & Astronomy

  3. arXiv:2506.07851  [pdf, ps, other

    cs.CL

    Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning

    Authors: Yiju Guo, Wenkai Yang, Zexu Sun, Ning Ding, Zhiyuan Liu, Yankai Lin

    Abstract: Large language models (LLMs) have demonstrated significant improvements in contextual understanding. However, their ability to attend to truly critical information during long-context reasoning and generation still falls behind the pace. Specifically, our preliminary experiments reveal that certain distracting patterns can misdirect the model's attention during inference, and removing these patter… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  4. arXiv:2506.06877  [pdf, ps, other

    cs.CL

    Right Is Not Enough: The Pitfalls of Outcome Supervision in Training LLMs for Math Reasoning

    Authors: Jiaxing Guo, Wenjie Yang, Shengzhong Zhang, Tongshan Xu, Lun Du, Da Zheng, Zengfeng Huang

    Abstract: Outcome-rewarded Large Language Models (LLMs) have demonstrated remarkable success in mathematical problem-solving. However, this success often masks a critical issue: models frequently achieve correct answers through fundamentally unsound reasoning processes, a phenomenon indicative of reward hacking. We introduce MathOlympiadEval, a new dataset with fine-grained annotations, which reveals a sign… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

  5. arXiv:2506.05771  [pdf, ps, other

    astro-ph.HE astro-ph.SR hep-ph

    Detection of multiple X-ray quasi-periodic oscillations in IGR J19294+1816 with Insight-HXMT

    Authors: Wen Yang, Wei Wang

    Abstract: We report the timing results with Insight-HXMT observations of X-ray binary IGR J19294+1816 during its 2019 Type I outburst at the decline phase shortly following its peak. We analyze the light curves and power density spectrum (PDS) of the 2019 observations and reveal a peak at approximately $ν_{NS} \sim 80.2$ mHz, corresponding to X-ray pulsations from the neutron star. In addition, a significan… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: 9 pages, 5 figures, 2 tables, accept for the publication in ApJ

  6. arXiv:2506.05055  [pdf, ps, other

    hep-ex

    Study of $f_1(1420)$ and $η(1405)$ in the decay $J/ψ\to γπ^{0}π^{0}π^{0}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (650 additional authors not shown)

    Abstract: A partial-wave analysis is performed on the decay $J/ψ\toγπ^{0}π^{0}π^{0}$ within the $π^{0}π^{0}π^{0}$ invariant-mass region below 1.6 GeV$/c^{2}$, using $(10.09~\pm~0.04)\times10^{9} ~J/ψ$ events collected with the BESIII detector. Significant isospin-violating decays of $η(1405)$ and $f_1(1420)$ into $f_0(980)π^{0}$ are observed. For the first time, three axial-vectors, $f_1(1285)$,… ▽ More

    Submitted 7 June, 2025; v1 submitted 5 June, 2025; originally announced June 2025.

  7. arXiv:2506.04955  [pdf, ps, other

    math.GR math.DS math.GT

    Hausdorff Dimension of non-conical and Myrberg limit sets

    Authors: Mahan Mj, Wenyuan Yang

    Abstract: In this paper, we develop techniques to study the Hausdorff dimensions of non-conical and Myrberg limit sets for groups acting on negatively curved spaces. We establish maximality of the Hausdorff dimension of the non-conical limit set of $G$ in the following cases. 1. $M$ is a finite volume complete Riemannian manifold of pinched negative curvature and $G$ is an infinite normal subgroups of infin… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: 49 pages, 3 figures

  8. arXiv:2506.03928  [pdf, ps, other

    cs.CV

    Vision Remember: Alleviating Visual Forgetting in Efficient MLLM with Vision Feature Resample

    Authors: Ze Feng, Jiang-Jiang Liu, Sen Yang, Lingyu Xiao, Xiaofan Li, Wankou Yang, Jingdong Wang

    Abstract: In this work, we study the Efficient Multimodal Large Language Model. Redundant vision tokens consume a significant amount of computational memory and resources. Therefore, many previous works compress them in the Vision Projector to reduce the number of vision tokens. However, simply compressing in the Vision Projector can lead to the loss of visual information, especially for tasks that rely on… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  9. arXiv:2506.03569  [pdf, ps, other

    cs.CL

    MiMo-VL Technical Report

    Authors: Xiaomi LLM-Core Team, :, Zihao Yue, Zhenru Lin, Yifan Song, Weikun Wang, Shuhuai Ren, Shuhao Gu, Shicheng Li, Peidian Li, Liang Zhao, Lei Li, Kainan Bao, Hao Tian, Hailin Zhang, Gang Wang, Dawei Zhu, Cici, Chenhong He, Bowen Ye, Bowen Shen, Zihan Zhang, Zihan Jiang, Zhixian Zheng, Zhichao Song , et al. (50 additional authors not shown)

    Abstract: We open-source MiMo-VL-7B-SFT and MiMo-VL-7B-RL, two powerful vision-language models delivering state-of-the-art performance in both general visual understanding and multimodal reasoning. MiMo-VL-7B-RL outperforms Qwen2.5-VL-7B on 35 out of 40 evaluated tasks, and scores 59.4 on OlympiadBench, surpassing models with up to 78B parameters. For GUI grounding applications, it sets a new standard with… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 32 pages

  10. arXiv:2506.03532  [pdf, other

    cs.SI cs.CY

    GA-S$^3$: Comprehensive Social Network Simulation with Group Agents

    Authors: Yunyao Zhang, Zikai Song, Hang Zhou, Wenfeng Ren, Yi-Ping Phoebe Chen, Junqing Yu, Wei Yang

    Abstract: Social network simulation is developed to provide a comprehensive understanding of social networks in the real world, which can be leveraged for a wide range of applications such as group behavior emergence, policy optimization, and business strategy development. However, billions of individuals and their evolving interactions involved in social networks pose challenges in accurately reflecting re… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: Accepted by Findings of ACL 2025

  11. arXiv:2506.03211  [pdf, ps, other

    cs.CV cs.NI

    Channel-adaptive Cross-modal Generative Semantic Communication for Point Cloud Transmission

    Authors: Wanting Yang, Zehui Xiong, Qianqian Yang, Ping Zhang, Merouane Debbah, Rahim Tafazolli

    Abstract: With the rapid development of autonomous driving and extended reality, efficient transmission of point clouds (PCs) has become increasingly important. In this context, we propose a novel channel-adaptive cross-modal generative semantic communication (SemCom) for PC transmission, called GenSeC-PC. GenSeC-PC employs a semantic encoder that fuses images and point clouds, where images serve as non-tra… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  12. arXiv:2506.03109  [pdf, other

    cs.LG

    On Weak-to-Strong Generalization and f-Divergence

    Authors: Wei Yao, Gengze Xu, Huayi Tang, Wenkai Yang, Donglin Di, Ziqiao Wang, Yong Liu

    Abstract: Weak-to-strong generalization (W2SG) has emerged as a promising paradigm for stimulating the capabilities of strong pre-trained models by leveraging supervision from weaker supervisors. To improve the performance of the strong model, existing methods often require additional weak models or complex procedures, leading to substantial computational and memory overhead. Motivated by the effectiveness… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  13. arXiv:2506.03007  [pdf, other

    cs.CV

    DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models

    Authors: Jiarui Wang, Huiyu Duan, Juntong Wang, Ziheng Jia, Woo Yi Yang, Xiaorong Zhu, Yu Zhao, Jiaying Qian, Yuke Xing, Guangtao Zhai, Xiongkuo Min

    Abstract: With the rapid advancement of generative models, the realism of AI-generated images has significantly improved, posing critical challenges for verifying digital content authenticity. Current deepfake detection methods often depend on datasets with limited generation models and content diversity that fail to keep pace with the evolving complexity and increasing realism of the AI-generated content.… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  14. arXiv:2506.02577  [pdf, ps, other

    cs.LG

    Reachability Weighted Offline Goal-conditioned Resampling

    Authors: Wenyan Yang, Joni Pajarinen

    Abstract: Offline goal-conditioned reinforcement learning (RL) relies on fixed datasets where many potential goals share the same state and action spaces. However, these potential goals are not explicitly represented in the collected trajectories. To learn a generalizable goal-conditioned policy, it is common to sample goals and state-action pairs uniformly using dynamic programming methods such as Q-learni… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  15. arXiv:2506.01488  [pdf, ps, other

    cs.CL cs.IR

    Argument-Centric Causal Intervention Method for Mitigating Bias in Cross-Document Event Coreference Resolution

    Authors: Long Yao, Wenzhong Yang, Yabo Yin, Fuyuan Wei, Hongzhen Lv, Jiaren Peng, Liejun Wang, Xiaoming Tao

    Abstract: Cross-document Event Coreference Resolution (CD-ECR) is a fundamental task in natural language processing (NLP) that seeks to determine whether event mentions across multiple documents refer to the same real-world occurrence. However, current CD-ECR approaches predominantly rely on trigger features within input mention pairs, which induce spurious correlations between surface-level lexical feature… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  16. arXiv:2506.01116  [pdf, ps, other

    cs.AI q-bio.QM

    ChemAU: Harness the Reasoning of LLMs in Chemical Research with Adaptive Uncertainty Estimation

    Authors: Xinyi Liu, Lipeng Ma, Yixuan Li, Weidong Yang, Qingyuan Zhou, Jiayi Song, Shuhao Li, Ben Fei

    Abstract: Large Language Models (LLMs) are widely used across various scenarios due to their exceptional reasoning capabilities and natural language understanding. While LLMs demonstrate strong performance in tasks involving mathematics and coding, their effectiveness diminishes significantly when applied to chemistry-related problems. Chemistry problems typically involve long and complex reasoning steps, w… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

  17. arXiv:2505.24586  [pdf, ps, other

    astro-ph.HE

    All-sky search for individual Primordial Black Hole bursts with LHAASO

    Authors: Zhen Cao, F. Aharonian, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, C. M. Cai, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, G. H. Chen, H. X. Chen, Liang Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen, S. H. Chen , et al. (293 additional authors not shown)

    Abstract: Primordial Black Holes~(PBHs) are hypothetical black holes with a wide range of masses that formed in the early universe. As a result, they may play an important cosmological role and provide a unique probe of the early universe. A PBH with an initial mass of approximately $10^{15}$~g is expected to explode today in a final burst of Hawking radiation. In this work, we conduct an all-sky search for… ▽ More

    Submitted 2 June, 2025; v1 submitted 30 May, 2025; originally announced May 2025.

    Comments: 8 pages, 2 figures

  18. arXiv:2505.24208  [pdf, ps, other

    cs.AI

    Bootstrapping LLM Robustness for VLM Safety via Reducing the Pretraining Modality Gap

    Authors: Wenhan Yang, Spencer Stice, Ali Payani, Baharan Mirzasoleiman

    Abstract: Ensuring Vision-Language Models (VLMs) generate safe outputs is crucial for their reliable deployment. However, LVLMs suffer from drastic safety degradation compared to their LLM backbone. Even blank or irrelevant images can trigger LVLMs to generate harmful responses to prompts that would otherwise be refused in text-only contexts. The modality gap between image and text representations has been… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  19. arXiv:2505.23677  [pdf, other

    astro-ph.HE

    Optical Photometric Monitoring of the Blazar OT 355 and Local Standard Stars' Calibration

    Authors: R. Bachev, Tushar Tripathi, Alok C. Gupta, A. Kurtenkov, Y. Nikolov, A. Strigachev, S. Boeva, G. Latev, B. Spassov, M. Minev, E. Ovcharov, W. -X. Yang, Yi Liu, J. -H. Fan

    Abstract: OT 355 (4FGL J1734.3 + 3858) is a relatively rarely studied but highly variable, moderate-redshift (z = 0.975) flat-spectrum radio quasar (blazar). With this work, we aim to study its optical variability on different timescales, which can help us to better understand the physical processes in relativistic jets operating in blazar-type active galactic nuclei. OT 355 was observed in four colors (BVR… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: To be published in Universe

  20. arXiv:2505.23522  [pdf, ps, other

    cs.CV cs.LG

    OmniEarth-Bench: Towards Holistic Evaluation of Earth's Six Spheres and Cross-Spheres Interactions with Multimodal Observational Earth Data

    Authors: Fengxiang Wang, Mingshuo Chen, Xuming He, YiFan Zhang, Feng Liu, Zijie Guo, Zhenghao Hu, Jiong Wang, Jingyi Xu, Zhangrui Li, Fenghua Ling, Ben Fei, Weijia Li, Long Lan, Wenjing Yang, Wenlong Zhang, Lei Bai

    Abstract: Existing benchmarks for Earth science multimodal learning exhibit critical limitations in systematic coverage of geosystem components and cross-sphere interactions, often constrained to isolated subsystems (only in Human-activities sphere or atmosphere) with limited evaluation dimensions (less than 16 tasks). To address these gaps, we introduce OmniEarth-Bench, the first comprehensive multimodal b… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  21. arXiv:2505.23453  [pdf, ps, other

    quant-ph

    Enhancing the sensitivity of quantum optomechanical gyroscope by optical Kerr effect

    Authors: Ying Liu, Rui Zhang, Wen-Quan Yang, Ya-Feng Jiao, Wang-Jun Lu, Qing-Shou Tan, Le-Man Kuang

    Abstract: We propose a theoretical scheme to enhance the sensitivity of a quantum optomechanical gyroscope (QOMG) by optical Kerr effect. We utilize quantum Fisher information (QFI) to evaluate the metrological potential of the QOMG scheme. It is found that the Kerr interaction can significantly enhances the sensitivity of the QOMG. We observe the super-Hesenberg scaling of parameter estimation precision. F… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: 12 pages, 10 figures

  22. arXiv:2505.23186  [pdf, other

    cs.CV

    HiGarment: Cross-modal Harmony Based Diffusion Model for Flat Sketch to Realistic Garment Image

    Authors: Junyi Guo, Jingxuan Zhang, Fangyu Wu, Huanda Lu, Qiufeng Wang, Wenmian Yang, Eng Gee Lim, Dongming Lu

    Abstract: Diffusion-based garment synthesis tasks primarily focus on the design phase in the fashion domain, while the garment production process remains largely underexplored. To bridge this gap, we introduce a new task: Flat Sketch to Realistic Garment Image (FS2RG), which generates realistic garment images by integrating flat sketches and textual guidance. FS2RG presents two key challenges: 1) fabric cha… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  23. arXiv:2505.23048  [pdf, ps, other

    cs.LG

    ProDiff: Prototype-Guided Diffusion for Minimal Information Trajectory Imputation

    Authors: Tianci Bu, Le Zhou, Wenchuan Yang, Jianhong Mou, Kang Yang, Suoyi Tan, Feng Yao, Jingyuan Wang, Xin Lu

    Abstract: Trajectory data is crucial for various applications but often suffers from incompleteness due to device limitations and diverse collection scenarios. Existing imputation methods rely on sparse trajectory or travel information, such as velocity, to infer missing points. However, these approaches assume that sparse trajectories retain essential behavioral patterns, which place significant demands on… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  24. arXiv:2505.22882  [pdf, other

    cs.RO

    TwinTrack: Bridging Vision and Contact Physics for Real-Time Tracking of Unknown Dynamic Objects

    Authors: Wen Yang, Zhixian Xie, Xuechao Zhang, Heni Ben Amor, Shan Lin, Wanxin Jin

    Abstract: Real-time tracking of previously unseen, highly dynamic objects in contact-rich environments -- such as during dexterous in-hand manipulation -- remains a significant challenge. Purely vision-based tracking often suffers from heavy occlusions due to the frequent contact interactions and motion blur caused by abrupt motion during contact impacts. We propose TwinTrack, a physics-aware visual trackin… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  25. arXiv:2505.21375  [pdf, ps, other

    cs.CV

    GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution

    Authors: Fengxiang Wang, Mingshuo Chen, Yueying Li, Di Wang, Haotian Wang, Zonghao Guo, Zefan Wang, Boqi Shan, Long Lan, Yulin Wang, Hongzhen Wang, Wenjing Yang, Bo Du, Jing Zhang

    Abstract: Ultra-high-resolution (UHR) remote sensing (RS) imagery offers valuable data for Earth observation but pose challenges for existing multimodal foundation models due to two key bottlenecks: (1) limited availability of UHR training data, and (2) token explosion caused by the large image size. To address data scarcity, we introduce SuperRS-VQA (avg. 8,376$\times$8,376) and HighRS-VQA (avg. 2,000… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  26. arXiv:2505.21333  [pdf, other

    cs.CV

    MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios

    Authors: Yang Shi, Huanqian Wang, Wulin Xie, Huanyao Zhang, Lijie Zhao, Yi-Fan Zhang, Xinfeng Li, Chaoyou Fu, Zhuoer Wen, Wenting Liu, Zhuoran Zhang, Xinlong Chen, Bohan Zeng, Sihan Yang, Yuanxing Zhang, Pengfei Wan, Haotian Wang, Wenjing Yang

    Abstract: Multimodal Large Language Models (MLLMs) have achieved considerable accuracy in Optical Character Recognition (OCR) from static images. However, their efficacy in video OCR is significantly diminished due to factors such as motion blur, temporal variations, and visual effects inherent in video content. To provide clearer guidance for training practical MLLMs, we introduce the MME-VideoOCR benchmar… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: preprint

  27. arXiv:2505.21172  [pdf, ps, other

    cs.CL

    TAT-R1: Terminology-Aware Translation with Reinforcement Learning and Word Alignment

    Authors: Zheng Li, Mao Zheng, Mingyang Song, Wenjie Yang

    Abstract: Recently, deep reasoning large language models(LLMs) like DeepSeek-R1 have made significant progress in tasks such as mathematics and coding. Inspired by this, several studies have employed reinforcement learning(RL) to enhance models' deep reasoning capabilities and improve machine translation(MT) quality. However, the terminology translation, an essential task in MT, remains unexplored in deep r… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  28. arXiv:2505.20777  [pdf, ps, other

    cs.CV

    TACO: Think-Answer Consistency for Optimized Long-Chain Reasoning and Efficient Data Learning via Reinforcement Learning in LVLMs

    Authors: Zhehan Kan, Yanlin Liu, Kun Yin, Xinghua Jiang, Xin Li, Haoyu Cao, Yinsong Liu, Deqiang Jiang, Xing Sun, Qingmin Liao, Wenming Yang

    Abstract: DeepSeek R1 has significantly advanced complex reasoning for large language models (LLMs). While recent methods have attempted to replicate R1's reasoning capabilities in multimodal settings, they face limitations, including inconsistencies between reasoning and final answers, model instability and crashes during long-chain exploration, and low data learning efficiency. To address these challenges… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  29. arXiv:2505.20773  [pdf, ps, other

    cs.IR

    Cold-Start Recommendation with Knowledge-Guided Retrieval-Augmented Generation

    Authors: Wooseong Yang, Weizhi Zhang, Yuqing Liu, Yuwei Han, Yu Wang, Junhyun Lee, Philip S. Yu

    Abstract: Cold-start items remain a persistent challenge in recommender systems due to their lack of historical user interactions, which collaborative models rely on. While recent zero-shot methods leverage large language models (LLMs) to address this, they often struggle with sparse metadata and hallucinated or incomplete knowledge. We propose ColdRAG, a retrieval-augmented generation approach that builds… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 10 pages

    MSC Class: 68T05 68T05

  30. arXiv:2505.20114  [pdf

    physics.app-ph

    Tri-band Aperture-shared Antenna Array Using Scalable FSS-based Electromagnetic Transparent Structure

    Authors: Yongzheng Li, Wanchen Yang, Quan Xue, Wenquan Che

    Abstract: A novel dual-band electromagnetic transparent structure (DBTS) is firstly proposed and used to realize a tri-band aperture-shared antenna array. The DBTS achieves two tunable electromagnetic transparent frequency bands by periodically loading capacitive patches and meander lines to an inductive strip. Meanwhile, the DBTS features flexible frequency band scalability by loading additional serial L-C… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: This manuscript has been submitted to the IEEE Antennas and Wireless Propagation Letters. Under review currently

  31. arXiv:2505.19849  [pdf, other

    cs.IR

    HIT Model: A Hierarchical Interaction-Enhanced Two-Tower Model for Pre-Ranking Systems

    Authors: Haoqiang Yang, Congde Yuan, Kun Bai, Mengzhuo Guo, Wei Yang, Chao Zhou

    Abstract: Online display advertising platforms rely on pre-ranking systems to efficiently filter and prioritize candidate ads from large corpora, balancing relevance to users with strict computational constraints. The prevailing two-tower architecture, though highly efficient due to its decoupled design and pre-caching, suffers from cross-domain interaction and coarse similarity metrics, undermining its cap… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: 7 pages

  32. arXiv:2505.19505  [pdf, other

    cs.IR cs.AI

    Hierarchical Tree Search-based User Lifelong Behavior Modeling on Large Language Model

    Authors: Yu Xia, Rui Zhong, Hao Gu, Wei Yang, Chi Lu, Peng Jiang, Kun Gai

    Abstract: Large Language Models (LLMs) have garnered significant attention in Recommendation Systems (RS) due to their extensive world knowledge and robust reasoning capabilities. However, a critical challenge lies in enabling LLMs to effectively comprehend and extract insights from massive user behaviors. Current approaches that directly leverage LLMs for user interest learning face limitations in handling… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  33. arXiv:2505.19491  [pdf, ps, other

    cs.LG stat.ML

    Discounted Online Convex Optimization: Uniform Regret Across a Continuous Interval

    Authors: Wenhao Yang, Sifan Yang, Lijun Zhang

    Abstract: Reflecting the greater significance of recent history over the distant past in non-stationary environments, $λ$-discounted regret has been introduced in online convex optimization (OCO) to gracefully forget past data as new information arrives. When the discount factor $λ$ is given, online gradient descent with an appropriate step size achieves an $O(1/\sqrt{1-λ})$ discounted regret. However, the… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  34. arXiv:2505.19293  [pdf, ps, other

    cs.CL cs.AI cs.LG

    100-LongBench: Are de facto Long-Context Benchmarks Literally Evaluating Long-Context Ability?

    Authors: Wang Yang, Hongye Jin, Shaochen Zhong, Song Jiang, Qifan Wang, Vipin Chaudhary, Xiaotian Han

    Abstract: Long-context capability is considered one of the most important abilities of LLMs, as a truly long context-capable LLM enables users to effortlessly process many originally exhausting tasks -- e.g., digesting a long-form document to find answers vs. directly asking an LLM about it. However, existing real-task-based long-context evaluation benchmarks have two major shortcomings. First, benchmarks l… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  35. arXiv:2505.18506  [pdf

    physics.app-ph

    Capacity Enhancement Analysis and Implementation of a 3D Array Based on Miniaturized Dipole Antennas

    Authors: Yongzheng Li, Wanchen Yang, Shuai S. A. Yuan, Zhitao Ye, Chongwen Huang, Xiaoming Chen, Wenquan Che, Wei E. I. Sha

    Abstract: Theoretically, the three-dimensional (3D) array architecture provides a higher communication degree of freedom (DoF) compared to the planar arrays, allowing for greater capacity potential in multiple-input multiple-output (MIMO) systems. However, in practical implementations, the upper elements of 3D arrays significantly degrade the performance of the lower elements, leading to increased inter-ele… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

    Comments: This manuscript hvae been submitted to IEEE Transactions on Antennas and Propagation. Under review currently

  36. arXiv:2505.18004  [pdf, ps, other

    hep-ex

    Measurement of branching fractions of $Λ_{c}^{+}$ decays to $Σ^{+} η$ and $Σ^{+} η'$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: By analyzing $e^+e^-$ collision data taken at center-of-mass energies $\sqrt{s} = 4.600 \sim 4.699$ $\mbox{GeV}$ with the BESIII detector at the BEPCII collider, corresponding to an integrated luminosity of $\rm 4.5~fb^{-1}$, we study the hadronic decays $Λ_{c}^{+} \rightarrow Σ^{+} η$ and $Λ_{c}^{+} \rightarrow Σ^{+} η^{\prime}$ using the single-tag method. The branching fraction ratio of… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  37. arXiv:2505.17614  [pdf, ps, other

    cs.CV

    PathoSCOPE: Few-Shot Pathology Detection via Self-Supervised Contrastive Learning and Pathology-Informed Synthetic Embeddings

    Authors: Sinchee Chin, Yinuo Ma, Xiaochen Yang, Jing-Hao Xue, Wenming Yang

    Abstract: Unsupervised pathology detection trains models on non-pathological data to flag deviations as pathologies, offering strong generalizability for identifying novel diseases and avoiding costly annotations. However, building reliable normality models requires vast healthy datasets, as hospitals' data is inherently biased toward symptomatic populations, while privacy regulations hinder the assembly of… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  38. arXiv:2505.17315  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Longer Context, Deeper Thinking: Uncovering the Role of Long-Context Ability in Reasoning

    Authors: Wang Yang, Zirui Liu, Hongye Jin, Qingyu Yin, Vipin Chaudhary, Xiaotian Han

    Abstract: Recent language models exhibit strong reasoning capabilities, yet the influence of long-context capacity on reasoning remains underexplored. In this work, we hypothesize that current limitations in reasoning stem, in part, from insufficient long-context capacity, motivated by empirical observations such as (1) higher context window length often leads to stronger reasoning performance, and (2) fail… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  39. arXiv:2505.17296  [pdf, other

    cs.CL cs.LG

    SELF: Self-Extend the Context Length With Logistic Growth Function

    Authors: Phat Thanh Dang, Saahil Thoppay, Wang Yang, Qifan Wang, Vipin Chaudhary, Xiaotian Han

    Abstract: Large language models suffer issues when operated on long contexts that are larger than their training context length due to the standard position encoding for tokens in the attention layer. Tokens a long distance apart will rarely have an effect on each other and long prompts yield unexpected results. To solve this problem, we propose SELF (Self-Extend the Context Length With Logistic Growth Func… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: 11 pages, 5 figures, 3 tables

  40. arXiv:2505.16826  [pdf, ps, other

    cs.AI cs.CL

    KTAE: A Model-Free Algorithm to Key-Tokens Advantage Estimation in Mathematical Reasoning

    Authors: Wei Sun, Wen Yang, Pu Jian, Qianlong Du, Fuwei Cui, Shuo Ren, Jiajun Zhang

    Abstract: Recent advances have demonstrated that integrating reinforcement learning with rule-based rewards can significantly enhance the reasoning capabilities of large language models, even without supervised fine-tuning. However, prevalent reinforcement learning algorithms such as GRPO and its variants like DAPO, suffer from a coarse granularity issue when computing the advantage. Specifically, they comp… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  41. arXiv:2505.16637  [pdf, ps, other

    cs.CL cs.AI cs.LG

    SSR-Zero: Simple Self-Rewarding Reinforcement Learning for Machine Translation

    Authors: Wenjie Yang, Mao Zheng, Mingyang Song, Zheng Li

    Abstract: Large language models (LLMs) have recently demonstrated remarkable capabilities in machine translation (MT). However, most advanced MT-specific LLMs heavily rely on external supervision signals during training, such as human-annotated reference data or trained reward models (RMs), which are often expensive to obtain and challenging to scale. To overcome this limitation, we propose a Simple Self-Re… ▽ More

    Submitted 23 May, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

  42. arXiv:2505.14988  [pdf, ps, other

    hep-ex

    Test of local realism via entangled $Λ\barΛ$ system

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (597 additional authors not shown)

    Abstract: The non-locality of quantum correlations is a fundamental feature of quantum theory. The Bell inequality serves as a benchmark for distinguishing between predictions made by quantum theory and local hidden variable theory (LHVT). Recent advancements in photon-entanglement experiments have addressed potential loopholes and have observed significant violations of variants of Bell inequality. However… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  43. arXiv:2505.14552  [pdf, other

    cs.CL cs.AI cs.LG

    KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation

    Authors: Jiajun Shi, Jian Yang, Jiaheng Liu, Xingyuan Bu, Jiangjie Chen, Junting Zhou, Kaijing Ma, Zhoufutu Wen, Bingli Wang, Yancheng He, Liang Song, Hualei Zhu, Shilong Li, Xingjian Wang, Wei Zhang, Ruibin Yuan, Yifan Yao, Wenjun Yang, Yunli Wang, Siyuan Fang, Siyu Yuan, Qianyu He, Xiangru Tang, Yingshui Tan, Wangchunshu Zhou , et al. (4 additional authors not shown)

    Abstract: Recent advancements in large language models (LLMs) underscore the need for more comprehensive evaluation methods to accurately assess their reasoning capabilities. Existing benchmarks are often domain-specific and thus cannot fully capture an LLM's general reasoning potential. To address this limitation, we introduce the Knowledge Orthogonal Reasoning Gymnasium (KORGym), a dynamic evaluation plat… ▽ More

    Submitted 21 May, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

    Comments: 22 pages

  44. arXiv:2505.14447  [pdf, ps, other

    astro-ph.HE hep-ex

    First Identification and Precise Spectral Measurement of the Proton Component in the Cosmic-Ray `Knee'

    Authors: The LHAASO Collaboration, Zhen Cao, F. Aharonian, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, C. M. Cai, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, G. H. Chen, H. X. Chen, Liang Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (292 additional authors not shown)

    Abstract: We report the first high-purity identification of cosmic-ray (CR) protons and a precise measurement of their energy spectrum from 0.15 to 12 PeV using the Large High Altitude Air Shower Observatory (LHAASO). Abundant event statistics, combined with the simultaneous detection of electrons/photons, muons, and Cherenkov light in air showers, enable spectroscopic measurements with statistical and syst… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  45. arXiv:2505.13350  [pdf, ps, other

    cs.RO

    Approximating Global Contact-Implicit MPC via Sampling and Local Complementarity

    Authors: Sharanya Venkatesh, Bibit Bianchini, Alp Aydinoglu, William Yang, Michael Posa

    Abstract: To achieve general-purpose dexterous manipulation, robots must rapidly devise and execute contact-rich behaviors. Existing model-based controllers are incapable of globally optimizing in real-time over the exponential number of possible contact sequences. Instead, recent progress in contact-implicit control has leveraged simpler models that, while still hybrid, make local approximations. However,… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: S.V. and B.B. contributed equally to this work. Project page: https://approximating-global-ci-mpc.github.io

  46. arXiv:2505.13222  [pdf, ps, other

    hep-ex

    Partial Wave Analysis of $e^{+}e^{-} \rightarrow π^{+}π^{-}J/ψ$ and Cross Section Measurement of $e^{+}e^{-} \rightarrow π^{\pm}Z_{c}(3900)^{\mp}$ from 4.1271 to 4.3583 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Based on 12.0 $\mathrm{fb^{-1}}$ of $e^{+}e^{-}$ collision data samples collected by the BESIII detector at center-of-mass energies from 4.1271 to 4.3583 GeV, a partial wave analysis is performed for the process $e^{+}e^{-} \rightarrow π^{+}π^{-}J/ψ$. The cross sections for the sub processes ${e^{+}e^{-}\rightarrowπ^{+}Z_{c}(3900)^{-}+c.c.\rightarrowπ^{+}π^{-}J/ψ}$,… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  47. arXiv:2505.12860  [pdf, ps, other

    cs.CV eess.IV

    Towards a Universal Image Degradation Model via Content-Degradation Disentanglement

    Authors: Wenbo Yang, Zhongling Wang, Zhou Wang

    Abstract: Image degradation synthesis is highly desirable in a wide variety of applications ranging from image restoration to simulating artistic effects. Existing models are designed to generate one specific or a narrow set of degradations, which often require user-provided degradation parameters. As a result, they lack the generalizability to synthesize degradations beyond their initial design or adapt to… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  48. arXiv:2505.12098  [pdf, other

    cs.CV

    LOVE: Benchmarking and Evaluating Text-to-Video Generation and Video-to-Text Interpretation

    Authors: Jiarui Wang, Huiyu Duan, Ziheng Jia, Yu Zhao, Woo Yi Yang, Zicheng Zhang, Zijian Chen, Juntong Wang, Yuke Xing, Guangtao Zhai, Xiongkuo Min

    Abstract: Recent advancements in large multimodal models (LMMs) have driven substantial progress in both text-to-video (T2V) generation and video-to-text (V2T) interpretation tasks. However, current AI-generated videos (AIGVs) still exhibit limitations in terms of perceptual quality and text-video alignment. Therefore, a reliable and scalable automatic model for AIGV evaluation is desirable, which heavily r… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  49. Seasonal Forecasting of Pan-Arctic Sea Ice with State Space Model

    Authors: Wei Wang, Weidong Yang, Lei Wang, Guihua Wang, Ruibo Lei

    Abstract: The rapid decline of Arctic sea ice resulting from anthropogenic climate change poses significant risks to indigenous communities, ecosystems, and the global climate system. This situation emphasizes the immediate necessity for precise seasonal sea ice forecasts. While dynamical models perform well for short-term forecasts, they encounter limitations in long-term forecasts and are computationally… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: This paper is published in npj Climate and Atmospheric Science: https://www.nature.com/articles/s41612-025-01058-0#Sec16 Supplementary information: https://static-content.springer.com/esm/art%3A10.1038%2Fs41612-025-01058-0/MediaObjects/41612_2025_1058_MOESM1_ESM.pdf

    Journal ref: npj Clim Atmos Sci 8, 172 (2025)

  50. arXiv:2505.09965  [pdf, ps, other

    cs.CV

    MambaControl: Anatomy Graph-Enhanced Mamba ControlNet with Fourier Refinement for Diffusion-Based Disease Trajectory Prediction

    Authors: Hao Yang, Tao Tan, Shuai Tan, Weiqin Yang, Kunyan Cai, Calvin Chen, Yue Sun

    Abstract: Modelling disease progression in precision medicine requires capturing complex spatio-temporal dynamics while preserving anatomical integrity. Existing methods often struggle with longitudinal dependencies and structural consistency in progressive disorders. To address these limitations, we introduce MambaControl, a novel framework that integrates selective state-space modelling with diffusion pro… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.