Skip to main content

Showing 1–50 of 7,791 results for author: Xu, Y

.
  1. arXiv:2507.04978  [pdf, ps, other

    cs.CV

    Parameterized Diffusion Optimization enabled Autoregressive Ordinal Regression for Diabetic Retinopathy Grading

    Authors: Qinkai Yu, Wei Zhou, Hantao Liu, Yanyu Xu, Meng Wang, Yitian Zhao, Huazhu Fu, Xujiong Ye, Yalin Zheng, Yanda Meng

    Abstract: As a long-term complication of diabetes, diabetic retinopathy (DR) progresses slowly, potentially taking years to threaten vision. An accurate and robust evaluation of its severity is vital to ensure prompt management and care. Ordinal regression leverages the underlying inherent order between categories to achieve superior performance beyond traditional classification. However, there exist challe… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: MICCAI 2025

  2. arXiv:2507.04618  [pdf, ps, other

    astro-ph.IM astro-ph.CO

    Introduction to the China Space Station Telescope (CSST)

    Authors: CSST Collaboration, Yan Gong, Haitao Miao, Hu Zhan, Zhao-Yu Li, Jinyi Shangguan, Haining Li, Chao Liu, Xuefei Chen, Haibo Yuan, Jilin Zhou, Hui-Gen Liu, Cong Yu, Jianghui Ji, Zhaoxiang Qi, Jiacheng Liu, Zigao Dai, Xiaofeng Wang, Zhenya Zheng, Lei Hao, Jiangpei Dou, Yiping Ao, Zhenhui Lin, Kun Zhang, Wei Wang , et al. (88 additional authors not shown)

    Abstract: The China Space Station Telescope (CSST) is a next-generation Stage-IV sky survey telescope, distinguished by its large field of view (FoV), high image quality, and multi-band observation capabilities. It can simultaneously conduct precise measurements of the Universe by performing multi-color photometric imaging and slitless spectroscopic surveys. The CSST is equipped with five scientific instrum… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

    Comments: 44 pages, 12 figures, 1 table

  3. arXiv:2507.04294  [pdf, ps, other

    cs.IR

    BiFair: A Fairness-aware Training Framework for LLM-enhanced Recommender Systems via Bi-level Optimization

    Authors: Jiaming Zhang, Yuyuan Li, Yiqun Xu, Li Zhang, Xiaohua Feng, Zhifei Ren, Chaochao Chen

    Abstract: Large Language Model-enhanced Recommender Systems (LLM-enhanced RSs) have emerged as a powerful approach to improving recommendation quality by leveraging LLMs to generate item representations. Despite these advancements, the integration of LLMs raises severe fairness concerns. Existing studies reveal that LLM-based RSs exhibit greater unfairness than traditional RSs, yet fairness issues in LLM-en… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

  4. arXiv:2507.04290  [pdf, ps, other

    cs.CV

    MPQ-DMv2: Flexible Residual Mixed Precision Quantization for Low-Bit Diffusion Models with Temporal Distillation

    Authors: Weilun Feng, Chuanguang Yang, Haotong Qin, Yuqi Li, Xiangqi Li, Zhulin An, Libo Huang, Boyu Diao, Fuzhen Zhuang, Michele Magno, Yongjun Xu, Yingli Tian, Tingwen Huang

    Abstract: Diffusion models have demonstrated remarkable performance on vision generation tasks. However, the high computational complexity hinders its wide application on edge devices. Quantization has emerged as a promising technique for inference acceleration and memory reduction. However, existing quantization methods do not generalize well under extremely low-bit (2-4 bit) quantization. Directly applyin… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

  5. arXiv:2507.04287  [pdf

    cond-mat.supr-con cond-mat.mes-hall cond-mat.str-el

    Signature of gate tunable superconducting network in twisted bilayer graphene

    Authors: Yingbo Wang, Yingzhuo Han, Lu Cao, Xun-Jiang Luo, Yucheng Xue, Jiefei Shi, Xiaomeng Wang, Xiangjia Bai, Junnan Jiang, Ziyi Tian, Kenji Watanabe, Takashi Taniguchi, Fengcheng Wu, Qing-feng Sun, Hong-Jun Gao, Yuhang Jiang, Jinhai Mao

    Abstract: Twisted van der Waals materials provide a tunable platform for investigating two-dimensional superconductivity and quantum phases. Using spectra-imaging scanning tunneling microscopy, we study the superconducting states in twisted bilayer graphene and track their evolution from insulating phases. Gate-dependent spectroscopic measurements reveal two distinct regimes: under-doped (ν = -2.3) and opti… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

    Comments: 18 pages, 4 figures

  6. arXiv:2507.04220  [pdf, ps, other

    math.CT

    Extriangulated factorization systems, $s$-torsion pairs and recollements

    Authors: Yan Xu, Haicheng Zhang, Zhiwei Zhu

    Abstract: We introduce extriangulated factorization systems in extriangulated categories and show that there exists a bijection between $s$-torsion pairs and extriangulated factorization systems. We also consider the gluing of $s$-torsion pairs and extriangulated factorization systems under recollements of extriangulated categories.

    Submitted 5 July, 2025; originally announced July 2025.

    Comments: 14 pages

  7. arXiv:2507.04067  [pdf, ps, other

    cs.AI cs.MA

    HAWK: A Hierarchical Workflow Framework for Multi-Agent Collaboration

    Authors: Yuyang Cheng, Yumiao Xu, Chaojia Yu, Yong Zhao

    Abstract: Contemporary multi-agent systems encounter persistent challenges in cross-platform interoperability, dynamic task scheduling, and efficient resource sharing. Agents with heterogeneous implementations often lack standardized interfaces; collaboration frameworks remain brittle and hard to extend; scheduling policies are static; and inter-agent state synchronization is insufficient. We propose Hierar… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

    Comments: AgentIR@SIGIR 2025

  8. arXiv:2507.03916  [pdf, ps, other

    cs.AI cs.CV

    Animation Needs Attention: A Holistic Approach to Slides Animation Comprehension with Visual-Language Models

    Authors: Yifan Jiang, Yibo Xue, Yukun Kang, Pin Zheng, Jian Peng, Feiran Wu, Changliang Xu

    Abstract: Slide animations, such as fade-ins, fly-ins, and wipes, are critical for audience engagement, efficient information delivery, and vivid visual expression. However, most AI-driven slide-generation tools still lack native animation support, and existing vision-language models (VLMs) struggle with animation tasks due to the absence of public datasets and limited temporal-reasoning capabilities. To ad… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

    Comments: Appendix at: https://github.com/PAMPAS-Lab/ANA-PPT-Anamation/blob/main/Appendix.pdf

    MSC Class: 68T01

  9. arXiv:2507.03407  [pdf

    cs.AI q-bio.QM

    Artificial intelligence in drug discovery: A comprehensive review with a case study on hyperuricemia, gout arthritis, and hyperuricemic nephropathy

    Authors: Junwei Su, Cheng Xin, Ao Shang, Shan Wu, Zhenzhen Xie, Ruogu Xiong, Xiaoyu Xu, Cheng Zhang, Guang Chen, Yau-Tuen Chan, Guoyi Tang, Ning Wang, Yong Xu, Yibin Feng

    Abstract: This paper systematically reviews recent advances in artificial intelligence (AI), with a particular focus on machine learning (ML), across the entire drug discovery pipeline. Due to the inherent complexity, escalating costs, prolonged timelines, and high failure rates of traditional drug discovery methods, there is a critical need to comprehensively understand how AI/ML can be effectively integra… ▽ More

    Submitted 4 July, 2025; originally announced July 2025.

  10. arXiv:2507.03282  [pdf

    astro-ph.HE astro-ph.IM nucl-ex physics.ins-det

    Observation and research on cosmic ray muons and solar modulation effect based on plastic scintillator detector

    Authors: Wang Dexin, Zhang Rui, Yu Dekang, Na Hui, Yao Zhangha, Wu Linghe, Zhang Suyalatu, Liang Tairan, Huang Meirong, Wang Zhilong, Bai Yu, Huang Yongshun, Yang Xue, Zhang Jiawen, Liu Mengdi, Ma Qiang, Yu Jing, Ji Xiuyan, Yu Yiliqi, Shao Xuepeng

    Abstract: Cosmic rays, originating from stars, supernovae, and other astrophysical sources, are composed of high-energy particles that enter Earths atmosphere. Upon interaction with atmospheric nuclei, these primary cosmic rays generate secondary particles, including neutrons, electrons, and muons, with muons constituting a dominant component at ground level. Muons, due to their relative abundance, stabilit… ▽ More

    Submitted 4 July, 2025; originally announced July 2025.

  11. User Location Disclosure Fails to Deter Overseas Criticism but Amplifies Regional Divisions on Chinese Social Media

    Authors: Leo Yang Yang, Yiqing Xu

    Abstract: We examine the behavioral impact of a user location disclosure policy implemented on Sina Weibo, China's largest microblogging platform, using a high-frequency, real-time dataset of uncensored user engagement with 165 leading government and media accounts. Leveraging a natural experiment result from the platform's sudden rollout of location tagging on April 28, 2022, we compare millions of time-st… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

    Comments: Main text: 8 pages, Supplement: 38 pages

  12. arXiv:2507.03063  [pdf

    physics.optics

    Hybrid Superscattering Driven by Toroidal Dipole

    Authors: D. Kislov, D. Borovkov, L. Huang, A. Kuznetsov, A. Canos Valero, A. Ipatovs, V. Bobrovs, V. Fedotov, L. Gao, S. Xie, Y. Xu, J. Luo, D. Baranov, A. Arsenin, A. Bolshakov, A. S. Shalin

    Abstract: The dynamic toroidal dipole is a unique radiation source beyond standard multipoles. Since its first demonstration 15 years ago, it has attracted growing theoretical and experimental interest. Research mainly aims to enhance its weak electromagnetic coupling to free space. Here we report on a surprising finding that the toroidal dipole can, in fact, be engaged in the enhancement of electromagnetic… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

    Comments: 31 pages, 12 figures

  13. arXiv:2507.02939  [pdf, ps, other

    cs.LG cs.AI cs.CV

    Frequency-Aligned Knowledge Distillation for Lightweight Spatiotemporal Forecasting

    Authors: Yuqi Li, Chuanguang Yang, Hansheng Zeng, Zeyu Dong, Zhulin An, Yongjun Xu, Yingli Tian, Hao Wu

    Abstract: Spatiotemporal forecasting tasks, such as traffic flow, combustion dynamics, and weather forecasting, often require complex models that suffer from low training efficiency and high memory consumption. This paper proposes a lightweight framework, Spectral Decoupled Knowledge Distillation (termed SDKD), which transfers the multi-scale spatiotemporal representations from a complex teacher model to a… ▽ More

    Submitted 27 June, 2025; originally announced July 2025.

    Comments: Accepted by ICCV-2025, 11 pages

  14. arXiv:2507.02870  [pdf, ps, other

    cs.CL

    Loki's Dance of Illusions: A Comprehensive Survey of Hallucination in Large Language Models

    Authors: Chaozhuo Li, Pengbo Wang, Chenxu Wang, Litian Zhang, Zheng Liu, Qiwei Ye, Yuanbo Xu, Feiran Huang, Xi Zhang, Philip S. Yu

    Abstract: Edgar Allan Poe noted, "Truth often lurks in the shadow of error," highlighting the deep complexity intrinsic to the interplay between truth and falsehood, notably under conditions of cognitive and informational asymmetry. This dynamic is strikingly evident in large language models (LLMs). Despite their impressive linguistic generation capabilities, LLMs sometimes produce information that appears… ▽ More

    Submitted 6 June, 2025; originally announced July 2025.

  15. arXiv:2507.02806  [pdf, ps, other

    astro-ph.HE

    GRB 240825A: Early Reverse Shock and Its Physical Implications

    Authors: Chao Wu, Yun Wang, Hua-Li Li, Li-Ping Xin, Dong Xu, Benjamin Schneider, Antonio de Ugarte Postigo, Gavin Lamb, Andrea Reguitti, Andrea Saccardi, Xing Gao, Xing-Ling Li, Qiu-Li Wang, Bing Zhang, Jian-Yan Wei, Shuang-Nan Zhang, Frédéric Daigne, Jean-Luc Atteia, Maria-Grazia Bernardini, Hong-bo Cai, Arnaud Claret, Bertrand Cordier, Jin-Song Deng, Olivier Godet, Diego Götz , et al. (62 additional authors not shown)

    Abstract: Early multi-wavelength observations offer crucial insights into the nature of the relativistic jets responsible for gamma-ray bursts and their interaction with the surrounding medium.We present data of GRB 240825A from 17 space- and ground-based telescopes/instruments, covering wavelengths from NIR/optical to X-ray and GeV, and spanning from the prompt emission to the afterglow phase triggered by… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

    Comments: 31 pages, 9 Figures, 10 Tables

  16. arXiv:2507.02418  [pdf, ps, other

    nucl-th

    Application of the microscopic optical potential of chiral effective field theory in astrophysical neutron-capture reactions

    Authors: Bing Wang, Dong Bai, Yi Xu

    Abstract: The microscopic global nucleon-nucleus optical potential proposed by Whitehead, Lim, and Holt (WLH) is a state-of-the-art potential developed within the framework of many-body perturbation theory using realistic nuclear interactions from chiral effective field theory. Given its potentially greater predictive power for reactions involving exotic isotopes, we apply it to the calculations of astrophy… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

  17. arXiv:2507.02348  [pdf, ps, other

    eess.SP eess.SY

    Joint Radiation Power, Antenna Position, and Beamforming Optimization for Pinching-Antenna Systems with Motion Power Consumption

    Authors: Yiming Xu, Dongfang Xu, Xianghao Yu, Shenghui Song, Zhiguo Ding, Robert Schober

    Abstract: Pinching-antenna systems (PASS) have been recently proposed to improve the performance of wireless networks by reconfiguring both the large-scale and small-scale channel conditions. However, existing studies ignore the physical constraints of antenna placement and assume fixed antenna radiation power. To fill this research gap, this paper investigates the design of PASS taking into account the mot… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

    Comments: 13 pages

  18. arXiv:2507.02294  [pdf, ps, other

    cs.CV

    ViRefSAM: Visual Reference-Guided Segment Anything Model for Remote Sensing Segmentation

    Authors: Hanbo Bi, Yulong Xu, Ya Li, Yongqiang Mao, Boyuan Tong, Chongyang Li, Chunbo Lang, Wenhui Diao, Hongqi Wang, Yingchao Feng, Xian Sun

    Abstract: The Segment Anything Model (SAM), with its prompt-driven paradigm, exhibits strong generalization in generic segmentation tasks. However, applying SAM to remote sensing (RS) images still faces two major challenges. First, manually constructing precise prompts for each image (e.g., points or boxes) is labor-intensive and inefficient, especially in RS scenarios with dense small objects or spatially… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

  19. arXiv:2507.02231  [pdf

    q-bio.BM

    Downregulation of aquaporin 3 promotes hyperosmolarity-induced apoptosis of nucleus pulposus cells through PI3K/Akt/mTOR pathway suppression

    Authors: Yuan Sang, Huiqing Zhao, Jiajun Wu, Ting Zhang, Wenbin Xu, Hui Yao, Kaihua Liu, Chang Liu, Junbin Zhang, Ping Li, Depeng Wu, Yichun Xu, Jianying Zhang, Gang Hou

    Abstract: Hyperosmolarity is a key contributor to nucleus pulposus cell (NPC) apoptosis during intervertebral disc degeneration (IVDD). Aquaporin 3 (AQP3), a membrane channel protein, regulates cellular osmotic balance by transporting water and osmolytes. Although AQP3 downregulation is associated with disc degeneration, its role in apoptosis under hyperosmotic conditions remains unclear. Here, we demonstra… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

  20. arXiv:2507.02215  [pdf, ps, other

    stat.ML cs.LG math.NA

    Hybrid least squares for learning functions from highly noisy data

    Authors: Ben Adcock, Bernhard Hientzsch, Akil Narayan, Yiming Xu

    Abstract: Motivated by the need for efficient estimation of conditional expectations, we consider a least-squares function approximation problem with heavily polluted data. Existing methods that are powerful in the small noise regime are suboptimal when large noise is present. We propose a hybrid approach that combines Christoffel sampling with certain types of optimal experimental design to address this is… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: 30 pages

  21. arXiv:2507.02047  [pdf, ps, other

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.str-el

    Quantum Geometry in the NbSe$_2$ Family I: Obstructed Compact Wannier Function and New Perturbation Theory

    Authors: Jiabin Yu, Yi Jiang, Yuanfeng Xu, Dumitru Călugăru, Haoyu Hu, Haojie Guo, Sandra Sajan, Yongsong Wang, Miguel M. Ugeda, Fernando De Juan, B. Andrei Bernevig

    Abstract: We revisit the electronic structure and band topology of monolayer 1H-NbSe$_2$, which hosts both superconductivity and charge density wave, and its related compounds 1H-MoS$_2$, NbS$_2$, TaS$_2$, TaSe$_2$ and WS$_2$. We construct a 6-band, a 3-band, and - simplest of all - a single-band model for this material family, by directly Wannierizing the ab initio bands. All host obstructed atomic isolate… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: 9+38 pages, 2+11 figures, 0+2 tables. See previously posted arXiv:2501.09063

  22. arXiv:2507.02029  [pdf, ps, other

    cs.RO

    RoboBrain 2.0 Technical Report

    Authors: BAAI RoboBrain Team, Mingyu Cao, Huajie Tan, Yuheng Ji, Minglan Lin, Zhiyu Li, Zhou Cao, Pengwei Wang, Enshen Zhou, Yi Han, Yingbo Tang, Xiangqi Xu, Wei Guo, Yaoxu Lyu, Yijie Xu, Jiayu Shi, Mengfei Du, Cheng Chi, Mengdi Zhao, Xiaoshuai Hao, Junkai Zhao, Xiaojie Zhang, Sh/anyu Rong, Huaihai Lyu, Zhengliang Cai , et al. (26 additional authors not shown)

    Abstract: We introduce RoboBrain 2.0, our latest generation of embodied vision-language foundation models, designed to unify perception, reasoning, and planning for complex embodied tasks in physical environments. It comes in two variants: a lightweight 7B model and a full-scale 32B model, featuring a heterogeneous architecture with a vision encoder and a language model. Despite its compact size, RoboBrain… ▽ More

    Submitted 5 July, 2025; v1 submitted 2 July, 2025; originally announced July 2025.

  23. arXiv:2507.01915  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Gradient-Adaptive Policy Optimization: Towards Multi-Objective Alignment of Large Language Models

    Authors: Chengao Li, Hanyu Zhang, Yunkun Xu, Hongyan Xue, Xiang Ao, Qing He

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has emerged as a powerful technique for aligning large language models (LLMs) with human preferences. However, effectively aligning LLMs with diverse human preferences remains a significant challenge, particularly when they are conflict. To address this issue, we frame human value alignment as a multi-objective optimization problem, aiming to maxim… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: 19 pages, 3 figures. Accepted by ACL 2025 (main)

  24. arXiv:2507.01873  [pdf, ps, other

    cs.DS

    Breaking the $n^{1.5}$ Additive Error Barrier for Private and Efficient Graph Sparsification via Private Expander Decomposition

    Authors: Anders Aamand, Justin Y. Chen, Mina Dalirrooyfard, Slobodan Mitrović, Yuriy Nevmyvaka, Sandeep Silwal, Yinzhan Xu

    Abstract: We study differentially private algorithms for graph cut sparsification, a fundamental problem in algorithms, privacy, and machine learning. While significant progress has been made, the best-known private and efficient cut sparsifiers on $n$-node graphs approximate each cut within $\widetilde{O}(n^{1.5})$ additive error and $1+γ$ multiplicative error for any $γ> 0$ [Gupta, Roth, Ullman TCC'12]. I… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: ICML 2025

  25. arXiv:2507.01679  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Blending Supervised and Reinforcement Fine-Tuning with Prefix Sampling

    Authors: Zeyu Huang, Tianhao Cheng, Zihan Qiu, Zili Wang, Yinghui Xu, Edoardo M. Ponti, Ivan Titov

    Abstract: Existing post-training techniques for large language models are broadly categorized into Supervised Fine-Tuning (SFT) and Reinforcement Fine-Tuning (RFT). Each paradigm presents a distinct trade-off: SFT excels at mimicking demonstration data but can lead to problematic generalization as a form of behavior cloning. Conversely, RFT can significantly enhance a model's performance but is prone to lea… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

    Comments: Work in progress

  26. arXiv:2507.01006  [pdf, ps, other

    cs.CV cs.AI cs.LG

    GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

    Authors: GLM-V Team, :, Wenyi Hong, Wenmeng Yu, Xiaotao Gu, Guo Wang, Guobing Gan, Haomiao Tang, Jiale Cheng, Ji Qi, Junhui Ji, Lihang Pan, Shuaiqi Duan, Weihan Wang, Yan Wang, Yean Cheng, Zehai He, Zhe Su, Zhen Yang, Ziyang Pan, Aohan Zeng, Baoxu Wang, Boyan Shi, Changyu Pang, Chenhui Zhang , et al. (54 additional authors not shown)

    Abstract: We present GLM-4.1V-Thinking, a vision-language model (VLM) designed to advance general-purpose multimodal understanding and reasoning. In this report, we share our key findings in the development of the reasoning-centric training framework. We first develop a capable vision foundation model with significant potential through large-scale pre-training, which arguably sets the upper bound for the fi… ▽ More

    Submitted 2 July, 2025; v1 submitted 1 July, 2025; originally announced July 2025.

  27. arXiv:2507.00681  [pdf, ps, other

    math.AG

    Hilbert series of second order jets of determinantal varieties

    Authors: Yifan Chen, Yongxin Xu, Huaiqing Zuo

    Abstract: In this paper, we will investigate the jet schemes of determinantal varieties. It is quite often the case that the geometric information concerning the jet schemes of an algebraic variety can be described, but the more refined algebraic information is quite mysterious. For example, it is known that computing the Hilbert function associated to a natural grading on these jet schemes is a very hard p… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: 20 pages

  28. arXiv:2507.00527  [pdf

    eess.IV

    Anti-aliasing Algorithm Based on Three-dimensional Display Image

    Authors: Ziyang Liu, Xingchen Xiao, Yueyang Xu

    Abstract: 3D-display technology has been a promising emerging area with potential to be the core of next-generation display technology. When directly observing unprocessed images and text through a naked-eye 3D display device, severe distortion and jaggedness will be displayed, which will make the display effect much worse. In this work, we try to settle down such degradation with spatial and frequency proc… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

  29. arXiv:2506.23966  [pdf, ps, other

    eess.SP cs.IT

    Pinching-Antenna Systems with In-Waveguide Attenuation: Performance Analysis and Algorithm Design

    Authors: Yanqing Xu, Zhiguo Ding, Robert Schober, Tsung-Hui Chang

    Abstract: Pinching-antenna systems have emerged as a promising flexible-antenna architecture for next-generation wireless networks, enabling enhanced adaptability and user-centric connectivity through antenna repositioning along waveguides. However, existing studies often overlook in-waveguide signal attenuation and in the literature, there is no comprehensive analysis on whether and under what conditions s… ▽ More

    Submitted 30 June, 2025; originally announced June 2025.

    Comments: This paper aims to address a fundamental question in pinching-antenna systems: Can in-waveguide attenuation be safely ignored without causing significant performance degradation? Our analytical results provide a clear answer -- YES, provided that certain mild and practically realizable conditions on the system parameters are satisfied

  30. arXiv:2506.23785  [pdf, ps, other

    cs.CV

    Visual Textualization for Image Prompted Object Detection

    Authors: Yongjian Wu, Yang Zhou, Jiya Saiyin, Bingzheng Wei, Yan Xu

    Abstract: We propose VisTex-OVLM, a novel image prompted object detection method that introduces visual textualization -- a process that projects a few visual exemplars into the text feature space to enhance Object-level Vision-Language Models' (OVLMs) capability in detecting rare categories that are difficult to describe textually and nearly absent from their pre-training data, while preserving their pre-t… ▽ More

    Submitted 30 June, 2025; originally announced June 2025.

    Comments: Accepted by ICCV 2025

  31. arXiv:2506.23569  [pdf, ps, other

    quant-ph eess.SY

    Alleviating CoD in Renewable Energy Profile Clustering Using an Optical Quantum Computer

    Authors: Chengjun Liu, Yijun Xu, Wei Gu, Bo Sun, Kai Wen, Shuai Lu, Lamine Mili

    Abstract: The traditional clustering problem of renewable energy profiles is typically formulated as a combinatorial optimization that suffers from the Curse of Dimensionality (CoD) on classical computers. To address this issue, this paper first proposed a kernel-based quantum clustering method. More specifically, the kernel-based similarity between profiles with minimal intra-group distance is encoded into… ▽ More

    Submitted 30 June, 2025; originally announced June 2025.

  32. arXiv:2506.23562  [pdf, ps, other

    quant-ph

    Realization of a functioning dual-type trapped-ion quantum network node

    Authors: Y. -Y. Huang, L. Feng, Y. -K. Wu, Y. -L. Xu, L. Zhang, Z. -B. Cui, C. -X. Huang, C. Zhang, S. -A. Guo, Q. -X. Mei, B. -X. Qi, Y. Xu, Y. -F. Pu, Z. -C. Zhou, L. -M. Duan

    Abstract: Trapped ions constitute a promising platform for implementation of a quantum network. Recently, a dual-type qubit scheme has been realized in a quantum network node where the communication qubits and the memory qubits are encoded in different energy levels of the same ion species, such that the generation of ion-photon entanglement on the communication qubits has negligible crosstalk error on the… ▽ More

    Submitted 30 June, 2025; originally announced June 2025.

  33. arXiv:2506.23538  [pdf, ps, other

    cs.CV cs.AI cs.LG

    Uncertainty-aware Diffusion and Reinforcement Learning for Joint Plane Localization and Anomaly Diagnosis in 3D Ultrasound

    Authors: Yuhao Huang, Yueyue Xu, Haoran Dou, Jiaxiao Deng, Xin Yang, Hongyu Zheng, Dong Ni

    Abstract: Congenital uterine anomalies (CUAs) can lead to infertility, miscarriage, preterm birth, and an increased risk of pregnancy complications. Compared to traditional 2D ultrasound (US), 3D US can reconstruct the coronal plane, providing a clear visualization of the uterine morphology for assessing CUAs accurately. In this paper, we propose an intelligent system for simultaneous automated plane locali… ▽ More

    Submitted 30 June, 2025; originally announced June 2025.

    Comments: Accepted by MICCAI 2025;10 pages, 3 figures

  34. arXiv:2506.23520  [pdf, ps, other

    cs.AI

    ChemActor: Enhancing Automated Extraction of Chemical Synthesis Actions with LLM-Generated Data

    Authors: Yu Zhang, Ruijie Yu, Jidong Tian, Feng Zhu, Jiapeng Liu, Xiaokang Yang, Yaohui Jin, Yanyan Xu

    Abstract: With the increasing interest in robotic synthesis in the context of organic chemistry, the automated extraction of chemical procedures from literature is critical. However, this task remains challenging due to the inherent ambiguity of chemical language and the high cost of human annotation required for developing reliable computer-aided extraction protocols. Here, we present ChemActor, a fully fi… ▽ More

    Submitted 1 July, 2025; v1 submitted 30 June, 2025; originally announced June 2025.

  35. arXiv:2506.23287  [pdf, ps, other

    cs.LG q-bio.QM

    Hierarchical Quantized Diffusion Based Tree Generation Method for Hierarchical Representation and Lineage Analysis

    Authors: Zelin Zang, WenZhe Li, Fei Chen, Yongjie Xu, Chang Yu, Zhen Lei, Stan Z. Li

    Abstract: In single-cell research, tracing and analyzing high-throughput single-cell differentiation trajectories is crucial for understanding complex biological processes. Key to this is the modeling and generation of hierarchical data that represents the intrinsic structure within datasets. Traditional methods face limitations in terms of computational cost, performance, generative capacity, and stability… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

    Comments: 9 pages, 6 figures, under review

  36. arXiv:2506.23101  [pdf, ps, other

    cs.CL cs.AI cs.CY

    From Individuals to Interactions: Benchmarking Gender Bias in Multimodal Large Language Models from the Lens of Social Relationship

    Authors: Yue Xu, Wenjie Wang

    Abstract: Multimodal large language models (MLLMs) have shown impressive capabilities across tasks involving both visual and textual modalities. However, growing concerns remain about their potential to encode and amplify gender bias, particularly in socially sensitive applications. Existing benchmarks predominantly evaluate bias in isolated scenarios, overlooking how bias may emerge subtly through interper… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

  37. arXiv:2506.22916  [pdf, ps, other

    math.CA

    Best approximation by polynomials on the conic domains

    Authors: Yan Ge, Yuan Xu

    Abstract: A new modulus of smoothness and its equivalent $K$-function are defined on the conic domains in $\mathbb{R}^d$, and used to characterize the weighted best approximation by polynomials. Both direct and weak inverse theorems of the characterization are established via the modulus of smoothness. For the conic surface $\mathbb{V}_0^{d+1} = \{(x,t): \|x\| = t\le 1\}$, the natural weight function is… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

    Comments: 31 pages

    MSC Class: 41A10; 41A63; 42C10; 42C40

  38. arXiv:2506.22902  [pdf, ps, other

    cs.CV eess.IV

    Point Cloud Compression and Objective Quality Assessment: A Survey

    Authors: Yiling Xu, Yujie Zhang, Shuting Xia, Kaifa Yang, He Huang, Ziyu Shan, Wenjie Huang, Qi Yang, Le Yang

    Abstract: The rapid growth of 3D point cloud data, driven by applications in autonomous driving, robotics, and immersive environments, has led to criticals demand for efficient compression and quality assessment techniques. Unlike traditional 2D media, point clouds present unique challenges due to their irregular structure, high data volume, and complex attributes. This paper provides a comprehensive survey… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

  39. arXiv:2506.22899  [pdf, ps, other

    cs.CV cs.GR cs.LG cs.MA eess.IV

    Neural Cellular Automata: From Cells to Pixels

    Authors: Ehsan Pajouheshgar, Yitao Xu, Ali Abbasi, Alexander Mordvintsev, Wenzel Jakob, Sabine Süsstrunk

    Abstract: Neural Cellular Automata (NCAs) are bio-inspired systems in which identical cells self-organize to form complex and coherent patterns by repeatedly applying simple local rules. NCAs display striking emergent behaviors including self-regeneration, generalization and robustness to unseen situations, and spontaneous motion. Despite their success in texture synthesis and morphogenesis, NCAs remain lar… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

    Comments: 6 pages, 5 figures, first draft

  40. arXiv:2506.22714  [pdf, ps, other

    cs.DC cs.LG cs.PF

    Libra: Synergizing CUDA and Tensor Cores for High-Performance Sparse Matrix Multiplication

    Authors: Jinliang Shi, Shigang Li, Youxuan Xu, Xueying Wang, Rongtian Fu, Zhi Ma, Tong Wu

    Abstract: Sparse matrix multiplication operators (i.e., SpMM and SDDMM) are widely used in deep learning and scientific computing. Modern accelerators are commonly equipped with Tensor cores and CUDA cores to accelerate sparse operators. The former brings superior computing power but only for structured matrix multiplication, while the latter has relatively lower performance but with higher programming flex… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    ACM Class: C.1.4; I.2.11

  41. arXiv:2506.22606  [pdf, ps, other

    cs.CR cs.LG

    A User-Centric, Privacy-Preserving, and Verifiable Ecosystem for Personal Data Management and Utilization

    Authors: Osama Zafar, Mina Namazi, Yuqiao Xu, Youngjin Yoo, Erman Ayday

    Abstract: In the current paradigm of digital personalized services, the centralized management of personal data raises significant privacy concerns, security vulnerabilities, and diminished individual autonomy over sensitive information. Despite their efficiency, traditional centralized architectures frequently fail to satisfy rigorous privacy requirements and expose users to data breaches and unauthorized… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

  42. arXiv:2506.22465  [pdf, ps, other

    eess.SP cs.IT

    Preconditioned Conjugate Gradient for MIMO-AFDM System

    Authors: Jun Zhu, Yin Xu, Dazhi He, Haoyang Li, Yunfeng Guan, Wenjun Zhang

    Abstract: Affine frequency division multiplexing (AFDM) is a promising chirp-assisted multicarrier waveform for future high mobility communications. A significant challenge in MIMO-AFDM systems is the multi-user interference (MUI), which can be effectively addressed by employing precoding techniques. However, the complexity introduced by AFDM makes the precoding process computationally expensive and challen… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: arXiv admin note: text overlap with arXiv:2503.10525

  43. arXiv:2506.22295  [pdf, ps, other

    cs.LG

    Score-Based Model for Low-Rank Tensor Recovery

    Authors: Zhengyun Cheng, Changhao Wang, Guanwen Zhang, Yi Xu, Wei Zhou, Xiangyang Ji

    Abstract: Low-rank tensor decompositions (TDs) provide an effective framework for multiway data analysis. Traditional TD methods rely on predefined structural assumptions, such as CP or Tucker decompositions. From a probabilistic perspective, these can be viewed as using Dirac delta distributions to model the relationships between shared factors and the low-rank tensor. However, such prior knowledge is rare… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

  44. arXiv:2506.22246  [pdf, ps, other

    cs.CV

    EAMamba: Efficient All-Around Vision State Space Model for Image Restoration

    Authors: Yu-Cheng Lin, Yu-Syuan Xu, Hao-Wei Chen, Hsien-Kai Kuo, Chun-Yi Lee

    Abstract: Image restoration is a key task in low-level computer vision that aims to reconstruct high-quality images from degraded inputs. The emergence of Vision Mamba, which draws inspiration from the advanced state space model Mamba, marks a significant advancement in this field. Vision Mamba demonstrates excellence in modeling long-range dependencies with linear complexity, a crucial advantage for image… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: ICCV 2025

  45. arXiv:2506.22242  [pdf, ps, other

    cs.CV

    4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration

    Authors: Jiahui Zhang, Yurui Chen, Yueming Xu, Ze Huang, Yanpeng Zhou, Yu-Jie Yuan, Xinyue Cai, Guowei Huang, Xingyue Quan, Hang Xu, Li Zhang

    Abstract: Leveraging diverse robotic data for pretraining remains a critical challenge. Existing methods typically model the dataset's action distribution using simple observations as inputs. However, these inputs are often incomplete, resulting in a dispersed conditional action distribution-an issue we refer to as coordinate system chaos and state chaos. This inconsistency significantly hampers pretraining… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

  46. arXiv:2506.22213  [pdf, ps, other

    math.PR

    Function space induced by no arbitrage

    Authors: Kihun Nam, Yunxi Xu

    Abstract: In this article, we show necessary and sufficient conditions for a function to transform a continuous Markov semimartingale to a semimartingale. As a result, the no-arbitrage principle guarantees the differentiability of asset prices with respect to the underlying noise, if the asset prices are continuous and the underlying noise is a continuous Markov semimartingale.

    Submitted 27 June, 2025; originally announced June 2025.

  47. arXiv:2506.22134  [pdf, ps, other

    cs.CV

    Low-Rank Implicit Neural Representation via Schatten-p Quasi-Norm and Jacobian Regularization

    Authors: Zhengyun Cheng, Changhao Wang, Guanwen Zhang, Yi Xu, Wei Zhou, Xiangyang Ji

    Abstract: Higher-order tensors are well-suited for representing multi-dimensional data, such as color images and videos. Low-rank tensor representation has become essential in machine learning and computer vision, but existing methods like Tucker decomposition offer flexibility at the expense of interpretability. In contrast, while the CANDECOMP/PARAFAC (CP) decomposition provides a more natural and interpr… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: Submitted to IEEE Transactions on Circuits and Systems for Video Technology

  48. arXiv:2506.22058  [pdf, ps, other

    cs.CL

    Lost at the Beginning of Reasoning

    Authors: Baohao Liao, Xinyi Chen, Sara Rajaee, Yuhui Xu, Christian Herold, Anders Søgaard, Maarten de Rijke, Christof Monz

    Abstract: Recent advancements in large language models (LLMs) have significantly advanced complex reasoning capabilities, particularly through extended chain-of-thought (CoT) reasoning that incorporates mechanisms such as backtracking, self-reflection and self-correction. Despite these developments, the self-correction abilities of LLMs during long CoT reasoning remain underexplored. And recent findings on… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: 9 pages, 5 figures, 2 tables

  49. arXiv:2506.21933  [pdf, ps, other

    cs.NI cs.LG

    Joint Task Offloading and Resource Allocation in Low-Altitude MEC via Graph Attention Diffusion

    Authors: Yifan Xue, Ruihuai Liang, Bo Yang, Xuelin Cao, Zhiwen Yu, Mérouane Debbah, Chau Yuen

    Abstract: With the rapid development of the low-altitude economy, air-ground integrated multi-access edge computing (MEC) systems are facing increasing demands for real-time and intelligent task scheduling. In such systems, task offloading and resource allocation encounter multiple challenges, including node heterogeneity, unstable communication links, and dynamic task variations. To address these issues, t… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

  50. Lightweight Fingernail Haptic Device: Unobstructed Fingerpad Force and Vibration Feedback for Enhanced Virtual Dexterous Manipulation

    Authors: Yunxiu Xu, Siyu Wang, Shoichi Hasegawa

    Abstract: This study presents a lightweight, wearable fingertip haptic device that provides physics-based haptic feedback for dexterous manipulation in virtual environments without hindering real-world interactions. The device, designed with thin strings and actuators attached to the fingernails, ensures minimal weight (1.55 g per finger) and preserves finger flexibility. Integrating the software with a phy… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

    Comments: 14 pages, 15 figures, 2 tables. Published in IEEE Transactions on Haptics (Early Access)

    ACM Class: H.5.2; I.3.6

    Journal ref: IEEE Transactions on Haptics, Early Access, pp. 1-14, June 2025