Skip to main content

Showing 51–100 of 20,906 results for author: Liang

.
  1. arXiv:2506.13503  [pdf, ps, other

    astro-ph.HE

    Fast Transitions of X-ray Variability in the Neutron Star Low Mass X-ray Binary Cygnus X-2

    Authors: Liang Zhang, Mariano Méndez, Hua Feng, Diego Altamirano, Zi-xu Yang, Qing-chang Zhao, Shuang-nan Zhang, Lian Tao, Yue Huang, Xiang Ma, Shu-mei Jia, Ming-yu Ge, Li-ming Song, Jin-lu Qu, Shu Zhang

    Abstract: We present a spectral-timing analysis of two NICER observations of the weakly magnetized neutron star low-mass X-ray binary Cygnus X-2. During these observations, we detect a rapid transition from a narrow 50-Hz horizontal-branch oscillation to a broad 5-Hz normal-branch oscillation, accompanied by an increase in source flux and a decrease in spectral hardness. Thanks to the large effective area o… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: 12 pages, 7 figures, accepted for publication in ApJ

  2. arXiv:2506.13415  [pdf, other

    eess.IV cs.AI cs.CV

    Simple is what you need for efficient and accurate medical image segmentation

    Authors: Xiang Yu, Yayan Chen, Guannan He, Qing Zeng, Yue Qin, Meiling Liang, Dandan Luo, Yimei Liao, Zeyu Ren, Cheng Kang, Delong Yang, Bocheng Liang, Bin Pu, Ying Yuan, Shengli Li

    Abstract: While modern segmentation models often prioritize performance over practicality, we advocate a design philosophy prioritizing simplicity and efficiency, and attempted high performance segmentation model design. This paper presents SimpleUNet, a scalable ultra-lightweight medical image segmentation model with three key innovations: (1) A partial feature selection mechanism in skip connections for r… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: 15 pages, 11 figures

    ACM Class: I.4.6

  3. arXiv:2506.13402  [pdf, ps, other

    math.OC

    A Dynamic Relaxation Framework for Global Solution of ACOPF

    Authors: Yu-Yang Tang, Liang Chen, Sheng-Jie Chen, Yu-Hong Dai, Bo Zhou, Xiaomeng Ai

    Abstract: Solving the Alternating Current Optimal Power Flow (AC OPF) problem to global optimality remains challenging due to its nonconvex quadratic constraints. In this paper, we present a unified framework that combines static piecewise relaxations with dynamic cut-generation mechanism to systematically tighten the classic Second-Order Cone Programming (SOCP) relaxation to arbitrarily small conic violati… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: Full version of a submission to IEEE Transactions on Power Systems. Includes all proofs and algorithm pseudocode

  4. arXiv:2506.13334  [pdf, ps, other

    hep-ex

    Measurement of the $Ω_c^0$ and $Ξ_c^0$ baryon lifetimes using hadronic $b$-baryon decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1141 additional authors not shown)

    Abstract: The lifetimes of the $Ω_c^0$ and $Ξ_c^0$ baryons are measured using a $pp$ collision dataset collected by the LHCb experiment, corresponding to an integrated luminosity of $9~\rm{fb^{-1}}$. The charm baryons are produced in the fully reconstructed decay chains $Ω_b^- \rightarrow Ω_c^0 (\rightarrow pK^-K^-π^+)~π^-$ and $Ξ_b^- \rightarrow Ξ_c^0 (\rightarrow pK^-K^-π^+)~π^-$. The measurement uses top… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3875/ (LHCb public pages)

    Report number: LHCb-PAPER-2025-013,CERN-EP-2025-117

  5. arXiv:2506.13274  [pdf, ps, other

    cs.LG cs.CL

    AdaLRS: Loss-Guided Adaptive Learning Rate Search for Efficient Foundation Model Pretraining

    Authors: Hongyuan Dong, Dingkang Yang, Xiao Liang, Chao Feng, Jiao Ran

    Abstract: Learning rate is widely regarded as crucial for effective foundation model pretraining. Recent research explores and demonstrates the transferability of learning rate configurations across varying model and dataset sizes, etc. Nevertheless, these approaches are constrained to specific training scenarios and typically necessitate extensive hyperparameter tuning on proxy models. In this work, we pro… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  6. arXiv:2506.13205  [pdf, ps, other

    cs.CR cs.AI

    Screen Hijack: Visual Poisoning of VLM Agents in Mobile Environments

    Authors: Xuan Wang, Siyuan Liang, Zhe Liu, Yi Yu, Yuliang Lu, Xiaochun Cao, Ee-Chien Chang

    Abstract: With the growing integration of vision-language models (VLMs), mobile agents are now widely used for tasks like UI automation and camera-based user assistance. These agents are often fine-tuned on limited user-generated datasets, leaving them vulnerable to covert threats during the training process. In this work we present GHOST, the first clean-label backdoor attack specifically designed for mobi… ▽ More

    Submitted 19 June, 2025; v1 submitted 16 June, 2025; originally announced June 2025.

    Comments: 12 pages

  7. arXiv:2506.13201  [pdf, ps, other

    cs.CV

    A Comprehensive Survey on Deep Learning Solutions for 3D Flood Mapping

    Authors: Wenfeng Jia, Bin Liang, Yuxi Liu, Muhammad Arif Khan, Lihong Zheng

    Abstract: Flooding remains a major global challenge, worsened by climate change and urbanization, demanding advanced solutions for effective disaster management. While traditional 2D flood mapping techniques provide limited insights, 3D flood mapping, powered by deep learning (DL), offers enhanced capabilities by integrating flood extent and depth. This paper presents a comprehensive survey of deep learning… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  8. arXiv:2506.13127  [pdf, ps, other

    cs.SD eess.AS

    I$^2$S-TFCKD: Intra-Inter Set Knowledge Distillation with Time-Frequency Calibration for Speech Enhancement

    Authors: Jiaming Cheng, Ruiyu Liang, Chao Xu, Ye Ni, Wei Zhou, Björn W. Schuller, Xiaoshuai Hao

    Abstract: In recent years, complexity compression of neural network (NN)-based speech enhancement (SE) models has gradually attracted the attention of researchers, especially in scenarios with limited hardware resources or strict latency requirements. The main difficulties and challenges lie in achieving a balance between complexity and performance according to the characteristics of the task. In this paper… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: submitted to IEEE Transactions on Neural Networks and Learning Systems

  9. arXiv:2506.13094  [pdf, ps, other

    eess.IV

    MorphSAM: Learning the Morphological Prompts from Atlases for Spine Image Segmentation

    Authors: Dingwei Fan, Junyong Zhao, Chunlin Li, Xinlong Wang, Ronghan Zhang, Mingliang Wang, Qi Zhu, Haipeng Si, Daoqiang Zhang, Liang Sun

    Abstract: Spine image segmentation is crucial for clinical diagnosis and treatment of spine diseases. The complex structure of the spine and the high morphological similarity between individual vertebrae and adjacent intervertebral discs make accurate spine segmentation a challenging task. Although the Segment Anything Model (SAM) has been developed, it still struggles to effectively capture and utilize mor… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  10. arXiv:2506.13093  [pdf, ps, other

    astro-ph.GA astro-ph.HE

    Acceleration and Collimation of the Two-Sided Jets in the Nearby Low-luminosity Active Galactic Nucleus NGC 4261 (3C 270)

    Authors: Xi Yan, Lang Cui, Kazuhiro Hada, Sandor Frey, Ru-sen Lu, Liang Chen, Wancheng Xu, Elika P. Fariyanto, Luis C. Ho

    Abstract: We study the acceleration and collimation of the two-sided jets in the nearby low-luminosity active galactic nucleus NGC 4261 (3C 270) using archival multifrequency, multi-epoch Very Long Baseline Array data. By applying multiple analysis methods and incorporating results from the literature, we robustly identify a parabolic-to-conical structural transition in both the jet and counterjet, with the… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: A revision and a similar version will appear in ApJ. Comments are welcome

  11. arXiv:2506.13050  [pdf, ps, other

    cs.GR cs.CV

    NeuVAS: Neural Implicit Surfaces for Variational Shape Modeling

    Authors: Pengfei Wang, Qiujie Dong, Fangtian Liang, Hao Pan, Lei Yang, Congyi Zhang, Guying Lin, Caiming Zhang, Yuanfeng Zhou, Changhe Tu, Shiqing Xin, Alla Sheffer, Xin Li, Wenping Wang

    Abstract: Neural implicit shape representation has drawn significant attention in recent years due to its smoothness, differentiability, and topological flexibility. However, directly modeling the shape of a neural implicit surface, especially as the zero-level set of a neural signed distance function (SDF), with sparse geometric control is still a challenging task. Sparse input shape control typically incl… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  12. arXiv:2506.13035  [pdf, ps, other

    gr-qc

    Probing Dark Matter's Gravitational Effects Locally with TianQin

    Authors: Zheng-Cheng Liang, Fa-Peng Huang, Xuefeng Zhang, Yi-Ming Hu

    Abstract: In this study, we explore the potential of using TianQin missions to probe the local gravitational effects of dark matter. The TianQin project plans to launch satellites at both low and high orbits. High-precision orbit determination is expected to assist in the Earth's gravity or gravitational waves detection. By comparing the derived masses in low and high orbits, it is possible to constrain the… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

    Comments: 5 pages, 1 figure

  13. arXiv:2506.13021  [pdf, ps, other

    cs.LG

    C-TLSAN: Content-Enhanced Time-Aware Long- and Short-Term Attention Network for Personalized Recommendation

    Authors: Siqi Liang, Yudi Zhang, Yubo Wang

    Abstract: Sequential recommender systems aim to model users' evolving preferences by capturing patterns in their historical interactions. Recent advances in this area have leveraged deep neural networks and attention mechanisms to effectively represent sequential behaviors and time-sensitive interests. In this work, we propose C-TLSAN (Content-Enhanced Time-Aware Long- and Short-Term Attention Network), an… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  14. arXiv:2506.12909  [pdf, ps, other

    cs.CL

    SciDA: Scientific Dynamic Assessor of LLMs

    Authors: Junting Zhou, Tingjia Miao, Yiyan Liao, Qichao Wang, Zhoufutu Wen, Yanqin Wang, Yunjie Huang, Ge Yan, Leqi Wang, Yucheng Xia, Hongwan Gao, Yuansong Zeng, Renjie Zheng, Chen Dun, Yitao Liang, Tong Yang, Wenhao Huang, Ge Zhang

    Abstract: Advancement in Large Language Models (LLMs) reasoning capabilities enables them to solve scientific problems with enhanced efficacy. Thereby, a high-quality benchmark for comprehensive and appropriate assessment holds significance, while existing ones either confront the risk of data contamination or lack involved disciplines. To be specific, due to the data source overlap of LLMs training and sta… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  15. arXiv:2506.12861  [pdf, ps, other

    physics.atom-ph

    Exceptional Point-enhanced Rydberg Atomic Electrometers

    Authors: Chao Liang, Ce Yang, Wei Huang

    Abstract: Rydberg atoms, with their large transition dipole moments and extreme sensitivity to electric fields, have attracted widespread attention as promising candidates for next-generation quantum precision electrometry. Meanwhile, exceptional points (EPs) in non-Hermitian systems have opened new avenues for ultrasensitive metrology. Despite increasing interest in non-Hermitian physics, EP-enhanced sensi… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

    Comments: Any comments are welcome

  16. arXiv:2506.12857  [pdf, ps, other

    quant-ph

    Experimental Observation of Purity-Like Invariants of Multi-photon States in Linear Optics

    Authors: Baichuan Yang, Hao Zhan, Minghao Mi, Aonan Zhang, Liang Xu, Lijian Zhang

    Abstract: Linear optical networks (LONs) with multi-photon inputs offer a powerful platform for advanced quantum technologies. However, the number of degrees of freedom of a LON is far fewer than the dimensionality of the multi-photon multi-mode Fock space, therefore it cannot implement arbitrary unitary evolutions on multi-photon states. Understanding these intrinsic constraints is essential for the prepar… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

    Comments: 3 figures, supplementary material included

  17. arXiv:2506.12815  [pdf, ps, other

    cs.LG

    TrojanTO: Action-Level Backdoor Attacks against Trajectory Optimization Models

    Authors: Yang Dai, Oubo Ma, Longfei Zhang, Xingxing Liang, Xiaochun Cao, Shouling Ji, Jiaheng Zhang, Jincai Huang, Li Shen

    Abstract: Recent advances in Trajectory Optimization (TO) models have achieved remarkable success in offline reinforcement learning. However, their vulnerabilities against backdoor attacks are poorly understood. We find that existing backdoor attacks in reinforcement learning are based on reward manipulation, which are largely ineffective against the TO model due to its inherent sequence modeling nature. Mo… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

    Comments: 23 pages, 6 figures

  18. arXiv:2506.12786  [pdf, ps, other

    cs.CV

    Semantic-Aware Visual Information Transmission With Key Information Extraction Over Wireless Networks

    Authors: Chen Zhu, Kang Liang, Jianrong Bao, Zhouxiang Zhao, Zhaohui Yang, Zhaoyang Zhang, Mohammad Shikh-Bahaei

    Abstract: The advent of 6G networks demands unprecedented levels of intelligence, adaptability, and efficiency to address challenges such as ultra-high-speed data transmission, ultra-low latency, and massive connectivity in dynamic environments. Traditional wireless image transmission frameworks, reliant on static configurations and isolated source-channel coding, struggle to balance computational efficienc… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  19. arXiv:2506.12776  [pdf, ps, other

    cs.CV

    Native Visual Understanding: Resolving Resolution Dilemmas in Vision-Language Models

    Authors: Junbo Niu, Yuanhong Zheng, Ziyang Miao, Hejun Dong, Chunjiang Ge, Hao Liang, Ma Lu, Bohan Zeng, Qiahao Zheng, Conghui He, Wentao Zhang

    Abstract: Vision-Language Models (VLMs) face significant challenges when dealing with the diverse resolutions and aspect ratios of real-world images, as most existing models rely on fixed, low-resolution inputs. While recent studies have explored integrating native resolution visual encoding to improve model performance, such efforts remain fragmented and lack a systematic framework within the open-source c… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

  20. arXiv:2506.12760  [pdf, ps, other

    cs.SE

    IDOL: Improved Different Optimization Levels Testing for Solidity Compilers

    Authors: Lantian Li, Yejian Liang, Zhongxing Yu

    Abstract: As blockchain technology continues to evolve and mature, smart contracts have become a key driving force behind the digitization and automation of transactions. Smart contracts greatly simplify and refine the traditional business transaction processes, and thus have had a profound impact on various industries such as finance and supply chain management. However, because smart contracts cannot be m… ▽ More

    Submitted 15 June, 2025; originally announced June 2025.

    Comments: Accepted by QRS 2025 (Fast Abstracts track)

  21. arXiv:2506.12710  [pdf, ps, other

    cs.RO

    Multimodal Large Language Models-Enabled UAV Swarm: Towards Efficient and Intelligent Autonomous Aerial Systems

    Authors: Yuqi Ping, Tianhao Liang, Huahao Ding, Guangyu Lei, Junwei Wu, Xuan Zou, Kuan Shi, Rui Shao, Chiya Zhang, Weizheng Zhang, Weijie Yuan, Tingting Zhang

    Abstract: Recent breakthroughs in multimodal large language models (MLLMs) have endowed AI systems with unified perception, reasoning and natural-language interaction across text, image and video streams. Meanwhile, Unmanned Aerial Vehicle (UAV) swarms are increasingly deployed in dynamic, safety-critical missions that demand rapid situational understanding and autonomous adaptation. This paper explores pot… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

    Comments: 8 pages, 5 figures,submitted to IEEE wcm

  22. arXiv:2506.12708  [pdf, ps, other

    cs.DC cs.AI cs.AR cs.LG

    Serving Large Language Models on Huawei CloudMatrix384

    Authors: Pengfei Zuo, Huimin Lin, Junbo Deng, Nan Zou, Xingkun Yang, Yingyu Diao, Weifeng Gao, Ke Xu, Zhangyu Chen, Shirui Lu, Zhao Qiu, Peiyang Li, Xianyu Chang, Zhengzhong Yu, Fangzheng Miao, Jia Zheng, Ying Li, Yuan Feng, Bei Wang, Zaijian Zong, Mosong Zhou, Wenli Zhou, Houjiang Chen, Xingyu Liao, Yipeng Li , et al. (21 additional authors not shown)

    Abstract: The rapid evolution of large language models (LLMs), driven by growing parameter scales, adoption of mixture-of-experts (MoE) architectures, and expanding context lengths, imposes unprecedented demands on AI infrastructure. Traditional AI clusters face limitations in compute intensity, memory bandwidth, inter-chip communication, and latency, compounded by variable workloads and strict service-leve… ▽ More

    Submitted 19 June, 2025; v1 submitted 14 June, 2025; originally announced June 2025.

    Comments: 59 pages, 24 figures

  23. arXiv:2506.12700  [pdf, ps, other

    cs.LG

    Large Scalable Cross-Domain Graph Neural Networks for Personalized Notification at LinkedIn

    Authors: Shihai He, Julie Choi, Tianqi Li, Zhiwei Ding, Peng Du, Priya Bannur, Franco Liang, Fedor Borisyuk, Padmini Jaikumar, Xiaobing Xue, Viral Gupta

    Abstract: Notification recommendation systems are critical to driving user engagement on professional platforms like LinkedIn. Designing such systems involves integrating heterogeneous signals across domains, capturing temporal dynamics, and optimizing for multiple, often competing, objectives. Graph Neural Networks (GNNs) provide a powerful framework for modeling complex interactions in such environments.… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

    MSC Class: 68R10

  24. arXiv:2506.12479  [pdf, ps, other

    cs.AI cs.CL cs.CV cs.DC eess.SP

    AI Flow: Perspectives, Scenarios, and Approaches

    Authors: Hongjun An, Sida Huang, Siqi Huang, Ruanjun Li, Yuanzhi Liang, Jiawei Shao, Zihan Wang, Cheng Yuan, Chi Zhang, Hongyuan Zhang, Wenhao Zhuang, Xuelong Li

    Abstract: Pioneered by the foundational information theory by Claude Shannon and the visionary framework of machine intelligence by Alan Turing, the convergent evolution of information and communication technologies (IT/CT) has created an unbroken wave of connectivity and computation. This synergy has sparked a technological revolution, now reaching its peak with large artificial intelligence (AI) models th… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

    Comments: Authors are with Institute of Artificial Intelligence (TeleAI), China Telecom, China. Author names are listed alphabetically by surname. This work was conducted at TeleAI, facilitated by Dr. Jiawei Shao (e-mail: [email protected]) under the leadership of Prof. Xuelong Li. The corresponding author is Prof. Xuelong Li (e-mail: xuelong [email protected]), the CTO and Chief Scientist of China Telecom

  25. arXiv:2506.12441  [pdf, ps, other

    cs.CV cs.AI

    MS-UMamba: An Improved Vision Mamba Unet for Fetal Abdominal Medical Image Segmentation

    Authors: Caixu Xu, Junming Wei, Huizhen Chen, Pengchen Liang, Bocheng Liang, Ying Tan, Xintong Wei

    Abstract: Recently, Mamba-based methods have become popular in medical image segmentation due to their lightweight design and long-range dependency modeling capabilities. However, current segmentation methods frequently encounter challenges in fetal ultrasound images, such as enclosed anatomical structures, blurred boundaries, and small anatomical structures. To address the need for balancing local feature… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

  26. arXiv:2506.12430  [pdf, ps, other

    cs.CR cs.CV

    Pushing the Limits of Safety: A Technical Report on the ATLAS Challenge 2025

    Authors: Zonghao Ying, Siyang Wu, Run Hao, Peng Ying, Shixuan Sun, Pengyu Chen, Junze Chen, Hao Du, Kaiwen Shen, Shangkun Wu, Jiwei Wei, Shiyuan He, Yang Yang, Xiaohai Xu, Ke Ma, Qianqian Xu, Qingming Huang, Shi Lin, Xun Wang, Changting Lin, Meng Han, Yilei Jiang, Siqi Lai, Yaozhi Zheng, Yifei Song , et al. (22 additional authors not shown)

    Abstract: Multimodal Large Language Models (MLLMs) have enabled transformative advancements across diverse applications but remain susceptible to safety threats, especially jailbreak attacks that induce harmful outputs. To systematically evaluate and improve their safety, we organized the Adversarial Testing & Large-model Alignment Safety Grand Challenge (ATLAS) 2025}. This technical report presents finding… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

  27. arXiv:2506.12286  [pdf, ps, other

    cs.AI cs.SE

    The SWE-Bench Illusion: When State-of-the-Art LLMs Remember Instead of Reason

    Authors: Shanchao Liang, Spandan Garg, Roshanak Zilouchian Moghaddam

    Abstract: As large language models (LLMs) become increasingly capable and widely adopted, benchmarks play a central role in assessing their practical utility. For example, SWE-Bench Verified has emerged as a critical benchmark for evaluating LLMs' software engineering abilities, particularly their aptitude for resolving real-world GitHub issues. Recent LLMs show impressive performance on SWE-Bench, leading… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  28. arXiv:2506.12264  [pdf

    cs.ET cs.AR

    A Novel Thermal Network Model and Electro-Thermal Coupling Study for NSFETs and CFETs Considering Thermal Crosstalk

    Authors: Tianci Miao, Qihang Zheng, Yangyang Hu, Xiaoyu Cheng, Jie Liang, Liang Chen, Aiying Guo, Jingjing Liu, Kailin Ren, Jianhua Zhang

    Abstract: As the technology node continues to shrink, nanosheet field effect transistors (NSFETs) and complementary FETs (CFETs) become valid candidates for the 3nm and sub-nanometre nodes. However, due to the shrinking device size, self-heating and inter-device thermal crosstalk of NSFETs and CFETs become more severe. It is important to accurately calculate the self-heating and thermal crosstalk of devices… ▽ More

    Submitted 9 March, 2025; originally announced June 2025.

  29. arXiv:2506.12103  [pdf, other

    cs.AI cs.CY cs.LG

    The Amazon Nova Family of Models: Technical Report and Model Card

    Authors: Amazon AGI, Aaron Langford, Aayush Shah, Abhanshu Gupta, Abhimanyu Bhatter, Abhinav Goyal, Abhinav Mathur, Abhinav Mohanty, Abhishek Kumar, Abhishek Sethi, Abi Komma, Abner Pena, Achin Jain, Adam Kunysz, Adam Opyrchal, Adarsh Singh, Aditya Rawal, Adok Achar Budihal Prasad, Adrià de Gispert, Agnika Kumar, Aishwarya Aryamane, Ajay Nair, Akilan M, Akshaya Iyengar, Akshaya Vishnu Kudlu Shanbhogue , et al. (761 additional authors not shown)

    Abstract: We present Amazon Nova, a new generation of state-of-the-art foundation models that deliver frontier intelligence and industry-leading price performance. Amazon Nova Pro is a highly-capable multimodal model with the best combination of accuracy, speed, and cost for a wide range of tasks. Amazon Nova Lite is a low-cost multimodal model that is lightning fast for processing images, video, documents… ▽ More

    Submitted 17 March, 2025; originally announced June 2025.

    Comments: 48 pages, 10 figures

    Report number: 20250317

  30. arXiv:2506.11991  [pdf, ps, other

    cs.CV cs.AI cs.CL

    VGR: Visual Grounded Reasoning

    Authors: Jiacong Wang, Zijian Kang, Haochen Wang, Haiyong Jiang, Jiawen Li, Bohong Wu, Ya Wang, Jiao Ran, Xiao Liang, Chao Feng, Jun Xiao

    Abstract: In the field of multimodal chain-of-thought (CoT) reasoning, existing approaches predominantly rely on reasoning on pure language space, which inherently suffers from language bias and is largely confined to math or science domains. This narrow focus limits their ability to handle complex visual reasoning tasks that demand comprehensive understanding of image details. To address these limitations,… ▽ More

    Submitted 16 June, 2025; v1 submitted 13 June, 2025; originally announced June 2025.

    Comments: 9 pages, 4 figures

  31. arXiv:2506.11881  [pdf, ps, other

    physics.atom-ph cond-mat.quant-gas quant-ph

    Continuously trapped matter-wave interferometry in magic Floquet-Bloch band structures

    Authors: Xiao Chai, Jeremy L. Tanlimco, Eber Nolasco-Martinez, Xuanwei Liang, E. Quinn Simmons, Eric Zhu, Roshan Sajjad, Hector Mas, S. Nicole Halawani, David M. Weld

    Abstract: Trapped matter-wave interferometry offers the promise of compact high-precision local force sensing. However, the trap itself can introduce new systematic errors which are absent in traditional free-fall interferometers. We describe and demonstrate a novel Floquet-engineered platform for compact, continuously trapped atom interferometry which is intrinsically robust against trap noise and beamspli… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Comments: 20 pages, 11 figures

  32. arXiv:2506.11870  [pdf, ps, other

    cs.DB

    LLM-based Dynamic Differential Testing for Database Connectors with Reinforcement Learning-Guided Prompt Selection

    Authors: Ce Lyu, Minghao Zhao, Yanhao Wang, Liang Jie

    Abstract: Database connectors are critical components enabling applications to interact with underlying database management systems (DBMS), yet their security vulnerabilities often remain overlooked. Unlike traditional software defects, connector vulnerabilities exhibit subtle behavioral patterns and are inherently challenging to detect. Besides, nonstandardized implementation of connectors leaves potential… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Comments: 5 pages

    MSC Class: 68N99 ACM Class: H.2.4; D.2.5

  33. arXiv:2506.11784  [pdf, ps, other

    cs.CV

    GPLQ: A General, Practical, and Lightning QAT Method for Vision Transformers

    Authors: Guang Liang, Xinyao Liu, Jianxin Wu

    Abstract: Vision Transformers (ViTs) are essential in computer vision but are computationally intensive, too. Model quantization, particularly to low bit-widths like 4-bit, aims to alleviate this difficulty, yet existing Post-Training Quantization (PTQ) and Quantization-Aware Training (QAT) methods exhibit significant limitations. PTQ often incurs substantial accuracy drop, while QAT achieves high accuracy… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  34. arXiv:2506.11783  [pdf, ps, other

    hep-ex

    Holistic approach and Advanced Color Singlet Identification for physics measurements at high energy frontier

    Authors: Yongfeng Zhu, Hao Liang, Yuexin Wang, Yuzhi Che, Hengyu Wang, Chen Zhou, Huilin Qu, Manqi Ruan

    Abstract: To enhance the discovery power of high-energy colliders, we propose a holistic approach and Advanced Color Singlet Identification (ACSI), both of which utilize inclusive reconstructed information as input. The holistic approach is designed to simultaneously classify physics events, while ACSI focuses on associating final-state particles with their parent massive bosons. Implemented using state-of-… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  35. arXiv:2506.11612  [pdf, ps, other

    cs.CR cs.SE

    KEENHash: Hashing Programs into Function-Aware Embeddings for Large-Scale Binary Code Similarity Analysis

    Authors: Zhijie Liu, Qiyi Tang, Sen Nie, Shi Wu, Liang Feng Zhang, Yutian Tang

    Abstract: Binary code similarity analysis (BCSA) is a crucial research area in many fields such as cybersecurity. Specifically, function-level diffing tools are the most widely used in BCSA: they perform function matching one by one for evaluating the similarity between binary programs. However, such methods need a high time complexity, making them unscalable in large-scale scenarios (e.g., 1/n-to-n search)… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  36. arXiv:2506.11512  [pdf, ps, other

    cs.LG cs.AI

    Prioritizing Alignment Paradigms over Task-Specific Model Customization in Time-Series LLMs

    Authors: Wei Li, Yunyao Cheng, Xinli Hao, Chaohong Ma, Yuxuan Liang, Bin Yang, Christian S. Jensen, Xiaofeng Meng

    Abstract: Recent advances in Large Language Models (LLMs) have enabled unprecedented capabilities for time-series reasoning in diverse real-world applications, including medical, financial, and spatio-temporal domains. However, existing approaches typically focus on task-specific model customization, such as forecasting and anomaly detection, while overlooking the data itself, referred to as time-series pri… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  37. arXiv:2506.11498  [pdf, ps, other

    cs.CL

    Lag-Relative Sparse Attention In Long Context Training

    Authors: Manlai Liang, Wanyi Huang, Mandi Liu, Huaijun Li, Jinlong Li

    Abstract: Large Language Models (LLMs) have made significant strides in natural language processing and generation, yet their ability to handle long-context input remains constrained by the quadratic complexity of attention computation and linear-increasing key-value memory footprint. To reduce computational costs and memory, key-value cache compression techniques are commonly applied at inference time, but… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  38. arXiv:2506.11400  [pdf, ps, other

    cs.SE cs.RO

    A Step-by-Step Guide to Creating a Robust Autonomous Drone Testing Pipeline

    Authors: Yupeng Jiang, Yao Deng, Sebastian Schroder, Linfeng Liang, Suhaas Gambhir, Alice James, Avishkar Seth, James Pirrie, Yihao Zhang, Xi Zheng

    Abstract: Autonomous drones are rapidly reshaping industries ranging from aerial delivery and infrastructure inspection to environmental monitoring and disaster response. Ensuring the safety, reliability, and efficiency of these systems is paramount as they transition from research prototypes to mission-critical platforms. This paper presents a step-by-step guide to establishing a robust autonomous drone te… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  39. arXiv:2506.11397  [pdf, ps, other

    math.NA math.OC

    On existence of a variational regularization parameter under Morozov's discrepancy principle

    Authors: Liang Ding, Long Li, Weimin Han, Wei Wang

    Abstract: Morozov's discrepancy principle is commonly adopted in Tikhonov regularization for choosing the regularization parameter. Nevertheless, for a general non-linear inverse problem, the discrepancy $\|F(x_α^δ)-y^δ\|_Y$ does not depend continuously on $α$ and it is questionable whether there exists a regularization parameter $α$ such that $τ_1δ\leq \|F(x_α^δ)-y^δ\|_Y\leq τ_2 δ$ $(1\le τ_1<τ_2)$. In thi… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: 24 pages, 10 figures

    MSC Class: 47J06 ACM Class: G.1.6

  40. arXiv:2506.11379  [pdf, ps, other

    math.OC

    SVD method for sparse recovery

    Authors: Long Li, Liang Ding

    Abstract: Sparsity regularization has garnered significant interest across multiple disciplines, including statistics, imaging, and signal processing. Standard techniques for addressing sparsity regularization include iterative soft thresholding algorithms and their accelerated variants. However, these algorithms rely on Landweber iteration, which can be computationally intensive. Therefore, there is a pres… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: 33 pages, 5 figures

    MSC Class: 47A52 ACM Class: G.1.6

  41. arXiv:2506.11372  [pdf, ps, other

    math.OC

    $\ell_{1}^{2}-η\ell_{2}^{2}$ regularization for sparse recovery

    Authors: Long Li, Liang Ding

    Abstract: This paper presents a regularization technique incorporating a non-convex and non-smooth term, $\ell_{1}^{2}-η\ell_{2}^{2}$, with parameters $0<η\leq 1$ designed to address ill-posed linear problems that yield sparse solutions. We explore the existence, stability, and convergence of the regularized solution, demonstrating that the $\ell_{1}^{2}-η\ell_{2}^{2}$ regularization is well-posed and resul… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: 40 pages, 9 figures

    MSC Class: 47A52 ACM Class: G.1.6

  42. arXiv:2506.11343  [pdf, ps, other

    cs.CL

    From Replication to Redesign: Exploring Pairwise Comparisons for LLM-Based Peer Review

    Authors: Yaohui Zhang, Haijing Zhang, Wenlong Ji, Tianyu Hua, Nick Haber, Hancheng Cao, Weixin Liang

    Abstract: The advent of large language models (LLMs) offers unprecedented opportunities to reimagine peer review beyond the constraints of traditional workflows. Despite these opportunities, prior efforts have largely focused on replicating traditional review workflows with LLMs serving as direct substitutes for human reviewers, while limited attention has been given to exploring new paradigms that fundamen… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  43. arXiv:2506.11337  [pdf

    physics.geo-ph

    Multiscale transform based seismic reflectivity inversion using convolutional neural network

    Authors: John Castagna, Oleg Portniaguine, Gabriel Gil, Arnold Oyem, Chen Liang

    Abstract: The Multiscale Fourier Transform of a seismic trace performs time-frequency analyses over a range of window lengths. The variation in window length captures local and global relative amplitudes between events, thereby allowing reflectivity inversion that is independent of the amplitude spectrum of the seismic wavelet. As the temporal and spatial variation of the actual seismic wavelet in seismic r… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  44. arXiv:2506.11144  [pdf, ps, other

    cs.CV

    AlignHuman: Improving Motion and Fidelity via Timestep-Segment Preference Optimization for Audio-Driven Human Animation

    Authors: Chao Liang, Jianwen Jiang, Wang Liao, Jiaqi Yang, Zerong zheng, Weihong Zeng, Han Liang

    Abstract: Recent advancements in human video generation and animation tasks, driven by diffusion models, have achieved significant progress. However, expressive and realistic human animation remains challenging due to the trade-off between motion naturalness and visual fidelity. To address this, we propose \textbf{AlignHuman}, a framework that combines Preference Optimization as a post-training technique wi… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: Homepage: https://alignhuman.github.io/

  45. arXiv:2506.11017  [pdf, ps, other

    cs.CL cs.AI cs.PF

    TeleEval-OS: Performance evaluations of large language models for operations scheduling

    Authors: Yanyan Wang, Yingying Wang, Junli Liang, Yin Xu, Yunlong Liu, Yiming Xu, Zhengwang Jiang, Zhehe Li, Fei Li, Long Zhao, Kuang Xu, Qi Song, Xiangyang Li

    Abstract: The rapid advancement of large language models (LLMs) has significantly propelled progress in artificial intelligence, demonstrating substantial application potential across multiple specialized domains. Telecommunications operation scheduling (OS) is a critical aspect of the telecommunications industry, involving the coordinated management of networks, services, risks, and human resources to opti… ▽ More

    Submitted 5 May, 2025; originally announced June 2025.

  46. arXiv:2506.10960  [pdf, other

    cs.CL cs.AI cs.CR cs.IR cs.LG

    ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark

    Authors: Kangwei Liu, Siyuan Cheng, Bozhong Tian, Xiaozhuan Liang, Yuyang Yin, Meng Han, Ningyu Zhang, Bryan Hooi, Xi Chen, Shumin Deng

    Abstract: Large language models (LLMs) have been increasingly applied to automated harmful content detection tasks, assisting moderators in identifying policy violations and improving the overall efficiency and accuracy of content review. However, existing resources for harmful content detection are predominantly focused on English, with Chinese datasets remaining scarce and often limited in scope. We prese… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: Work in progress

  47. arXiv:2506.10938  [pdf, ps, other

    cond-mat.supr-con cond-mat.str-el

    From Fractionalization to Chiral Topological Superconductivity in Flat Chern Band

    Authors: Daniele Guerci, Ahmed Abouelkomsan, Liang Fu

    Abstract: We show that interacting electrons in a flat Chern band can form, in addition to fractional Chern insulators, a chiral $f$-wave topological superconductor that hosts neutral Majorana fermion edge modes. Superconductivity emerges from an interaction-induced metallic state that exhibits anomalous Hall effect, as observed in rhombohedral graphene and near the $ν=\frac{2}{3}$ fractional Chern insulato… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: 7+9 pages, 5+6 figures

  48. MARS: Processing-In-Memory Acceleration of Raw Signal Genome Analysis Inside the Storage Subsystem

    Authors: Melina Soysal, Konstantina Koliogeorgi, Can Firtina, Nika Mansouri Ghiasi, Rakesh Nadig, Haiyu Mayo, Geraldo F. Oliveira, Yu Liang, Klea Zambaku, Mohammad Sadrosadati, Onur Mutlu

    Abstract: Raw signal genome analysis (RSGA) has emerged as a promising approach to enable real-time genome analysis by directly analyzing raw electrical signals. However, rapid advancements in sequencing technologies make it increasingly difficult for software-based RSGA to match the throughput of raw signal generation. This paper demonstrates that while hardware acceleration techniques can significantly ac… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  49. arXiv:2506.10877  [pdf, ps, other

    cs.CL

    Enhancing Medical Dialogue Generation through Knowledge Refinement and Dynamic Prompt Adjustment

    Authors: Hongda Sun, Jiaren Peng, Wenzhong Yang, Liang He, Bo Du, Rui Yan

    Abstract: Medical dialogue systems (MDS) have emerged as crucial online platforms for enabling multi-turn, context-aware conversations with patients. However, existing MDS often struggle to (1) identify relevant medical knowledge and (2) generate personalized, medically accurate responses. To address these challenges, we propose MedRef, a novel MDS that incorporates knowledge refining and dynamic prompt adj… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

    Comments: ACL 2025 Findings

  50. arXiv:2506.10776  [pdf, ps, other

    cs.CR cs.AI

    ME: Trigger Element Combination Backdoor Attack on Copyright Infringement

    Authors: Feiyu Yang, Siyuan Liang, Aishan Liu, Dacheng Tao

    Abstract: The capability of generative diffusion models (DMs) like Stable Diffusion (SD) in replicating training data could be taken advantage of by attackers to launch the Copyright Infringement Attack, with duplicated poisoned image-text pairs. SilentBadDiffusion (SBD) is a method proposed recently, which shew outstanding performance in attacking SD in text-to-image tasks. However, the feasible data resou… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.