Skip to main content

Showing 151–200 of 3,624 results for author: Hang

.
  1. arXiv:2504.18000  [pdf, other

    astro-ph.CO gr-qc

    The Impact of Inhomogeneous Perturbations of the Inflaton on the Cosmological Primordial Magnetic Field

    Authors: Yu Li, Shuang Liu, Hang Wang, Yao-Chuan Wang

    Abstract: We investigate the impact of inhomogeneous inflaton perturbations on primordial magnetic fields within the framework of generalized inflationary magnetogenesis models. Extending the Ratra model to general spacetime backgrounds, we analyze the constraint structure of the electromagnetic field and demonstrate that the standard Coulomb gauge must be generalized to accommodate spatial inhomogeneities.… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: 13 pages, 1 figure

  2. arXiv:2504.17384  [pdf, other

    physics.geo-ph cs.AI

    On the workflow, opportunities and challenges of developing foundation model in geophysics

    Authors: Hanlin Sheng, Xinming Wu, Hang Gao, Haibin Di, Sergey Fomel, Jintao Li, Xu Si

    Abstract: Foundation models, as a mainstream technology in artificial intelligence, have demonstrated immense potential across various domains in recent years, particularly in handling complex tasks and multimodal data. In the field of geophysics, although the application of foundation models is gradually expanding, there is currently a lack of comprehensive reviews discussing the full workflow of integrati… ▽ More

    Submitted 25 April, 2025; v1 submitted 24 April, 2025; originally announced April 2025.

  3. arXiv:2504.16693  [pdf, ps, other

    cs.LG cs.RO

    PIN-WM: Learning Physics-INformed World Models for Non-Prehensile Manipulation

    Authors: Wenxuan Li, Hang Zhao, Zhiyuan Yu, Yu Du, Qin Zou, Ruizhen Hu, Kai Xu

    Abstract: While non-prehensile manipulation (e.g., controlled pushing/poking) constitutes a foundational robotic skill, its learning remains challenging due to the high sensitivity to complex physical interactions involving friction and restitution. To achieve robust policy learning and generalization, we opt to learn a world model of the 3D rigid body dynamics involved in non-prehensile manipulations and u… ▽ More

    Submitted 3 May, 2025; v1 submitted 23 April, 2025; originally announced April 2025.

    Comments: Robotics: Science and Systems 2025

  4. arXiv:2504.16385  [pdf, other

    eess.SY

    Distributed Space Resource Logistics Architecture Optimization under Economies of Scale

    Authors: Evangelia Gkaravela, Hang Woon Lee, Hao Chen

    Abstract: This paper proposes an optimization framework for distributed resource logistics system design to support future multimission space exploration. The performance and impact of distributed In-Situ Resource Utilization (ISRU) systems in facilitating space transportation are analyzed. The proposed framework considers technology trade studies, deployment strategy, facility location evaluation, and reso… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: 27 pages, 13 figures, Journal of Spacecraft and Rockets (Accepted)

  5. arXiv:2504.16008  [pdf, other

    quant-ph

    An Error Mitigated Non-Orthogonal Quantum Eigensolver via Shadow Tomography

    Authors: Hang Ren, Yipei Zhang, Wendy M. Billings, Rebecca Tomann, Nikolay V. Tkachenko, Martin Head-Gordon, K. Birgitta Whaley

    Abstract: We present a shadow-tomography-enhanced Non-Orthogonal Quantum Eigensolver (NOQE) for more efficient and accurate electronic structure calculations on near-term quantum devices. By integrating shadow tomography into the NOQE, the measurement cost scales linearly rather than quadratically with the number of reference states, while also reducing the required qubits and circuit depth by half. This ap… ▽ More

    Submitted 23 April, 2025; v1 submitted 22 April, 2025; originally announced April 2025.

  6. arXiv:2504.15300  [pdf, other

    cs.LG cs.DC cs.MA

    Collaborative Learning of On-Device Small Model and Cloud-Based Large Model: Advances and Future Directions

    Authors: Chaoyue Niu, Yucheng Ding, Junhui Lu, Zhengxiang Huang, Hang Zeng, Yutong Dai, Xuezhen Tu, Chengfei Lv, Fan Wu, Guihai Chen

    Abstract: The conventional cloud-based large model learning framework is increasingly constrained by latency, cost, personalization, and privacy concerns. In this survey, we explore an emerging paradigm: collaborative learning between on-device small model and cloud-based large model, which promises low-latency, cost-efficient, and personalized intelligent services while preserving user privacy. We provide… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  7. arXiv:2504.14594  [pdf, other

    cs.HC cs.AI cs.CL

    HealthGenie: Empowering Users with Healthy Dietary Guidance through Knowledge Graph and Large Language Models

    Authors: Fan Gao, Xinjie Zhao, Ding Xia, Zhongyi Zhou, Rui Yang, Jinghui Lu, Hang Jiang, Chanjun Park, Irene Li

    Abstract: Seeking dietary guidance often requires navigating complex professional knowledge while accommodating individual health conditions. Knowledge Graphs (KGs) offer structured and interpretable nutritional information, whereas Large Language Models (LLMs) naturally facilitate conversational recommendation delivery. In this paper, we present HealthGenie, an interactive system that combines the strength… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

  8. arXiv:2504.13914  [pdf, other

    cs.CL

    Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

    Authors: ByteDance Seed, :, Jiaze Chen, Tiantian Fan, Xin Liu, Lingjun Liu, Zhiqi Lin, Mingxuan Wang, Chengyi Wang, Xiangpeng Wei, Wenyuan Xu, Yufeng Yuan, Yu Yue, Lin Yan, Qiying Yu, Xiaochen Zuo, Chi Zhang, Ruofei Zhu, Zhecheng An, Zhihao Bai, Yu Bao, Xingyan Bin, Jiangjie Chen, Feng Chen, Hongmin Chen , et al. (249 additional authors not shown)

    Abstract: We introduce Seed1.5-Thinking, capable of reasoning through thinking before responding, resulting in improved performance on a wide range of benchmarks. Seed1.5-Thinking achieves 86.7 on AIME 2024, 55.0 on Codeforces and 77.3 on GPQA, demonstrating excellent reasoning abilities in STEM and coding. Beyond reasoning tasks, the method demonstrates notable generalization across diverse domains. For in… ▽ More

    Submitted 29 April, 2025; v1 submitted 10 April, 2025; originally announced April 2025.

  9. arXiv:2504.13865  [pdf, ps, other

    cs.HC cs.AI cs.CL cs.CV

    A Survey on (M)LLM-Based GUI Agents

    Authors: Fei Tang, Haolei Xu, Hang Zhang, Siqi Chen, Xingyu Wu, Yongliang Shen, Wenqi Zhang, Guiyang Hou, Zeqi Tan, Yuchen Yan, Kaitao Song, Jian Shao, Weiming Lu, Jun Xiao, Yueting Zhuang

    Abstract: Graphical User Interface (GUI) Agents have emerged as a transformative paradigm in human-computer interaction, evolving from rule-based automation scripts to sophisticated AI-driven systems capable of understanding and executing complex interface operations. This survey provides a comprehensive examination of the rapidly advancing field of LLM-based GUI Agents, systematically analyzing their archi… ▽ More

    Submitted 4 June, 2025; v1 submitted 27 March, 2025; originally announced April 2025.

  10. arXiv:2504.12892  [pdf, other

    math.NA

    Manifold-valued function approximation from multiple tangent spaces

    Authors: Hang Wang, Raf Vandebril, Joeri Van der Veken, Nick Vannieuwenhoven

    Abstract: Approximating a manifold-valued function from samples of input-output pairs consists of modeling the relationship between an input from a vector space and an output on a Riemannian manifold. We propose a function approximation method that leverages and unifies two prior techniques: (i) approximating a pullback to the tangent space, and (ii) the Riemannian moving least squares method. The core idea… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: 25 pages, 7 figures

    MSC Class: 65D15; 65D40; 65J99; 46T20; 53B20; 58C25

  11. arXiv:2504.12711  [pdf, other

    cs.CV cs.AI eess.IV

    NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results

    Authors: Xin Li, Yeying Jin, Xin Jin, Zongwei Wu, Bingchen Li, Yufei Wang, Wenhan Yang, Yu Li, Zhibo Chen, Bihan Wen, Robby T. Tan, Radu Timofte, Qiyu Rong, Hongyuan Jing, Mengmeng Zhang, Jinglong Li, Xiangyu Lu, Yi Ren, Yuting Liu, Meng Zhang, Xiang Chen, Qiyuan Guan, Jiangxin Dong, Jinshan Pan, Conglin Gou , et al. (112 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images. This challenge received a wide range of impressive solutions, which are developed and evaluated using our collected real-world Raindrop Clarity dataset. Unlike existing deraining datasets, our Raindrop Clarity dataset is more diverse and challenging in degradation types and contents, which includ… ▽ More

    Submitted 19 April, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

    Comments: Challenge Report of CVPR NTIRE 2025; 26 pages; Methods from 32 teams

  12. arXiv:2504.12643  [pdf, ps, other

    cs.CV

    RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding

    Authors: Hang Ji, Tao Ni, Xufeng Huang, Zhan Shi, Tao Luo, Xin Zhan, Junbo Chen

    Abstract: This technical report introduces a targeted improvement to the StreamPETR framework, specifically aimed at enhancing velocity estimation, a critical factor influencing the overall NuScenes Detection Score. While StreamPETR exhibits strong 3D bounding box detection performance as reflected by its high mean Average Precision our analysis identified velocity estimation as a substantial bottleneck whe… ▽ More

    Submitted 6 June, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

  13. arXiv:2504.12276  [pdf, other

    cs.CV

    The Tenth NTIRE 2025 Image Denoising Challenge Report

    Authors: Lei Sun, Hang Guo, Bin Ren, Luc Van Gool, Radu Timofte, Yawei Li, Xiangyu Kong, Hyunhee Park, Xiaoxuan Yu, Suejin Han, Hakjae Jeon, Jia Li, Hyung-Ju Chun, Donghun Ryou, Inju Ha, Bohyung Han, Jingyu Ma, Zhijuan Huang, Huiyuan Fu, Hongyuan Yu, Boqi Zhang, Jiawei Shi, Heng Zhang, Huadong Ma, Deepak Kumar Tyagi , et al. (69 additional authors not shown)

    Abstract: This paper presents an overview of the NTIRE 2025 Image Denoising Challenge (σ = 50), highlighting the proposed methodologies and corresponding results. The primary objective is to develop a network architecture capable of achieving high-quality denoising performance, quantitatively evaluated using PSNR, without constraints on computational complexity or model size. The task assumes independent ad… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

  14. arXiv:2504.12234  [pdf, other

    cs.SE

    MOS: Towards Effective Smart Contract Vulnerability Detection through Mixture-of-Experts Tuning of Large Language Models

    Authors: Hang Yuan, Lei Yu, Zhirong Huang, Jingyuan Zhang, Junyi Lu, Shiqi Cheng, Li Yang, Fengjun Zhang, Jiajia Ma, Chun Zuo

    Abstract: Smart contract vulnerabilities pose significant security risks to blockchain systems, potentially leading to severe financial losses. Existing methods face several limitations: (1) Program analysis-based approaches rely on predefined patterns, lacking flexibility for new vulnerability types; (2) Deep learning-based methods lack explanations; (3) Large language model-based approaches suffer from hi… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

  15. Multi-goal Rapidly Exploring Random Tree with Safety and Dynamic Constraints for UAV Cooperative Path Planning

    Authors: Thu Hang Khuat, Duy-Nam Bui, Hoa TT. Nguyen, Mien L. Trinh, Minh T. Nguyen, Manh Duong Phung

    Abstract: Cooperative path planning is gaining its importance due to the increasing demand on using multiple unmanned aerial vehicles (UAVs) for complex missions. This work addresses the problem by introducing a new algorithm named MultiRRT that extends the rapidly exploring random tree (RRT) to generate paths for a group of UAVs to reach multiple goal locations at the same time. We first derive the dynamic… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

    Journal ref: IEEE Transactions on Vehicular Technology, 2025

  16. arXiv:2504.11711  [pdf, ps, other

    cs.SE cs.AI

    The Hitchhiker's Guide to Program Analysis, Part II: Deep Thoughts by LLMs

    Authors: Haonan Li, Hang Zhang, Kexin Pei, Zhiyun Qian

    Abstract: Static analysis plays a crucial role in software vulnerability detection, yet faces a persistent precision-scalability tradeoff. In large codebases like the Linux kernel, traditional static analysis tools often generate excessive false positives due to simplified vulnerability modeling and overapproximation of path and data constraints. While large language models (LLMs) demonstrate promising code… ▽ More

    Submitted 31 May, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

  17. arXiv:2504.10686  [pdf, other

    cs.CV eess.IV

    The Tenth NTIRE 2025 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Hang Guo, Lei Sun, Zongwei Wu, Radu Timofte, Yawei Li, Yao Zhang, Xinning Chai, Zhengxue Cheng, Yingsheng Qin, Yucai Yang, Li Song, Hongyuan Yu, Pufan Xu, Cheng Wan, Zhijuan Huang, Peng Guo, Shuyuan Cui, Chenjun Li, Xuehai Hu, Pan Pan, Xin Zhang, Heng Zhang, Qing Luo, Linyan Jiang , et al. (122 additional authors not shown)

    Abstract: This paper presents a comprehensive review of the NTIRE 2025 Challenge on Single-Image Efficient Super-Resolution (ESR). The challenge aimed to advance the development of deep models that optimize key computational metrics, i.e., runtime, parameters, and FLOPs, while achieving a PSNR of at least 26.90 dB on the $\operatorname{DIV2K\_LSDIR\_valid}$ dataset and 26.99 dB on the… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: Accepted by CVPR2025 NTIRE Workshop, Efficient Super-Resolution Challenge Report. 50 pages

  18. arXiv:2504.10416  [pdf, other

    cs.RO

    Region Based SLAM-Aware Exploration: Efficient and Robust Autonomous Mapping Strategy That Can Scale

    Authors: Megha Maheshwari, Sadeigh Rabiee, He Yin, Martin Labrie, Hang Liu

    Abstract: Autonomous exploration for mapping unknown large scale environments is a fundamental challenge in robotics, with efficiency in time, stability against map corruption and computational resources being crucial. This paper presents a novel approach to indoor exploration that addresses these key issues in existing methods. We introduce a Simultaneous Localization and Mapping (SLAM)-aware region-based… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: 8 pages, 9 figures

  19. Bingo: Radix-based Bias Factorization for Random Walk on Dynamic Graphs

    Authors: Pinhuan Wang, Chengying Huan, Zhibin Wang, Chen Tian, Yuede Ji, Hang Liu

    Abstract: Random walks are a primary means for extracting information from large-scale graphs. While most real-world graphs are inherently dynamic, state-of-the-art random walk engines failed to efficiently support such a critical use case. This paper takes the initiative to build a general random walk engine for dynamically changing graphs with two key principles: (i) This system should support both low-la… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: 17 pages, Published in EuroSys'25

    Journal ref: Proceedings of the Twentieth European Conference on Computer Systems, 2025, pp. 605-620

  20. arXiv:2504.10076  [pdf, other

    cs.LG stat.ML

    Towards Scalable Bayesian Optimization via Gradient-Informed Bayesian Neural Networks

    Authors: Georgios Makrygiorgos, Joshua Hang Sai Ip, Ali Mesbah

    Abstract: Bayesian optimization (BO) is a widely used method for data-driven optimization that generally relies on zeroth-order data of objective function to construct probabilistic surrogate models. These surrogates guide the exploration-exploitation process toward finding global optimum. While Gaussian processes (GPs) are commonly employed as surrogates of the unknown objective function, recent studies ha… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

  21. arXiv:2504.10014  [pdf, other

    cs.LG cs.AI cs.CV

    Air Quality Prediction with A Meteorology-Guided Modality-Decoupled Spatio-Temporal Network

    Authors: Hang Yin, Yan-Ming Zhang, Jian Xu, Jian-Long Chang, Yin Li, Cheng-Lin Liu

    Abstract: Air quality prediction plays a crucial role in public health and environmental protection. Accurate air quality prediction is a complex multivariate spatiotemporal problem, that involves interactions across temporal patterns, pollutant correlations, spatial station dependencies, and particularly meteorological influences that govern pollutant dispersion and chemical transformations. Existing works… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

  22. arXiv:2504.09280  [pdf, ps, other

    math.CA math-ph

    Full asymptotic expansions of the Humbert function $Φ_1$

    Authors: Peng-Cheng Hang, Liangjian Hu

    Abstract: We derive full asymptotic expansions for the Humbert function $Φ_1$ in different limiting regimes of its variables. Our derivation employs various asymptotic methods and relies on key transformation formulae established by Erdélyi (1940), and Tuan and Kalla (1987). The efficiency of our asymptotic results are also illustrated through two applications: (1) analytic continuations of Saran's function… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

    Comments: 12 pages

    MSC Class: 33C65; 41A60; 82C20

  23. arXiv:2504.08694  [pdf, other

    cs.CL

    TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning

    Authors: Hang Ni, Fan Liu, Xinyu Ma, Lixin Su, Shuaiqiang Wang, Dawei Yin, Hui Xiong, Hao Liu

    Abstract: Large language models (LLMs) have shown promise in automating travel planning, yet they often fall short in addressing nuanced spatiotemporal rationality. While existing benchmarks focus on basic plan validity, they neglect critical aspects such as route efficiency, POI appeal, and real-time adaptability. This paper introduces TP-RAG, the first benchmark tailored for retrieval-augmented, spatiotem… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

  24. arXiv:2504.08672  [pdf, other

    cs.CL cs.AI cs.LG

    Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

    Authors: Fangzhi Xu, Hang Yan, Chang Ma, Haiteng Zhao, Qiushi Sun, Kanzhi Cheng, Junxian He, Jun Liu, Zhiyong Wu

    Abstract: Advancing LLM reasoning skills has captivated wide interest. However, current post-training techniques rely heavily on supervisory signals, such as outcome supervision or auxiliary reward models, which face the problem of scalability and high annotation costs. This motivates us to enhance LLM reasoning without the need for external supervision. We introduce a generalizable and purely unsupervised… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: 14 pages, 7 figures

  25. arXiv:2504.08619  [pdf, other

    cs.DL cs.CL

    Analyzing 16,193 LLM Papers for Fun and Profits

    Authors: Zhiqiu Xia, Lang Zhu, Bingzhe Li, Feng Chen, Qiannan Li, Chunhua Liao, Feiyi Wang, Hang Liu

    Abstract: Large Language Models (LLMs) are reshaping the landscape of computer science research, driving significant shifts in research priorities across diverse conferences and fields. This study provides a comprehensive analysis of the publication trend of LLM-related papers in 77 top-tier computer science conferences over the past six years (2019-2024). We approach this analysis from four distinct perspe… ▽ More

    Submitted 22 April, 2025; v1 submitted 11 April, 2025; originally announced April 2025.

  26. arXiv:2504.08334  [pdf, other

    cs.AR cs.DC

    Efficient Architecture for RISC-V Vector Memory Access

    Authors: Hongyi Guan, Yichuan Gao, Chenlu Miao, Haoyang Wu, Hang Zhu, Mingfeng Lin, Huayue Liang

    Abstract: Vector processors frequently suffer from inefficient memory accesses, particularly for strided and segment patterns. While coalescing strided accesses is a natural solution, effectively gathering or scattering elements at fixed strides remains challenging. Naive approaches rely on high-overhead crossbars that remap any byte between memory and registers, leading to physical design issues. Segment o… ▽ More

    Submitted 16 April, 2025; v1 submitted 11 April, 2025; originally announced April 2025.

  27. arXiv:2504.06330  [pdf, other

    cs.CV cs.AI

    Analyzing the Impact of Low-Rank Adaptation for Cross-Domain Few-Shot Object Detection in Aerial Images

    Authors: Hicham Talaoubrid, Anissa Mokraoui, Ismail Ben Ayed, Axel Prouvost, Sonimith Hang, Monit Korn, Rémi Harvey

    Abstract: This paper investigates the application of Low-Rank Adaptation (LoRA) to small models for cross-domain few-shot object detection in aerial images. Originally designed for large-scale models, LoRA helps mitigate overfitting, making it a promising approach for resource-constrained settings. We integrate LoRA into DiffusionDet, and evaluate its performance on the DOTA and DIOR datasets. Our results s… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

  28. arXiv:2504.05810  [pdf, other

    cs.CV

    PaMi-VDPO: Mitigating Video Hallucinations by Prompt-Aware Multi-Instance Video Preference Learning

    Authors: Xinpeng Ding, Kui Zhang, Jianhua Han, Lanqing Hong, Hang Xu, Xiaomeng Li

    Abstract: Direct Preference Optimization (DPO) helps reduce hallucinations in Video Multimodal Large Language Models (VLLMs), but its reliance on offline preference data limits adaptability and fails to capture true video-response misalignment. We propose Video Direct Preference Optimization (VDPO), an online preference learning framework that eliminates the need for preference annotation by leveraging vide… ▽ More

    Submitted 15 April, 2025; v1 submitted 8 April, 2025; originally announced April 2025.

  29. arXiv:2504.05541  [pdf, other

    cs.CV

    Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting

    Authors: Yunlong Tang, Jing Bi, Chao Huang, Susan Liang, Daiki Shimada, Hang Hua, Yunzhong Xiao, Yizhi Song, Pinxin Liu, Mingqian Feng, Junjia Guo, Zhuo Liu, Luchuan Song, Ali Vosoughi, Jinxi He, Liu He, Zeliang Zhang, Jiebo Luo, Chenliang Xu

    Abstract: We present CAT-V (Caption AnyThing in Video), a training-free framework for fine-grained object-centric video captioning that enables detailed descriptions of user-selected objects through time. CAT-V integrates three key components: a Segmenter based on SAMURAI for precise object segmentation across frames, a Temporal Analyzer powered by TRACE-Uni for accurate event boundary detection and tempora… ▽ More

    Submitted 8 April, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

  30. arXiv:2504.05276  [pdf, ps, other

    cs.CL

    Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation

    Authors: Yucheng Chu, Peng He, Hang Li, Haoyu Han, Kaiqi Yang, Yu Xue, Tingting Li, Joseph Krajcik, Jiliang Tang

    Abstract: Short answer assessment is a vital component of science education, allowing evaluation of students' complex three-dimensional understanding. Large language models (LLMs) that possess human-like ability in linguistic tasks are increasingly popular in assisting human graders to reduce their workload. However, LLMs' limitations in domain knowledge restrict their understanding in task-specific require… ▽ More

    Submitted 3 June, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

    Comments: EDM 2025 Short Paper

  31. arXiv:2504.05239  [pdf, other

    cs.CL

    LLM-based Automated Grading with Human-in-the-Loop

    Authors: Hang Li, Yucheng Chu, Kaiqi Yang, Yasemin Copur-Gencturk, Jiliang Tang

    Abstract: The rise of artificial intelligence (AI) technologies, particularly large language models (LLMs), has brought significant advancements to the field of education. Among various applications, automatic short answer grading (ASAG), which focuses on evaluating open-ended textual responses, has seen remarkable progress with the introduction of LLMs. These models not only enhance grading performance com… ▽ More

    Submitted 28 April, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

  32. arXiv:2504.05199  [pdf, ps, other

    hep-th

    Equivalence Theorems and Double-Copy Structure in Scattering Amplitudes of Massive Kaluza-Klein States with Matter Interactions

    Authors: Kezhu Guo, Yanfeng Hang

    Abstract: We investigate the scattering amplitudes of massive Kaluza-Klein (KK) states in compactified five-dimensional warped gauge and gravity theories. Focusing on tree-level $2\to2$ processes, we analyze the leading-order amplitudes involving bulk KK matter fields and KK gauge/gravitational Goldstone bosons. By imposing the gauge theory equivalence theorem (GAET) and the gravitational equivalence theore… ▽ More

    Submitted 14 May, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

    Comments: 31 pages. Incorporating general N-point discussion and new references. The typos have been corrected and the conclusion remains unchanged

  33. arXiv:2504.05118  [pdf, other

    cs.AI

    VAPO: Efficient and Reliable Reinforcement Learning for Advanced Reasoning Tasks

    Authors: Yu Yue, Yufeng Yuan, Qiying Yu, Xiaochen Zuo, Ruofei Zhu, Wenyuan Xu, Jiaze Chen, Chengyi Wang, TianTian Fan, Zhengyin Du, Xiangpeng Wei, Xiangyu Yu, Gaohong Liu, Juncai Liu, Lingjun Liu, Haibin Lin, Zhiqi Lin, Bole Ma, Chi Zhang, Mofan Zhang, Wang Zhang, Hang Zhu, Ru Zhang, Xin Liu, Mingxuan Wang , et al. (2 additional authors not shown)

    Abstract: We present VAPO, Value-based Augmented Proximal Policy Optimization framework for reasoning models., a novel framework tailored for reasoning models within the value-based paradigm. Benchmarked the AIME 2024 dataset, VAPO, built on the Qwen 32B pre-trained model, attains a state-of-the-art score of $\mathbf{60.4}$. In direct comparison under identical experimental settings, VAPO outperforms the pr… ▽ More

    Submitted 10 April, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

  34. arXiv:2504.05041  [pdf, ps, other

    cs.RO

    Segmented Trajectory Optimization for Autonomous Parking in Unstructured Environments

    Authors: Hang Yu, Renjie Li

    Abstract: This paper presents a Segmented Trajectory Optimization (STO) method for autonomous parking, which refines an initial trajectory into a dynamically feasible and collision-free one using an iterative SQP-based approach. STO maintains the maneuver strategy of the high-level global planner while allowing curvature discontinuities at switching points to improve maneuver efficiency. To ensure safety, a… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: 8 pages, 6 figures, submitted to IROS 2025

  35. arXiv:2504.04748  [pdf, other

    math.PR

    Sharp threshold for network recovery from voter model dynamics

    Authors: Hang Du, Seokmin Ha, Oriol Solé-Pi

    Abstract: We investigate the problem of recovering a latent directed Erdős-Rényi graph $G^*\sim \mathcal G(n,p)$ from observations of discrete voter model trajectories on $G^*$, where $np$ grows polynomially in $n$. Given access to $M$ independent voter model trajectories evolving up to time $T$, we establish that $G^*$ can be recovered \emph{exactly} with probability at least $0.9$ by an \emph{efficient} a… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: 58 pages, 3 figures

    MSC Class: 60K35; 68Q87

  36. arXiv:2504.04598  [pdf, other

    cs.RO

    B4P: Simultaneous Grasp and Motion Planning for Object Placement via Parallelized Bidirectional Forests and Path Repair

    Authors: Benjamin H. Leebron, Kejia Ren, Yiting Chen, Kaiyu Hang

    Abstract: Robot pick and place systems have traditionally decoupled grasp, placement, and motion planning to build sequential optimization pipelines with the assumption that the individual components will be able to work together. However, this separation introduces sub-optimality, as grasp choices may limit or even prohibit feasible motions for a robot to reach the target placement pose, particularly in cl… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

  37. arXiv:2504.04421  [pdf, other

    cs.RO cs.LG

    Deliberate Planning of 3D Bin Packing on Packing Configuration Trees

    Authors: Hang Zhao, Juzhan Xu, Kexiong Yu, Ruizhen Hu, Chenyang Zhu, Kai Xu

    Abstract: Online 3D Bin Packing Problem (3D-BPP) has widespread applications in industrial automation. Existing methods usually solve the problem with limited resolution of spatial discretization, and/or cannot deal with complex practical constraints well. We propose to enhance the practical applicability of online 3D-BPP via learning on a novel hierarchical representation, packing configuration tree (PCT).… ▽ More

    Submitted 29 April, 2025; v1 submitted 6 April, 2025; originally announced April 2025.

  38. arXiv:2504.04381  [pdf, other

    math.NA

    Error analysis of a Euler finite element scheme for Natural convection model with variable density

    Authors: Li Hang, Chenyang Li

    Abstract: In this paper, we derive first-order Euler finite element discretization schemes for a time-dependent natural convection model with variable density (NCVD). The model is governed by the variable density Navier-Stokes equations coupled with a parabolic partial differential equation that describes the evolution of temperature. Stability and error estimate for the velocity, pressure, density and temp… ▽ More

    Submitted 19 May, 2025; v1 submitted 6 April, 2025; originally announced April 2025.

  39. arXiv:2504.03687  [pdf, other

    eess.SP cs.AI cs.CV

    Process Optimization and Deployment for Sensor-Based Human Activity Recognition Based on Deep Learning

    Authors: Hanyu Liu, Ying Yu, Hang Xiao, Siyao Li, Xuze Li, Jiarui Li, Haotian Tang

    Abstract: Sensor-based human activity recognition is a key technology for many human-centered intelligent applications. However, this research is still in its infancy and faces many unresolved challenges. To address these, we propose a comprehensive optimization process approach centered on multi-attention interaction. We first utilize unsupervised statistical feature-guided diffusion models for highly adap… ▽ More

    Submitted 22 March, 2025; originally announced April 2025.

  40. arXiv:2504.03026  [pdf, other

    cs.CV

    HALO: Human-Aligned End-to-end Image Retargeting with Layered Transformations

    Authors: Yiran Xu, Siqi Xie, Zhuofang Li, Harris Shadmany, Yinxiao Li, Luciano Sbaiz, Miaosen Wang, Junjie Ke, Jose Lezama, Hang Qi, Han Zhang, Jesse Berent, Ming-Hsuan Yang, Irfan Essa, Jia-Bin Huang, Feng Yang

    Abstract: Image retargeting aims to change the aspect-ratio of an image while maintaining its content and structure with less visual artifacts. Existing methods still generate many artifacts or fail to maintain original content or structure. To address this, we introduce HALO, an end-to-end trainable solution for image retargeting. Since humans are more sensitive to distortions in salient areas than non-sal… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  41. arXiv:2504.02296  [pdf, other

    stat.ME

    Exceedance and force of centrality for functional data

    Authors: Poorbita Kundu, Hang Zhou, Hans-Georg Müller

    Abstract: Exceedance refers to instances where a dynamic process surpasses given thresholds, e.g., the occurrence of a heat wave. We propose a novel exceedance framework for functional data, where each observed random trajectory is transformed into an exceedance function, which quantifies exceedance durations as a function of threshold levels. An inherent relationship between exceedance functions and probab… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  42. arXiv:2504.01934  [pdf, other

    cs.CV

    ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement

    Authors: Runhui Huang, Chunwei Wang, Junwei Yang, Guansong Lu, Yunlong Yuan, Jianhua Han, Lu Hou, Wei Zhang, Lanqing Hong, Hengshuang Zhao, Hang Xu

    Abstract: We present ILLUME+ that leverages dual visual tokenization and a diffusion decoder to improve both deep semantic understanding and high-fidelity image generation. Existing unified models have struggled to simultaneously handle the three fundamental capabilities in a unified model: understanding, generation, and editing. Models like Chameleon and EMU3 utilize VQGAN for image discretization, due to… ▽ More

    Submitted 3 April, 2025; v1 submitted 2 April, 2025; originally announced April 2025.

  43. A Two-Timescale Approach for Wireless Federated Learning with Parameter Freezing and Power Control

    Authors: Jinhao Ouyang, Yuan Liu, Hang Liu

    Abstract: Federated learning (FL) enables distributed devices to train a shared machine learning (ML) model collaboratively while protecting their data privacy. However, the resource-limited mobile devices suffer from intensive computation-and-communication costs of model parameters. In this paper, we observe the phenomenon that the model parameters tend to be stabilized long before convergence during train… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, including reprinting, republishing, or reuse in other works. This work has been accepted to IEEE Transactions on Mobile Computing

  44. arXiv:2504.01448  [pdf, ps, other

    cs.IR cs.LG

    LLM-VPRF: Large Language Model Based Vector Pseudo Relevance Feedback

    Authors: Hang Li, Shengyao Zhuang, Bevan Koopman, Guido Zuccon

    Abstract: Vector Pseudo Relevance Feedback (VPRF) has shown promising results in improving BERT-based dense retrieval systems through iterative refinement of query representations. This paper investigates the generalizability of VPRF to Large Language Model (LLM) based dense retrievers. We introduce LLM-VPRF and evaluate its effectiveness across multiple benchmark datasets, analyzing how different LLMs impa… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  45. arXiv:2504.01391  [pdf

    physics.acc-ph physics.optics

    Enabling Continuous THz Band Coverage via Precise Electron Beam Tailoring in Free-electron Lasers

    Authors: Yin Kang, Tong Li, Zhen Wang, Yue Wang, Cheng Yu, Weiyi Yin, Zhangfeng Gao, Hanghua Xu, Hang Luo, Xiaofan Wang, Jian Chen, Taihe Lan, Xiaoqing Liu, Jinguo Wang, Huan Zhao, Fei Gao, Liping Sun, YanYan Zhu, Yongmei Wen, Qili Tian, Chenye Xu, Xingtao Wang, Jiaqiang Xu, Zheng Qi, Tao Liu , et al. (6 additional authors not shown)

    Abstract: High-power, continuously tunable narrowband terahertz (THz) sources are essential for advancing nonlinear optics, THz-driven material dynamics, and ultrafast spectroscopy. Conventional techniques typically impose a trade-off between pulse energy and frequency tunability. Here, we introduce a novel free-electron laser approach that overcomes these limitations by pre-modulating a relativistic electr… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  46. arXiv:2504.00562  [pdf, other

    cs.MM

    Diffusion Model-Based Size Variable Virtual Try-On Technology and Evaluation Method

    Authors: Shufang Zhang, Hang Qian, Minxue Ni, Yaxuan Li, Wenxin Ding, Jun Liu

    Abstract: With the rapid development of e-commerce, virtual try-on technology has become an essential tool to satisfy consumers' personalized clothing preferences. Diffusion-based virtual try-on systems aim to naturally align garments with target individuals, generating realistic and detailed try-on images. However, existing methods overlook the importance of garment size variations in meeting personalized… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  47. arXiv:2504.00521  [pdf, other

    cs.SE cs.AI

    Automated detection of atomicity violations in large-scale systems

    Authors: Hang He, Yixing Luo, Chengcheng Wan, Ting Su, Haiying Sun, Geguang Pu

    Abstract: Atomicity violations in interrupt-driven programs pose a significant threat to software safety in critical systems. These violations occur when the execution sequence of operations on shared resources is disrupted by asynchronous interrupts. Detecting atomicity violations is challenging due to the vast program state space, application-level code dependencies, and complex domain-specific knowledge.… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  48. Contextual Preference Collaborative Measure Framework Based on Belief System

    Authors: Hang Yu, Wei Wei, Zheng Tan, Jing-lei Liu

    Abstract: To reduce the human intervention in the preference measure process,this article proposes a preference collaborative measure framework based on an updated belief system,which is also capable of improving the accuracy and efficiency of preferen-ce measure algorithms.Firstly,the distance of rules and the average internal distance of rulesets are proposed for specifying the relationship between the ru… ▽ More

    Submitted 31 March, 2025; originally announced March 2025.

    Comments: in Chinese language

  49. arXiv:2503.23496  [pdf, other

    cs.AR

    FlexMem: High-Parallel Near-Memory Architecture for Flexible Dataflow in Fully Homomorphic Encryption

    Authors: Shangyi Shi, Husheng Han, Jianan Mu, Xinyao Zheng, Ling Liang, Hang Lu, Zidong Du, Xiaowei Li, Xing Hu, Qi Guo

    Abstract: Fully Homomorphic Encryption (FHE) imposes substantial memory bandwidth demands, presenting significant challenges for efficient hardware acceleration. Near-memory Processing (NMP) has emerged as a promising architectural solution to alleviate the memory bottleneck. However, the irregular memory access patterns and flexible dataflows inherent to FHE limit the effectiveness of existing NMP accelera… ▽ More

    Submitted 30 March, 2025; originally announced March 2025.

    Comments: 9 pages,ICCAD

  50. arXiv:2503.23367  [pdf, other

    cs.CV

    FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning

    Authors: Hang Guo, Yawei Li, Taolin Zhang, Jiangshan Wang, Tao Dai, Shu-Tao Xia, Luca Benini

    Abstract: Visual Autoregressive (VAR) modeling has gained popularity for its shift towards next-scale prediction. However, existing VAR paradigms process the entire token map at each scale step, leading to the complexity and runtime scaling dramatically with image resolution. To address this challenge, we propose FastVAR, a post-training acceleration method for efficient resolution scaling with VARs. Our ke… ▽ More

    Submitted 6 April, 2025; v1 submitted 30 March, 2025; originally announced March 2025.

    Comments: Technical Report