Skip to main content

Showing 101–150 of 3,624 results for author: Hang

.
  1. arXiv:2505.10963  [pdf, ps, other

    quant-ph physics.chem-ph

    Beyond real: Alternative unitary cluster Jastrow models for molecular electronic structure calculations on near-term quantum computers

    Authors: Nikolay V. Tkachenko, Hang Ren, Wendy M. Billings, Rebecca Tomann, K. Birgitta Whaley, Martin Head-Gordon

    Abstract: Near-term quantum devices require wavefunction ansätze that are expressive while also of shallow circuit depth in order to both accurately and efficiently simulate molecular electronic structure. While unitary coupled cluster (e.g., UCCSD) has become a standard, the high gate count associated with the implementation of this limits its feasibility on noisy intermediate-scale quantum (NISQ) hardware… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  2. arXiv:2505.10442  [pdf, ps, other

    cs.RO cs.AI

    IN-RIL: Interleaved Reinforcement and Imitation Learning for Policy Fine-Tuning

    Authors: Dechen Gao, Hang Wang, Hanchu Zhou, Nejib Ammar, Shatadal Mishra, Ahmadreza Moradipari, Iman Soltani, Junshan Zhang

    Abstract: Imitation learning (IL) and reinforcement learning (RL) each offer distinct advantages for robotics policy learning: IL provides stable learning from demonstrations, and RL promotes generalization through exploration. While existing robot learning approaches using IL-based pre-training followed by RL-based fine-tuning are promising, this two-step learning paradigm often suffers from instability an… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  3. arXiv:2505.10207  [pdf, other

    cs.DM

    How to Color Temporal Graphs to Ensure Proper Transitions

    Authors: Allen Ibiapina, Minh Hang Nguyen, Mikaël Rabie, Cléophée Robin

    Abstract: Graph Coloring consists in assigning colors to vertices ensuring that two adjacent vertices do not have the same color. In dynamic graphs, this notion is not well defined, as we need to decide if different colors for adjacent vertices must happen all the time or not, and how to go from a coloring in one time to the next one. In this paper, we define a coloring notion for Temporal Graphs where at… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 20 pages, 9 figures

  4. arXiv:2505.10166  [pdf, ps, other

    cond-mat.mtrl-sci physics.optics quant-ph

    Cavity-Mediated Electron-Electron Interactions: Renormalizing Dirac States in Graphene

    Authors: Hang Liu, Francesco Troisi, Hannes Hübener, Simone Latini, Angel Rubio

    Abstract: Embedding materials in optical cavities has emerged as a strategy for tuning material properties. Accurate simulations of electrons in materials interacting with quantum photon fluctuations of a cavity are crucial for understanding and predicting cavity-induced phenomena. In this article, we develop a non-perturbative quantum electrodynamical approach based on a photon-free self-consistent Hartree… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 20 pages, 10 figures

  5. arXiv:2505.10039  [pdf, other

    cs.LG

    Rethinking Circuit Completeness in Language Models: AND, OR, and ADDER Gates

    Authors: Hang Chen, Jiaying Zhu, Xinyu Yang, Wenya Wang

    Abstract: Circuit discovery has gradually become one of the prominent methods for mechanistic interpretability, and research on circuit completeness has also garnered increasing attention. Methods of circuit discovery that do not guarantee completeness not only result in circuits that are not fixed across different runs but also cause key mechanisms to be omitted. The nature of incompleteness arises from th… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: 10 pages

  6. arXiv:2505.09684  [pdf, ps, other

    quant-ph

    Demonstration of low-overhead quantum error correction codes

    Authors: Ke Wang, Zhide Lu, Chuanyu Zhang, Gongyu Liu, Jiachen Chen, Yanzhe Wang, Yaozu Wu, Shibo Xu, Xuhao Zhu, Feitong Jin, Yu Gao, Ziqi Tan, Zhengyi Cui, Ning Wang, Yiren Zou, Aosai Zhang, Tingting Li, Fanhao Shen, Jiarun Zhong, Zehang Bao, Zitian Zhu, Yihang Han, Yiyang He, Jiayuan Shen, Han Wang , et al. (17 additional authors not shown)

    Abstract: Quantum computers hold the potential to surpass classical computers in solving complex computational problems. However, the fragility of quantum information and the error-prone nature of quantum operations make building large-scale, fault-tolerant quantum computers a prominent challenge. To combat errors, pioneering experiments have demonstrated a variety of quantum error correction codes. Yet, mo… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  7. arXiv:2505.09665  [pdf, other

    cs.SI cs.CL

    Tales of the 2025 Los Angeles Fire: Hotwash for Public Health Concerns in Reddit via LLM-Enhanced Topic Modeling

    Authors: Sulong Zhou, Qunying Huang, Shaoheng Zhou, Yun Hang, Xinyue Ye, Aodong Mei, Kathryn Phung, Yuning Ye, Uma Govindswamy, Zehan Li

    Abstract: Wildfires have become increasingly frequent, irregular, and severe in recent years. Understanding how affected populations perceive and respond during wildfire crises is critical for timely and empathetic disaster response. Social media platforms offer a crowd-sourced channel to capture evolving public discourse, providing hyperlocal information and insight into public sentiment. This study analyz… ▽ More

    Submitted 15 May, 2025; v1 submitted 14 May, 2025; originally announced May 2025.

    Comments: Corrected capitalization errors in the section subtitle 3.4, 4.3, step 1 in section 3.3.2, and Supplementary Information. Fix typo with "Weighting" for step 4 in section 3.3.2

  8. arXiv:2505.09201  [pdf

    physics.optics

    Photoswitchable exceptional points derived from bound states in the continuum

    Authors: Lei Wang, Hang Liu, Junwei Liu, Aoxuan Liu, Jialiang Huang, Qiannan Li, Hui Dai, Caihong Zhang, Jingbo Wu, Kebin Fan, Huabing Wang, Biaobing Jin, Jian Chen, Peiheng Wu

    Abstract: Bound states in the continuum (BICs) and exceptional points (EPs), as two distinct physical singularities represented by complex frequencies in non-Hermitian systems, have garnered significant attention and clear definitions in their respective fields in recent years. They share overlapping applications in areas such as high-sensitivity sensing and laser emission. However, the transition between t… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  9. arXiv:2505.08690  [pdf, ps, other

    cs.CL

    Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation

    Authors: Sheng Liang, Hang Lv, Zhihao Wen, Yaxiong Wu, Yongyue Zhang, Hao Wang, Yong Liu

    Abstract: Event extraction (EE) is a fundamental task in natural language processing (NLP) that involves identifying and extracting event information from unstructured text. Effective EE in real-world scenarios requires two key steps: selecting appropriate schemas from hundreds of candidates and executing the extraction process. Existing research exhibits two critical gaps: (1) the rigid schema fixation in… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: 15 pages, 3 figures

    ACM Class: I.2.7

  10. arXiv:2505.08293  [pdf, ps, other

    cs.GR cs.AI cs.CV cs.SD eess.AS

    M3G: Multi-Granular Gesture Generator for Audio-Driven Full-Body Human Motion Synthesis

    Authors: Zhizhuo Yin, Yuk Hang Tsui, Pan Hui

    Abstract: Generating full-body human gestures encompassing face, body, hands, and global movements from audio is a valuable yet challenging task in virtual avatar creation. Previous systems focused on tokenizing the human gestures framewisely and predicting the tokens of each frame from the input audio. However, one observation is that the number of frames required for a complete expressive human gesture, d… ▽ More

    Submitted 19 May, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

    Comments: 9 Pages, 4 figures

    ACM Class: I.3.6

  11. arXiv:2505.08291  [pdf, ps, other

    quant-ph cond-mat.str-el physics.chem-ph physics.comp-ph

    Multireference error mitigation for quantum computation of chemistry

    Authors: Hang Zou, Erika Magnusson, Hampus Brunander, Werner Dobrautz, Martin Rahm

    Abstract: Quantum error mitigation (QEM) strategies are essential for improving the precision and reliability of quantum chemistry algorithms on noisy intermediate-scale quantum devices. Reference-state error mitigation (REM) is a cost-effective chemistry-inspired QEM method that performs exceptionally well for weakly correlated problems. However, the effectiveness of REM is often limited when applied to st… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  12. arXiv:2505.08265  [pdf, ps, other

    cs.LG cs.AI

    LLM Enhancers for GNNs: An Analysis from the Perspective of Causal Mechanism Identification

    Authors: Hang Gao, Wenxuan Huang, Fengge Wu, Junsuo Zhao, Changwen Zheng, Huaping Liu

    Abstract: The use of large language models (LLMs) as feature enhancers to optimize node representations, which are then used as inputs for graph neural networks (GNNs), has shown significant potential in graph representation learning. However, the fundamental properties of this approach remain underexplored. To address this issue, we propose conducting a more in-depth analysis of this issue based on the int… ▽ More

    Submitted 11 June, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

    Comments: Accepted by ICML 2025

  13. arXiv:2505.08155  [pdf, other

    cs.AI

    Efficient and Scalable Neural Symbolic Search for Knowledge Graph Complex Query Answering

    Authors: Weizhi Fei, Zihao Wang, hang Yin, Shukai Zhao, Wei Zhang, Yangqiu Song

    Abstract: Complex Query Answering (CQA) aims to retrieve answer sets for complex logical formulas from incomplete knowledge graphs, which is a crucial yet challenging task in knowledge graph reasoning. While neuro-symbolic search utilized neural link predictions achieve superior accuracy, they encounter significant complexity bottlenecks: (i) Data complexity typically scales quadratically with the number of… ▽ More

    Submitted 20 May, 2025; v1 submitted 12 May, 2025; originally announced May 2025.

  14. arXiv:2505.07916  [pdf, ps, other

    eess.AS cs.SD

    MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

    Authors: Bowen Zhang, Congchao Guo, Geng Yang, Hang Yu, Haozhe Zhang, Heidi Lei, Jialong Mai, Junjie Yan, Kaiyue Yang, Mingqi Yang, Peikai Huang, Ruiyang Jin, Sitan Jiang, Weihua Cheng, Yawei Li, Yichen Xiao, Yiying Zhou, Yongmao Zhang, Yuan Lu, Yucen He

    Abstract: We introduce MiniMax-Speech, an autoregressive Transformer-based Text-to-Speech (TTS) model that generates high-quality speech. A key innovation is our learnable speaker encoder, which extracts timbre features from a reference audio without requiring its transcription. This enables MiniMax-Speech to produce highly expressive speech with timbre consistent with the reference in a zero-shot manner, w… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  15. TrumorGPT: Graph-Based Retrieval-Augmented Large Language Model for Fact-Checking

    Authors: Ching Nam Hang, Pei-Duo Yu, Chee Wei Tan

    Abstract: In the age of social media, the rapid spread of misinformation and rumors has led to the emergence of infodemics, where false information poses a significant threat to society. To combat this issue, we introduce TrumorGPT , a novel generative artificial intelligence solution designed for fact-checking in the health domain. TrumorGPT aims to distinguish "trumors", which are health-related rumors th… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

  16. arXiv:2505.07680  [pdf, other

    cs.LG cs.DC

    SpecRouter: Adaptive Routing for Multi-Level Speculative Decoding in Large Language Models

    Authors: Hang Wu, Jianian Zhu, Yinghui Li, Haojie Wang, Biao Hou, Jidong Zhai

    Abstract: Large Language Models (LLMs) present a critical trade-off between inference quality and computational cost: larger models offer superior capabilities but incur significant latency, while smaller models are faster but less powerful. Existing serving strategies often employ fixed model scales or static two-stage speculative decoding, failing to dynamically adapt to the varying complexities of user r… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: 10 pages

  17. arXiv:2505.06875  [pdf, ps, other

    cs.RO

    Towards Human-Centric Autonomous Driving: A Fast-Slow Architecture Integrating Large Language Model Guidance with Reinforcement Learning

    Authors: Chengkai Xu, Jiaqi Liu, Yicheng Guo, Yuhang Zhang, Peng Hang, Jian Sun

    Abstract: Autonomous driving has made significant strides through data-driven techniques, achieving robust performance in standardized tasks. However, existing methods frequently overlook user-specific preferences, offering limited scope for interaction and adaptation with users. To address these challenges, we propose a "fast-slow" decision-making framework that integrates a Large Language Model (LLM) for… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

  18. arXiv:2505.06512  [pdf, other

    cs.CV

    HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image Generation

    Authors: Hang Wang, Zhi-Qi Cheng, Chenhao Lin, Chao Shen, Lei Zhang

    Abstract: Text-to-image synthesis has progressed to the point where models can generate visually compelling images from natural language prompts. Yet, existing methods often fail to reconcile high-level semantic fidelity with explicit spatial control, particularly in scenes involving multiple objects, nuanced relations, or complex layouts. To bridge this gap, we propose a Hierarchical Cross-Modal Alignment… ▽ More

    Submitted 14 May, 2025; v1 submitted 10 May, 2025; originally announced May 2025.

    Comments: 10 pages, 4 figures

  19. arXiv:2505.06455  [pdf, ps, other

    quant-ph

    Reconstructing Real-Valued Quantum States

    Authors: Zhixin Song, Hang Ren, Melody Lee, Bryan Gard, Nicolas Renaud, Spencer H. Bryngelson

    Abstract: Quantum tomography is a crucial tool for characterizing quantum states and devices and estimating nonlinear properties of the systems. Performing full quantum state tomography (FQST) on an $N_\mathrm{q}$ qubit system requires an exponentially increasing overhead with $O(3^{N_\mathrm{q}})$ distinct Pauli measurement settings to resolve all complex phases and reconstruct the density matrix. However,… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

  20. arXiv:2505.06321  [pdf, other

    cs.LG cs.AI

    Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning

    Authors: Hang Gao, Chenhao Zhang, Tie Wang, Junsuo Zhao, Fengge Wu, Changwen Zheng, Huaping Liu

    Abstract: Large Language Models (LLMs) have achieved remarkable success across various domains. However, they still face significant challenges, including high computational costs for training and limitations in solving complex reasoning problems. Although existing methods have extended the reasoning capabilities of LLMs through structured paradigms, these approaches often rely on task-specific prompts and… ▽ More

    Submitted 16 May, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

    Comments: Accepted by IJCAI 2025

  21. arXiv:2505.05822  [pdf, ps, other

    physics.bio-ph cond-mat.stat-mech physics.flu-dyn

    Self-reorganization and Information Transfer in Massive Schools of Fish

    Authors: Haotian Hang, Chenchen Huang, Alex Barnett, Eva Kanso

    Abstract: The remarkable cohesion and coordination observed in moving animal groups and their collective responsiveness to threats are thought to be mediated by scale-free correlations, where changes in the behavior of one animal influence others in the group, regardless of the distance between them. But are these features independent of group size? Here, we investigate group cohesiveness and collective res… ▽ More

    Submitted 3 June, 2025; v1 submitted 9 May, 2025; originally announced May 2025.

  22. arXiv:2505.05579   

    cs.AR

    LaZagna: An Open-Source Framework for Flexible 3D FPGA Architectural Exploration

    Authors: Ismael Youssef, Hang Yang, Cong Hao

    Abstract: While 3D IC technology has been extensively explored for ASICs, their application to FPGAs remains limited. Existing studies on 3D FPGAs are often constrained to fixed prototypes, narrow architectural templates, and simulation-only evaluations. In this work, we present LaZagna, the first open-source framework for automated, end-to-end 3D FPGA architecture generation and evaluation. LaZagna support… ▽ More

    Submitted 11 June, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

    Comments: Withdrawn due to an error in experimental setup that affected the results. A corrected version is in progress

  23. arXiv:2505.05061  [pdf, other

    physics.geo-ph

    Seismic first-arrival traveltime simulation based on reciprocity-constrained PINN

    Authors: Hang Geng, Chao Song, Umair bin Waheed, Cai Liu

    Abstract: Simulating seismic first-arrival traveltime plays a crucial role in seismic tomography. First-arrival traveltime simulation relies on solving the eikonal equation. The accuracy of conventional numerical solvers is limited to a finite-difference approximation. In recent years, physics-informed neural networks (PINNs) have been applied to achieve this task. However, traditional PINNs encounter chall… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  24. arXiv:2505.04996  [pdf, other

    cs.GR cs.CV cs.SD eess.AS

    Inter-Diffusion Generation Model of Speakers and Listeners for Effective Communication

    Authors: Jinhe Huang, Yongkang Cheng, Yuming Hang, Gaoge Han, Jinewei Li, Jing Zhang, Xingjian Gu

    Abstract: Full-body gestures play a pivotal role in natural interactions and are crucial for achieving effective communication. Nevertheless, most existing studies primarily focus on the gesture generation of speakers, overlooking the vital role of listeners in the interaction process and failing to fully explore the dynamic interaction between them. This paper innovatively proposes an Inter-Diffusion Gener… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

    Comments: accepted by ICMR 2025

  25. Crafting Physical Adversarial Examples by Combining Differentiable and Physically Based Renders

    Authors: Yuqiu Liu, Huanqian Yan, Xiaopei Zhu, Xiaolin Hu, Liang Tang, Hang Su, Chen Lv

    Abstract: Recently we have witnessed progress in hiding road vehicles against object detectors through adversarial camouflage in the digital world. The extension of this technique to the physical world is crucial for testing the robustness of autonomous driving systems. However, existing methods do not show good performances when applied to the physical world. This is partly due to insufficient photorealism… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: 13 pages, 15 figures; this paper has been accepted by IEEE/CAA Journal of Automatica Sinica

  26. arXiv:2505.04519  [pdf, other

    cs.CL

    Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs

    Authors: Yehui Tang, Yichun Yin, Yaoyuan Wang, Hang Zhou, Yu Pan, Wei Guo, Ziyang Zhang, Miao Rang, Fangcheng Liu, Naifu Zhang, Binghan Li, Yonghan Dong, Xiaojun Meng, Yasheng Wang, Dong Li, Yin Li, Dandan Tu, Can Chen, Youliang Yan, Fisher Yu, Ruiming Tang, Yunhe Wang, Botian Huang, Bo Wang, Boxiao Liu , et al. (49 additional authors not shown)

    Abstract: Sparse large language models (LLMs) with Mixture of Experts (MoE) and close to a trillion parameters are dominating the realm of most capable language models. However, the massive model scale poses significant challenges for the underlying software and hardware systems. In this paper, we aim to uncover a recipe to harness such scale on Ascend NPUs. The key goals are better usage of the computing r… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  27. arXiv:2505.04212  [pdf, other

    astro-ph.GA

    MAMMOTH-MOSFIRE: Environmental Effects on Galaxy Interstellar Medium at $z\sim2$

    Authors: Hang Zhou, Xin Wang, Matthew A. Malkan, Tommaso Treu, Yiming Yang, Zheng Cai, Xiaohui Fan, Mengting Ju, Dong Dong Shi, Anahita Alavi, Fuyan Bian, James Colbert, Alaina L. Henry, Sijia Li, Zihao Li, Harry I. Teplitz, Hu Zhan, Xian Zhong Zheng, Zheng Zheng

    Abstract: The MAMMOTH-MOSFIRE program is a deep Keck MOSFIRE K-band spectroscopic follow-up of emission-line galaxies identified in the MAMMOTH-Grism HST WFC3/G141 slitless spectroscopic survey, targeting the core regions of three most massive galaxy protoclusters at cosmic noon. To introduce this program, we present a comprehensive analysis of the emission-line diagnostics for a unique sample of 43 protocl… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 20 pages, 9 figures, 6 table, submitted to ApJ

  28. arXiv:2505.03756  [pdf, other

    cs.AR cs.AI cs.LG cs.PF

    Improving the Serving Performance of Multi-LoRA Large Language Models via Efficient LoRA and KV Cache Management

    Authors: Hang Zhang, Jiuchen Shi, Yixiao Wang, Quan Chen, Yizhou Shan, Minyi Guo

    Abstract: Multiple Low-Rank Adapters (Multi-LoRAs) are gaining popularity for task-specific Large Language Model (LLM) applications. For multi-LoRA serving, caching hot KV caches and LoRA adapters in high bandwidth memory of accelerations can improve inference performance. However, existing Multi-LoRA inference systems fail to optimize serving performance like Time-To-First-Toke (TTFT), neglecting usage dep… ▽ More

    Submitted 19 April, 2025; originally announced May 2025.

  29. arXiv:2505.03739  [pdf, other

    cs.CL cs.AI

    VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

    Authors: Zuwei Long, Yunhang Shen, Chaoyou Fu, Heting Gao, Lijiang Li, Peixian Chen, Mengdan Zhang, Hang Shao, Jian Li, Jinlong Peng, Haoyu Cao, Ke Li, Rongrong Ji, Xing Sun

    Abstract: With the growing requirement for natural human-computer interaction, speech-based systems receive increasing attention as speech is one of the most common forms of daily communication. However, the existing speech models still experience high latency when generating the first audio token during streaming, which poses a significant bottleneck for deployment. To address this issue, we propose VITA-A… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: Training and Inference Codes: https://github.com/VITA-MLLM/VITA-Audio

  30. arXiv:2505.03469  [pdf, other

    cs.CL

    Long-Short Chain-of-Thought Mixture Supervised Fine-Tuning Eliciting Efficient Reasoning in Large Language Models

    Authors: Bin Yu, Hang Yuan, Haotian Li, Xueyin Xu, Yuliang Wei, Bailing Wang, Weizhen Qi, Kai Chen

    Abstract: Recent advances in large language models have demonstrated that Supervised Fine-Tuning (SFT) with Chain-of-Thought (CoT) reasoning data distilled from large reasoning models (e.g., DeepSeek R1) can effectively transfer reasoning capabilities to non-reasoning models. However, models fine-tuned with this approach inherit the "overthinking" problem from teacher models, producing verbose and redundant… ▽ More

    Submitted 21 May, 2025; v1 submitted 6 May, 2025; originally announced May 2025.

    Comments: 12 pages, 5 figures

  31. arXiv:2505.02928  [pdf, other

    astro-ph.IM astro-ph.CO astro-ph.GA

    Redshift Assessment Infrastructure Layers (RAIL): Rubin-era photometric redshift stress-testing and at-scale production

    Authors: The RAIL Team, Jan Luca van den Busch, Eric Charles, Johann Cohen-Tanugi, Alice Crafford, John Franklin Crenshaw, Sylvie Dagoret, Josue De-Santiago, Juan De Vicente, Qianjun Hang, Benjamin Joachimi, Shahab Joudaki, J. Bryce Kalmbach, Shuang Liang, Olivia Lynn, Alex I. Malz, Rachel Mandelbaum, Grant Merz, Irene Moskowitz, Drew Oldag, Jaime Ruiz-Zapatero, Mubdi Rahman, Samuel J. Schmidt, Jennifer Scora, Raphael Shirley , et al. (6 additional authors not shown)

    Abstract: Virtually all extragalactic use cases of the Vera C. Rubin Observatory's Legacy Survey of Space and Time (LSST) require the use of galaxy redshift information, yet the vast majority of its sample of tens of billions of galaxies will lack high-fidelity spectroscopic measurements thereof, instead relying on photometric redshifts (photo-$z$) subject to systematic imprecision and inaccuracy best encap… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: Submitted to OJA, 21 pages, 6 figures, 5 tables. Comments welcomed!

  32. arXiv:2505.02825  [pdf, ps, other

    cs.CV

    Towards Application-Specific Evaluation of Vision Models: Case Studies in Ecology and Biology

    Authors: Alex Hoi Hang Chan, Otto Brookes, Urs Waldmann, Hemal Naik, Iain D. Couzin, Majid Mirmehdi, Noël Adiko Houa, Emmanuelle Normand, Christophe Boesch, Lukas Boesch, Mimi Arandjelovic, Hjalmar Kühl, Tilo Burghardt, Fumihiro Kano

    Abstract: Computer vision methods have demonstrated considerable potential to streamline ecological and biological workflows, with a growing number of datasets and models becoming available to the research community. However, these resources focus predominantly on evaluation using machine learning metrics, with relatively little emphasis on how their application impacts downstream analysis. We argue that mo… ▽ More

    Submitted 6 May, 2025; v1 submitted 5 May, 2025; originally announced May 2025.

    Comments: Accepted at CVPR Workshop, CV4Animals 2025

  33. arXiv:2505.02498  [pdf, ps, other

    math.KT math.DG math.OA

    A higher index and rapidly decaying kernels

    Authors: Hao Guo, Peter Hochs, Hang Wang

    Abstract: We construct an index of first-order, self-adjoint, elliptic differential operators in the $K$-theory of a Fréchet algebra of smooth kernels with faster than exponential off-diagonal decay. We show that this index can be represented by an idempotent involving heat operators. The rapid decay of the kernels in the algebra used is helpful in proving convergence of pairings with cyclic cocycles. Repre… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: The preprint with ArXiv number 2407.16275 was split into two parts; this is the first part. arXiv admin note: substantial text overlap with arXiv:2407.16275

  34. arXiv:2505.01983  [pdf, ps, other

    stat.ME

    Association and Independence Test for Random Objects

    Authors: Hang Zhou, Hans-Georg Müller

    Abstract: We develop a unified framework for testing independence and quantifying association between random objects that are located in general metric spaces. Special cases include functional and high-dimensional data as well as networks, covariance matrices and data on Riemannian manifolds, among other metric space-valued data. A key concept is the profile association, a measure based on distance profiles… ▽ More

    Submitted 10 June, 2025; v1 submitted 4 May, 2025; originally announced May 2025.

  35. arXiv:2505.01950  [pdf, other

    cs.CV cs.AI

    Segment Any RGB-Thermal Model with Language-aided Distillation

    Authors: Dong Xing, Xianxun Zhu, Wei Zhou, Qika Lin, Hang Yang, Yuqing Wang

    Abstract: The recent Segment Anything Model (SAM) demonstrates strong instance segmentation performance across various downstream tasks. However, SAM is trained solely on RGB data, limiting its direct applicability to RGB-thermal (RGB-T) semantic segmentation. Given that RGB-T provides a robust solution for scene understanding in adverse weather and lighting conditions, such as low light and overexposure, w… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

    Comments: arXiv admin note: text overlap with arXiv:2412.04220 by other authors

  36. arXiv:2505.01458  [pdf, other

    cs.RO cs.AI

    A Survey of Robotic Navigation and Manipulation with Physics Simulators in the Era of Embodied AI

    Authors: Lik Hang Kenny Wong, Xueyang Kang, Kaixin Bai, Jianwei Zhang

    Abstract: Navigation and manipulation are core capabilities in Embodied AI, yet training agents with these capabilities in the real world faces high costs and time complexity. Therefore, sim-to-real transfer has emerged as a key approach, yet the sim-to-real gap persists. This survey examines how physics simulators address this gap by analyzing their properties overlooked in previous surveys. We also analyz… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

  37. arXiv:2505.01383  [pdf, other

    cs.RO cs.AI

    FalconWing: An Open-Source Platform for Ultra-Light Fixed-Wing Aircraft Research

    Authors: Yan Miao, Will Shen, Hang Cui, Sayan Mitra

    Abstract: We present FalconWing -- an open-source, ultra-lightweight (150 g) fixed-wing platform for autonomy research. The hardware platform integrates a small camera, a standard airframe, offboard computation, and radio communication for manual overrides. We demonstrate FalconWing's capabilities by developing and deploying a purely vision-based control policy for autonomous landing (without IMU or motion… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

  38. arXiv:2505.00565  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Differentiating anomalous and topological Hall effects using first-order reversal curve measurements

    Authors: Gregory M. Stephen, Ryan T. Van Haren, Vinay Sharma, Lixuan Tai, Bingqian Dai, Hang Chi, Kang L. Wang, Aubrey T. Hanbicki, Adam L. Friedman

    Abstract: Next generation magnetic memories rely on novel magnetic phases for information storage. Novel spin textures such as skyrmions provide one possible avenue forward due to their topological protection and controllability via electric fields. However, the common signature of these spin textures, the topological Hall effect (THE), can be mimicked by other trivial effects. Competing anomalous Hall effe… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

    Comments: 10 pages, 4 figures

  39. arXiv:2504.21741  [pdf, ps, other

    math.PR math.CO

    Asymptotic diameter of preferential attachment model

    Authors: Hang Du, Shuyang Gong, Zhangsong Li, Haodong Zhu

    Abstract: We study the asymptotic diameter of the preferential attachment model $\operatorname{PA}\!_n^{(m,δ)}$ with parameters $m \ge 2$ and $δ> 0$. Building on the recent work \cite{VZ25}, we prove that the diameter of $G_n \sim \operatorname{PA}\!_n^{(m,δ)}$ is $(1+o(1))\log_νn$ with high probability, where $ν$ is the exponential growth rate of the local weak limit of $G_n$. Our result confirms the conje… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

    Comments: 11 pages

    MSC Class: 05C80; 05C82

  40. arXiv:2504.21622  [pdf, other

    cs.RO

    Path Planning on Multi-level Point Cloud with a Weighted Traversability Graph

    Authors: Yujie Tang, Quan Li, Hao Geng, Yangmin Xie, Hang Shi, Yusheng Yang

    Abstract: This article proposes a new path planning method for addressing multi-level terrain situations. The proposed method includes innovations in three aspects: 1) the pre-processing of point cloud maps with a multi-level skip-list structure and data-slimming algorithm for well-organized and simplified map formalization and management, 2) the direct acquisition of local traversability indexes through ve… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

  41. arXiv:2504.21055  [pdf, ps, other

    cs.LG cs.AI

    Modeling and Performance Analysis for Semantic Communications Based on Empirical Results

    Authors: Shuai Ma, Bin Shen, Chuanhui Zhang, Youlong Wu, Hang Li, Shiyin Li, Guangming Shi, Naofal Al-Dhahir

    Abstract: Due to the black-box characteristics of deep learning based semantic encoders and decoders, finding a tractable method for the performance analysis of semantic communications is a challenging problem. In this paper, we propose an Alpha-Beta-Gamma (ABG) formula to model the relationship between the end-to-end measurement and SNR, which can be applied for both image reconstruction tasks and inferenc… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  42. arXiv:2504.21017  [pdf, ps, other

    cs.CL cs.LG

    ViQA-COVID: COVID-19 Machine Reading Comprehension Dataset for Vietnamese

    Authors: Hai-Chung Nguyen-Phung, Ngoc C. Lê, Van-Chien Nguyen, Hang Thi Nguyen, Thuy Phuong Thi Nguyen

    Abstract: After two years of appearance, COVID-19 has negatively affected people and normal life around the world. As in May 2022, there are more than 522 million cases and six million deaths worldwide (including nearly ten million cases and over forty-three thousand deaths in Vietnam). Economy and society are both severely affected. The variant of COVID-19, Omicron, has broken disease prevention measures o… ▽ More

    Submitted 14 June, 2025; v1 submitted 21 April, 2025; originally announced April 2025.

    Comments: 8 pages. Technical report

  43. arXiv:2504.20681  [pdf, other

    cs.CR

    Data Encryption Battlefield: A Deep Dive into the Dynamic Confrontations in Ransomware Attacks

    Authors: Arash Mahboubi, Hamed Aboutorab, Seyit Camtepe, Hang Thanh Bui, Khanh Luong, Keyvan Ansari, Shenlu Wang, Bazara Barry

    Abstract: In the rapidly evolving landscape of cybersecurity threats, ransomware represents a significant challenge. Attackers increasingly employ sophisticated encryption methods, such as entropy reduction through Base64 encoding, and partial or intermittent encryption to evade traditional detection methods. This study explores the dynamic battle between adversaries who continuously refine encryption strat… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

    MSC Class: 68M25

  44. arXiv:2504.20468  [pdf, other

    cs.CV

    Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception

    Authors: Yuanchen Wu, Lu Zhang, Hang Yao, Junlong Du, Ke Yan, Shouhong Ding, Yunsheng Wu, Xiaoqiang Li

    Abstract: Large Vision-Language Models (LVLMs) have achieved impressive results across various cross-modal tasks. However, hallucinations, i.e., the models generating counterfactual responses, remain a challenge. Though recent studies have attempted to alleviate object perception hallucinations, they focus on the models' response generation, and overlooking the task question itself. This paper discusses the… ▽ More

    Submitted 7 May, 2025; v1 submitted 29 April, 2025; originally announced April 2025.

    Comments: Accepted to CVPR 2025

  45. arXiv:2504.19860  [pdf, other

    cs.CV

    CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback

    Authors: Chenhan Jiang, Yihan Zeng, Hang Xu, Dit-Yan Yeung

    Abstract: Score Distillation Sampling (SDS) has achieved remarkable success in text-to-3D content generation. However, SDS-based methods struggle to maintain semantic fidelity for user prompts, particularly when involving multiple objects with intricate interactions. While existing approaches often address 3D consistency through multiview diffusion model fine-tuning on 3D datasets, this strategy inadvertent… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  46. arXiv:2504.19478  [pdf, other

    cs.CV

    CasaGPT: Cuboid Arrangement and Scene Assembly for Interior Design

    Authors: Weitao Feng, Hang Zhou, Jing Liao, Li Cheng, Wenbo Zhou

    Abstract: We present a novel approach for indoor scene synthesis, which learns to arrange decomposed cuboid primitives to represent 3D objects within a scene. Unlike conventional methods that use bounding boxes to determine the placement and scale of 3D objects, our approach leverages cuboids as a straightforward yet highly effective alternative for modeling objects. This allows for compact scene generation… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  47. arXiv:2504.18842  [pdf

    cs.RO

    A Microgravity Simulation Experimental Platform For Small Space Robots In Orbit

    Authors: Hang Luo, Nanlin Zhou, Haoxiang Zhang, Kai Han, Ning Zhao, Zhiyuan Yang, Jian Qi, Sikai Zhao, Jie Zhao, Yanhe Zhu

    Abstract: This study describes the development and validation of a novel microgravity experimental platform that is mainly applied to small robots such as modular self-reconfigurable robots. This platform mainly consists of an air supply system, a microporous platform and glass. By supplying air to the microporous platform to form an air film, the influence of the weight of the air foot and the ventilation… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

  48. arXiv:2504.18782  [pdf, other

    cs.CV cs.MM

    CAMeL: Cross-modality Adaptive Meta-Learning for Text-based Person Retrieval

    Authors: Hang Yu, Jiahao Wen, Zhedong Zheng

    Abstract: Text-based person retrieval aims to identify specific individuals within an image database using textual descriptions. Due to the high cost of annotation and privacy protection, researchers resort to synthesized data for the paradigm of pretraining and fine-tuning. However, these generated data often exhibit domain biases in both images and textual annotations, which largely compromise the scalabi… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

  49. arXiv:2504.18391  [pdf, other

    cs.CV cs.LG

    Fast Autoregressive Models for Continuous Latent Generation

    Authors: Tiankai Hang, Jianmin Bao, Fangyun Wei, Dong Chen

    Abstract: Autoregressive models have demonstrated remarkable success in sequential data generation, particularly in NLP, but their extension to continuous-domain image generation presents significant challenges. Recent work, the masked autoregressive model (MAR), bypasses quantization by modeling per-token distributions in continuous spaces using a diffusion head but suffers from slow inference due to the h… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  50. arXiv:2504.18005  [pdf, ps, other

    astro-ph.CO gr-qc

    The equivalence between Einstein and Jordan frames: a study based on the inflationary magnetogenesis model

    Authors: Hang Wang, Shuang Liu, Yu Li, Yao-chuan Wang

    Abstract: The equivalence of the Jordan and Einstein frames has been a subject of considerable interest in the field. In this paper, within the context of $f(R)$ gravity, we explore the inflationary magnetogenesis model, focusing on the magnetic field energy density and its spectrum in both the Jordan and Einstein frames to elucidate the equivalence between these two reference frames. Our analysis reveals t… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

    Comments: 15 pages, no figure