Skip to main content

Showing 1–50 of 505 results for author: Zhu, P

.
  1. arXiv:2507.05248  [pdf, ps, other

    cs.CL

    Response Attack: Exploiting Contextual Priming to Jailbreak Large Language Models

    Authors: Ziqi Miao, Lijun Li, Yuan Xiong, Zhenhua Liu, Pengyu Zhu, Jing Shao

    Abstract: Contextual priming, where earlier stimuli covertly bias later judgments, offers an unexplored attack surface for large language models (LLMs). We uncover a contextual priming vulnerability in which the previous response in the dialogue can steer its subsequent behavior toward policy-violating content. Building on this insight, we propose Response Attack, which uses an auxiliary LLM to generate a m… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 21 pages, 9 figures. Code and data available at https://github.com/Dtc7w3PQ/Response-Attack

  2. arXiv:2507.00690  [pdf, ps, other

    cs.CV cs.CR

    Cage-Based Deformation for Transferable and Undefendable Point Cloud Attack

    Authors: Keke Tang, Ziyong Du, Weilong Peng, Xiaofei Wang, Peican Zhu, Ligang Liu, Zhihong Tian

    Abstract: Adversarial attacks on point clouds often impose strict geometric constraints to preserve plausibility; however, such constraints inherently limit transferability and undefendability. While deformation offers an alternative, existing unstructured approaches may introduce unnatural distortions, making adversarial point clouds conspicuous and undermining their plausibility. In this paper, we propose… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

  3. arXiv:2506.22642  [pdf, ps, other

    astro-ph.GA

    DeepDive: A deep dive into the physics of the first massive quiescent galaxies in the Universe

    Authors: K. Ito, F. Valentino, G. Brammer, M. L. Hamadouche, K. E. Whitaker, V. Kokorev, P. Zhu, T. Kakimoto, P. -F. Wu, J. Antwi-Danso, W. M. Baker, D. Ceverino, A. L. Faisst, M. Farcy, S. Fujimoto, A. Gallazzi, S. Gillman, R. Gottumukkala, K. E. Heintz, M. Hirschmann, C. K. Jespersen, M. Kubo, M. Lee, G. Magdis, M. Onodera , et al. (4 additional authors not shown)

    Abstract: We present the DeepDive program, in which we obtained deep ($1-3$ hours) JWST/NIRSpec G235M/F170LP spectra for 10 primary massive ($\log{(M_\star/M_\odot)}=10.8-11.5$) quiescent galaxies at $z\sim3-4$. A novel reduction procedure extends the nominal wavelength coverage of G235M beyond H$α$ and [NII] at $z\sim4$, revealing weak, narrow H$α$ lines indicative of low star formation rates (… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: 14 pages, 9 figures, and 1 table+Appendix. submitted to A&A. All photometric and spectroscopic data in this paper will be made publicly available after the acceptance of the paper

  4. arXiv:2506.21047  [pdf, ps, other

    physics.chem-ph physics.atm-clus physics.atom-ph physics.optics

    Probing valence electron and hydrogen dynamics using charge-pair imaging with ultrafast electron diffraction

    Authors: Tianyu Wang, Hui Jiang, Ming Zhang, Xiao Zou, Pengfei Zhu, Feng He, Zheng Li, Dao Xiang

    Abstract: A key challenge in ultrafast science has been to directly track the coupled motions of electrons and nuclei in real-space and real-time. This study presents a significant step towards this goal by demonstrating the feasibility of time-resolved real-space tracking of valence electron and hydrogen dynamics during the photodissociation of ammonia (NH3) using MeV ultrafast electron diffraction. It is… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  5. arXiv:2506.20892  [pdf, ps, other

    physics.plasm-ph

    Nonlinear edge localized mode with impurity seeding in CFETR hybrid scenario

    Authors: Shiyong Zeng, Ping Zhu

    Abstract: A critical challenge for operating fusion burning plasma in high confinement mode lies in mitigating damage caused by edge localized modes (ELMs). While impurity seeding has been experimentally validated as a reliable and effective ELM mitigation technique, its underlying physics remains insufficiently understood and requires further clarification. Through nonlinear magnetohydrodynamic (MHD) simul… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

    Comments: 29 pages, 17 figures

  6. arXiv:2506.20887  [pdf, ps, other

    physics.optics

    Dual Synchronization Effects in Light Scattering by Spherical Particle Systems

    Authors: Guanglang Xu, Bingqiang Sun, Ping Zhu, Huizeng Liu, Ye Zhou, Chen Zhou

    Abstract: We report the discovery of a novel and fundamental dual synchronization relationship between the scattering efficiency (Q$_{\text{sca}}$) and a specifically formulated angular distribution complexity parameter ($\widetilde{C}_{\text{p}}$) in spherical particle systems. Through extensive numerical simulations using the rigorous Multiple Sphere T-Matrix (MSTM) method, we found that Q$_{\text{sca}}$… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  7. arXiv:2506.20443  [pdf, ps, other

    physics.plasm-ph

    MHD simulation of tilt instability during the dynamic FRC magnetic compression process

    Authors: Yiming Ma, Ping Zhu, Bo Rao, Haolong Li

    Abstract: The nonlinear evolution of the tilt instability in a field reversed configuration (FRC) during the dynamic magnetic compression process has been investigated using magnetohydrodynamic (MHD) simulations with the NIMROD code [C. R. Sovinec \textit{et al.}, J. Comput. Phys. \textbf{195}, 355 (2004)]. The tilt mode induces significant deformations in the linear growth phase and results in complete con… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  8. arXiv:2506.18962  [pdf, ps, other

    cs.HC

    UniMind: Unleashing the Power of LLMs for Unified Multi-Task Brain Decoding

    Authors: Weiheng Lu, Chunfeng Song, Jiamin Wu, Pengyu Zhu, Yuchen Zhou, Weijian Mai, Qihao Zheng, Wanli Ouyang

    Abstract: Decoding human brain activity from electroencephalography (EEG) signals is a central challenge at the intersection of neuroscience and artificial intelligence, enabling diverse applications in mental state assessment, clinical monitoring, and human-machine interaction. Recent efforts have extensively explored EEG-based brain foundation models for generalized brain decoding, employing large-scale t… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: 19pages,4 figures

  9. Mapping the Excitation Mechanisms in the LINER I Active Galactic Nucleus NGC 5005: Positive Feedback and a Thin LINER Cocoon

    Authors: Anna Trindade Falcão, G. Fabbiano, M. Elvis, P. Zhu, S. Kraemer, W. P. Maksym, R. Middei, D. L. Król

    Abstract: We present a spatially resolved Baldwin-Phillips-Terlevich analysis of the narrow-line region (NLR) in the low-ionization nuclear emission-line region (LINER) I galaxy NGC 5005 using Hubble Space Telescope narrowband imaging of [O III]$λ$5007, H$β$, H$α$, and [S II]$λλ$6717,6731. With a resolution of ${\lesssim}$0.1 (${\lesssim}$10 pc at z = 0.003), we dissect the NLR into H II (star-forming), Sey… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Journal ref: ApJ 986 175 (2025)

  10. arXiv:2506.12796  [pdf, ps, other

    cs.CL

    Surprise Calibration for Better In-Context Learning

    Authors: Zhihang Tan, Jingrui Hou, Ping Wang, Qibiao Hu, Peng Zhu

    Abstract: In-context learning (ICL) has emerged as a powerful paradigm for task adaptation in large language models (LLMs), where models infer underlying task structures from a few demonstrations. However, ICL remains susceptible to biases that arise from prior knowledge and contextual demonstrations, which can degrade the performance of LLMs. Existing bias calibration methods typically apply fixed class pr… ▽ More

    Submitted 17 June, 2025; v1 submitted 15 June, 2025; originally announced June 2025.

    Comments: 16 pages, 11 figures

    MSC Class: I.2.7

  11. arXiv:2506.12708  [pdf, ps, other

    cs.DC cs.AI cs.AR cs.LG

    Serving Large Language Models on Huawei CloudMatrix384

    Authors: Pengfei Zuo, Huimin Lin, Junbo Deng, Nan Zou, Xingkun Yang, Yingyu Diao, Weifeng Gao, Ke Xu, Zhangyu Chen, Shirui Lu, Zhao Qiu, Peiyang Li, Xianyu Chang, Zhengzhong Yu, Fangzheng Miao, Jia Zheng, Ying Li, Yuan Feng, Bei Wang, Zaijian Zong, Mosong Zhou, Wenli Zhou, Houjiang Chen, Xingyu Liao, Yipeng Li , et al. (21 additional authors not shown)

    Abstract: The rapid evolution of large language models (LLMs), driven by growing parameter scales, adoption of mixture-of-experts (MoE) architectures, and expanding context lengths, imposes unprecedented demands on AI infrastructure. Traditional AI clusters face limitations in compute intensity, memory bandwidth, inter-chip communication, and latency, compounded by variable workloads and strict service-leve… ▽ More

    Submitted 19 June, 2025; v1 submitted 14 June, 2025; originally announced June 2025.

    Comments: 59 pages, 24 figures

  12. arXiv:2506.09962  [pdf, ps, other

    astro-ph.GA

    A Theoretical Three-Dimensional Diagram to Separate Star Formation, Active Galactic Nuclei, and Shocks in Galaxies

    Authors: Peixin Zhu, Lisa J. Kewley, Ralph S. Sutherland, Kathryn Grasha

    Abstract: The excitation sources in galaxies are frequently mixed due to AGN and stellar feedback, including star formation, active galactic nuclei (AGNs), and shock excitation. Disentangling the star formation, AGN, and shocks in galaxy integral-field spectra (IFU) at optical wavelengths is crucial to expanding the galaxy sample for AGN and stellar feedback studies, given the lack of multiwavelength observ… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: 20 pages, 7 figures (2 interactive figures available in the online journal), accepted for publication in ApJ

  13. arXiv:2506.09113  [pdf, ps, other

    cs.CV

    Seedance 1.0: Exploring the Boundaries of Video Generation Models

    Authors: Yu Gao, Haoyuan Guo, Tuyen Hoang, Weilin Huang, Lu Jiang, Fangyuan Kong, Huixia Li, Jiashi Li, Liang Li, Xiaojie Li, Xunsong Li, Yifu Li, Shanchuan Lin, Zhijie Lin, Jiawei Liu, Shu Liu, Xiaonan Nie, Zhiwu Qing, Yuxi Ren, Li Sun, Zhi Tian, Rui Wang, Sen Wang, Guoqiang Wei, Guohong Wu , et al. (19 additional authors not shown)

    Abstract: Notable breakthroughs in diffusion modeling have propelled rapid improvements in video generation, yet current foundational model still face critical challenges in simultaneously balancing prompt following, motion plausibility, and visual quality. In this report, we introduce Seedance 1.0, a high-performance and inference-efficient video foundation generation model that integrates several core tec… ▽ More

    Submitted 28 June, 2025; v1 submitted 10 June, 2025; originally announced June 2025.

    Comments: Seedance 1.0 Technical Report

  14. arXiv:2506.06818  [pdf, ps, other

    cs.CV

    Stepwise Decomposition and Dual-stream Focus: A Novel Approach for Training-free Camouflaged Object Segmentation

    Authors: Chao Yin, Hao Li, Kequan Yang, Jide Li, Pinpin Zhu, Xiaoqiang Li

    Abstract: While promptable segmentation (\textit{e.g.}, SAM) has shown promise for various segmentation tasks, it still requires manual visual prompts for each object to be segmented. In contrast, task-generic promptable segmentation aims to reduce the need for such detailed prompts by employing only a task-generic prompt to guide segmentation across all test samples. However, when applied to Camouflaged Ob… ▽ More

    Submitted 6 July, 2025; v1 submitted 7 June, 2025; originally announced June 2025.

    Comments: accepted by ACM MM2025

  15. arXiv:2506.04968  [pdf, other

    eess.SY

    En Route Path-planning for Partially Occupied Vehicles in Ride-pooling Systems

    Authors: Pengbo Zhu, Giancarlo Ferrari-Trecate, Nikolas Geroliminis

    Abstract: Ride-pooling services, such as UberPool and Lyft Shared Saver, enable a single vehicle to serve multiple customers within one shared trip. Efficient path-planning algorithms are crucial for improving the performance of such systems. For partially occupied vehicles with available capacity, we introduce a novel routing algorithm designed to maximize the likelihood of picking up additional passengers… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: Accepted by European Control Conference (ECC 2025), 24-27 June, Thessaloniki, Greece

  16. arXiv:2506.02925  [pdf, ps, other

    astro-ph.HE

    Normal Distribution of Crab Pulsar Glitch Activity from a Glitch Cluster Perspective

    Authors: Pei-Xin Zhu, Xiao-Ping Zheng

    Abstract: As the most extensively and continuously monitored neutron star, the Crab pulsar serves as representative of the earliest evolutionary stage. Its unique and complex glitch phenomenology provides an unparalleled testing ground for theoretical models of neutron star interior dynamics. Within the self-organized criticality paradigm, Crab pulsar glitch sizes are modeled by a power-law distribution and… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  17. arXiv:2506.01786  [pdf, ps, other

    astro-ph.HE astro-ph.IM

    Science Prospects for the Southern Wide-field Gamma-ray Observatory: SWGO

    Authors: SWGO Collaboration, P. Abreu, R. Alfaro, A. Alfonso, M. Andrade, E. O. Angüner, E. A. Anita-Rangel, O. Aquines-Gutiérrez, C. Arcaro, R. Arceo, J. C. Arteaga-Velázquez, P. Assis, H. A. Ayala Solares, A. Bakalova, E. M. Bandeira, P. Bangale, U. Barres de Almeida, P. Batista, I. Batković, J. Bazo, E. Belmont, J. Bennemann, S. Y. BenZvi, A. Bernal, W. Bian , et al. (295 additional authors not shown)

    Abstract: Ground-based gamma-ray astronomy is now well established as a key observational approach to address critical topics at the frontiers of astroparticle physics and high-energy astrophysics. Whilst the field of TeV astronomy was once dominated by arrays of atmospheric Cherenkov Telescopes, ground-level particle detection has now been demonstrated to be an equally viable and strongly complementary app… ▽ More

    Submitted 25 June, 2025; v1 submitted 2 June, 2025; originally announced June 2025.

    Comments: Revised version

  18. arXiv:2505.24810  [pdf, ps, other

    hep-ex hep-ph

    New Physics Search at the CEPC: a General Perspective

    Authors: Stefan Antusch, Peter Athron, Daniele Barducci, Long Chen, Mingshui Chen, Xiang Chen, Huajie Cheng, Kingman Cheung, Joao Guimaraes da Costa, Arindam Das, Frank F. Deppisch, P. S. Bhupal Dev, Xiaokang Du, Yong Du, Yaquan Fang, Andrew Fowlie, Yu Gao, Bruce Mellado Garcia, Shao-Feng Ge, Jiayin Gu, Yu-Chen Guo, Jan Hajer, Chengcheng Han, Tao Han, Sven Heinemeyer , et al. (68 additional authors not shown)

    Abstract: The Circular Electron-Positron Collider (CEPC), a proposed next-generation Higgs factory, provides new opportunities to explore physics beyond the Standard Model (SM). With its clean electron-positron collision environment and the ability to collect large samples of Higgs, W, and Z bosons, the CEPC enables precision measurements and searches for new physics. This white paper outlines the CEPC's di… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  19. arXiv:2505.22995  [pdf, ps, other

    eess.AS cs.SD

    LLM-Synth4KWS: Scalable Automatic Generation and Synthesis of Confusable Data for Custom Keyword Spotting

    Authors: Pai Zhu, Quan Wang, Dhruuv Agarwal, Kurt Partridge

    Abstract: Custom keyword spotting (KWS) allows detecting user-defined spoken keywords from streaming audio. This is achieved by comparing the embeddings from voice enrollments and input audio. State-of-the-art custom KWS models are typically trained contrastively using utterances whose keywords are randomly sampled from training dataset. These KWS models often struggle with confusing keywords, such as "blue… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  20. arXiv:2505.20511  [pdf, other

    cs.CL

    Multimodal Emotion Recognition in Conversations: A Survey of Methods, Trends, Challenges and Prospects

    Authors: Chengyan Wu, Yiqiang Cai, Yang Liu, Pengxu Zhu, Yun Xue, Ziwei Gong, Julia Hirschberg, Bolei Ma

    Abstract: While text-based emotion recognition methods have achieved notable success, real-world dialogue systems often demand a more nuanced emotional understanding than any single modality can offer. Multimodal Emotion Recognition in Conversations (MERC) has thus emerged as a crucial direction for enhancing the naturalness and emotional understanding of human-computer interaction. Its goal is to accuratel… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  21. arXiv:2505.16364  [pdf

    physics.optics physics.ins-det

    Single-shot 3D characterization the spatiotemporal optical vortex via a spatiotemporal wavefront sensor (STWFS)

    Authors: Xiuyu Yao, Ping Zhu, Youjian Yi, Zezhao Gong, Dongjun Zhang, Ailin Guo, Fucai Ding, Xiao Liang, Xuejie Zhang, Meizhi Sun, Qiang Zhang, Miaoyan Tong, Lijie Cui, Hailun Zen, Xinglong Xie, Jianqiang Zhu

    Abstract: The advent of spatiotemporal wave packets (STWPs), represented by spatiotemporal optical vortices (STOVs), has paved the way for the exploration in optics and photonics. To date, despite considerable efforts, a comprehensive and efficient practical means to characterizing wave packets with such complex structures is still lacking. In this study, we introduced a new method designed to achieve high-… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  22. arXiv:2505.14814  [pdf, ps, other

    cs.SD cs.CL eess.AS

    GraphemeAug: A Systematic Approach to Synthesized Hard Negative Keyword Spotting Examples

    Authors: Harry Zhang, Kurt Partridge, Pai Zhu, Neng Chen, Hyun Jin Park, Dhruuv Agarwal, Quan Wang

    Abstract: Spoken Keyword Spotting (KWS) is the task of distinguishing between the presence and absence of a keyword in audio. The accuracy of a KWS model hinges on its ability to correctly classify examples close to the keyword and non-keyword boundary. These boundary examples are often scarce in training data, limiting model performance. In this paper, we propose a method to systematically generate adversa… ▽ More

    Submitted 24 May, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

    Comments: Accepted at Interspeech 2025

  23. arXiv:2505.14085  [pdf, ps, other

    cs.NI

    CE-LSLM: Efficient Large-Small Language Model Inference and Communication via Cloud-Edge Collaboration

    Authors: Pengyan Zhu, Tingting Yang

    Abstract: Emerging intelligent service scenarios in 6G communication impose stringent requirements for low latency, high reliability, and privacy preservation. Generative large language models (LLMs) are gradually becoming key enablers for the integration of semantic communication and computation. However, due to the limited computational resources of edge devices and the increasing complexity of heterogene… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: 14 pages, 7 figures including subplots

  24. arXiv:2505.13990  [pdf, ps, other

    cs.CL

    DecIF: Improving Instruction-Following through Meta-Decomposition

    Authors: Tingfeng Hui, Pengyu Zhu, Bowen Ping, Ling Tang, Guanting Dong, Yaqi Zhang, Sen Su

    Abstract: Instruction-following has emerged as a crucial capability for large language models (LLMs). However, existing approaches often rely on pre-existing documents or external resources to synthesize instruction-following data, which limits their flexibility and generalizability. In this paper, we introduce DecIF, a fully autonomous, meta-decomposition guided framework that generates diverse and high-qu… ▽ More

    Submitted 10 June, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

    Comments: We release the source code and SFT data in this version

  25. arXiv:2505.12910  [pdf, ps, other

    cs.SI cs.AI

    SourceDetMamba: A Graph-aware State Space Model for Source Detection in Sequential Hypergraphs

    Authors: Le Cheng, Peican Zhu, Yangming Guo, Chao Gao, Zhen Wang, Keke Tang

    Abstract: Source detection on graphs has demonstrated high efficacy in identifying rumor origins. Despite advances in machine learning-based methods, many fail to capture intrinsic dynamics of rumor propagation. In this work, we present SourceDetMamba: A Graph-aware State Space Model for Source Detection in Sequential Hypergraphs, which harnesses the recent success of the state space model Mamba, known for… ▽ More

    Submitted 4 June, 2025; v1 submitted 19 May, 2025; originally announced May 2025.

    Comments: Accepted by IJCAI25

  26. arXiv:2505.12894  [pdf, ps, other

    cs.SI cs.AI

    HyperDet: Source Detection in Hypergraphs via Interactive Relationship Construction and Feature-rich Attention Fusion

    Authors: Le Cheng, Peican Zhu, Yangming Guo, Keke Tang, Chao Gao, Zhen Wang

    Abstract: Hypergraphs offer superior modeling capabilities for social networks, particularly in capturing group phenomena that extend beyond pairwise interactions in rumor propagation. Existing approaches in rumor source detection predominantly focus on dyadic interactions, which inadequately address the complexity of more intricate relational structures. In this study, we present a novel approach for Sourc… ▽ More

    Submitted 4 June, 2025; v1 submitted 19 May, 2025; originally announced May 2025.

    Comments: Accepted by IJCAI25

  27. arXiv:2505.12308  [pdf

    stat.ME math.ST

    A Hybrid Prior Bayesian Method for Combining Domestic Real-World Data and Overseas Data in Global Drug Development

    Authors: Keer Chen, Zengyue Zheng, Pengfei Zhu, Shuping Jiang, Nan Li, Jumin Deng, Pingyan Chen, Zhenyu Wu, Ying Wu

    Abstract: Background Hybrid clinical trial design integrates randomized controlled trials (RCTs) with real-world data (RWD) to enhance efficiency through dynamic incorporation of external data. Existing methods like the Meta-Analytic Predictive Prior (MAP) inadequately control data heterogeneity, adjust baseline discrepancies, or optimize dynamic borrowing proportions, introducing bias and limiting applicat… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

    Comments: 10 figures

  28. arXiv:2505.09323  [pdf, ps, other

    eess.IV cs.CV

    Q-space Guided Collaborative Attention Translation Network for Flexible Diffusion-Weighted Images Synthesis

    Authors: Pengli Zhu, Yingji Fu, Nanguang Chen, Anqi Qiu

    Abstract: This study, we propose a novel Q-space Guided Collaborative Attention Translation Networks (Q-CATN) for multi-shell, high-angular resolution DWI (MS-HARDI) synthesis from flexible q-space sampling, leveraging the commonly acquired structural MRI data. Q-CATN employs a collaborative attention mechanism to effectively extract complementary information from multiple modalities and dynamically adjust… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    Comments: MICCAI 2025

  29. arXiv:2505.08822  [pdf, other

    cs.CY cs.LG physics.soc-ph

    The Geography of Transportation Cybersecurity: Visitor Flows, Industry Clusters, and Spatial Dynamics

    Authors: Yuhao Wang, Kailai Wang, Songhua Hu, Yunpeng, Zhang, Gino Lim, Pengyu Zhu

    Abstract: The rapid evolution of the transportation cybersecurity ecosystem, encompassing cybersecurity, automotive, and transportation and logistics sectors, will lead to the formation of distinct spatial clusters and visitor flow patterns across the US. This study examines the spatiotemporal dynamics of visitor flows, analyzing how socioeconomic factors shape industry clustering and workforce distribution… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  30. arXiv:2505.06975  [pdf, ps, other

    cs.CV

    High-Frequency Prior-Driven Adaptive Masking for Accelerating Image Super-Resolution

    Authors: Wei Shang, Dongwei Ren, Wanying Zhang, Pengfei Zhu, Qinghua Hu, Wangmeng Zuo

    Abstract: The primary challenge in accelerating image super-resolution lies in reducing computation while maintaining performance and adaptability. Motivated by the observation that high-frequency regions (e.g., edges and textures) are most critical for reconstruction, we propose a training-free adaptive masking module for acceleration that dynamically focuses computation on these challenging areas. Specifi… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

    Comments: 10 pages, 6 figures, 5 tables

    ACM Class: I.4.3

  31. arXiv:2505.06920  [pdf, ps, other

    cs.CV

    Bi-directional Self-Registration for Misaligned Infrared-Visible Image Fusion

    Authors: Timing Li, Bing Cao, Pengfei Zhu, Bin Xiao, Qinghua Hu

    Abstract: Acquiring accurately aligned multi-modal image pairs is fundamental for achieving high-quality multi-modal image fusion. To address the lack of ground truth in current multi-modal image registration and fusion methods, we propose a novel self-supervised \textbf{B}i-directional \textbf{S}elf-\textbf{R}egistration framework (\textbf{B-SR}). Specifically, B-SR utilizes a proxy data generator (PDG) an… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

  32. arXiv:2505.05840  [pdf, ps, other

    cs.RO eess.SY

    Versatile Distributed Maneuvering with Generalized Formations using Guiding Vector Fields

    Authors: Yang Lu, Sha Luo, Pengming Zhu, Weijia Yao, Hector Garcia de Marina, Xinglong Zhang, Xin Xu

    Abstract: This paper presents a unified approach to realize versatile distributed maneuvering with generalized formations. Specifically, we decompose the robots' maneuvers into two independent components, i.e., interception and enclosing, which are parameterized by two independent virtual coordinates. Treating these two virtual coordinates as dimensions of an abstract manifold, we derive the corresponding s… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

  33. arXiv:2505.05456  [pdf, other

    cs.CV

    SITE: towards Spatial Intelligence Thorough Evaluation

    Authors: Wenqi Wang, Reuben Tan, Pengyue Zhu, Jianwei Yang, Zhengyuan Yang, Lijuan Wang, Andrey Kolobov, Jianfeng Gao, Boqing Gong

    Abstract: Spatial intelligence (SI) represents a cognitive ability encompassing the visualization, manipulation, and reasoning about spatial relationships, underpinning disciplines from neuroscience to robotics. We introduce SITE, a benchmark dataset towards SI Thorough Evaluation in a standardized format of multi-choice visual question-answering, designed to assess large vision-language models' spatial int… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  34. arXiv:2505.02710  [pdf, other

    physics.plasm-ph

    Increasing the density limit with ECRH-assisted Ohmic start-up on EAST

    Authors: Jiaxing Liu, Ping Zhu, Dominique Franck Escande, Wenbin Liu, Shiwei Xue, Xin Lin, Panjun Tang, Liang Wang, Ning Yan, Jinju Yang, Yanmin Duan, Kai Jia, Zhenwei Wu, Yunxin Cheng, Ling Zhang, Jinping Qian, Rui Ding, Ruijie Zhou, the EAST team

    Abstract: High plasma density operation is crucial for a tokamak to achieve energy breakeven and a burning plasma. However, there is often an empirical upper limit of electron density in tokamak operation, namely the Greenwald density limit $n_G$, above which tokamaks generally disrupt. Achieving high-density operations above the density limit has been a long-standing challenge in magnetic confinement fusio… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: 20 pages, 5 figures

  35. arXiv:2504.17462  [pdf, other

    nucl-ex hep-ph

    Measuring short-range correlations and quasi-elastic cross sections in A(e,e') at x>1 and modest Q$^2$

    Authors: Y. P. Zhang, Z. H. Ye, D. Nguyen, P. Aguilera, Z. Ahmed, H. Albataineh, K. Allada, B. Anderson, D. Anez, K. Aniol, J. Annand, J. Arrington, T. Averett, H. Baghdasaryan, X. Bai, A. Beck, S. Beck, V. Bellini, F. Benmokhtar, A. Camsonne, C. Chen, J. -P. Chen, K. Chirapatpimol, E. Cisbani, S. Covrig Dusa , et al. (74 additional authors not shown)

    Abstract: We present results from the Jefferson Lab E08-014 experiment, investigating short-range correlations (SRC) through measurements of absolute inclusive quasi-elastic cross sections and their ratios. This study utilized 3.356 GeV electrons scattered off targets including $^2$H, $^3$He, $^4$He, $^{12}$C, $^{40}$Ca, and $^{48}$Ca, at modest momentum transfers ($1.3 < Q^2 \leq 2$ GeV$^2$). Kinematics we… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  36. arXiv:2504.13853  [pdf, other

    q-bio.BM cs.AI

    GenShin:geometry-enhanced structural graph embodies binding pose can better predicting compound-protein interaction affinity

    Authors: Pingfei Zhu, Chenyang Zhao, Haishi Zhao, Bo Yang

    Abstract: AI-powered drug discovery typically relies on the successful prediction of compound-protein interactions, which are pivotal for the evaluation of designed compound molecules in structure-based drug design and represent a core challenge in the field. However, accurately predicting compound-protein affinity via regression models usually requires adequate-binding pose, which are derived from costly… ▽ More

    Submitted 16 March, 2025; originally announced April 2025.

    Comments: 11 pages, 3 figures

  37. arXiv:2504.12080  [pdf, other

    cs.CV

    DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency

    Authors: Mengshi Qi, Pengfei Zhu, Xiangtai Li, Xiaoyang Bi, Lu Qi, Huadong Ma, Ming-Hsuan Yang

    Abstract: Given a single labeled example, in-context segmentation aims to segment corresponding objects. This setting, known as one-shot segmentation in few-shot learning, explores the segmentation model's generalization ability and has been applied to various vision tasks, including scene understanding and image/video editing. While recent Segment Anything Models have achieved state-of-the-art results in i… ▽ More

    Submitted 17 April, 2025; v1 submitted 16 April, 2025; originally announced April 2025.

    Comments: V1 has been withdrawn due to a template issue, because of the arXiv policy, we can't delete it. Please refer to the newest version v2

  38. arXiv:2504.11729  [pdf, other

    eess.SP

    EdgePrompt: A Distributed Key-Value Inference Framework for LLMs in 6G Networks

    Authors: Jiahong Ning, Pengyan Zhu, Ce Zheng, Gary Lee, Sumei Sun, Tingting Yang

    Abstract: As sixth-generation (6G) networks advance, large language models (LLMs) are increasingly integrated into 6G infrastructure to enhance network management and intelligence. However, traditional LLMs architecture struggle to meet the stringent latency and security requirements of 6G, especially as the increasing in sequence length leads to greater task complexity. This paper proposes Edge-Prompt, a c… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  39. arXiv:2504.09820  [pdf, other

    eess.SP cs.IT

    Finite-Precision Conjugate Gradient Method for Massive MIMO Detection

    Authors: Yiming Fang, Li Chen, Changsheng You, Dingzhu Wen, Pengcheng Zhu

    Abstract: The implementation of the conjugate gradient (CG) method for massive MIMO detection is computationally challenging, especially for a large number of users and correlated channels. In this paper, we propose a low computational complexity CG detection from a finite-precision perspective. First, we develop a finite-precision CG (FP-CG) detection to mitigate the computational bottleneck of each CG ite… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

    Comments: 13 pages, 7 figures

  40. arXiv:2504.08685  [pdf, other

    cs.CV cs.AI

    Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

    Authors: Team Seawead, Ceyuan Yang, Zhijie Lin, Yang Zhao, Shanchuan Lin, Zhibei Ma, Haoyuan Guo, Hao Chen, Lu Qi, Sen Wang, Feng Cheng, Feilong Zuo, Xuejiao Zeng, Ziyan Yang, Fangyuan Kong, Meng Wei, Zhiwu Qing, Fei Xiao, Tuyen Hoang, Siyu Zhang, Peihao Zhu, Qi Zhao, Jiangqiao Yan, Liangke Gui, Sheng Bi , et al. (30 additional authors not shown)

    Abstract: This technical report presents a cost-efficient strategy for training a video generation foundation model. We present a mid-sized research model with approximately 7 billion parameters (7B) called Seaweed-7B trained from scratch using 665,000 H100 GPU hours. Despite being trained with moderate computational resources, Seaweed-7B demonstrates highly competitive performance compared to contemporary… ▽ More

    Submitted 4 May, 2025; v1 submitted 11 April, 2025; originally announced April 2025.

    Comments: Technical report (some typos fixed)

  41. arXiv:2504.04224  [pdf, other

    cs.SE eess.SY

    Exploration of Approaches for Robustness and Safety in a Low Code Open Environment for Factory Automation

    Authors: Gustavo Quiros A., Yi Peng Zhu, Tao Cui, Shaokai Lin, Marten Lohstroh, Edward A. Lee

    Abstract: This report is a compilation of technical knowledge and concepts that were produced by the authors and additional contributors in the context of the collaboration projects "Abstraction Requirements for Language of Choice in Industrial Automation" (FY21-22) and "Approaches for Robust and Safe Low-Code" (FY23-24) from Siemens Technology and the University of California, Berkeley. The primary objecti… ▽ More

    Submitted 5 April, 2025; originally announced April 2025.

    Comments: 15 pages, 4 figures, technical report

  42. arXiv:2504.03342  [pdf, other

    cs.CV cs.AI

    EOOD: Entropy-based Out-of-distribution Detection

    Authors: Guide Yang, Chao Hou, Weilong Peng, Xiang Fang, Yongwei Nie, Peican Zhu, Keke Tang

    Abstract: Deep neural networks (DNNs) often exhibit overconfidence when encountering out-of-distribution (OOD) samples, posing significant challenges for deployment. Since DNNs are trained on in-distribution (ID) datasets, the information flow of ID samples through DNNs inevitably differs from that of OOD samples. In this paper, we propose an Entropy-based Out-Of-distribution Detection (EOOD) framework. EOO… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: IJCNN 2025

  43. arXiv:2504.01010  [pdf, other

    cs.CV eess.IV

    A YOLO-Based Semi-Automated Labeling Approach to Improve Fault Detection Efficiency in Railroad Videos

    Authors: Dylan Lester, James Gao, Samuel Sutphin, Pingping Zhu, Husnu Narman, Ammar Alzarrad

    Abstract: Manual labeling for large-scale image and video datasets is often time-intensive, error-prone, and costly, posing a significant barrier to efficient machine learning workflows in fault detection from railroad videos. This study introduces a semi-automated labeling method that utilizes a pre-trained You Only Look Once (YOLO) model to streamline the labeling process and enhance fault detection accur… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

    Comments: Published on American Society of Engineering Education (ASEE) North Central Section Conference, 2025

  44. arXiv:2503.18407  [pdf, other

    cs.CV

    VTD-CLIP: Video-to-Text Discretization via Prompting CLIP

    Authors: Wencheng Zhu, Yuexin Wang, Hongxuan Li, Pengfei Zhu, Qinghua Hu

    Abstract: Vision-language models bridge visual and linguistic understanding and have proven to be powerful for video recognition tasks. Existing approaches primarily rely on parameter-efficient fine-tuning of image-text pre-trained models, yet they often suffer from limited interpretability and poor generalization due to inadequate temporal modeling. To address these, we propose a simple yet effective video… ▽ More

    Submitted 24 March, 2025; v1 submitted 24 March, 2025; originally announced March 2025.

  45. arXiv:2503.17717  [pdf, other

    cs.CV

    BackMix: Regularizing Open Set Recognition by Removing Underlying Fore-Background Priors

    Authors: Yu Wang, Junxian Mu, Hongzhi Huang, Qilong Wang, Pengfei Zhu, Qinghua Hu

    Abstract: Open set recognition (OSR) requires models to classify known samples while detecting unknown samples for real-world applications. Existing studies show impressive progress using unknown samples from auxiliary datasets to regularize OSR models, but they have proved to be sensitive to selecting such known outliers. In this paper, we discuss the aforementioned problem from a new perspective: Can we r… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

    Comments: 20 pages, 11 figures. Accepted by TPAMI

  46. arXiv:2503.10109  [pdf, other

    cs.CV

    Dream-IF: Dynamic Relative EnhAnceMent for Image Fusion

    Authors: Xingxin Xu, Bing Cao, Yinan Xia, Pengfei Zhu, Qinghua Hu

    Abstract: Image fusion aims to integrate comprehensive information from images acquired through multiple sources. However, images captured by diverse sensors often encounter various degradations that can negatively affect fusion quality. Traditional fusion methods generally treat image enhancement and fusion as separate processes, overlooking the inherent correlation between them; notably, the dominant regi… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  47. arXiv:2503.07952  [pdf, other

    cs.CV cs.RO

    NeRF-VIO: Map-Based Visual-Inertial Odometry with Initialization Leveraging Neural Radiance Fields

    Authors: Yanyu Zhang, Dongming Wang, Jie Xu, Mengyuan Liu, Pengxiang Zhu, Wei Ren

    Abstract: A prior map serves as a foundational reference for localization in context-aware applications such as augmented reality (AR). Providing valuable contextual information about the environment, the prior map is a vital tool for mitigating drift. In this paper, we propose a map-based visual-inertial localization algorithm (NeRF-VIO) with initialization using neural radiance fields (NeRF). Our algorith… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  48. arXiv:2503.03227  [pdf, other

    quant-ph

    SSR: A Swapping-Sweeping-and-Rewriting Optimizer for Quantum Circuit Transformation

    Authors: Yunqi Huang, Xiangzhen Zhou, Fanxu Meng, Pengcheng Zhu, Yu Luo, Zhenlong Du

    Abstract: Quantum circuit transformation (QCT), necessary for adapting any quantum circuit to the qubit connectivity constraints of the NISQ device, often introduces numerous additional SWAP gates into the original circuit, increasing the circuit depth and thus reducing the success rate of computation. To minimize the depth of QCT circuits, we propose a Swapping-Sweeping-and-Rewriting optimizer. This optimi… ▽ More

    Submitted 27 April, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

    Comments: We optimized the program, so the experimental data have changed

  49. arXiv:2503.01990  [pdf, ps, other

    astro-ph.GA astro-ph.CO

    Gas outflows in two recently quenched galaxies at z = 4 and 7

    Authors: F. Valentino, K. E. Heintz, G. Brammer, K. Ito, V. Kokorev, K. E. Whitaker, A. Gallazzi, A. de Graaff, A. Weibel, B. L. Frye, P. S. Kamieneski, S. Jin, D. Ceverino, A. Faisst, M. Farcy, S. Fujimoto, S. Gillman, R. Gottumukkala, M. Hamadouche, K. C. Harrington, M. Hirschmann, C. K. Jespersen, T. Kakimoto, M. Kubo, C. d. P. Lagos , et al. (11 additional authors not shown)

    Abstract: Outflows are a key element in the baryon cycle of galaxies, and their properties provide a fundamental test for our models of how star formation quenches in galaxies. Here we report the detection of outflowing gas in two recently quenched, massive ($M_\star\sim10^{10.2}M_\odot$) galaxies at z=4.106 (NS_274) and z=7.276 (RUBIES-UDS-QG-z7) observed with JWST/NIRSpec. The outflows are traced by blue-… ▽ More

    Submitted 3 July, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

    Comments: 12 pages, 7 figures + Appendix. Accepted in A&A on May 19, 2025. Data available at the links in the paper

  50. A merging pair of massive quiescent galaxies at $z=3.44$ in the Cosmic Vine

    Authors: K. Ito, F. Valentino, M. Farcy, G. De Lucia, C. D. P. Lagos, M. Hirschmann, G. Brammer, A. de Graaff, D. Blánquez-Sesé, D. Ceverino, A. L. Faisst, F. Fontanot, S. Gillman, M. L. Hamadouche, K. E. Heintz, S. Jin, C. K. Jespersen, M. Kubo, M. Lee, G. Magdis, A. W. S. Man, M. Onodera, F. Rizzo, R. Shimakawa, M. Tanaka , et al. (4 additional authors not shown)

    Abstract: We report the spectroscopic confirmation of a merging pair of massive quiescent galaxies at $z=3.44$. Using JWST observations, we confirm that the two galaxies lie at a projected separation of 4.5 kpc with a velocity offset of $\sim 680\, {\rm km\, s^{-1}}\ (δ_z \sim 0.01)$. The pair resides in the core of a known rich overdensity of galaxies, dubbed the "Cosmic Vine". For both pair members, model… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: 8 pages, 5 figures, 1 table + Appendix. Accepted for publication in A&A on Feb 28, 2025. Spectra and photometry used in this paper are available at https://doi.org/10.5281/zenodo.14883519 , See Valentino et al. (2025) on arXiv today for another result from the JWST "DeepDive" program

    Journal ref: A&A 697, A111 (2025)