Skip to main content

Showing 51–100 of 505 results for author: Zhu, P

.
  1. arXiv:2503.01710  [pdf, other

    cs.SD cs.AI eess.AS

    Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens

    Authors: Xinsheng Wang, Mingqi Jiang, Ziyang Ma, Ziyu Zhang, Songxiang Liu, Linqin Li, Zheng Liang, Qixi Zheng, Rui Wang, Xiaoqin Feng, Weizhen Bian, Zhen Ye, Sitong Cheng, Ruibin Yuan, Zhixian Zhao, Xinfa Zhu, Jiahao Pan, Liumeng Xue, Pengcheng Zhu, Yunlin Chen, Zhifei Li, Xie Chen, Lei Xie, Yike Guo, Wei Xue

    Abstract: Recent advancements in large language models (LLMs) have driven significant progress in zero-shot text-to-speech (TTS) synthesis. However, existing foundation models rely on multi-stage processing or complex architectures for predicting multiple codebooks, limiting efficiency and integration flexibility. To overcome these challenges, we introduce Spark-TTS, a novel system powered by BiCodec, a sin… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: Submitted to ACL 2025

  2. arXiv:2502.17879  [pdf

    cs.CV

    Dual Classification Head Self-training Network for Cross-scene Hyperspectral Image Classification

    Authors: Rong Liu, Junye Liang, Jiaqi Yang, Jiang He, Peng Zhu

    Abstract: Due to the difficulty of obtaining labeled data for hyperspectral images (HSIs), cross-scene classification has emerged as a widely adopted approach in the remote sensing community. It involves training a model using labeled data from a source domain (SD) and unlabeled data from a target domain (TD), followed by inferencing on the TD. However, variations in the reflectance spectrum of the same obj… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  3. arXiv:2502.17141  [pdf, other

    astro-ph.GA

    The Starburst Acceleration of High-Velocity Clouds in the Galactic Center

    Authors: Mengfei Zhang, Miao Li, Peixin Zhu

    Abstract: High-velocity clouds (HVCs) in the Galactic center have garnered significant attention due to their mysterious formation, potentially linked to starburst events or supermassive black hole activity in the region. However, it remains challenging to explain the observed column density and velocity distribution of HVCs. The discovery of high-velocity molecular clouds (HVMCs), which are denser and more… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

    Comments: 15 pages, 6 figures, accepted by ApJ

  4. arXiv:2502.16236  [pdf, ps, other

    physics.chem-ph

    Imaging the photochemical dynamics of cyclobutanone with MeV ultrafast electron diffraction

    Authors: Tianyu Wang, Hui Jiang, Cheng Jin, Xiao Zou, Pengfei Zhu, Tao Jiang, Feng He, Dao Xiang

    Abstract: We study the photoinduced chemical dynamics of cyclobutanone upon excitation at 200 nm to the 3s Rydberg state using MeV ultrafast electron diffraction (UED). We observe both the elastic scattering signal, which contains information about the structural dynamics, and the inelastic scattering signal, which encodes information about the electronic state. Our results suggest a sub-picosecond timescal… ▽ More

    Submitted 22 February, 2025; originally announced February 2025.

    Journal ref: J. Chem. Phys. 162, 184201 (2025)

  5. arXiv:2502.14506  [pdf, other

    physics.plasm-ph

    Enhanced dynamo drive for the sawtooth relaxation process due to non-uniform resistivity distribution in a reversed field pinch

    Authors: Wentan Yan, Ping Zhu, Hong Li, Wandong Liu, Bing Luo, Haolong Li

    Abstract: In this work, we use the three-dimensional resistive MHD code NIMROD to investigate the impact of resistivity inhomogeneity on the sawtooth process of an reversed field pinch (RFP) plasma. The simulation employs a non-uniform resistivity profile similar to experiments, which monotonically increases from the core to the edge as the temperature decreases. The resistivity inhomogeneity introduces an… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  6. arXiv:2502.14332  [pdf, other

    cs.CV cs.IR

    A Collaborative Jade Recognition System for Mobile Devices Based on Lightweight and Large Models

    Authors: Zhenyu Wang, Wenjia Li, Pengyu Zhu

    Abstract: With the widespread adoption and development of mobile devices, vision-based recognition applications have become a hot topic in research. Jade, as an important cultural heritage and artistic item, has significant applications in fields such as jewelry identification and cultural relic preservation. However, existing jade recognition systems still face challenges in mobile implementation, such as… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  7. arXiv:2502.13546  [pdf, other

    physics.plasm-ph

    Power dependence of density limit due to plasma-wall interaction in a burning plasma

    Authors: Jiaxing Liu, Ping Zhu, Dominique Franck Escande

    Abstract: The density limit is one of the major obstacles to achieving the desired fusion performance in tokamaks. However, the underlying physics mechanism for its recently observed power dependence in experiments has not been well understood or predicted in theory. In this work, the power dependent scalings of density limit are obtained based on the plasma-wall self-organization model [D.F. Escande 2022 N… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: 17 pages, 7 figures

  8. arXiv:2502.12575  [pdf, other

    cs.CR cs.AI

    DemonAgent: Dynamically Encrypted Multi-Backdoor Implantation Attack on LLM-based Agent

    Authors: Pengyu Zhu, Zhenhong Zhou, Yuanhe Zhang, Shilinlu Yan, Kun Wang, Sen Su

    Abstract: As LLM-based agents become increasingly prevalent, backdoors can be implanted into agents through user queries or environment feedback, raising critical concerns regarding safety vulnerabilities. However, backdoor attacks are typically detectable by safety audits that analyze the reasoning process of agents. To this end, we propose a novel backdoor implantation strategy called \textbf{Dynamically… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  9. arXiv:2502.11370  [pdf, other

    cs.RO

    HI-GVF: Shared Control based on Human-Influenced Guiding Vector Fields for Human-multi-robot Cooperation

    Authors: Pengming Zhu, Zongtan Zhou, Weijia Yao, Wei Dai, Zhiwen Zeng, Huimin Lu

    Abstract: Human-multi-robot shared control leverages human decision-making and robotic autonomy to enhance human-robot collaboration. While widely studied, existing systems often adopt a leader-follower model, limiting robot autonomy to some extent. Besides, a human is required to directly participate in the motion control of robots through teleoperation, which significantly burdens the operator. To allevia… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

  10. arXiv:2502.05170  [pdf, other

    cond-mat.mes-hall cond-mat.dis-nn cond-mat.stat-mech

    Observation of a dynamic magneto-chiral instability in photoexcited tellurium

    Authors: Yijing Huang, Nick Abboud, Yinchuan Lv, Penghao Zhu, Azel Murzabekova, Changjun Lee, Emma A. Pappas, Dominic Petruzzi, Jason Y. Yan, Dipanjan Chauduri, Peter Abbamonte, Daniel P. Shoemaker, Rafael M. Fernandes, Jorge Noronha, Fahad Mahmood

    Abstract: In a system of charged chiral fermions driven out of equilibrium, an electric current parallel to the magnetic field can generate a dynamic instability by which electromagnetic waves become amplified. Whether a similar instability can occur in chiral solid-state systems remains an open question. Using time-domain terahertz (THz) emission spectroscopy, we detect signatures of what we dub a ``dynami… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: Supplementary Information (SI) available as a PDF in the TeX source

  11. arXiv:2502.02690  [pdf, ps, other

    cs.CV cs.AI cs.LG

    Controllable Video Generation with Provable Disentanglement

    Authors: Yifan Shen, Peiyuan Zhu, Zijian Li, Shaoan Xie, Zeyu Tang, Namrata Deka, Zongfang Liu, Guangyi Chen, Kun Zhang

    Abstract: Controllable video generation remains a significant challenge, despite recent advances in generating high-quality and consistent videos. Most existing methods for controlling video generation treat the video as a whole, neglecting intricate fine-grained spatiotemporal relationships, which limits both control precision and efficiency. In this paper, we propose Controllable Video Generative Adversar… ▽ More

    Submitted 24 June, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

  12. arXiv:2501.16767  [pdf, other

    cs.CV

    Target-driven Self-Distillation for Partial Observed Trajectories Forecasting

    Authors: Pengfei Zhu, Peng Shu, Mengshi Qi, Liang Liu, Huadong Ma

    Abstract: Accurate prediction of future trajectories of traffic agents is essential for ensuring safe autonomous driving. However, partially observed trajectories can significantly degrade the performance of even state-of-the-art models. Previous approaches often rely on knowledge distillation to transfer features from fully observed trajectories to partially observed ones. This involves firstly training a… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

  13. SQuIGG$\vec{L}$E: Observational Evidence of Low Ongoing Star Formation Rates in Gas-Rich Post-Starburst Galaxies

    Authors: Pengpei Zhu, Katherine A. Suess, Mariska Kriek, David J. Setton, Rachel Bezanson, Vincenzo Donofrio, Robert Feldmann, Andy D. Goulding, Jenny E. Greene, Desika Narayanan, Justin Spilker

    Abstract: ALMA observations have shown that candidate "post-starburst" galaxies (PSBs) at z$\sim$0.6 can retain significant molecular gas reservoirs. These results would imply that -- unlike many model predictions -- galaxies can shut down their star formation before their cold gas reservoirs are depleted. However, these studies inferred star formation rates (SFRs) either from [O II] line fluxes or from spe… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

    Comments: 12 pages, 4 figures, 1 table. Accepted by the Astrophysical Journal (ApJ)

    Journal ref: ApJ 981 60 (2025)

  14. arXiv:2501.15045  [pdf, other

    cs.CV cs.AI

    Towards Robust Unsupervised Attention Prediction in Autonomous Driving

    Authors: Mengshi Qi, Xiaoyang Bi, Pengfei Zhu, Huadong Ma

    Abstract: Robustly predicting attention regions of interest for self-driving systems is crucial for driving safety but presents significant challenges due to the labor-intensive nature of obtaining large-scale attention labels and the domain gap between self-driving scenarios and natural scenes. These challenges are further exacerbated by complex traffic environments, including camera corruption under adver… ▽ More

    Submitted 28 January, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

  15. arXiv:2501.12648  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Engineering nonlinear Hall effect in bilayer graphene/black phosphorus heterostructures

    Authors: Xing-Guo Ye, Zhen-Tao Zhang, Peng-Fei Zhu, Wen-Zheng Xu, An-Qi Wang, Zhi-Min Liao

    Abstract: Two-dimensional van der Waals materials offer a highly tunable platform for generating emergent quantum phenomena through symmetry breaking. Stacking-induced symmetry breaking at interfaces provides an effective method to modulate their electronic properties for functional devices. Here, we strategically stack bilayer graphene with black phosphorus, a low-symmetry semiconductor, to break the symme… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

    Journal ref: Phys. Rev. B 111, L041403 (2025)

  16. arXiv:2501.04085  [pdf, other

    astro-ph.GA

    The Cosmic Evolution Early Release Science Survey (CEERS)

    Authors: Steven L. Finkelstein, Micaela B. Bagley, Pablo Arrabal Haro, Mark Dickinson, Henry C. Ferguson, Jeyhan S. Kartaltepe, Dale D. Kocevski, Anton M. Koekemoer, Jennifer M. Lotz, Casey Papovich, Pablo G. Perez-Gonzalez, Nor Pirzkal, Rachel S. Somerville, Jonathan R. Trump, Guang Yang, L. Y. Aaron Yung, Adriano Fontana, Andrea Grazian, Norman A. Grogin, Lisa J. Kewley, Allison Kirkpatrick, Rebecca L. Larson, Laura Pentericci, Swara Ravindranath, Stephen M. Wilkins , et al. (74 additional authors not shown)

    Abstract: We present the Cosmic Evolution Early Release Science (CEERS) Survey, a 77.2 hour Director's Discretionary Early Release Science Program. CEERS demonstrates, tests, and validates efficient extragalactic surveys using coordinated, overlapping parallel observations with the JWST instrument suite, including NIRCam and MIRI imaging, NIRSpec low (R~100) and medium (R~1000) resolution spectroscopy, and… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

    Comments: 38 pages, 13 figures, 6 tables

  17. Glitches and glitching clusters in rotation-powered pulsars

    Authors: Pei-Xin Zhu, Xiao-Ping Zheng

    Abstract: The study of pulsar glitch phenomena serves as a valuable probe into the dynamic properties of matter under extreme high-density conditions, offering insights into the physics within neutron stars. Providing theoretical explanations for the diverse manifestations observed in different pulsars has proven to be a formidable challenge. By analyzing the distribution of glitch sizes and waiting times,… ▽ More

    Submitted 6 January, 2025; v1 submitted 3 January, 2025; originally announced January 2025.

    Journal ref: The Astrophysical Journal, 978:49 (13pp), 2025 January 01

  18. arXiv:2501.01240  [pdf, other

    cs.CV

    Asymmetric Reinforcing against Multi-modal Representation Bias

    Authors: Xiyuan Gao, Bing Cao, Pengfei Zhu, Nannan Wang, Qinghua Hu

    Abstract: The strength of multimodal learning lies in its ability to integrate information from various sources, providing rich and comprehensive insights. However, in real-world scenarios, multi-modal systems often face the challenge of dynamic modality contributions, the dominance of different modalities may change with the environments, leading to suboptimal performance in multimodal learning. Current me… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

    Comments: Accepted by AAAI 2025

  19. Revisiting CMSSM with Non-Universal Gaugino Masses under Current Constraints

    Authors: Yabo Dong, Kun Wang, Hailong Yuan, Jingya Zhu, Pengxuan Zhu

    Abstract: To address the longstanding tension between the Constrained Minimal Supersymmetric Standard Model (CMSSM) and recent experimental data, we investigate non-universal gaugino masses within an SU(5) Grand Unified Theory (GUT) framework, focusing on the $\tilde{g}$-SUGRA scenario where $\lvert M_{3} \rvert \gg \lvert M_{1} \rvert, \lvert M_{2} \rvert$. This hierarchy enables a heavier gluino, thereby… ▽ More

    Submitted 31 March, 2025; v1 submitted 27 December, 2024; originally announced December 2024.

    Comments: 27 pages, 7 figures, 2 tables. Accepted by JHEP

    Journal ref: JHEP 03 (2025) 207

  20. arXiv:2412.19743  [pdf, other

    hep-ex hep-ph

    Flavor Physics at CEPC: a General Perspective

    Authors: Xiaocong Ai, Wolfgang Altmannshofer, Peter Athron, Xiaozhi Bai, Lorenzo Calibbi, Lu Cao, Yuzhi Che, Chunhui Chen, Ji-Yuan Chen, Long Chen, Mingshui Chen, Shanzhen Chen, Xuan Chen, Shan Cheng, Cheng-Wei Chiang, Andreas Crivellin, Hanhua Cui, Olivier Deschamps, Sébastien Descotes-Genon, Xiaokang Du, Shuangshi Fang, Yu Gao, Li-Sheng Geng, Pablo Goldenzweig, Jiayin Gu , et al. (116 additional authors not shown)

    Abstract: We discuss the landscape of flavor physics at the Circular Electron-Positron Collider (CEPC), based on the nominal luminosity outlined in its Technical Design Report. The CEPC is designed to operate in multiple modes to address a variety of tasks. At the $Z$ pole, the expected production of 4 Tera $Z$ bosons will provide unique and highly precise measurements of $Z$ boson couplings, while the subs… ▽ More

    Submitted 31 December, 2024; v1 submitted 27 December, 2024; originally announced December 2024.

  21. arXiv:2412.19015  [pdf, other

    cs.CV cs.CR

    Imperceptible Adversarial Attacks on Point Clouds Guided by Point-to-Surface Field

    Authors: Keke Tang, Weiyao Ke, Weilong Peng, Xiaofei Wang, Ziyong Du, Zhize Wu, Peican Zhu, Zhihong Tian

    Abstract: Adversarial attacks on point clouds are crucial for assessing and improving the adversarial robustness of 3D deep learning models. Traditional solutions strictly limit point displacement during attacks, making it challenging to balance imperceptibility with adversarial effectiveness. In this paper, we attribute the inadequate imperceptibility of adversarial attacks on point clouds to deviations fr… ▽ More

    Submitted 25 December, 2024; originally announced December 2024.

    Comments: Accepted by ICASSP 2025

    MSC Class: 68T07

  22. arXiv:2412.18922  [pdf

    physics.plasm-ph

    Numerical solutions of resistive finite-pressure magnetohydrodynamic equilibria for non-axisymmetric toroidal plasmas

    Authors: Jian Zhang, Ping Zhu, Chris C. Hegna

    Abstract: A hybrid spectral/finite-element code is developed to numerically solve the resistive finite-pressure magnetohydrodynamic equilibria without the necessity of postulating nested magnetic flux surfaces in the non-axisymmetric toroidal systems. The adopted approach integrates a hyperbolic parallel damping equation for pressure updating, along with a dynamic resistive relaxation for magnetic field. To… ▽ More

    Submitted 25 December, 2024; originally announced December 2024.

  23. arXiv:2412.18365  [pdf, other

    cs.LG cs.AI

    Hypergraph Attacks via Injecting Homogeneous Nodes into Elite Hyperedges

    Authors: Meixia He, Peican Zhu, Keke Tang, Yangming Guo

    Abstract: Recent studies have shown that Hypergraph Neural Networks (HGNNs) are vulnerable to adversarial attacks. Existing approaches focus on hypergraph modification attacks guided by gradients, overlooking node spanning in the hypergraph and the group identity of hyperedges, thereby resulting in limited attack performance and detectable attacks. In this manuscript, we present a novel framework, i.e., Hyp… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

    Comments: 9 pages, The 39th Annual AAAI Conference on Artificial Intelligence(2025)

  24. arXiv:2412.18361  [pdf, ps, other

    math.DG math.AP

    On a generalized Monge-Ampère equation on closed almost Kähler surfaces

    Authors: Ken Wang, Zuyi Zhang, Tao Zheng, Peng Zhu

    Abstract: We show the existence and uniqueness of solutions to a generalized Monge-Ampère equation on closed almost Kähler surfaces, where the equation depends only on the underlying almost Kähler structure. As an application, we prove Donaldson's conjecture for tamed almost complex 4-manifolds.

    Submitted 2 May, 2025; v1 submitted 24 December, 2024; originally announced December 2024.

    MSC Class: 53D35; 53C56; 53C65; 32Q60

  25. arXiv:2412.12149   

    cs.LG cs.AI cs.CV

    MHSA: A Multi-scale Hypergraph Network for Mild Cognitive Impairment Detection via Synchronous and Attentive Fusion

    Authors: Manman Yuan, Weiming Jia, Xiong Luo, Jiazhen Ye, Peican Zhu, Junlin Li

    Abstract: The precise detection of mild cognitive impairment (MCI) is of significant importance in preventing the deterioration of patients in a timely manner. Although hypergraphs have enhanced performance by learning and analyzing brain networks, they often only depend on vector distances between features at a single scale to infer interactions. In this paper, we deal with a more arduous challenge, hyperg… ▽ More

    Submitted 11 January, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: The submission was made prematurely and will be resubmitted after further development

  26. arXiv:2412.10087  [pdf, other

    cs.RO

    Consensus-Based Dynamic Task Allocation for Multi-Robot System Considering Payloads Consumption

    Authors: Xuekai Qiu, Pengming Zhu, Yiming Hu, Zhiwen Zeng, Huimin Lu

    Abstract: This paper presents a consensus-based payload algorithm (CBPA) to deal with the condition of robots' capability decrease for multi-robot task allocation. During the execution of complex tasks, robots' capabilities could decrease with the consumption of payloads, which causes a problem that the robot coalition would not meet the tasks' requirements in real time. The proposed CBPA is an enhanced ver… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

  27. arXiv:2412.02938  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Nonlinear spin and orbital Edelstein effect in WTe2

    Authors: Xing-Guo Ye, Peng-Fei Zhu, Wen-Zheng Xu, Tong-Yang Zhao, Zhi-Min Liao

    Abstract: In materials with spin-momentum locked spin textures, such as Rashba states and topological surface states, the current-induced shift of the Fermi contour in the k space leads to spin polarization, known as the Edelstein effect, which depends linearly on the applied current. However, its nonlinear counterpart has not yet been discovered. Here, we report the observation of the nonlinear Edelstein e… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

    Comments: 29 pages

    Journal ref: Physical Review B 110, L201407 (2024)

  28. arXiv:2412.02491  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Facilitating field-free perpendicular magnetization switching with a Berry curvature dipole in a Weyl semimetal

    Authors: Dong Li, Xing-Yu Liu, Xing-Guo Ye, Zhen-Cun Pan, Wen-Zheng Xu, Peng-Fei Zhu, An-Qi Wang, Kenji Watanabe, Takashi Taniguchi, Zhi-Min Liao

    Abstract: We report the synergy between orbital and spin-orbit torques in WTe2/Fe3GeTe2 heterostructures characterized by a Berry curvature dipole. By applying a current along the a axis in WTe2, we detect an out-of-plane magnetization in the system, which we attribute to nonequilibrium orbital magnetization linked to the Berry curvature dipole based on first-principles calculations, manifesting as the orbi… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

    Comments: 29 pages

    Journal ref: Physical Review B 110, L100409 (2024)

  29. arXiv:2412.00701  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci

    Superconductivity at Pd/Bi$_2$Se$_3$ Interfaces Due to Self-Formed PdBiSe Interlayers

    Authors: Kaixuan Fan, Ze Hua, Siyao Gu, Peng Zhu, Guangtong Liu, Hechen Ren, Ruiwen Shao, Zhiwei Wang, Li Lu, Fan Yang

    Abstract: Understanding the physical and chemical processes at the interface of metals and topological insulators is crucial for developing the next generation of topological quantum devices. Here we report the discovery of robust superconductivity in Pd/Bi$_2$Se$_3$ bilayers fabricated by sputtering Pd on the surface of Bi$_2$Se$_3$. Through transmission electron microscopy measurements, we identify that t… ▽ More

    Submitted 1 December, 2024; originally announced December 2024.

    Journal ref: Materials 2024, 17(22), 5460

  30. arXiv:2411.13056  [pdf, other

    cs.CV

    Efficient Masked AutoEncoder for Video Object Counting and A Large-Scale Benchmark

    Authors: Bing Cao, Quanhao Lu, Jiekang Feng, Qilong Wang, Qinghua Hu, Pengfei Zhu

    Abstract: The dynamic imbalance of the fore-background is a major challenge in video object counting, which is usually caused by the sparsity of target objects. This remains understudied in existing works and often leads to severe under-/over-prediction errors. To tackle this issue in video object counting, we propose a density-embedded Efficient Masked Autoencoder Counting (E-MAC) framework in this paper.… ▽ More

    Submitted 6 March, 2025; v1 submitted 20 November, 2024; originally announced November 2024.

    Comments: ICLR25

  31. arXiv:2411.07174  [pdf, other

    cond-mat.str-el cond-mat.stat-mech quant-ph

    Bilayer construction for mixed state phenomena with strong, weak symmetries and symmetry breakings

    Authors: Shuangyuan Lu, Penghao Zhu, Yuan-Ming Lu

    Abstract: We introduce the bilayer construction, as a specific purification scheme for a general mixed state, where each mixed state has a one-to-one correspondence with a bilayer pure state with two constraints: non-negativity of the bilayer wavefunction; and the presence of an anti-unitary layer-exchange symmetry T. Different from the Choi-Jamiołkowski isomorphism, any mixed state can be realized as the m… ▽ More

    Submitted 3 December, 2024; v1 submitted 11 November, 2024; originally announced November 2024.

    Comments: 18 pages, 1 figure, 1 table, added discussions on the definition of mixed state phases, added details for the examples discussed in section V B

  32. Dynamic Brightness Adaptation for Robust Multi-modal Image Fusion

    Authors: Yiming Sun, Bing Cao, Pengfei Zhu, Qinghua Hu

    Abstract: Infrared and visible image fusion aim to integrate modality strengths for visually enhanced, informative images. Visible imaging in real-world scenarios is susceptible to dynamic environmental brightness fluctuations, leading to texture degradation. Existing fusion methods lack robustness against such brightness perturbations, significantly compromising the visual fidelity of the fused imagery. To… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: Accepted by IJCAI 2024

    ACM Class: I.4.9

    Journal ref: Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence,Main Track,Pages 1317-1325, 2024

  33. arXiv:2411.04103  [pdf, other

    astro-ph.GA

    Theoretical Diagnostics for Narrow Line Regions of Active Galactic Nuclei

    Authors: Peixin Zhu, Lisa J. Kewley, Ralph Sutherland

    Abstract: Gas metallicity, ionization parameter, and gas pressure can affect the observed ratios of specific strong emission lines within galaxies. While the theoretical strong lines diagnostics for gas metallicity, ionization parameters, and gas pressure in star-forming regions are well-established, theoretical diagnostics for active galactic nuclei (AGNs) narrow line regions are still lacking. In Zhu et a… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

    Comments: 33 pages, 22 figures, 7 tables, Accepted for publication in ApJ

  34. arXiv:2411.01573  [pdf, other

    cs.CV cs.LG eess.IV

    Conditional Controllable Image Fusion

    Authors: Bing Cao, Xingxin Xu, Pengfei Zhu, Qilong Wang, Qinghua Hu

    Abstract: Image fusion aims to integrate complementary information from multiple input images acquired through various sources to synthesize a new fused image. Existing methods usually employ distinct constraint designs tailored to specific scenes, forming fixed fusion paradigms. However, this data-driven fusion approach is challenging to deploy in varying scenarios, especially in rapidly changing environme… ▽ More

    Submitted 3 November, 2024; originally announced November 2024.

    Comments: Accepted by NeurIPS 2024

  35. arXiv:2410.20787  [pdf, ps, other

    physics.plasm-ph

    Impurity radiation seeding of neoclassical tearing mode growth

    Authors: Shiyong Zeng, Ping Zhu, Eric C. Howell

    Abstract: The physics of neoclassical tearing mode (NTM) is of great concern to the tokamak plasma stability and performance, especially in the burning plasma regime. Whereas a great deal about the different seeding mechanisms have been understood, and in many situations the seed event can be clearly identified, the potential seeding process of NTM due to the resistive tearing instability driven by the impu… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: 21 pages, 12 figures

    MSC Class: 76W05 (Primary) ACM Class: J.2

  36. arXiv:2410.20679  [pdf, other

    q-fin.ST cs.LG q-fin.CP

    MCI-GRU: Stock Prediction Model Based on Multi-Head Cross-Attention and Improved GRU

    Authors: Peng Zhu, Yuante Li, Yifan Hu, Sheng Xiang, Qinyuan Liu, Dawei Cheng, Yuqi Liang

    Abstract: As financial markets grow increasingly complex in the big data era, accurate stock prediction has become more critical. Traditional time series models, such as GRUs, have been widely used but often struggle to capture the intricate nonlinear dynamics of markets, particularly in the flexible selection and effective utilization of key historical information. Recently, methods like Graph Neural Netwo… ▽ More

    Submitted 28 March, 2025; v1 submitted 25 September, 2024; originally announced October 2024.

  37. arXiv:2410.20374  [pdf, other

    cs.RO eess.SY

    A CT-guided Control Framework of a Robotic Flexible Endoscope for the Diagnosis of the Maxillary Sinusitis

    Authors: Puchen Zhu, Huayu Zhang, Xin Ma, Xiaoyin Zheng, Xuchen Wang, Kwok Wai Samuel Au

    Abstract: Flexible endoscopes are commonly adopted in narrow and confined anatomical cavities due to their higher reachability and dexterity. However, prolonged and unintuitive manipulation of these endoscopes leads to an increased workload on surgeons and risks of collision. To address these challenges, this paper proposes a CT-guided control framework for the diagnosis of maxillary sinusitis by using a ro… ▽ More

    Submitted 27 October, 2024; originally announced October 2024.

  38. arXiv:2410.16647  [pdf, other

    eess.AS cs.AI cs.LG

    GE2E-KWS: Generalized End-to-End Training and Evaluation for Zero-shot Keyword Spotting

    Authors: Pai Zhu, Jacob W. Bartel, Dhruuv Agarwal, Kurt Partridge, Hyun Jin Park, Quan Wang

    Abstract: We propose GE2E-KWS -- a generalized end-to-end training and evaluation framework for customized keyword spotting. Specifically, enrollment utterances are separated and grouped by keywords from the training batch and their embedding centroids are compared to all other test utterance embeddings to compute the loss. This simulates runtime enrollment and verification stages, and improves convergence… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: 8 pages, 6 figures, 2 tables The paper is accepted in IEEE Spoken Language Technology (SLT) 2024

  39. arXiv:2410.13573  [pdf, other

    cs.RO

    SPF-EMPC Planner: A real-time multi-robot trajectory planner for complex environments with uncertainties

    Authors: Peng Liu, Pengming Zhu, Zhiwen Zeng, Xuekai Qiu, Yu Wang, Huimin Lu

    Abstract: In practical applications, the unpredictable movement of obstacles and the imprecise state observation of robots introduce significant uncertainties for the swarm of robots, especially in cluster environments. However, existing methods are difficult to realize safe navigation, considering uncertainties, complex environmental structures, and robot swarms. This paper introduces an extended state mod… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  40. Giant non-reciprocity and gyration through modulation-induced Hatano-Nelson coupling in integrated photonics

    Authors: Ogulcan E. Orsel, Jiho Noh, Penghao Zhu, Jieun Yim, Taylor L. Hughes, Ronny Thomale, Gaurav Bahl

    Abstract: Asymmetric energy exchange interactions, also known as Hatano-Nelson type couplings, enable the study of non-Hermitian physics and associated phenomena like the non-Hermitian skin effect and exceptional points (EP). Since these interactions are by definition non-reciprocal, there have been very few options for real-space implementations in integrated photonics. In this work, we show that real-spac… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Journal ref: Phys. Rev. Lett. 134, 153801 (2025)

  41. arXiv:2410.05652  [pdf, other

    eess.SP

    Performance Analysis of Local Partial MMSE Precoding Based User-Centric Cell-Free Massive MIMO Systems and Deployment Optimization

    Authors: Peng Jiang, Jiafei Fu, Pengcheng Zhu, Yan Wang, Jiangzhou Wang, Xiaohu You

    Abstract: Cell-free massive multiple-input multiple-output (MIMO) systems, leveraging tight cooperation among wireless access points, exhibit remarkable signal enhancement and interference suppression capabilities, demonstrating significant performance advantages over traditional cellular networks. This paper investigates the performance and deployment optimization of a user-centric scalable cell-free massi… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: 14 pages, 8 figures

  42. arXiv:2410.02510  [pdf, other

    cs.RO cs.MA eess.SY

    SwarmCVT: Centroidal Voronoi Tessellation-Based Path Planning for Very-Large-Scale Robotics

    Authors: James Gao, Jacob Lee, Yuting Zhou, Yunze Hu, Chang Liu, Pingping Zhu

    Abstract: Swarm robotics, or very large-scale robotics (VLSR), has many meaningful applications for complicated tasks. However, the complexity of motion control and energy costs stack up quickly as the number of robots increases. In addressing this problem, our previous studies have formulated various methods employing macroscopic and microscopic approaches. These methods enable microscopic robots to adhere… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: Submitted to American Control Conference (ACC) 2025

  43. arXiv:2409.15782  [pdf, other

    eess.AS cs.SD

    M-Vec: Matryoshka Speaker Embeddings with Flexible Dimensions

    Authors: Shuai Wang, Pengcheng Zhu, Haizhou Li

    Abstract: Fixed-dimensional speaker embeddings have become the dominant approach in speaker modeling, typically spanning hundreds to thousands of dimensions. These dimensions are hyperparameters that are not specifically picked, nor are they hierarchically ordered in terms of importance. In large-scale speaker representation databases, reducing the dimensionality of embeddings can significantly lower storag… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    Comments: ICSR 2024, Shenzhen

  44. arXiv:2409.12884  [pdf, other

    cs.CR cs.CV

    Hypersphere Secure Sketch Revisited: Probabilistic Linear Regression Attack on IronMask in Multiple Usage

    Authors: Pengxu Zhu, Lei Wang

    Abstract: Protection of biometric templates is a critical and urgent area of focus. IronMask demonstrates outstanding recognition performance while protecting facial templates against existing known attacks. In high-level, IronMask can be conceptualized as a fuzzy commitment scheme building on the hypersphere directly. We devise an attack on IronMask targeting on the security notion of renewability. Our att… ▽ More

    Submitted 19 September, 2024; originally announced September 2024.

  45. arXiv:2409.09352  [pdf, other

    cs.SD eess.AS

    MacST: Multi-Accent Speech Synthesis via Text Transliteration for Accent Conversion

    Authors: Sho Inoue, Shuai Wang, Wanxing Wang, Pengcheng Zhu, Mengxiao Bi, Haizhou Li

    Abstract: In accented voice conversion or accent conversion, we seek to convert the accent in speech from one another while preserving speaker identity and semantic content. In this study, we formulate a novel method for creating multi-accented speech samples, thus pairs of accented speech samples by the same speaker, through text transliteration for training accent conversion systems. We begin by generatin… ▽ More

    Submitted 10 January, 2025; v1 submitted 14 September, 2024; originally announced September 2024.

    Comments: This is accepted to IEEE ICASSP 2025; Project page with Speech Demo: https://github.com/shinshoji01/MacST-project-page

  46. arXiv:2409.09351  [pdf, other

    eess.AS cs.SD

    E1 TTS: Simple and Fast Non-Autoregressive TTS

    Authors: Zhijun Liu, Shuai Wang, Pengcheng Zhu, Mengxiao Bi, Haizhou Li

    Abstract: This paper introduces Easy One-Step Text-to-Speech (E1 TTS), an efficient non-autoregressive zero-shot text-to-speech system based on denoising diffusion pretraining and distribution matching distillation. The training of E1 TTS is straightforward; it does not require explicit monotonic alignment between the text and audio pairs. The inference of E1 TTS is efficient, requiring only one neural netw… ▽ More

    Submitted 14 September, 2024; originally announced September 2024.

  47. arXiv:2409.08282  [pdf, other

    q-fin.ST cs.CE cs.LG

    LSR-IGRU: Stock Trend Prediction Based on Long Short-Term Relationships and Improved GRU

    Authors: Peng Zhu, Yuante Li, Yifan Hu, Qinyuan Liu, Dawei Cheng, Yuqi Liang

    Abstract: Stock price prediction is a challenging problem in the field of finance and receives widespread attention. In recent years, with the rapid development of technologies such as deep learning and graph neural networks, more research methods have begun to focus on exploring the interrelationships between stocks. However, existing methods mostly focus on the short-term dynamic relationships of stocks a… ▽ More

    Submitted 11 May, 2025; v1 submitted 25 August, 2024; originally announced September 2024.

  48. arXiv:2409.01111  [pdf, other

    eess.SP eess.SY

    A Novel Massive Random Access in Cell-Free Massive MIMO Systems for High-Speed Mobility with OTFS Modulation

    Authors: Yanfeng Hu, Dongming Wang, Xinjiang Xia, Jiamin Li, Pengcheng Zhu, Xiaohu You

    Abstract: In the research of next-generation wireless communication technologies, orthogonal time frequency space (OTFS) modulation is emerging as a promising technique for high-speed mobile environments due to its superior efficiency and robustness in doubly selective channels. Additionally, the cell-free architecture, which eliminates the issues associated with cell boundaries, offers broader coverage for… ▽ More

    Submitted 27 April, 2025; v1 submitted 2 September, 2024; originally announced September 2024.

  49. arXiv:2408.14533  [pdf, other

    cond-mat.str-el cond-mat.mes-hall

    Obstruction to Broken Symmetries in Topological Flat Bands

    Authors: Penghao Zhu, Shi Feng, Yuan-Ming Lu

    Abstract: Motivated by the abundance of symmetry breaking states in magic-angle twisted bilayer graphene and other two-dimensional materials, we study superconducting (SC) and charge orders in two-dimensional topological flat bands in the strong correlation regime. By relating the half-filled 2D topological flat bands to the surface states of 3D topological insulators in symmetry class AIII, we reveal the t… ▽ More

    Submitted 12 September, 2024; v1 submitted 26 August, 2024; originally announced August 2024.

    Comments: 5+9 pages, 2+1 figures; We added a short discussion about materials in v2

  50. arXiv:2408.14165  [pdf, ps, other

    physics.plasm-ph

    Formation of quasi-single helicity state from a paramagnetic pinch in KTX regime

    Authors: Bing Luo, Ping Zhu, Wentan Yan, Hong Li, Wandong Liu

    Abstract: The formation of quasi-single helicity (QSH) state from a paramagnetic pinch in the KTX-RFP regime has been observed in recent NIMROD simulations. The quasi-single helicity state has a dominant helical component of the magnetic field that is known to improve the RFP confinement. For the initial paramagnetic pinch, linear calculations indicate that the tearing mode growth rate decreases with the pl… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.