Skip to main content

Showing 201–250 of 4,528 results for author: Zhang, P

.
  1. arXiv:2502.07406  [pdf, other

    hep-ex

    Search for $e^+e^-\to K_S^0 K_S^0 h_c$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (642 additional authors not shown)

    Abstract: Using $e^+e^-$ collision data at 13 center-of-mass energies ranging from 4.600 to 4.950 GeV collected with the BESIII detector, we search for the unmeasured $e^+e^-\to K_S^0 K_S^0 h_c$ process . No significant signal is observed, and the upper limits of the Born cross sections at each center-of-mass energy are presented.

    Submitted 11 February, 2025; originally announced February 2025.

  2. arXiv:2502.07319  [pdf, ps, other

    cs.LG cs.IT

    Learnable Residual-based Latent Denoising in Semantic Communication

    Authors: Mingkai Xu, Yongpeng Wu, Yuxuan Shi, Xiang-Gen Xia, Wenjun Zhang, Ping Zhang

    Abstract: A latent denoising semantic communication (SemCom) framework is proposed for robust image transmission over noisy channels. By incorporating a learnable latent denoiser into the receiver, the received signals are preprocessed to effectively remove the channel noise and recover the semantic information, thereby enhancing the quality of the decoded images. Specifically, a latent denoising mapping is… ▽ More

    Submitted 29 April, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

    Comments: This paper has been accepted by IEEE Wireless Communications Letters

  3. arXiv:2502.07317  [pdf, other

    physics.ins-det hep-ex

    Position reconstruction and surface background model for the PandaX-4T detector

    Authors: Zhicheng Qian, Linhui Gu, Chen Cheng, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Zhixing Gao, Lisheng Geng, Karl Giboni, Xunan Guo, Xuyuan Guo, Zichao Guo, Chencheng Han, Ke Han, Changda He, Jinrong He, Di Huang, Houqi Huang, Junting Huang, Ruquan Hou , et al. (78 additional authors not shown)

    Abstract: We report the position reconstruction methods and surface background model for the PandaX-4T dark matter direct search experiment. This work develops two position reconstruction algorithms: template matching (TM) method and photon acceptance function (PAF) method. Both methods determine the horizontal position of events based on the light pattern of secondary scintillation collected by the light s… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: 22 pages, 15 figures, 2 tables

  4. arXiv:2502.07239  [pdf, other

    cs.CV cs.AI

    Contextual Gesture: Co-Speech Gesture Video Generation through Context-aware Gesture Representation

    Authors: Pinxin Liu, Pengfei Zhang, Hyeongwoo Kim, Pablo Garrido, Ari Sharpio, Kyle Olszewski

    Abstract: Co-speech gesture generation is crucial for creating lifelike avatars and enhancing human-computer interactions by synchronizing gestures with speech. Despite recent advancements, existing methods struggle with accurately identifying the rhythmic or semantic triggers from audio for generating contextualized gesture patterns and achieving pixel-level realism. To address these challenges, we introdu… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  5. arXiv:2502.07027  [pdf, other

    cs.LG cs.AI

    Representational Alignment with Chemical Induced Fit for Molecular Relational Learning

    Authors: Peiliang Zhang, Jingling Yuan, Qing Xie, Yongjun Zhu, Lin Li

    Abstract: Molecular Relational Learning (MRL) is widely applied in natural sciences to predict relationships between molecular pairs by extracting structural features. The representational similarity between substructure pairs determines the functional compatibility of molecular binding sites. Nevertheless, aligning substructure representations by attention mechanisms lacks guidance from chemical knowledge,… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  6. arXiv:2502.06877  [pdf, other

    cs.LG

    WirelessGPT: A Generative Pre-trained Multi-task Learning Framework for Wireless Communication

    Authors: Tingting Yang, Ping Zhang, Mengfan Zheng, Yuxuan Shi, Liwen Jing, Jianbo Huang, Nan Li

    Abstract: This paper introduces WirelessGPT, a pioneering foundation model specifically designed for multi-task learning in wireless communication and sensing. Specifically, WirelessGPT leverages large-scale wireless channel datasets for unsupervised pretraining and extracting universal channel representations, which captures complex spatiotemporal dependencies. In fact,this task-agnostic design adapts Wire… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

    Comments: 8 pages, 4 figures

  7. arXiv:2502.06155  [pdf, other

    cs.CV

    Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile

    Authors: Hangliang Ding, Dacheng Li, Runlong Su, Peiyuan Zhang, Zhijie Deng, Ion Stoica, Hao Zhang

    Abstract: Despite the promise of synthesizing high-fidelity videos, Diffusion Transformers (DiTs) with 3D full attention suffer from expensive inference due to the complexity of attention computation and numerous sampling steps. For example, the popular Open-Sora-Plan model consumes more than 9 minutes for generating a single video of 29 frames. This paper addresses the inefficiency issue from two aspects:… ▽ More

    Submitted 17 February, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

  8. arXiv:2502.06145  [pdf, other

    cs.CV

    Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance

    Authors: Li Hu, Guangyuan Wang, Zhen Shen, Xin Gao, Dechao Meng, Lian Zhuo, Peng Zhang, Bang Zhang, Liefeng Bo

    Abstract: Recent character image animation methods based on diffusion models, such as Animate Anyone, have made significant progress in generating consistent and generalizable character animations. However, these approaches fail to produce reasonable associations between characters and their environments. To address this limitation, we introduce Animate Anyone 2, aiming to animate characters with environmen… ▽ More

    Submitted 9 February, 2025; originally announced February 2025.

    Comments: Project Page: https://humanaigc.github.io/animate-anyone-2/

  9. arXiv:2502.06090   

    physics.optics

    Tip-Enhanced Raman Spectroscopy of Cell Wall Heterogeneity for Aspergillus Fumigatus

    Authors: Zhenfei Jiang, Jizhou Wang, Zhe He, Peng Zhang, Zhenhuan Yi, Alexei V. Sokolov, Marlan O. Scully

    Abstract: Tip-enhanced Raman spectroscopy (TERS) enables nanoscale chemical mapping of biological structures, providing high-resolution, high-signal-to-noise ratio imaging into molecular distribution and interactions beyond the capabilities of conventional Raman imaging. However, challenges such as the deformation of fragile biological cells and the complexity of signal interpretation would increase the dif… ▽ More

    Submitted 7 May, 2025; v1 submitted 9 February, 2025; originally announced February 2025.

    Comments: need a major revision

  10. arXiv:2502.05783  [pdf, other

    cs.HC cs.AI cs.LG

    WatchGuardian: Enabling User-Defined Personalized Just-in-Time Intervention on Smartwatch

    Authors: Ying Lei, Yancheng Cao, Will Wang, Yuanzhe Dong, Changchang Yin, Weidan Cao, Ping Zhang, Jingzhen Yang, Bingsheng Yao, Yifan Peng, Chunhua Weng, Randy Auerbach, Lena Mamykina, Dakuo Wang, Yuntao Wang, Xuhai Xu

    Abstract: While just-in-time interventions (JITIs) have effectively targeted common health behaviors, individuals often have unique needs to intervene in personal undesirable actions that can negatively affect physical, mental, and social well-being. We present WatchGuardian, a smartwatch-based JITI system that empowers users to define custom interventions for these personal actions with a small number of s… ▽ More

    Submitted 9 February, 2025; originally announced February 2025.

    Comments: Under submission

    MSC Class: 68U35 ACM Class: H.5.2; I.2.1

  11. arXiv:2502.05657  [pdf, other

    astro-ph.IM astro-ph.HE

    Ideas and Requirements for the Global Cosmic-Ray Observatory (GCOS)

    Authors: Markus Ahlers, Ingo Allekotte, Jaime Alvarez-Muniz, Gioacchino Alex Anastasi, Luis Anchordoqui, Rita de Cassia Dos Anjos, Hari Haran Balakrishnan, Rafael Alves Batista, Jose Bellido, Mario Bertaina, Sonali Bhatnagar, Pierre Billoir, Kathrin Bismark, Teresa Bister, Martina Bohacova, Carla Bonifazi, Fraser Bradfield, Antonella Castellina, Lorenzo Cazon, Kevin Almeida Cheminant, Alan Coleman, Fabio Convenga, Darko Veberič, Paramita Dasgupta, Kai Daumiller , et al. (114 additional authors not shown)

    Abstract: After a successful kick-off meeting in 2021. two workshops in 2022 and 2023 on the future Global Cosmic-Ray Observatory (GCOS) focused mainly on a straw man design of the detector and science possibilities for astro- and particle physics. About 100 participants gathered for in-person and hybrid panel discussions. In this report, we summarize these discussions, present a preliminary straw-man desig… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

    Comments: 48 pages, 27 figures

  12. arXiv:2502.05422  [pdf

    cond-mat.str-el cond-mat.mtrl-sci

    Magnetic transition in marcasite FeTe$_{2}$ induced by the competition between crystal field splitting and Coulomb repulsion

    Authors: Yue-Fei Hou, Zhibin Shao, Minghu Pan, Shiyang Wu, Fawei Zheng, Zhen-Guo Fu, Ping Zhang

    Abstract: The magnetic ground states in crystalline systems are significant for both fundamental condensed matter physics and practical materials engineering. Marcasite FeTe$_{2}$, characterized as a small-gap semiconductor, exhibits anomalous magnetic behaviors in low-temperature experiments. In this study, first-principles density functional theory calculations combined with scanning tunneling microscopy/… ▽ More

    Submitted 10 February, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

    Comments: 6 figures, 13 pages. Minor revisions were performed. Comments are welcome

  13. arXiv:2502.05173  [pdf, other

    cs.CV

    VideoRoPE: What Makes for Good Video Rotary Position Embedding?

    Authors: Xilin Wei, Xiaoran Liu, Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Jian Tong, Haodong Duan, Qipeng Guo, Jiaqi Wang, Xipeng Qiu, Dahua Lin

    Abstract: While Rotary Position Embedding (RoPE) and its variants are widely adopted for their long-context capabilities, the extension of the 1D RoPE to video, with its complex spatio-temporal structure, remains an open challenge. This work first introduces a comprehensive analysis that identifies four key characteristics essential for the effective adaptation of RoPE to video, which have not been fully co… ▽ More

    Submitted 27 April, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

  14. arXiv:2502.04848  [pdf, other

    astro-ph.HE

    Broadband $γ$-ray spectrum of supernova remnant Cassiopeia A

    Authors: Zhen Cao, F. Aharonian, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, C. M. Cai, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen, S. H. Chen, S. Z. Chen , et al. (293 additional authors not shown)

    Abstract: The core-collapse supernova remnant (SNR) Cassiopeia A (Cas A) is one of the brightest galactic radio sources with an angular radius of $\sim$ 2.5 $\arcmin$. Although no extension of this source has been detected in the $γ$-ray band, using more than 1000 days of LHAASO data above $\sim 0.8$ TeV, we find that its spectrum is significantly softer than those obtained with Imaging Air Cherenkov Telesc… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  15. arXiv:2502.04681  [pdf, other

    stat.ME

    CALF-SBM: A Covariate-Assisted Latent Factor Stochastic Block Model

    Authors: Sydney Louit, Evan Clark, Alexander Gelbard, Niketna Vivek, Jun Yan, Panpan Zhang

    Abstract: We propose a novel network generative model extended from the standard stochastic block model by concurrently utilizing observed node-level information and accounting for network-enabled nodal heterogeneity. The proposed model is so so-called covariate-assisted latent factor stochastic block model (CALF-SBM). The inference for the proposed model is done in a fully Bayesian framework. The primary a… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  16. arXiv:2502.04674  [pdf, other

    cs.CL cs.AI

    AdParaphrase: Paraphrase Dataset for Analyzing Linguistic Features toward Generating Attractive Ad Texts

    Authors: Soichiro Murakami, Peinan Zhang, Hidetaka Kamigaito, Hiroya Takamura, Manabu Okumura

    Abstract: Effective linguistic choices that attract potential customers play crucial roles in advertising success. This study aims to explore the linguistic features of ad texts that influence human preferences. Although the creation of attractive ad texts is an active area of research, progress in understanding the specific linguistic features that affect attractiveness is hindered by several obstacles. Fi… ▽ More

    Submitted 11 February, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

    Comments: Accepted to NAACL2025 Findings

  17. arXiv:2502.04649  [pdf, other

    eess.SY cs.LG math.OC

    End-to-End Learning Framework for Solving Non-Markovian Optimal Control

    Authors: Xiaole Zhang, Peiyu Zhang, Xiongye Xiao, Shixuan Li, Vasileios Tzoumas, Vijay Gupta, Paul Bogdan

    Abstract: Integer-order calculus often falls short in capturing the long-range dependencies and memory effects found in many real-world processes. Fractional calculus addresses these gaps via fractional-order integrals and derivatives, but fractional-order dynamical systems pose substantial challenges in system identification and optimal control due to the lack of standard control methodologies. In this pap… ▽ More

    Submitted 1 May, 2025; v1 submitted 6 February, 2025; originally announced February 2025.

  18. arXiv:2502.04507  [pdf, other

    cs.CV

    Fast Video Generation with Sliding Tile Attention

    Authors: Peiyuan Zhang, Yongqi Chen, Runlong Su, Hangliang Ding, Ion Stoica, Zhenghong Liu, Hao Zhang

    Abstract: Diffusion Transformers (DiTs) with 3D full attention power state-of-the-art video generation, but suffer from prohibitive compute cost -- when generating just a 5-second 720P video, attention alone takes 800 out of 945 seconds of total inference time. This paper introduces sliding tile attention (STA) to address this challenge. STA leverages the observation that attention scores in pretrained vide… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  19. arXiv:2502.04288  [pdf

    cs.LG

    Leveraging Geolocation in Clinical Records to Improve Alzheimer's Disease Diagnosis Using DMV Framework

    Authors: Peng Zhang, Divya Chaudhary

    Abstract: Alzheimer's Disease (AD) early detection is critical for enabling timely intervention and improving patient outcomes. This paper presents a DMV framework using Llama3-70B and GPT-4o as embedding models to analyze clinical notes and predict a continuous risk score associated with early AD onset. Framing the task as a regression problem, we model the relationship between linguistic features in clini… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  20. arXiv:2502.04268  [pdf, other

    cs.CV cs.AI

    Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances

    Authors: Yi Yu, Botao Ren, Peiyuan Zhang, Mingxin Liu, Junwei Luo, Shaofeng Zhang, Feipeng Da, Junchi Yan, Xue Yang

    Abstract: With the rapidly increasing demand for oriented object detection (OOD), recent research involving weakly-supervised detectors for learning OOD from point annotations has gained great attention. In this paper, we rethink this challenging task setting with the layout among instances and present Point2RBox-v2. At the core are three principles: 1) Gaussian overlap loss. It learns an upper bound for ea… ▽ More

    Submitted 6 February, 2025; v1 submitted 6 February, 2025; originally announced February 2025.

    Comments: 11 pages, 5 figures, 10 tables

  21. Observation of $D\to \bar{K}_{1}(1270)μ^+ν_μ$ and test of lepton flavor universality with $D\to \bar{K}_1(1270) \ell^{+} ν_{\ell}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (646 additional authors not shown)

    Abstract: By analyzing 7.93 $\rm fb^{-1}$ of $e^+e^-$ collision data collected at the center-of-mass energy of 3.773 GeV with the BESIII detector operated at the BEPCII collider, we report the observation of the semimuonic decays of $D^+\to \bar K_1(1270)^0μ^+ν_μ$ and $D^0\to K_1(1270)^-μ^+ν_μ$ with statistical significances of $12.5σ$ and $6.0σ$, respectively. Their decay branching fractions are determined… ▽ More

    Submitted 18 April, 2025; v1 submitted 6 February, 2025; originally announced February 2025.

    Comments: 11 pages, 5 figures

    Journal ref: Phys. Rev. D 111, L071101(2025)

  22. arXiv:2502.03732  [pdf, other

    cs.HC

    More Modality, More AI: Exploring Design Opportunities of AI-Based Multi-modal Remote Monitoring Technologies for Early Detection of Mental Health Sequelae in Youth Concussion Patients

    Authors: Bingsheng Yao, Menglin Zhao, Yuling Sun, Weidan Cao, Changchang Yin, Stephen Intille, Xuhai Xu, Ping Zhang, Jingzhen Yang, Dakuo Wang

    Abstract: Anxiety, depression, and suicidality are common mental health sequelae following concussion in youth patients, often exacerbating concussion symptoms and prolonging recovery. Despite the critical need for early detection of these mental health symptoms, clinicians often face challenges in accurately collecting patients' mental health data and making clinical decision-making in a timely manner. Tod… ▽ More

    Submitted 3 April, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

  23. arXiv:2502.03017  [pdf, other

    nucl-ex

    Search for Double Beta Decay of $^{136}$Xe to the $0^+_1$ Excited State of $^{136}$Ba with PandaX-4T

    Authors: PandaX Collaboration, Lingyin Luo, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Zhaokan Cheng, Xiangyi Cui, Yingji Fang, Deqing Fang, Zhixing Gao, Lisheng Geng, Karl Giboni, Xunan Guo, Xuyuan Guo, Zichao Guo, Chencheng Han, Ke Han, Changda He, Jinrong He, Di Huang, Houqi Huang, Junting Huang, Ruquan Hou, Yu Hou , et al. (76 additional authors not shown)

    Abstract: We perform a search of double beta decay of $^{136}$Xe to the excited state, $0^+_1$, of $^{136}$Ba (2$νββ$-0$_1^+$), using the dual-phase xenon detector of PandaX-4T with the first 94.9-day commissioning data. The multi-site events are reconstructed up to the MeV energy scale, which helps to improve the background model significantly. The background contribution from the stainless steel platform… ▽ More

    Submitted 7 March, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

  24. arXiv:2502.01326  [pdf, other

    gr-qc hep-th math-ph

    Flyby-induced displacement: analytic solution

    Authors: P. -M. Zhang, Z. K. Silagadze, P. A. Horvathy

    Abstract: We describe the scattering of particles by a sandwich gravitational wave generated during a flyby using an analytical approach. The derivative-of-the-Gaussian profile proposed by Gibbons and Hawking is approximated by the hyperbolic scarf potential, which allows for an exact analytic solution via the Nikiforov-Uvarov method. Our results confirm the prediction of Zel'dovich and Polnarev about certa… ▽ More

    Submitted 13 May, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

    Comments: Affiliation updated

  25. arXiv:2501.18850  [pdf, other

    cs.CE

    Equivariant Hypergraph Diffusion for Crystal Structure Prediction

    Authors: Yang Liu, Chuan Zhou, Shuai Zhang, Peng Zhang, Xixun Lin, Shirui Pan

    Abstract: Crystal Structure Prediction (CSP) remains a fundamental challenge with significant implications for the development of new materials and the advancement of various scientific disciplines. Recent developments have shown that generative models, particularly diffusion models, hold great promise for CSP. However, traditional graph-based representations, where atomic bonds are modeled as pairwise grap… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

    Comments: 14 pages, 4 figures

  26. arXiv:2501.18801  [pdf, other

    cs.CV cs.AI

    Every Image Listens, Every Image Dances: Music-Driven Image Animation

    Authors: Zhikang Dong, Weituo Hao, Ju-Chiang Wang, Peng Zhang, Pawel Polak

    Abstract: Image animation has become a promising area in multimodal research, with a focus on generating videos from reference images. While prior work has largely emphasized generic video generation guided by text, music-driven dance video generation remains underexplored. In this paper, we introduce MuseDance, an innovative end-to-end model that animates reference images using both music and text inputs.… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

  27. arXiv:2501.16330  [pdf, other

    cs.CV cs.AI

    RelightVid: Temporal-Consistent Diffusion Model for Video Relighting

    Authors: Ye Fang, Zeyi Sun, Shangzhan Zhang, Tong Wu, Yinghao Xu, Pan Zhang, Jiaqi Wang, Gordon Wetzstein, Dahua Lin

    Abstract: Diffusion models have demonstrated remarkable success in image generation and editing, with recent advancements enabling albedo-preserving image relighting. However, applying these models to video relighting remains challenging due to the lack of paired video relighting datasets and the high demands for output fidelity and temporal consistency, further complicated by the inherent randomness of dif… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

  28. arXiv:2501.16103  [pdf, ps, other

    cs.DC cs.LG

    Static Batching of Irregular Workloads on GPUs: Framework and Application to Efficient MoE Model Inference

    Authors: Yinghan Li, Yifei Li, Jiejing Zhang, Bujiao Chen, Xiaotong Chen, Lian Duan, Yejun Jin, Zheng Li, Xuanyu Liu, Haoyu Wang, Wente Wang, Yajie Wang, Jiacheng Yang, Peiyang Zhang, Laiwen Zheng, Wenyuan Yu

    Abstract: It has long been a problem to arrange and execute irregular workloads on massively parallel devices. We propose a general framework for statically batching irregular workloads into a single kernel with a runtime task mapping mechanism on GPUs. We further apply this framework to Mixture-of-Experts (MoE) model inference and implement an optimized and efficient CUDA kernel. Our MoE kernel achieves up… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

    Comments: 11 pages

    ACM Class: D.1.3; I.2.6

  29. arXiv:2501.15907  [pdf, other

    cs.SD cs.CL eess.AS

    Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation

    Authors: Haorui He, Zengqiang Shang, Chaoren Wang, Xuyuan Li, Yicheng Gu, Hua Hua, Liwei Liu, Chen Yang, Jiaqi Li, Peiyang Shi, Yuancheng Wang, Kai Chen, Pengyuan Zhang, Zhizheng Wu

    Abstract: Recent advancements in speech generation have been driven by the large-scale training datasets. However, current models fall short of capturing the spontaneity and variability inherent in real-world human speech, due to their reliance on audiobook datasets limited to formal read-aloud speech styles. To bridge this gap, we introduce Emilia-Pipe, an open-source preprocessing pipeline to extract high… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

    Comments: Extended version of arXiv:2407.05361, submitted to TASLP, dataset is available at: https://huggingface.co/datasets/amphion/Emilia-Dataset

  30. arXiv:2501.15898  [pdf, ps, other

    math.RT

    Homotopy categories and fibrant model structures

    Authors: Xue-Song Lu, Pu Zhang

    Abstract: The homotopy category of a model structure on a weakly idempotent complete additive category is proved to be equivalent to the additive quotient of the category of cofibrant-fibrant objects with respect to the subcategory of cofibrant-fibrant-trivial objects. A model structure on pointed category is fibrant, if every object is a fibrant object. Fibrant model structures is explicitly described by t… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

  31. arXiv:2501.15875  [pdf, other

    cs.CL

    LCTG Bench: LLM Controlled Text Generation Benchmark

    Authors: Kentaro Kurihara, Masato Mita, Peinan Zhang, Shota Sasaki, Ryosuke Ishigami, Naoaki Okazaki

    Abstract: The rise of large language models (LLMs) has led to more diverse and higher-quality machine-generated text. However, their high expressive power makes it difficult to control outputs based on specific business instructions. In response, benchmarks focusing on the controllability of LLMs have been developed, but several issues remain: (1) They primarily cover major languages like English and Chines… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

    Comments: 15 pages, 11 figures. Project page: this [URL](https://github.com/CyberAgentAILab/LCTG-Bench)

  32. arXiv:2501.15447  [pdf, ps, other

    hep-ex

    Observation of $h_{c}$ radiative decays to multiple light hadrons and the tensor state $f_2(1270)$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai , et al. (666 additional authors not shown)

    Abstract: Using $ψ(3686)\rightarrow π^{0} h_{c}$ decays from a data sample of $(27.12\pm0.14)\times10^{8}$ $ψ(3686)$ events collected by the BESIII detector at the BEPCII collider, $h_c$ radiative decays to $γπ^{+}π^{-},~γπ^{+}π^{-}η,~\gamma2(π^{+}π^{-})$, and $γp\bar{p}$ are observed for the first time, each with a significance greater than $5σ$. The corresponding branching fractions are measured. Furtherm… ▽ More

    Submitted 26 January, 2025; originally announced January 2025.

  33. arXiv:2501.15368  [pdf, other

    cs.CL cs.SD eess.AS

    Baichuan-Omni-1.5 Technical Report

    Authors: Yadong Li, Jun Liu, Tao Zhang, Tao Zhang, Song Chen, Tianpeng Li, Zehuan Li, Lijun Liu, Lingfeng Ming, Guosheng Dong, Da Pan, Chong Li, Yuanbo Fang, Dongdong Kuang, Mingrui Wang, Chenglin Zhu, Youwei Zhang, Hongyu Guo, Fengyu Zhang, Yuran Wang, Bowen Ding, Wei Song, Xu Li, Yuqi Huo, Zheng Liang , et al. (68 additional authors not shown)

    Abstract: We introduce Baichuan-Omni-1.5, an omni-modal model that not only has omni-modal understanding capabilities but also provides end-to-end audio generation capabilities. To achieve fluent and high-quality interaction across modalities without compromising the capabilities of any modality, we prioritized optimizing three key aspects. First, we establish a comprehensive data cleaning and synthesis pip… ▽ More

    Submitted 25 January, 2025; originally announced January 2025.

  34. arXiv:2501.15069   

    physics.optics

    Magnetic Field induced control and Multiple Magnomechanically Induced Transparency in Single Cavity

    Authors: Ghaisud Din, Muqaddar Abbas, Yunlong Wang, Feiran Wang, Pei Zhang

    Abstract: We investigate magnomechanically induced transparency (MMIT) in a microwave 3D copper cavity with two YIG spheres under varying interaction parameters. Numerical simulations show that the steady-state magnon number increases with stronger coupling between cavity photons and magnons, and is sensitive to both bias and drive magnetic fields. Pronounced peaks in the magnon population near resonant fie… ▽ More

    Submitted 6 May, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: Withdrawn due to errors in Section 2 and Appendix A. Section 2 omits a key coupling term in the Hamiltonian, affecting predictions. Appendix A contains a flaw in the linearization step used to derive the fluctuations. We are revising the analysis and will resubmit

  35. arXiv:2501.14989  [pdf, ps, other

    math.OC

    Redefining Coherent Risk Measures: From Gauge Optimization to Regularization

    Authors: Ningji Wei, Xian Yu, Peter Zhang

    Abstract: It is well understood that each coherent risk measure can be represented as the expectation with respect to the worst-case reweighted density function, chosen from an abstract risk envelope. This paper introduces an equivalent but more explicit definition of the risk envelope that uses gauge sets (i.e., a type of convex sets widely utilized in convex analysis and gauge optimization) to provide a g… ▽ More

    Submitted 18 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    MSC Class: 90C17; 90C15; 91G70

  36. arXiv:2501.14384  [pdf

    cond-mat.mtrl-sci cond-mat.dis-nn physics.comp-ph

    Efficiently charting the space of mixed vacancy-ordered perovskites by machine-learning encoded atomic-site information

    Authors: Fan Zhang, Li Fu, Weiwei Gao, Peihong Zhang, Jijun Zhao

    Abstract: Vacancy-ordered double perovskites (VODPs) are promising alternatives to three-dimensional lead halide perovskites for optoelectronic and photovoltaic applications. Mixing these materials creates a vast compositional space, allowing for highly tunable electronic and optical properties. However, the extensive chemical landscape poses significant challenges in efficiently screening candidates with t… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

    Comments: 22 pages, 9 figures

  37. arXiv:2501.14206  [pdf, ps, other

    hep-ex

    Cross section measurement of $e^{+}e^{-} \to f_{1}(1285)π^{+}π^{-}$ at center-of-mass energies between $3.808$ and $4.951\rm GeV$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Using data samples collected by the \mbox{BESIII} detector located at the Beijing Electron Positron Collider, the cross sections of the process $e^+e^-\to f_{1}(1285)π^+π^-$ are measured at forty-five center-of-mass energies from $3.808$ to $4.951 {\rm GeV}$. An investigation on the cross section line shape is performed, and no significant structure is observed.

    Submitted 23 January, 2025; originally announced January 2025.

  38. arXiv:2501.14187  [pdf, ps, other

    math.AP

    Linear enhanced dissipation for the 2D Taylor-Couette flow in the exterior region: A supplementary example for Gearhart-Prüss type lemma

    Authors: Te Li, Ping Zhang, Yibin Zhang

    Abstract: From the perspective of asymptotic stability at high Reynolds numbers, Taylor-Couette flow, as a typical rotating shear flow, exhibits rich decay behaviors. Previously, for the extensively studied Couette flow or the Taylor-Couette flow in bounded annular domains, methods based on resolvent estimates could derive exponential decay asymptotic for the solutions of the linearized system. However, unl… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

  39. arXiv:2501.13898  [pdf, other

    cs.CV cs.AI

    PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection

    Authors: Peiyuan Zhang, Junwei Luo, Xue Yang, Yi Yu, Qingyun Li, Yue Zhou, Xiaosong Jia, Xudong Lu, Jingdong Chen, Xiang Li, Junchi Yan, Yansheng Li

    Abstract: With the growing demand for oriented object detection (OOD), recent studies on point-supervised OOD have attracted significant interest. In this paper, we propose PointOBB-v3, a stronger single point-supervised OOD framework. Compared to existing methods, it generates pseudo rotated boxes without additional priors and incorporates support for the end-to-end paradigm. PointOBB-v3 functions by integ… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

    Comments: 16 pages, 5 figures, 10 tables

  40. arXiv:2501.13339  [pdf, ps, other

    eess.SP

    Joint Beamforming and Position Optimization for Fluid RIS-aided ISAC Systems

    Authors: Junjie Ye, Peichang Zhang, Xiao-Peng Li, Lei Huang, Yuanwei Liu

    Abstract: A fluid reconfigurable intelligent surface (fRIS)-aided integrated sensing and communications (ISAC) system is proposed to enhance multi-target sensing and multi-user communication. Unlike the conventional RIS, the fRIS incorporates movable elements whose positions can be flexibly adjusted to provide extra spatial degrees of freedom. In this system, a joint optimization problem is formulated to mi… ▽ More

    Submitted 24 January, 2025; v1 submitted 22 January, 2025; originally announced January 2025.

    Comments: 13 pages, 10 figures, has submitted to an IEEE journal for possible publication

  41. arXiv:2501.12948  [pdf, other

    cs.CL cs.AI cs.LG

    DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

    Authors: DeepSeek-AI, Daya Guo, Dejian Yang, Haowei Zhang, Junxiao Song, Ruoyu Zhang, Runxin Xu, Qihao Zhu, Shirong Ma, Peiyi Wang, Xiao Bi, Xiaokang Zhang, Xingkai Yu, Yu Wu, Z. F. Wu, Zhibin Gou, Zhihong Shao, Zhuoshu Li, Ziyi Gao, Aixin Liu, Bing Xue, Bingxuan Wang, Bochao Wu, Bei Feng, Chengda Lu , et al. (175 additional authors not shown)

    Abstract: We introduce our first-generation reasoning models, DeepSeek-R1-Zero and DeepSeek-R1. DeepSeek-R1-Zero, a model trained via large-scale reinforcement learning (RL) without supervised fine-tuning (SFT) as a preliminary step, demonstrates remarkable reasoning capabilities. Through RL, DeepSeek-R1-Zero naturally emerges with numerous powerful and intriguing reasoning behaviors. However, it encounters… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

  42. arXiv:2501.12696  [pdf, other

    eess.AS cs.SD eess.SP

    SoundSpring: Loss-Resilient Audio Transceiver with Dual-Functional Masked Language Modeling

    Authors: Shengshi Yao, Jincheng Dai, Xiaoqi Qin, Sixian Wang, Siye Wang, Kai Niu, Ping Zhang

    Abstract: In this paper, we propose "SoundSpring", a cutting-edge error-resilient audio transceiver that marries the robustness benefits of joint source-channel coding (JSCC) while also being compatible with current digital communication systems. Unlike recent deep JSCC transceivers, which learn to directly map audio signals to analog channel-input symbols via neural networks, our SoundSpring adopts the lay… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

    Comments: To appear in IEEE JSAC

  43. arXiv:2501.12614  [pdf, other

    astro-ph.IM hep-ex

    Electric field reconstruction with three polarizations for the radio detection of ultra-high energy particles

    Authors: Kewen Zhang, Tim Huege, Ramesh Koirala, Pengxiong Ma, Matías Tueros, Xin Xu, Chao Zhang, Pengfei Zhang, Yi Zhang

    Abstract: The amplitude, polarization, frequency spectrum and energy fluence carried by the electric field at a given measurement position are the key parameters for retrieving information from radio signals generated by extensive air showers. Accurate reconstruction of the electric field from the signals recorded by the antennas is therefore essential for the radio detection technique. Conventional reconst… ▽ More

    Submitted 24 January, 2025; v1 submitted 21 January, 2025; originally announced January 2025.

  44. arXiv:2501.12368  [pdf, other

    cs.CV cs.CL

    InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

    Authors: Yuhang Zang, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Ziyu Liu, Shengyuan Ding, Shenxi Wu, Yubo Ma, Haodong Duan, Wenwei Zhang, Kai Chen, Dahua Lin, Jiaqi Wang

    Abstract: Despite the promising performance of Large Vision Language Models (LVLMs) in visual understanding, they occasionally generate incorrect outputs. While reward models (RMs) with reinforcement learning or test-time scaling offer the potential for improving generation quality, a critical gap remains: publicly available multi-modal RMs for LVLMs are scarce, and the implementation details of proprietary… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

    Comments: Tech Report

  45. arXiv:2501.10705  [pdf, other

    cs.IT eess.SP

    Secure Communication in Dynamic RDARS-Driven Systems

    Authors: Ziqian Pei, Jintao Wang, Pingping Zhang, Zheng Shi, Guanghua Yang, Shaodan Ma

    Abstract: In this letter, we investigate a dynamic reconfigurable distributed antenna and reflection surface (RDARS)-driven secure communication system, where the working mode of the RDARS can be flexibly configured. We aim to maximize the secrecy rate by jointly designing the active beamforming vectors, reflection coefficients, and the channel-aware mode selection matrix. To address the non-convex binary a… ▽ More

    Submitted 18 January, 2025; originally announced January 2025.

    Comments: 5 pages, 5 figures

  46. arXiv:2501.10182  [pdf, other

    cs.CR eess.SP

    Secure Semantic Communication With Homomorphic Encryption

    Authors: Rui Meng, Dayu Fan, Haixiao Gao, Yifan Yuan, Bizhu Wang, Xiaodong Xu, Mengying Sun, Chen Dong, Xiaofeng Tao, Ping Zhang, Dusit Niyato

    Abstract: In recent years, Semantic Communication (SemCom), which aims to achieve efficient and reliable transmission of meaning between agents, has garnered significant attention from both academia and industry. To ensure the security of communication systems, encryption techniques are employed to safeguard confidentiality and integrity. However, traditional cryptography-based encryption algorithms encount… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

    Comments: 8 pages, 3 figures

  47. arXiv:2501.10130  [pdf, other

    hep-ex

    Study of $η\rightarrowπ^+π^-l^+l^-$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: Using a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events accumulated with the BESIII detector, we analyze the decays $η\rightarrowπ^+π^-l^+l^-$ ($l=e$ or $μ$) via the process $J/ψ\rightarrowγη$. The branching fraction of $η\rightarrowπ^+π^-e^+e^-$ is measured to be $\mathcal{B}(η\rightarrowπ^+π^-e^+e^-)=(3.07\pm0.12_{\rm{stat.}}\pm0.19_{\rm{syst.}}) \times10^{-4}$. No signal events are observed f… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

  48. arXiv:2501.10119  [pdf, other

    hep-ph hep-th nucl-th

    Gluon skewed generalized parton distributions of proton from a light-front Hamiltonian approach

    Authors: Pengxiang Zhang, Yiping Liu, Siqi Xu, Chandan Mondal, Xingbo Zhao, James P. Vary

    Abstract: We calculate all leading-twist gluon generalized parton distributions (GPDs) inside the proton at nonzero skewness using the basis light-front quantization framework. The proton's light-front wave functions are derived from a light-front quantized Hamiltonian incorporating Quantum Chromodynamics inputs. Our results show that the qualitative behaviors of the GPDs are consistent with those from othe… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

    Comments: 11 pages, 3 figures, and 1 table

  49. arXiv:2501.09400  [pdf, ps, other

    cs.IT eess.SP

    Joint Antenna Selection and Beamforming Design for Active RIS-aided ISAC Systems

    Authors: Wei Ma, Peichang Zhang, Junjie Ye, Rouyang Guan, Xiao-Peng Li, Lei Huang

    Abstract: Active reconfigurable intelligent surface (A-RIS) aided integrated sensing and communications (ISAC) system has been considered as a promising paradigm to improve spectrum efficiency. However, massive energy-hungry radio frequency (RF) chains hinder its large-scale deployment. To address this issue, an A-RIS-aided ISAC system with antenna selection (AS) is proposed in this work, where a target is… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  50. arXiv:2501.09079  [pdf, other

    quant-ph

    Demonstrating quantum error mitigation on logical qubits

    Authors: Aosai Zhang, Haipeng Xie, Yu Gao, Jia-Nan Yang, Zehang Bao, Zitian Zhu, Jiachen Chen, Ning Wang, Chuanyu Zhang, Jiarun Zhong, Shibo Xu, Ke Wang, Yaozu Wu, Feitong Jin, Xuhao Zhu, Yiren Zou, Ziqi Tan, Zhengyi Cui, Fanhao Shen, Tingting Li, Yihang Han, Yiyang He, Gongyu Liu, Jiayuan Shen, Han Wang , et al. (10 additional authors not shown)

    Abstract: A long-standing challenge in quantum computing is developing technologies to overcome the inevitable noise in qubits. To enable meaningful applications in the early stages of fault-tolerant quantum computing, devising methods to suppress post-correction logical failures is becoming increasingly crucial. In this work, we propose and experimentally demonstrate the application of zero-noise extrapola… ▽ More

    Submitted 15 January, 2025; originally announced January 2025.