Skip to main content

Showing 151–200 of 1,651 results for author: Pan, J

.
  1. arXiv:2412.04448  [pdf, other

    cs.CV

    MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation

    Authors: Longtao Zheng, Yifan Zhang, Hanzhong Guo, Jiachun Pan, Zhenxiong Tan, Jiahao Lu, Chuanxin Tang, Bo An, Shuicheng Yan

    Abstract: Recent advances in video diffusion models have unlocked new potential for realistic audio-driven talking video generation. However, achieving seamless audio-lip synchronization, maintaining long-term identity consistency, and producing natural, audio-aligned expressions in generated talking videos remain significant challenges. To address these challenges, we propose Memory-guided EMOtion-aware di… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

    Comments: Project Page: https://memoavatar.github.io

  2. arXiv:2412.04107  [pdf, other

    cs.IR cs.AI

    Pre-train, Align, and Disentangle: Empowering Sequential Recommendation with Large Language Models

    Authors: Yuhao Wang, Junwei Pan, Pengyue Jia, Wanyu Wang, Maolin Wang, Zhixiang Feng, Xiaotian Li, Jie Jiang, Xiangyu Zhao

    Abstract: Sequential Recommendation (SR) aims to leverage the sequential patterns in users' historical interactions to accurately track their preferences. However, the primary reliance of existing SR methods on collaborative data results in challenges such as the cold-start problem and sub-optimal performance. Concurrently, despite the proven effectiveness of large language models (LLMs), their integration… ▽ More

    Submitted 25 April, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

    Comments: accepted to SIGIR 2025

  3. arXiv:2412.02119  [pdf, other

    cs.CV cs.LG cs.RO

    Understanding Particles From Video: Property Estimation of Granular Materials via Visuo-Haptic Learning

    Authors: Zeqing Zhang, Guangze Zheng, Xuebo Ji, Guanqi Chen, Ruixing Jia, Wentao Chen, Guanhua Chen, Liangjun Zhang, Jia Pan

    Abstract: Granular materials (GMs) are ubiquitous in daily life. Understanding their properties is also important, especially in agriculture and industry. However, existing works require dedicated measurement equipment and also need large human efforts to handle a large number of particles. In this paper, we introduce a method for estimating the relative values of particle size and density from the video of… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: IEEE Robotics and Automation Letters, with ICRA 2025

  4. arXiv:2412.01427  [pdf, other

    cs.CV

    FoundIR: Unleashing Million-scale Training Data to Advance Foundation Models for Image Restoration

    Authors: Hao Li, Xiang Chen, Jiangxin Dong, Jinhui Tang, Jinshan Pan

    Abstract: Despite the significant progress made by all-in-one models in universal image restoration, existing methods suffer from a generalization bottleneck in real-world scenarios, as they are mostly trained on small-scale synthetic datasets with limited degradations. Therefore, large-scale high-quality real-world training data is urgently needed to facilitate the emergence of foundational models for imag… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: Project website: https://www.foundir.net

  5. arXiv:2412.01391  [pdf, other

    quant-ph

    Transversal Logical Clifford gates on rotated surface codes with reconfigurable neutral atom arrays

    Authors: Zi-Han Chen, Ming-Cheng Chen, Chao-Yang Lu, Jian-Wei Pan

    Abstract: We propose hardware-efficient schemes for implementing logical H and S gates transversally on rotated surface codes with reconfigurable neutral atom arrays. For logical H gates, we develop a simple strategy to rotate code patches efficiently with two sets of 2D-acousto-optic deflectors (2D-AODs). Our protocol for logical S gates utilizes the time-dynamics of the data and ancilla qubits during synd… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

  6. arXiv:2411.19285  [pdf, other

    cs.LG cs.AI q-fin.PM

    BPQP: A Differentiable Convex Optimization Framework for Efficient End-to-End Learning

    Authors: Jianming Pan, Zeqi Ye, Xiao Yang, Xu Yang, Weiqing Liu, Lewen Wang, Jiang Bian

    Abstract: Data-driven decision-making processes increasingly utilize end-to-end learnable deep neural networks to render final decisions. Sometimes, the output of the forward functions in certain layers is determined by the solutions to mathematical optimization problems, leading to the emergence of differentiable optimization layers that permit gradient back-propagation. However, real-world scenarios often… ▽ More

    Submitted 29 December, 2024; v1 submitted 28 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024 Spotlight

  7. arXiv:2411.18824  [pdf, other

    cs.CV

    FaithDiff: Unleashing Diffusion Priors for Faithful Image Super-resolution

    Authors: Junyang Chen, Jinshan Pan, Jiangxin Dong

    Abstract: Faithful image super-resolution (SR) not only needs to recover images that appear realistic, similar to image generation tasks, but also requires that the restored images maintain fidelity and structural consistency with the input. To this end, we propose a simple and effective method, named FaithDiff, to fully harness the impressive power of latent diffusion models (LDMs) for faithful image SR. I… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

    Comments: Project page: https://jychen9811.github.io/FaithDiff_page/

  8. arXiv:2411.18560  [pdf, ps, other

    hep-ph hep-ex

    A tale of Bethe logarithms: leptonic widths of $χ_{cJ}$ and Lamb shift

    Authors: Yu Jia, Jichen Pan

    Abstract: The rare annihilation decays of $P$-wave spin-triplet quarkonia into lepton pair have to proceed via two-photon intermediate state, which are plagued with the infrared divergence symptom. We recognize that the physical root of the IR divergence and its remedy is the same as the Lamb shift in QED. In this work we provide a complete solution to this IR problem by including the effect of the higher F… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

    Comments: 10 pages, 2 figures, 3 tables

  9. arXiv:2411.17083  [pdf, other

    cs.RO physics.flu-dyn physics.geo-ph physics.ins-det

    A Haptic-Based Proximity Sensing System for Buried Object in Granular Material

    Authors: Zeqing Zhang, Ruixing Jia, Youcan Yan, Ruihua Han, Shijie Lin, Qian Jiang, Liangjun Zhang, Jia Pan

    Abstract: The proximity perception of objects in granular materials is significant, especially for applications like minesweeping. However, due to particles' opacity and complex properties, existing proximity sensors suffer from high costs from sophisticated hardware and high user-cost from unintuitive results. In this paper, we propose a simple yet effective proximity sensing system for underground stuff b… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: The 40th International Symposium of Robotics Research (ISRR). Long Beach, California, USA, December 8-12 2024

  10. arXiv:2411.15443  [pdf, other

    cond-mat.quant-gas quant-ph

    String breaking mechanism in a lattice Schwinger model simulator

    Authors: Ying Liu, Wei-Yong Zhang, Zi-Hang Zhu, Ming-Gen He, Zhen-Sheng Yuan, Jian-Wei Pan

    Abstract: String breaking is a fundamental concept in gauge theories, describing the decay of a flux string connecting two charges through the production of particle-antiparticle pairs. This phenomenon is particularly important in particle physics, notably in Quantum Chromodynamics, and plays a crucial role in condensed matter physics. However, achieving a theoretical understanding of this non-perturbative… ▽ More

    Submitted 22 November, 2024; originally announced November 2024.

    Comments: 12 pages, (5+3) figures

  11. arXiv:2411.14507  [pdf, other

    cs.LG cs.AI cs.CL

    FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers

    Authors: Zehua Pei, Hui-Ling Zhen, Xianzhi Yu, Sinno Jialin Pan, Mingxuan Yuan, Bei Yu

    Abstract: Generative Pre-trained Transformers (GPTs) have demonstrated remarkable performance across diverse domains through the extensive scaling of model parameters. Recent works observe the redundancy across the transformer blocks and develop compression methods by structured pruning of the unimportant blocks. However, such straightforward elimination will always provide irreversible performance degradat… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  12. arXiv:2411.13943  [pdf, other

    quant-ph physics.app-ph physics.optics

    Independent Optical Frequency Combs Powered 546 km Field Test of Twin-Field Quantum Key Distribution

    Authors: Lai Zhou, Jinping Lin, Chengfang Ge, Yuanbin Fan, Zhiliang Yuan, Hao Dong, Yang Liu, Di Ma, Jiu-Peng Chen, Cong Jiang, Xiang-Bin Wang, Li-Xing You, Qiang Zhang, Jian-Wei Pan

    Abstract: Owing to its repeater-like rate-loss scaling, twin-field quantum key distribution (TF-QKD) has repeatedly exhibited in laboratory its superiority for secure communication over record fiber lengths. Field trials pose a new set of challenges however, which must be addressed before the technology's roll-out into real-world. Here, we verify in field the viability of using independent optical frequency… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

    Comments: To appear in Physical Review Applied

  13. arXiv:2411.13588  [pdf, other

    cs.CV cs.AI

    Unveiling Redundancy in Diffusion Transformers (DiTs): A Systematic Study

    Authors: Xibo Sun, Jiarui Fang, Aoyu Li, Jinzhe Pan

    Abstract: The increased model capacity of Diffusion Transformers (DiTs) and the demand for generating higher resolutions of images and videos have led to a significant rise in inference latency, impacting real-time performance adversely. While prior research has highlighted the presence of high similarity in activation values between adjacent diffusion steps (referred to as redundancy) and proposed various… ▽ More

    Submitted 17 November, 2024; originally announced November 2024.

    Comments: 9 pages including reference

  14. arXiv:2411.12565  [pdf, other

    cond-mat.quant-gas quant-ph

    Probing false vacuum decay on a cold-atom gauge-theory quantum simulator

    Authors: Zi-Hang Zhu, Ying Liu, Gianluca Lagnese, Federica Maria Surace, Wei-Yong Zhang, Ming-Gen He, Jad C. Halimeh, Marcello Dalmonte, Siddhardh C. Morampudi, Frank Wilczek, Zhen-Sheng Yuan, Jian-Wei Pan

    Abstract: In the context of quantum electrodynamics, the decay of false vacuum leads to the production of electron-positron pair, a phenomenon known as the Schwinger effect. In practical experimental scenarios, producing a pair requires an extremely strong electric field, thus suppressing the production rate and making this process very challenging to observe. Here we report an experimental investigation, i… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

  15. arXiv:2411.12441  [pdf, other

    cs.IR

    Towards Unifying Feature Interaction Models for Click-Through Rate Prediction

    Authors: Yu Kang, Junwei Pan, Jipeng Jin, Shudong Huang, Xiaofeng Gao, Lei Xiao

    Abstract: Modeling feature interactions plays a crucial role in accurately predicting click-through rates (CTR) in advertising systems. To capture the intricate patterns of interaction, many existing models employ matrix-factorization techniques to represent features as lower-dimensional embedding vectors, enabling the modeling of interactions as products between these embeddings. In this paper, we propose… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

  16. arXiv:2411.12026  [pdf, other

    astro-ph.CO

    Modified Gravity Constraints from the Full Shape Modeling of Clustering Measurements from DESI 2024

    Authors: M. Ishak, J. Pan, R. Calderon, K. Lodha, G. Valogiannis, A. Aviles, G. Niz, L. Yi, C. Zheng, C. Garcia-Quintero, A. de Mattia, L. Medina-Varela, J. L. Cervantes-Cota, U. Andrade, D. Huterer, H. E. Noriega, G. Zhao, A. Shafieloo, W. Fang, S. Ahlen, D. Bianchi, D. Brooks, E. Burtin, E. Chaussidon, T. Claybaugh , et al. (45 additional authors not shown)

    Abstract: We present cosmological constraints on deviations from general relativity (GR) from the first-year of clustering observations from the Dark Energy Spectroscopic Instrument (DESI) in combination with other datasets. We first consider the $μ(a,k)$-$Σ(a,k)$ modified gravity (MG) parametrization (as well as $η(a,k)$) in flat $Λ$CDM and $w_0 w_a$CDM backgrounds. Using a functional form for time-only ev… ▽ More

    Submitted 20 December, 2024; v1 submitted 18 November, 2024; originally announced November 2024.

    Comments: 55 pages, 13 figures. This DESI Collaboration Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers/). Added 3 figures and more discussions

  17. arXiv:2411.12022  [pdf, other

    astro-ph.CO

    DESI 2024 VII: Cosmological Constraints from the Full-Shape Modeling of Clustering Measurements

    Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander, C. Allende Prieto, M. Alvarez, O. Alves, A. Anand, U. Andrade, E. Armengaud, S. Avila, A. Aviles, H. Awan, B. Bahr-Kalus, S. Bailey, C. Baltay, A. Bault, J. Behera, S. BenZvi, F. Beutler, D. Bianchi, C. Blake, R. Blum , et al. (188 additional authors not shown)

    Abstract: We present cosmological results from the measurement of clustering of galaxy, quasar and Lyman-$α$ forest tracers from the first year of observations with the Dark Energy Spectroscopic Instrument (DESI Data Release 1). We adopt the full-shape (FS) modeling of the power spectrum, including the effects of redshift-space distortions, in an analysis which has been validated in a series of supporting p… ▽ More

    Submitted 21 November, 2024; v1 submitted 18 November, 2024; originally announced November 2024.

    Comments: This DESI Collaboration Key Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers/). 55 pages, 10 figures

  18. arXiv:2411.12021  [pdf, other

    astro-ph.CO

    DESI 2024 V: Full-Shape Galaxy Clustering from Galaxies and Quasars

    Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander, M. Alvarez, O. Alves, A. Anand, U. Andrade, E. Armengaud, S. Avila, A. Aviles, H. Awan, S. Bailey, C. Baltay, A. Bault, J. Behera, S. BenZvi, F. Beutler, D. Bianchi, C. Blake, R. Blum, S. Brieden, A. Brodzeller , et al. (174 additional authors not shown)

    Abstract: We present the measurements and cosmological implications of the galaxy two-point clustering using over 4.7 million unique galaxy and quasar redshifts in the range $0.1<z<2.1$ divided into six redshift bins over a $\sim 7,500$ square degree footprint, from the first year of observations with the Dark Energy Spectroscopic Instrument (DESI Data Release 1). By fitting the full power spectrum, we exte… ▽ More

    Submitted 11 March, 2025; v1 submitted 18 November, 2024; originally announced November 2024.

    Comments: This DESI Collaboration Key Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers/). 81 pages, 24 figures. This version matches the revision after the referee report

  19. arXiv:2411.12020  [pdf, other

    astro-ph.CO

    DESI 2024 II: Sample Definitions, Characteristics, and Two-point Clustering Statistics

    Authors: DESI Collaboration, A. G. Adame, J. Aguilar, S. Ahlen, S. Alam, D. M. Alexander, M. Alvarez, O. Alves, A. Anand, U. Andrade, E. Armengaud, S. Avila, A. Aviles, H. Awan, S. Bailey, C. Baltay, A. Bault, J. Behera, S. BenZvi, F. Beutler, D. Bianchi, C. Blake, R. Blum, S. Brieden, A. Brodzeller , et al. (178 additional authors not shown)

    Abstract: We present the samples of galaxies and quasars used for DESI 2024 cosmological analyses, drawn from the DESI Data Release 1 (DR1). We describe the construction of large-scale structure (LSS) catalogs from these samples, which include matched sets of synthetic reference `randoms' and weights that account for variations in the observed density of the samples due to experimental design and varying in… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

    Comments: This DESI Collaboration Key Publication is part of the 2024 publication series using the first year of observations (see https://data.desi.lbl.gov/doc/papers/)

  20. arXiv:2411.11004  [pdf, other

    cs.CV cs.RO

    EROAM: Event-based Camera Rotational Odometry and Mapping in Real-time

    Authors: Wanli Xing, Shijie Lin, Linhan Yang, Zeqing Zhang, Yanjun Du, Maolin Lei, Yipeng Pan, Jia Pan

    Abstract: This paper presents EROAM, a novel event-based rotational odometry and mapping system that achieves real-time, accurate camera rotation estimation. Unlike existing approaches that rely on event generation models or contrast maximization, EROAM employs a spherical event representation by projecting events onto a unit sphere and introduces Event Spherical Iterative Closest Point (ES-ICP), a novel ge… ▽ More

    Submitted 17 November, 2024; originally announced November 2024.

  21. arXiv:2411.08905  [pdf, other

    cs.CE math.NA

    Synthesis Method for Obtaining Characteristic Modes of Multi-Structure Systems via independent Structure T-Matrix

    Authors: Chenbo Shi, Xin Gu, Shichen Liang, Jin Pan, Le Zuo

    Abstract: This paper presents a novel and efficient method for characteristic mode decomposition in multi-structure systems. By leveraging the translation and rotation matrices of vector spherical wavefunctions, our approach enables the synthesis of a composite system's characteristic modes using independently computed simulations of its constituent structures. The computationally intensive translation proc… ▽ More

    Submitted 21 March, 2025; v1 submitted 29 October, 2024; originally announced November 2024.

  22. arXiv:2411.08904  [pdf, other

    eess.SP

    Generalized Scattering Matrix of Antenna: Moment Solution, Compression Storage and Application

    Authors: Chenbo Shi, Jin Pan, Xin Gu, Shichen Liang, Le Zuo

    Abstract: This paper presents a computation method of generalized scattering matrix (GSM) based on integral equations and the method of moments (MoM), specifically designed for antennas excited through waveguide ports. By leveraging two distinct formulations -- magnetic-type and electric-type integral equations -- we establish concise algebraic relations linking the GSM directly to the impedance matrices ob… ▽ More

    Submitted 23 April, 2025; v1 submitted 29 October, 2024; originally announced November 2024.

  23. arXiv:2411.06667  [pdf, other

    eess.AS cs.SD

    DCF-DS: Deep Cascade Fusion of Diarization and Separation for Speech Recognition under Realistic Single-Channel Conditions

    Authors: Shu-Tong Niu, Jun Du, Ruo-Yu Wang, Gao-Bin Yang, Tian Gao, Jia Pan, Yu Hu

    Abstract: We propose a single-channel Deep Cascade Fusion of Diarization and Separation (DCF-DS) framework for back-end automatic speech recognition (ASR), combining neural speaker diarization (NSD) and speech separation (SS). First, we sequentially integrate the NSD and SS modules within a joint training framework, enabling the separation module to leverage speaker time boundaries from the diarization modu… ▽ More

    Submitted 27 December, 2024; v1 submitted 10 November, 2024; originally announced November 2024.

  24. arXiv:2411.06174  [pdf, other

    cs.LG cs.RO

    State Chrono Representation for Enhancing Generalization in Reinforcement Learning

    Authors: Jianda Chen, Wen Zheng Terence Ng, Zichen Chen, Sinno Jialin Pan, Tianwei Zhang

    Abstract: In reinforcement learning with image-based inputs, it is crucial to establish a robust and generalizable state representation. Recent advancements in metric learning, such as deep bisimulation metric approaches, have shown promising results in learning structured low-dimensional representation space from pixel observations, where the distance between states is measured based on task-relevant featu… ▽ More

    Submitted 9 November, 2024; originally announced November 2024.

    Journal ref: 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

  25. arXiv:2411.04664  [pdf, other

    quant-ph

    Processing and Decoding Rydberg Decay Error with MBQC

    Authors: Cheng-Cheng Yu, Zi-Han Chen, Yu-Hao Deng, Ming-Cheng Chen, Chao-Yang Lu, Jian-Wei Pan

    Abstract: Achieving fault-tolerant quantum computing with neutral atom necessitates careful consideration of the errors inherent to this system. One typical error is the leakage from Rydberg states during the implementation of multi-qubit gates, which may propagate to multiple correlated errors and deteriorate the performance of error correction. To address this, researchers have proposed an erasure convers… ▽ More

    Submitted 9 March, 2025; v1 submitted 7 November, 2024; originally announced November 2024.

    Comments: v2 updated error model to handle coherent error in no-jump evolution. Conclusion about error distance is unchanged -v3 more numerical results, remove one approximation when adding two-qubit pauli error -v4 compatible with detector error model, slightly improved threshold

  26. arXiv:2411.04373  [pdf, other

    physics.optics physics.ins-det quant-ph

    Differential absorption ozone Lidar with 4H-SiC single-photon detectors

    Authors: Xian-Song Zhao, Chao Yu, Chong Wang, Tianyi Li, Bo Liu, Hai Lu, Rong Zhang, Xiankang Dou, Jun Zhang, Jian-Wei Pan

    Abstract: Differential absorption Lidar (DIAL) in the ultraviolet (UV) region is an effective approach for monitoring tropospheric ozone. 4H-SiC single-photon detectors (SPDs) are emergent devices for UV single-photon detection. Here, we demonstrate a 4H-SiC SPD-based ozone DIAL. We design and fabricate the 4H-SiC single-photon avalanche diode with a beveled mesa structure and optimized layer thickness. An… ▽ More

    Submitted 6 March, 2025; v1 submitted 6 November, 2024; originally announced November 2024.

    Comments: Pulished by Applied Physics Letters

    Journal ref: Appl. Phys. Lett. 125, 211103 (2024)

  27. arXiv:2411.03554  [pdf, other

    cs.CV

    Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset

    Authors: Yingzi Ma, Jiongxiao Wang, Fei Wang, Siyuan Ma, Jiazhao Li, Jinsheng Pan, Xiujun Li, Furong Huang, Lichao Sun, Bo Li, Yejin Choi, Muhao Chen, Chaowei Xiao

    Abstract: Machine unlearning has emerged as an effective strategy for forgetting specific information in the training data. However, with the increasing integration of visual data, privacy concerns in Vision Language Models (VLMs) remain underexplored. To address this, we introduce Facial Identity Unlearning Benchmark (FIUBench), a novel VLM unlearning benchmark designed to robustly evaluate the effectivene… ▽ More

    Submitted 7 March, 2025; v1 submitted 5 November, 2024; originally announced November 2024.

  28. arXiv:2411.01738  [pdf, other

    cs.DC cs.AI

    xDiT: an Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

    Authors: Jiarui Fang, Jinzhe Pan, Xibo Sun, Aoyu Li, Jiannan Wang

    Abstract: Diffusion models are pivotal for generating high-quality images and videos. Inspired by the success of OpenAI's Sora, the backbone of diffusion models is evolving from U-Net to Transformer, known as Diffusion Transformers (DiTs). However, generating high-quality content necessitates longer sequence lengths, exponentially increasing the computation required for the attention mechanism, and escalati… ▽ More

    Submitted 3 November, 2024; originally announced November 2024.

  29. arXiv:2410.23006  [pdf, other

    hep-th gr-qc

    Thermodynamics of the Kerr-AdS black hole from an ensemble-averaged theory

    Authors: Peng Cheng, Jindong Pan, Haichen Xu, Si-Jiang Yang

    Abstract: Exploring the universal structure of the gravitational path integral beyond semi-classical saddles and uncovering a compelling statistical interpretation of black hole thermodynamics have long been significant challenges. We investigate the statistical interpretation of the Kerr-AdS black hole thermodynamics through an ensemble-averaged theory. By extending the phase space to include all possible… ▽ More

    Submitted 22 April, 2025; v1 submitted 30 October, 2024; originally announced October 2024.

    Comments: 33 pages,9 figures, published version, comments are welcome!

    Journal ref: Eur. Phys. J. C 85, 423 (2025)

  30. arXiv:2410.21492  [pdf, other

    cs.CR cs.CL

    FATH: Authentication-based Test-time Defense against Indirect Prompt Injection Attacks

    Authors: Jiongxiao Wang, Fangzhou Wu, Wendi Li, Jinsheng Pan, Edward Suh, Z. Morley Mao, Muhao Chen, Chaowei Xiao

    Abstract: Large language models (LLMs) have been widely deployed as the backbone with additional tools and text information for real-world applications. However, integrating external information into LLM-integrated applications raises significant security concerns. Among these, prompt injection attacks are particularly threatening, where malicious instructions injected in the external text information can e… ▽ More

    Submitted 25 November, 2024; v1 submitted 28 October, 2024; originally announced October 2024.

  31. arXiv:2410.21256  [pdf, other

    cs.AI cs.CV eess.IV

    Multi-modal AI for comprehensive breast cancer prognostication

    Authors: Jan Witowski, Ken G. Zeng, Joseph Cappadona, Jailan Elayoubi, Khalil Choucair, Elena Diana Chiru, Nancy Chan, Young-Joon Kang, Frederick Howard, Irina Ostrovnaya, Carlos Fernandez-Granda, Freya Schnabel, Zoe Steinsnyder, Ugur Ozerdem, Kangning Liu, Waleed Abdulsattar, Yu Zong, Lina Daoud, Rafic Beydoun, Anas Saad, Nitya Thakore, Mohammad Sadic, Frank Yeung, Elisa Liu, Theodore Hill , et al. (26 additional authors not shown)

    Abstract: Treatment selection in breast cancer is guided by molecular subtypes and clinical characteristics. However, current tools including genomic assays lack the accuracy required for optimal clinical decision-making. We developed a novel artificial intelligence (AI)-based approach that integrates digital pathology images with clinical data, providing a more robust and effective method for predicting th… ▽ More

    Submitted 2 March, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

  32. arXiv:2410.20519  [pdf, other

    cs.CV

    Fractal and Turbulent Feature Extraction and NFT Label Generation for Pollock Style Migration Paintings Based on VGG19

    Authors: Yiquan Wang, Xu Wang, Jiazhuo Pan

    Abstract: This paper puts forth an innovative approach that fuses deep learning, fractal analysis, and turbulence feature extraction techniques to create abstract artworks in the style of Pollock. The content and style characteristics of the image are extracted by the MindSpore deep learning framework and a pre-trained VGG19 model. An optimisation process is then employed to The method generates high-qualit… ▽ More

    Submitted 3 November, 2024; v1 submitted 27 October, 2024; originally announced October 2024.

    Comments: 9 pages, 4 figures

  33. arXiv:2410.19743  [pdf, other

    cs.SE cs.AI

    AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction

    Authors: Hongru Wang, Rui Wang, Boyang Xue, Heming Xia, Jingtao Cao, Zeming Liu, Jeff Z. Pan, Kam-Fai Wong

    Abstract: Large Language Models (LLMs) can interact with the real world by connecting with versatile external APIs, resulting in better problem-solving and task automation capabilities. Previous research primarily focuses on APIs with limited arguments from a single source or overlooks the complex dependency relationship between different APIs. However, it is essential to utilize multiple APIs collaborative… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  34. arXiv:2410.19211  [pdf

    cs.LG

    Predicting Liquidity Coverage Ratio with Gated Recurrent Units: A Deep Learning Model for Risk Management

    Authors: Zhen Xu, Jingming Pan, Siyuan Han, Hongju Ouyang, Yuan Chen, Mohan Jiang

    Abstract: With the global economic integration and the high interconnection of financial markets, financial institutions are facing unprecedented challenges, especially liquidity risk. This paper proposes a liquidity coverage ratio (LCR) prediction model based on the gated recurrent unit (GRU) network to help financial institutions manage their liquidity risk more effectively. By utilizing the GRU network i… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

  35. arXiv:2410.18343  [pdf, ps, other

    math.CO

    Hook-valued tableaux uncrowding and tableau switching

    Authors: Jihyeug Jang, Jang Soo Kim, Jianping Pan, Joseph Pappe, Anne Schilling

    Abstract: Refined canonical stable Grothendieck polynomials were introduced by Hwang, Jang, Kim, Song, and Song. There exist two combinatorial models for these polynomials: one using hook-valued tableaux and the other using pairs of a semistandard Young tableau and (what we call) an exquisite tableau. An uncrowding algorithm on hook-valued tableaux was introduced by Pan, Pappe, Poh, and Schilling. In this p… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

    Comments: 18 pages

    MSC Class: Primary 05E05; 05A19; Secondary 05E10; 14N10; 14N15

  36. Magnetoresistance oscillations in vertical junctions of 2D antiferromagnetic semiconductor CrPS$_4$

    Authors: Pengyuan Shi, Xiaoyu Wang, Lihao Zhang, Wenqin Song, Kunlin Yang, Shuxi Wang, Ruisheng Zhang, Liangliang Zhang, Takashi Taniguchi, Kenji Watanabe, Sen Yang, Lei Zhang, Lei Wang, Wu Shi, Jie Pan, Zhe Wang

    Abstract: Magnetoresistance (MR) oscillations serve as a hallmark of intrinsic quantum behavior, traditionally observed only in conducting systems. Here we report the discovery of MR oscillations in an insulating system, the vertical junctions of CrPS$_4$ which is a two dimensional (2D) A-type antiferromagnetic semiconductor. Systematic investigations of MR peaks under varying conditions, including electrod… ▽ More

    Submitted 19 November, 2024; v1 submitted 23 October, 2024; originally announced October 2024.

    Comments: Accepted by Physical Review X

    Journal ref: Phys. Rev. X 14, 041065 (2024)

  37. arXiv:2410.17714  [pdf, other

    cs.CL cs.AI

    CogSteer: Cognition-Inspired Selective Layer Intervention for Efficiently Steering Large Language Models

    Authors: Xintong Wang, Jingheng Pan, Liang Ding, Longyue Wang, Longqin Jiang, Xingshan Li, Chris Biemann

    Abstract: Large Language Models (LLMs) achieve remarkable performance through pretraining on extensive data. This enables efficient adaptation to diverse downstream tasks. However, the lack of interpretability in their underlying mechanisms limits the ability to effectively steer LLMs for specific applications. In this work, we investigate the intrinsic mechanisms of LLMs from a cognitive perspective using… ▽ More

    Submitted 18 February, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

  38. arXiv:2410.16708  [pdf, other

    cs.CL

    Atomic Fact Decomposition Helps Attributed Question Answering

    Authors: Zhichao Yan, Jiapu Wang, Jiaoyan Chen, Xiaoli Li, Ru Li, Jeff Z. Pan

    Abstract: Attributed Question Answering (AQA) aims to provide both a trustworthy answer and a reliable attribution report for a given question. Retrieval is a widely adopted approach, including two general paradigms: Retrieval-Then-Read (RTR) and post-hoc retrieval. Recently, Large Language Models (LLMs) have shown remarkable proficiency, prompting growing interest in AQA among researchers. However, RTR-bas… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

  39. arXiv:2410.16565  [pdf, other

    astro-ph.HE

    Search for gravitational waves emitted from SN 2023ixf

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, I. Abouelfettouh, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, D. Agarwal, M. Agathos, M. Aghaei Abchouyeh, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, T. Akutsu, S. Albanesi, R. A. Alfaidi, A. Al-Jodah, C. Alléné, A. Allocca , et al. (1758 additional authors not shown)

    Abstract: We present the results of a search for gravitational-wave transients associated with core-collapse supernova SN 2023ixf, which was observed in the galaxy Messier 101 via optical emission on 2023 May 19th, during the LIGO-Virgo-KAGRA 15th Engineering Run. We define a five-day on-source window during which an accompanying gravitational-wave signal may have occurred. No gravitational waves have been… ▽ More

    Submitted 11 March, 2025; v1 submitted 21 October, 2024; originally announced October 2024.

    Comments: Main paper: 6 pages, 4 figures and 1 table. Total with appendices: 20 pages, 4 figures, and 1 table

    Report number: LIGO-P2400125

  40. arXiv:2410.16055  [pdf, ps, other

    math.AT math.GT

    Cohomotopy Sets of $(n-1)$-connected $(2n+2)$-manifolds for small $n$

    Authors: Pengcheng Li, Jianzhong Pan, Jie Wu

    Abstract: Let $M$ be a closed orientable $(n-1)$-connected $(2n+2)$-manifold, $n\geq 2$. In this paper we combine the Postnikov tower of spheres and the homotopy decomposition of the reduced suspension space $ΣM$ to investigate the cohomotopy sets $π^\ast(M)$ for $n=2,3,4$, under the assumption that $M$ has $2$-torsion-free homology. All cohomotopy sets $π^i(M)$ of such manifolds $M$ are characterized excep… ▽ More

    Submitted 5 May, 2025; v1 submitted 21 October, 2024; originally announced October 2024.

    Comments: revised version, 34 pages

    MSC Class: 55Q55; 57N65; 55P15; 55P40

  41. arXiv:2410.15820  [pdf, other

    cs.NI cs.AI

    MAC Revivo: Artificial Intelligence Paves the Way

    Authors: Jinzhe Pan, Jingqing Wang, Zelin Yun, Zhiyong Xiao, Yuehui Ouyang, Wenchi Cheng, Wei Zhang

    Abstract: The vast adoption of Wi-Fi and/or Bluetooth capabilities in Internet of Things (IoT) devices, along with the rapid growth of deployed smart devices, has caused significant interference and congestion in the industrial, scientific, and medical (ISM) bands. Traditional Wi-Fi Medium Access Control (MAC) design faces significant challenges in managing increasingly complex wireless environments while e… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  42. arXiv:2410.15488  [pdf, ps, other

    math.DG

    On the topology of manifolds with nonnegative Ricci curvature and linear volume growth

    Authors: Dimitri Navarro, Jiayin Pan, Xingyu Zhu

    Abstract: Understanding the relationships between geometry and topology is a central theme in Riemannian geometry. We establish two results on the fundamental groups of open (complete and noncompact) $n$-manifolds with nonnegative Ricci curvature and linear volume growth. First, we show that the fundamental group of such a manifold contains a subgroup $\mathbb{Z}^k$ of finite index, where $0\le k\le n-1$. S… ▽ More

    Submitted 20 October, 2024; originally announced October 2024.

  43. arXiv:2410.14668  [pdf, other

    cs.CL

    MiCEval: Unveiling Multimodal Chain of Thought's Quality via Image Description and Reasoning Steps

    Authors: Xiongtao Zhou, Jie He, Lanyu Chen, Jingyu Li, Haojing Chen, Víctor Gutiérrez-Basulto, Jeff Z. Pan, Hanjie Chen

    Abstract: Multimodal Chain of Thought (MCoT) is a popular prompting strategy for improving the performance of multimodal large language models (MLLMs) across a range of complex reasoning tasks. Despite its popularity, there is a notable absence of automated methods for evaluating the quality of reasoning steps in MCoT. To address this gap, we propose Multimodal Chain-of-Thought Evaluation (MiCEval), a frame… ▽ More

    Submitted 28 February, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

    Comments: NAACL 2025

  44. arXiv:2410.13726  [pdf, other

    cs.CV cs.AI

    DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation

    Authors: Hanbo Cheng, Limin Lin, Chenyu Liu, Pengcheng Xia, Pengfei Hu, Jiefeng Ma, Jun Du, Jia Pan

    Abstract: Talking head generation intends to produce vivid and realistic talking head videos from a single portrait and speech audio clip. Although significant progress has been made in diffusion-based talking head generation, almost all methods rely on autoregressive strategies, which suffer from limited context utilization beyond the current generation step, error accumulation, and slower generation speed… ▽ More

    Submitted 26 March, 2025; v1 submitted 17 October, 2024; originally announced October 2024.

  45. arXiv:2410.12961  [pdf, other

    cs.CV

    Super-resolving Real-world Image Illumination Enhancement: A New Dataset and A Conditional Diffusion Model

    Authors: Yang Liu, Yaofang Liu, Jinshan Pan, Yuxiang Hui, Fan Jia, Raymond H. Chan, Tieyong Zeng

    Abstract: Most existing super-resolution methods and datasets have been developed to improve the image quality in well-lighted conditions. However, these methods do not work well in real-world low-light conditions as the images captured in such conditions lose most important information and contain significant unknown noises. To solve this problem, we propose a SRRIIE dataset with an efficient conditional d… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: Code and dataset at https://github.com/Yaofang-Liu/Super-Resolving

  46. arXiv:2410.12270  [pdf, other

    cs.CV

    DaDiff: Domain-aware Diffusion Model for Nighttime UAV Tracking

    Authors: Haobo Zuo, Changhong Fu, Guangze Zheng, Liangliang Yao, Kunhan Lu, Jia Pan

    Abstract: Domain adaptation is an inspiring solution to the misalignment issue of day/night image features for nighttime UAV tracking. However, the one-step adaptation paradigm is inadequate in addressing the prevalent difficulties posed by low-resolution (LR) objects when viewed from the UAVs at night, owing to the blurry edge contour and limited detail information. Moreover, these approaches struggle to p… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  47. Improving the Generalization of Unseen Crowd Behaviors for Reinforcement Learning based Local Motion Planners

    Authors: Wen Zheng Terence Ng, Jianda Chen, Sinno Jialin Pan, Tianwei Zhang

    Abstract: Deploying a safe mobile robot policy in scenarios with human pedestrians is challenging due to their unpredictable movements. Current Reinforcement Learning-based motion planners rely on a single policy to simulate pedestrian movements and could suffer from the over-fitting issue. Alternatively, framing the collision avoidance problem as a multi-agent framework, where agents generate dynamic movem… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  48. arXiv:2410.11666  [pdf, other

    cs.CV

    DORNet: A Degradation Oriented and Regularized Network for Blind Depth Super-Resolution

    Authors: Zhengxue Wang, Zhiqiang Yan, Jinshan Pan, Guangwei Gao, Kai Zhang, Jian Yang

    Abstract: Recent RGB-guided depth super-resolution methods have achieved impressive performance under the assumption of fixed and known degradation (e.g., bicubic downsampling). However, in real-world scenarios, captured depth data often suffer from unconventional and unknown degradation due to sensor limitations and complex imaging environments (e.g., low reflective surfaces, varying illumination). Consequ… ▽ More

    Submitted 19 March, 2025; v1 submitted 15 October, 2024; originally announced October 2024.

    Comments: CVPR 2025

  49. arXiv:2410.11206  [pdf, other

    cs.LG

    Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning

    Authors: Jingyang Li, Jiachun Pan, Vincent Y. F. Tan, Kim-Chuan Toh, Pan Zhou

    Abstract: Semi-supervised learning (SSL), exemplified by FixMatch (Sohn et al., 2020), has shown significant generalization advantages over supervised learning (SL), particularly in the context of deep neural networks (DNNs). However, it is still unclear, from a theoretical standpoint, why FixMatch-like SSL algorithms generalize better than SL on DNNs. In this work, we present the first theoretical justific… ▽ More

    Submitted 9 March, 2025; v1 submitted 14 October, 2024; originally announced October 2024.

  50. arXiv:2410.10664  [pdf

    quant-ph physics.atom-ph physics.optics physics.pop-ph

    Tunable Einstein-Bohr recoiling-slit gedankenexperiment at the quantum limit

    Authors: Yu-Chen Zhang, Hao-Wen Cheng, Zhao-Qiu Zengxu, Zhan Wu, Rui Lin, Yu-Cheng Duan, Jun Rui, Ming-Cheng Chen, Chao-Yang Lu, Jian-Wei Pan

    Abstract: In 1927, during the fifth Solvay Conference, Einstein and Bohr described a double-slit interferometer with a "movable slit" that can detect the momentum recoil of one photon. Here, we report a faithful realization of the Einstein-Bohr interferometer using a single atom in an optical tweezer, cooled to the motional ground state in three dimensions. The single atom has an intrinsic momentum uncertai… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: 18 pages, 4 figures