Skip to main content

Showing 101–150 of 1,963 results for author: Jiang, L

.
  1. arXiv:2503.17827  [pdf, other

    cs.CV

    4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object Understanding

    Authors: Wenxuan Zhu, Bing Li, Cheng Zheng, Jinjie Mai, Jun Chen, Letian Jiang, Abdullah Hamdi, Sara Rojas Martinez, Chia-Wen Lin, Mohamed Elhoseiny, Bernard Ghanem

    Abstract: Multimodal Large Language Models (MLLMs) have demonstrated impressive 2D image/video understanding capabilities. However, there are no publicly standardized benchmarks to assess the abilities of MLLMs in understanding the 4D objects (3D objects with temporal evolution over time). In this paper, we introduce 4D-Bench, the first benchmark to evaluate the capabilities of MLLMs in 4D object understand… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

  2. arXiv:2503.17741  [pdf, other

    cs.SE

    RustMap: Towards Project-Scale C-to-Rust Migration via Program Analysis and LLM

    Authors: Xuemeng Cai, Jiakun Liu, Xiping Huang, Yijun Yu, Haitao Wu, Chunmiao Li, Bo Wang, Imam Nur Bani Yusuf, Lingxiao Jiang

    Abstract: Migrating existing C programs into Rust is increasingly desired, as Rust offers superior memory safety while maintaining C's high performance. However, vastly different features between C and Rust--e.g., distinct definitions and usages of pointers and references--pose significant challenges beyond mere syntactic translation. Existing automated translation tools, such as C2Rust, may rely too much o… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

  3. arXiv:2503.17704  [pdf, other

    physics.flu-dyn cs.AI

    PT-PINNs: A Parametric Engineering Turbulence Solver based on Physics-Informed Neural Networks

    Authors: Liang Jiang, Yuzhou Cheng, Kun Luo, Jianren Fan

    Abstract: Physics-informed neural networks (PINNs) demonstrate promising potential in parameterized engineering turbulence optimization problems but face challenges, such as high data requirements and low computational accuracy when applied to engineering turbulence problems. This study proposes a framework that enhances the ability of PINNs to solve parametric turbulence problems without training datasets… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

  4. arXiv:2503.17362  [pdf, other

    quant-ph

    Criteria for unbiased estimation: applications to noise-agnostic sensing and learnability of quantum channel

    Authors: Hyukgun Kwon, Kento Tsubouchi, Chia-Tung Chu, Liang Jiang

    Abstract: We establish the necessary and sufficient conditions for unbiased estimation in multi-parameter estimation tasks. More specifically, we first consider quantum state estimation, where multiple parameters are encoded in a quantum state, and derive two equivalent necessary and sufficient conditions for an unbiased estimation: one formulated in terms of the quantum Fisher information matrix (QFIM) and… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

    Comments: 2 figures, 5 pages of main text, 18 pages of supplemental matrerial

  5. arXiv:2503.16418  [pdf, other

    cs.CV cs.LG

    InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

    Authors: Liming Jiang, Qing Yan, Yumin Jia, Zichuan Liu, Hao Kang, Xin Lu

    Abstract: Achieving flexible and high-fidelity identity-preserved image generation remains formidable, particularly with advanced Diffusion Transformers (DiTs) like FLUX. We introduce InfiniteYou (InfU), one of the earliest robust frameworks leveraging DiTs for this task. InfU addresses significant issues of existing methods, such as insufficient identity similarity, poor text-image alignment, and low gener… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

    Comments: Project page: https://bytedance.github.io/InfiniteYou/ Code and model: https://github.com/bytedance/InfiniteYou

  6. arXiv:2503.15916  [pdf, other

    cs.CR cs.AR

    ALLMod: Exploring $\underline{\mathbf{A}}$rea-Efficiency of $\underline{\mathbf{L}}$UT-based $\underline{\mathbf{L}}$arge Number $\underline{\mathbf{Mod}}$ular Reduction via Hybrid Workloads

    Authors: Fangxin Liu, Haomin Li, Zongwu Wang, Bo Zhang, Mingzhe Zhang, Shoumeng Yan, Li Jiang, Haibing Guan

    Abstract: Modular arithmetic, particularly modular reduction, is widely used in cryptographic applications such as homomorphic encryption (HE) and zero-knowledge proofs (ZKP). High-bit-width operations are crucial for enhancing security; however, they are computationally intensive due to the large number of modular operations required. The lookup-table-based (LUT-based) approach, a ``space-for-time'' techni… ▽ More

    Submitted 27 May, 2025; v1 submitted 20 March, 2025; originally announced March 2025.

    Comments: Accepted by the 62nd Design Automation Conference ($\bf{DAC\ 2025}$)

  7. arXiv:2503.12689  [pdf, other

    cs.CV

    MagicID: Hybrid Preference Optimization for ID-Consistent and Dynamic-Preserved Video Customization

    Authors: Hengjia Li, Lifan Jiang, Xi Xiao, Tianyang Wang, Hongwei Yi, Boxi Wu, Deng Cai

    Abstract: Video identity customization seeks to produce high-fidelity videos that maintain consistent identity and exhibit significant dynamics based on users' reference images. However, existing approaches face two key challenges: identity degradation over extended video length and reduced dynamics during training, primarily due to their reliance on traditional self-reconstruction training with static imag… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

  8. arXiv:2503.12614  [pdf, other

    quant-ph

    Virtual purification complements quantum error correction in quantum metrology

    Authors: Hyukgun Kwon, Changhun Oh, Youngrong Lim, Hyunseok Jeong, Seung-Woo Lee, Liang Jiang

    Abstract: A practical realization of quantum metrology, enhancing the sensitivity of parameter estimation beyond the classical limit, is significantly hindered by the effect of noise. To tackle this challenge, quantum error correction (QEC) has been considered, however, indistinguishable noise from the signal and the bias induced by unknown noise prevents it from recovering the enhanced precision in practic… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

    Comments: 5 pages, 3 figures

  9. arXiv:2503.12374  [pdf, other

    cs.SE cs.AI

    Unveiling Pitfalls: Understanding Why AI-driven Code Agents Fail at GitHub Issue Resolution

    Authors: Zhi Chen, Wei Ma, Lingxiao Jiang

    Abstract: AI-driven software development has rapidly advanced with the emergence of software development agents that leverage large language models (LLMs) to tackle complex, repository-level software engineering tasks. These agents go beyond just generation of final code; they engage in multi-step reasoning, utilize various tools for code modification and debugging, and interact with execution environments… ▽ More

    Submitted 19 March, 2025; v1 submitted 16 March, 2025; originally announced March 2025.

  10. arXiv:2503.10623  [pdf, other

    quant-ph

    Fast Sideband Control of a Weakly Coupled Multimode Bosonic Memory

    Authors: Jordan Huang, Thomas J. DiNapoli, Gavin Rockwood, Ming Yuan, Prathyankara Narasimhan, Eesh Gupta, Mustafa Bal, Francesco Crisa, Sabrina Garattoni, Yao Lu, Liang Jiang, Srivatsan Chakram

    Abstract: Circuit quantum electrodynamics (cQED) with superconducting cavities coupled to nonlinear circuits like transmons offers a promising platform for hardware-efficient quantum information processing. We address critical challenges in realizing this architecture by weakening the dispersive coupling while also demonstrating fast, high-fidelity multimode control by dynamically amplifying gate speeds thr… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: 10 pages of main text with 5 figures. 21 pages of supplementary information with 18 figures

  11. arXiv:2503.10592  [pdf, other

    cs.CV

    CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models

    Authors: Hao He, Ceyuan Yang, Shanchuan Lin, Yinghao Xu, Meng Wei, Liangke Gui, Qi Zhao, Gordon Wetzstein, Lu Jiang, Hongsheng Li

    Abstract: This paper introduces CameraCtrl II, a framework that enables large-scale dynamic scene exploration through a camera-controlled video diffusion model. Previous camera-conditioned video generative models suffer from diminished video dynamics and limited range of viewpoints when generating videos with large camera movement. We take an approach that progressively expands the generation of dynamic sce… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: Project page: https://hehao13.github.io/Projects-CameraCtrl-II/

  12. arXiv:2503.10589  [pdf, other

    cs.CV

    Long Context Tuning for Video Generation

    Authors: Yuwei Guo, Ceyuan Yang, Ziyan Yang, Zhibei Ma, Zhijie Lin, Zhenheng Yang, Dahua Lin, Lu Jiang

    Abstract: Recent advances in video generation can produce realistic, minute-long single-shot videos with scalable diffusion transformers. However, real-world narrative videos require multi-shot scenes with visual and dynamic consistency across shots. In this work, we introduce Long Context Tuning (LCT), a training paradigm that expands the context window of pre-trained single-shot video diffusion models to… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: Project Page: https://guoyww.github.io/projects/long-context-video/

  13. arXiv:2503.10065  [pdf, other

    cs.LG cs.CV

    Do We Always Need the Simplicity Bias? Looking for Optimal Inductive Biases in the Wild

    Authors: Damien Teney, Liangze Jiang, Florin Gogianu, Ehsan Abbasnejad

    Abstract: Neural architectures tend to fit their data with relatively simple functions. This "simplicity bias" is widely regarded as key to their success. This paper explores the limits of this principle. Building on recent findings that the simplicity bias stems from ReLU activations [96], we introduce a method to meta-learn new activation functions and inductive biases better suited to specific tasks. F… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

  14. arXiv:2503.09949  [pdf, other

    cs.CV

    UVE: Are MLLMs Unified Evaluators for AI-Generated Videos?

    Authors: Yuanxin Liu, Rui Zhu, Shuhuai Ren, Jiacong Wang, Haoyuan Guo, Xu Sun, Lu Jiang

    Abstract: With the rapid growth of video generative models (VGMs), it is essential to develop reliable and comprehensive automatic metrics for AI-generated videos (AIGVs). Existing methods either use off-the-shelf models optimized for other tasks or rely on human assessment data to train specialized evaluators. These approaches are constrained to specific evaluation aspects and are difficult to scale with t… ▽ More

    Submitted 21 March, 2025; v1 submitted 12 March, 2025; originally announced March 2025.

  15. arXiv:2503.08664  [pdf, other

    cs.CV cs.AI

    MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention

    Authors: Yuhan Wang, Fangzhou Hong, Shuai Yang, Liming Jiang, Wayne Wu, Chen Change Loy

    Abstract: Multiview diffusion models have shown considerable success in image-to-3D generation for general objects. However, when applied to human data, existing methods have yet to deliver promising results, largely due to the challenges of scaling multiview attention to higher resolutions. In this paper, we explore human multiview diffusion models at the megapixel level and introduce a solution called mes… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: CVPR 2025. Code https://github.com/johannwyh/MEAT Project Page https://johann.wang/MEAT/

  16. arXiv:2503.08422  [pdf, other

    cs.CV

    JiSAM: Alleviate Labeling Burden and Corner Case Problems in Autonomous Driving via Minimal Real-World Data

    Authors: Runjian Chen, Wenqi Shao, Bo Zhang, Shaoshuai Shi, Li Jiang, Ping Luo

    Abstract: Deep-learning-based autonomous driving (AD) perception introduces a promising picture for safe and environment-friendly transportation. However, the over-reliance on real labeled data in LiDAR perception limits the scale of on-road attempts. 3D real world data is notoriously time-and-energy-consuming to annotate and lacks corner cases like rare traffic participants. On the contrary, in simulators… ▽ More

    Submitted 13 March, 2025; v1 submitted 11 March, 2025; originally announced March 2025.

  17. arXiv:2503.07601  [pdf, other

    cs.CV cs.LG

    Balanced Image Stylization with Style Matching Score

    Authors: Yuxin Jiang, Liming Jiang, Shuai Yang, Jia-Wei Liu, Ivor Tsang, Mike Zheng Shou

    Abstract: We present Style Matching Score (SMS), a novel optimization method for image stylization with diffusion models. Balancing effective style transfer with content preservation is a long-standing challenge. Unlike existing efforts, our method reframes image stylization as a style distribution matching problem. The target style distribution is estimated from off-the-shelf style-dependent LoRAs via care… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: Project page: https://yuxinn-j.github.io/projects/SMS.html

  18. arXiv:2503.07306  [pdf, other

    cs.CL

    Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies

    Authors: Luyi Jiang, Jiayuan Chen, Lu Lu, Xinwei Peng, Lihao Liu, Junjun He, Jie Xu

    Abstract: The evaluation and improvement of medical large language models (LLMs) are critical for their real-world deployment, particularly in ensuring accuracy, safety, and ethical alignment. Existing frameworks inadequately dissect domain-specific error patterns or address cross-modal challenges. This study introduces a granular error taxonomy through systematic analysis of top 10 models on MedBench, cate… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  19. arXiv:2503.05728  [pdf, ps, other

    cs.CY cs.AI

    Political Neutrality in AI Is Impossible- But Here Is How to Approximate It

    Authors: Jillian Fisher, Ruth E. Appel, Chan Young Park, Yujin Potter, Liwei Jiang, Taylor Sorensen, Shangbin Feng, Yulia Tsvetkov, Margaret E. Roberts, Jennifer Pan, Dawn Song, Yejin Choi

    Abstract: AI systems often exhibit political bias, influencing users' opinions and decisions. While political neutrality-defined as the absence of bias-is often seen as an ideal solution for fairness and safety, this position paper argues that true political neutrality is neither feasible nor universally desirable due to its subjective nature and the biases inherent in AI training data, algorithms, and user… ▽ More

    Submitted 3 June, 2025; v1 submitted 18 February, 2025; originally announced March 2025.

    Comments: Code: https://github.com/jfisher52/Approximation_Political_Neutrality

  20. arXiv:2503.04295  [pdf, other

    quant-ph

    Implementation of a quantum addressable router using superconducting qubits

    Authors: Connie Miao, Sébastien Léger, Ziqian Li, Gideon Lee, Liang Jiang, David I. Schuster

    Abstract: The implementation of a quantum router capable of performing both quantum signal routing and quantum addressing (a Q2-router) represents a key step toward building quantum networks and quantum random access memories. We realize a Q2-router that uses fixed-frequency transmon qubits to implement a routing protocol based on two native controlled-iSWAP gates. These gates leverage a large ZZ interactio… ▽ More

    Submitted 3 April, 2025; v1 submitted 6 March, 2025; originally announced March 2025.

    Comments: 21 pages, 11 figures

  21. arXiv:2503.04173  [pdf, other

    physics.optics physics.ins-det quant-ph

    Compact and fully functional high-frequency sine wave gating InGaAs/InP single-photon detector module

    Authors: Qi Xu, Chao Yu, Dajian Cui, Xuan-Yi Zhang, Wei Chen, Yu-Qiang Fang, Lianjun Jiang, Qixia Tong, Jianglin Zhao, Jun Zhang

    Abstract: High-frequency sine wave gating (SWG) InGaAs/InP single-photon detectors (SPDs) are widely used for synchronous near-infrared single-photon detection. For practical use, the size of SPD is one of the most concerning features for system integration. Here we present, to the best of our knowledge, the most compact and fully functional high-frequency SWG InGaAs/InP SPD. We develop a sine wave gating i… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: Published by IEEE Journal of Selected Topics in Quantum Electronics

    Journal ref: IEEE Journal of Selected Topics in Quantum Electronics 31(5), 3801007 (2025)

  22. arXiv:2503.04155  [pdf, other

    cs.CL

    BPQA Dataset: Evaluating How Well Language Models Leverage Blood Pressures to Answer Biomedical Questions

    Authors: Chi Hang, Ruiqi Deng, Lavender Yao Jiang, Zihao Yang, Anton Alyakin, Daniel Alber, Eric Karl Oermann

    Abstract: Clinical measurements such as blood pressures and respiration rates are critical in diagnosing and monitoring patient outcomes. It is an important component of biomedical data, which can be used to train transformer-based language models (LMs) for improving healthcare delivery. It is, however, unclear whether LMs can effectively interpret and use clinical measurements. We investigate two questions… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: 9 pages

  23. arXiv:2503.03898  [pdf, other

    quant-ph physics.optics

    Acoustic phonon phase gates with number-resolving phonon detection

    Authors: Hong Qiao, Zhaoyou Wang, Gustav Andersson, Alexander Anferov, Christopher R. Conner, Yash J. Joshi, Shiheng Li, Jacob M. Miller, Xuntao Wu, Haoxiong Yan, Liang Jiang, Andrew N. Cleland

    Abstract: Linear optical quantum computing (LOQC) provides a compelling approach to quantum information processing, with a short list of physical requirements; however, experimental implementations have faced significant challenges. Itinerant phonons in quantum acoustics, combined with superconducting qubits, offer a compelling alternative to the quantum optics approach. Here we demonstrate key advances in… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

  24. arXiv:2503.03829  [pdf, other

    astro-ph.GA

    Medium-band Astrophysics with the Grism of NIRCam In Frontier fields (MAGNIF): Spectroscopic Census of H$α$ Luminosity Functions and Cosmic Star Formation at $z\sim 4.5$ and 6.3

    Authors: Shuqi Fu, Fengwu Sun, Linhua Jiang, Xiaojing Lin, Jose M. Diego, Lukas J. Furtak, Mathilde Jauzac, Anton M. Koekemoer, Mingyu Li, Masamune Oguri, Nency R. Patel, Christopher N. A. Willmer, Rogier A. Windhorst, Adi Zitrin, Franz E. Bauer, Chian-Chou Chen, Wenlei Chen, Cheng Cheng, Christopher J. Conselice, Daniel J. Eisenstein, Eiichi Egami, Daniel Espada, Xiaohui Fan, Seiji Fujimoto, Tiger Yu-Yang Hsiao , et al. (13 additional authors not shown)

    Abstract: We measure H$α$ luminosity functions (LFs) at redshifts $z \sim 4.5$ and 6.3 using the JWST MAGNIF (Medium-band Astrophysics with the Grism of NIRCam In Frontier fields) survey. MAGNIF obtained NIRCam grism spectra with the F360M and F480M filters in four Frontier Fields. We identify 248 H$α$ emitters based on the grism spectra and photometric redshifts from combined HST and JWST imaging data. The… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: 25 pages, 14 figures, submitted to ApJ

  25. arXiv:2503.00255  [pdf, other

    quant-ph

    Efficient quantum tomography of a polynomial subspace

    Authors: Yat Wong, Ming Yuan, Kevin He, Srivatsan Chakram, Alireza Seif, David I. Schuster, Liang Jiang

    Abstract: Quantum tomography is crucial for characterizing the quantum states of multipartite systems, but its practicality is often limited by the exponentially large dimension of the Hilbert space. Most existing approaches, such as compressed sensing and tensor network-based tomography, impose structural constraints on the state to enable more resource-efficient characterization. However, not all physical… ▽ More

    Submitted 28 February, 2025; originally announced March 2025.

  26. arXiv:2502.21017  [pdf, other

    cs.CL

    PersuasiveToM: A Benchmark for Evaluating Machine Theory of Mind in Persuasive Dialogues

    Authors: Fangxu Yu, Lai Jiang, Shenyi Huang, Zhen Wu, Xinyu Dai

    Abstract: The ability to understand and predict the mental states of oneself and others, known as the Theory of Mind (ToM), is crucial for effective social scenarios. Although recent studies have evaluated ToM in Large Language Models (LLMs), existing benchmarks focus on simplified settings (e.g., Sally-Anne-style tasks) and overlook the complexity of real-world social interactions. To mitigate this gap, we… ▽ More

    Submitted 25 May, 2025; v1 submitted 28 February, 2025; originally announced February 2025.

  27. arXiv:2502.20807  [pdf, other

    cs.LG

    Digital Player: Evaluating Large Language Models based Human-like Agent in Games

    Authors: Jiawei Wang, Kai Wang, Shaojie Lin, Runze Wu, Bihan Xu, Lingeng Jiang, Shiwei Zhao, Renyu Zhu, Haoyu Liu, Zhipeng Hu, Zhong Fan, Le Li, Tangjie Lyu, Changjie Fan

    Abstract: With the rapid advancement of Large Language Models (LLMs), LLM-based autonomous agents have shown the potential to function as digital employees, such as digital analysts, teachers, and programmers. In this paper, we develop an application-level testbed based on the open-source strategy game "Unciv", which has millions of active players, to enable researchers to build a "data flywheel" for studyi… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

    Comments: neurips datasets and benchmarks 2024, not accepted

  28. arXiv:2502.20556  [pdf, other

    astro-ph.GA astro-ph.HE

    On the Extremely X-ray Variable Active Galactic Nuclei in the XMM-LSS Field

    Authors: Zijian Zhang, Bin Luo, Linhua Jiang, W. N. Brandt, Jian Huang, Qingling Ni

    Abstract: We present a systematic investigation of extremely X-ray variable active galactic nuclei (AGNs) in the $\approx 5.3~{\rm deg}^2$ XMM-SERVS XMM-LSS region. Eight variable AGNs are identified with rest-frame 2 keV flux density variability amplitudes around 6-12. We comprehensively analyze the X-ray and multiwavelength data to probe the origin of their extreme X-ray variability. It is found that thei… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: 32 pages, 13 figures, 5 tables, accepted for publication in the Astrophysical Journal

  29. arXiv:2502.19751  [pdf, other

    cs.CV

    Lightweight Contrastive Distilled Hashing for Online Cross-modal Retrieval

    Authors: Jiaxing Li, Lin Jiang, Zeqi Ma, Kaihang Jiang, Xiaozhao Fang, Jie Wen

    Abstract: Deep online cross-modal hashing has gained much attention from researchers recently, as its promising applications with low storage requirement, fast retrieval efficiency and cross modality adaptive, etc. However, there still exists some technical hurdles that hinder its applications, e.g., 1) how to extract the coexistent semantic relevance of cross-modal data, 2) how to achieve competitive perfo… ▽ More

    Submitted 27 February, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

    Comments: Accepted by AAAI 2025

  30. arXiv:2502.16799  [pdf, other

    cs.CV

    Hierarchical Semantic Compression for Consistent Image Semantic Restoration

    Authors: Shengxi Li, Zifu Zhang, Mai Xu, Lai Jiang, Yufan Liu, Ce Zhu

    Abstract: The emerging semantic compression has been receiving increasing research efforts most recently, capable of achieving high fidelity restoration during compression, even at extremely low bitrates. However, existing semantic compression methods typically combine standard pipelines with either pre-defined or high-dimensional semantics, thus suffering from deficiency in compression. To address this iss… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

    Comments: 13 pages, 12 figures

  31. arXiv:2502.16782  [pdf, other

    cs.LG cs.CR

    CipherPrune: Efficient and Scalable Private Transformer Inference

    Authors: Yancheng Zhang, Jiaqi Xue, Mengxin Zheng, Mimi Xie, Mingzhe Zhang, Lei Jiang, Qian Lou

    Abstract: Private Transformer inference using cryptographic protocols offers promising solutions for privacy-preserving machine learning; however, it still faces significant runtime overhead (efficiency issues) and challenges in handling long-token inputs (scalability issues). We observe that the Transformer's operational complexity scales quadratically with the number of input tokens, making it essential t… ▽ More

    Submitted 5 March, 2025; v1 submitted 23 February, 2025; originally announced February 2025.

    Comments: Accepted by ICLR 2025

  32. arXiv:2502.16600  [pdf, other

    cs.CL

    Diagnosing Moral Reasoning Acquisition in Language Models: Pragmatics and Generalization

    Authors: Guangliang Liu, Lei Jiang, Xitong Zhang, Kristen Marie Johnson

    Abstract: Ensuring that Large Language Models (LLMs) return just responses which adhere to societal values is crucial for their broader application. Prior research has shown that LLMs often fail to perform satisfactorily on tasks requiring moral cognizance, such as ethics-based judgments. While current approaches have focused on fine-tuning LLMs with curated datasets to improve their capabilities on such ta… ▽ More

    Submitted 6 March, 2025; v1 submitted 23 February, 2025; originally announced February 2025.

  33. arXiv:2502.14736  [pdf, other

    astro-ph.EP

    H$α$ Variability of AB Aur b with the Hubble Space Telescope: Probing the Nature of a Protoplanet Candidate with Accretion Light Echoes

    Authors: Brendan P. Bowler, Yifan Zhou, Lauren I. Biddle, Lillian Yushu Jiang, Jaehan Bae, Laird M. Close, Katherine B. Follette, Kyle Franson, Adam L. Kraus, Aniket Sanghi, Quang Tran, Kimberly Ward-Duong, Ya-Lin Wu, Zhaohuan Zhu

    Abstract: Giant planets generate accretion luminosity as they form. Much of this energy is radiated in strong H$α$ line emission, which has motivated direct imaging surveys at optical wavelengths to search for accreting protoplanets. However, compact disk structures can mimic accreting planets by scattering emission from the host star. This can complicate the interpretation of H$α$ point sources, especially… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: Accepted to AJ

  34. arXiv:2502.13388  [pdf, other

    cs.AI

    Reflection of Episodes: Learning to Play Game from Expert and Self Experiences

    Authors: Xiaojie Xu, Zongyuan Li, Chang Lu, Runnan Qi, Yanan Ni, Lumin Jiang, Xiangbei Liu, Xuebo Zhang, Yongchun Fang, Kuihua Huang, Xian Guo, Zhanghua Wu, Zhenya Li

    Abstract: StarCraft II is a complex and dynamic real-time strategy (RTS) game environment, which is very suitable for artificial intelligence and reinforcement learning research. To address the problem of Large Language Model(LLM) learning in complex environments through self-reflection, we propose a Reflection of Episodes(ROE) framework based on expert experience and self-experience. This framework first o… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  35. arXiv:2502.11122  [pdf, other

    cs.AI

    Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First Time

    Authors: Zongyuan Li, Chang Lu, Xiaojie Xu, Runnan Qi, Yanan Ni, Lumin Jiang, Xiangbei Liu, Xuebo Zhang, Yongchun Fang, Kuihua Huang, Xian Guo

    Abstract: Since the emergence of the Large Language Model (LLM), LLM has been widely used in fields such as writing, translating, and searching. However, there is still great potential for LLM-based methods in handling complex tasks such as decision-making in the StarCraft II environment. To address problems such as lack of relevant knowledge and poor control over subtasks of varying importance, we propose… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

  36. arXiv:2502.10121  [pdf, other

    quant-ph

    Nonreciprocal routing induced by chirality in an atom-dimer waveguide-QED system

    Authors: Shi-Yu Liu, Lin-Lin Jiang, Hai Zhu, Jie-Qiao Liao, Jin-Feng Huang

    Abstract: The implementation of quantum routers is an important and desired task in quantum information science, since quantum routers are important components of quantum networks. Here, we propose a scheme for implementing single-photon routers in a waveguide-QED system, which consists of two coupled two-level atoms coupled to two waveguides to form a four-port quantum device. We obtain the exact analytica… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

    Comments: 11 pages, 7 figures

  37. arXiv:2502.10047  [pdf, other

    cs.DC cs.AI

    Janus: Collaborative Vision Transformer Under Dynamic Network Environment

    Authors: Linyi Jiang, Silvery D. Fu, Yifei Zhu, Bo Li

    Abstract: Vision Transformers (ViTs) have outperformed traditional Convolutional Neural Network architectures and achieved state-of-the-art results in various computer vision tasks. Since ViTs are computationally expensive, the models either have to be pruned to run on resource-limited edge devices only or have to be executed on remote cloud servers after receiving the raw data transmitted over fluctuating… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

    Comments: Accepted for publication in IEEE INFOCOM 2025

  38. arXiv:2502.09542  [pdf, other

    quant-ph

    Constant-Overhead Fault-Tolerant Bell-Pair Distillation using High-Rate Codes

    Authors: J. Pablo Bonilla Ataides, Hengyun Zhou, Qian Xu, Gefen Baranes, Bikun Li, Mikhail D. Lukin, Liang Jiang

    Abstract: We present a fault-tolerant Bell-pair distillation scheme achieving constant overhead through high-rate quantum low-density parity-check (qLDPC) codes. Our approach maintains a constant distillation rate equal to the code rate - as high as $1/3$ in our implementations - while requiring no additional overhead beyond the physical qubits of the code. Full circuit-level analysis demonstrates fault-tol… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

    Comments: 6 pages, 4 figures

  39. arXiv:2502.09125  [pdf, other

    cs.CV cs.AI

    Automatic Pruning via Structured Lasso with Class-wise Information

    Authors: Xiang Liu, Mingchen Li, Xia Li, Leigang Qu, Zifan Peng, Yijun Song, Zemin Liu, Linshan Jiang, Jialin Li

    Abstract: Most pruning methods concentrate on unimportant filters of neural networks. However, they face the loss of statistical information due to a lack of consideration for class-wise data. In this paper, from the perspective of leveraging precise class-wise information for model pruning, we utilize structured lasso with guidance from Information Bottleneck theory. Our approach ensures that statistical i… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

    Comments: 11 pages, 2 figures

  40. arXiv:2502.09104  [pdf, ps, other

    cs.LG cs.AI

    One-shot Federated Learning Methods: A Practical Guide

    Authors: Xiang Liu, Zhenheng Tang, Xia Li, Yijun Song, Sijie Ji, Zemin Liu, Bo Han, Linshan Jiang, Jialin Li

    Abstract: One-shot Federated Learning (OFL) is a distributed machine learning paradigm that constrains client-server communication to a single round, addressing privacy and communication overhead issues associated with multiple rounds of data exchange in traditional Federated Learning (FL). OFL demonstrates the practical potential for integration with future approaches that require collaborative training mo… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

    Comments: 10 pages, 1 figure

  41. arXiv:2502.08831  [pdf, other

    quant-ph

    Quantum communication over bandwidth-and-time-limited channels

    Authors: Aditya Gandotra, Zhaoyou Wang, Aashish A. Clerk, Liang Jiang

    Abstract: Standard communication systems have transmission spectra that characterize their ability to perform frequency multiplexing over a finite bandwidth. Realistic quantum signals in quantum communication systems like transducers are inherently limited in time due to intrinsic decoherence and finite latency, which hinders the direct implementation of frequency-multiplexed encoding. We investigate quantu… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

  42. arXiv:2502.07770  [pdf, other

    quant-ph

    Quantum learning advantage on a scalable photonic platform

    Authors: Zheng-Hao Liu, Romain Brunel, Emil E. B. Østergaard, Oscar Cordero, Senrui Chen, Yat Wong, Jens A. H. Nielsen, Axel B. Bregnsbo, Sisi Zhou, Hsin-Yuan Huang, Changhun Oh, Liang Jiang, John Preskill, Jonas S. Neergaard-Nielsen, Ulrik L. Andersen

    Abstract: Recent advancements in quantum technologies have opened new horizons for exploring the physical world in ways once deemed impossible. Central to these breakthroughs is the concept of quantum advantage, where quantum systems outperform their classical counterparts in solving specific tasks. While much attention has been devoted to computational speedups, quantum advantage in learning physical syste… ▽ More

    Submitted 16 February, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

    Comments: 8+23 pages, 3+10 figures

  43. arXiv:2502.05305  [pdf, other

    stat.ML cs.LG math.OC

    Online Covariance Estimation in Nonsmooth Stochastic Approximation

    Authors: Liwei Jiang, Abhishek Roy, Krishna Balasubramanian, Damek Davis, Dmitriy Drusvyatskiy, Sen Na

    Abstract: We consider applying stochastic approximation (SA) methods to solve nonsmooth variational inclusion problems. Existing studies have shown that the averaged iterates of SA methods exhibit asymptotic normality, with an optimal limiting covariance matrix in the local minimax sense of Hájek and Le Cam. However, no methods have been proposed to estimate this covariance matrix in a nonsmooth and potenti… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: 46 pages, 1 figure

  44. arXiv:2502.04790  [pdf, other

    cs.CL cs.AI

    S$^2$-MAD: Breaking the Token Barrier to Enhance Multi-Agent Debate Efficiency

    Authors: Yuting Zeng, Weizhe Huang, Lei Jiang, Tongxuan Liu, Xitai Jin, Chen Tianying Tiana, Jing Li, Xiaohua Xu

    Abstract: Large language models (LLMs) have demonstrated remarkable capabilities across various natural language processing (NLP) scenarios, but they still face challenges when handling complex arithmetic and logical reasoning tasks. While Chain-Of-Thought (CoT) reasoning, self-consistency (SC) and self-correction strategies have attempted to guide models in sequential, multi-step reasoning, Multi-agent Deb… ▽ More

    Submitted 9 April, 2025; v1 submitted 7 February, 2025; originally announced February 2025.

    Comments: Accepted to NAACL 2025 Main

  45. arXiv:2502.03992  [pdf, other

    cs.CL cs.AI

    Ontology-Guided, Hybrid Prompt Learning for Generalization in Knowledge Graph Question Answering

    Authors: Longquan Jiang, Junbo Huang, Cedric Möller, Ricardo Usbeck

    Abstract: Most existing Knowledge Graph Question Answering (KGQA) approaches are designed for a specific KG, such as Wikidata, DBpedia or Freebase. Due to the heterogeneity of the underlying graph schema, topology and assertions, most KGQA systems cannot be transferred to unseen Knowledge Graphs (KGs) without resource-intensive training data. We present OntoSCPrompt, a novel Large Language Model (LLM)-based… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

    Comments: Accepted By ICSC 2025

  46. arXiv:2502.03684  [pdf, other

    astro-ph.GA

    Iron-corrected Single-epoch Black Hole Masses of DESI Quasars at low redshift

    Authors: Zhiwei Pan, Linhua Jiang, Wei-Jian Guo, Shengxiu Sun, Małgorzata Siudek, Jessica Nicole Aguilar, Steven Ahlen, David Brooks, Todd Claybaugh, Axel de la Macorra, Peter Doel, Enrique Gaztañaga, Satya Gontcho A Gontcho, Stephanie Juneau, Theodore Kisner, Andrew Lambert, Martin Landriau, Laurent Le Guillou, Marc Manera, Paul Martini, Aaron Meisner, Ramon Miquel, John Moustakas, Adam Myers, Claire Poppett , et al. (9 additional authors not shown)

    Abstract: We present a study on the possible overestimation of single-epoch supermassive black hole (SMBH) masses in previous works, based on more than 55,000 type 1 quasars at $0.25 < z < 0.8$ from the Dark Energy Spectroscopic Instrument (DESI). We confirm that iron emission strength serves as a good tracer of the Eddington ratio, and estimate SMBH masses using an iron-corrected $R$-$L$ relation for H$β$,… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

    Comments: 20pages, 13 figures; resubmitted to ApJ

  47. arXiv:2502.03683  [pdf, other

    astro-ph.GA

    Ruling out AGNs as the dominant source of cosmic reionization with JWST

    Authors: Danyang Jiang, Linhua Jiang, Shengxiu Sun, Weiyang Liu, Shuqi Fu

    Abstract: Cosmic reionization represents the latest phase transition of the intergalactic medium (IGM) in the Universe. It has long been debated whether galaxies or active galactic nuclei (AGNs) are the major source of Lyman continuum (LyC) photons responsible for reionization. Previous observations slightly favored galaxies as the major ionizing source. However, the James Webb Space Telescope (JWST) recent… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

    Comments: 15 pages, 4 figures, submitted

  48. arXiv:2502.01690  [pdf, other

    cs.CV

    HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment

    Authors: Lifan Jiang, Boxi Wu, Jiahui Zhang, Xiaotong Guan, Shuang Chen

    Abstract: With the rapid development of AIGC technology, significant progress has been made in diffusion model-based technologies for text-to-image (T2I) and text-to-video (T2V). In recent years, a few studies have introduced the strategy of Direct Preference Optimization (DPO) into T2I tasks, significantly enhancing human preferences in generated images. However, existing T2V generation methods lack a well… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

  49. arXiv:2502.01140  [pdf, other

    math.CA

    Assouad dimension of the Takagi function

    Authors: Lai Jiang

    Abstract: For any integer $b\geq2$ and real series $\{c_n\}$ such that $\sum_{n=0}^\infty|c_n|<\infty$, the generalized Takagi function $f_{{\mathbf c},b}(x)$ is defined by $$ f_{{\mathbf c},b}(x):=\sum_{n=0}^\infty c_nφ(b^n x), \quad x\in [0,1], $$ where $φ(x)=dist(x,\mathbb{Z})$ is the distance from $x$ to the nearest integer. The collection of functions with the form are called the Takagi class. In thi… ▽ More

    Submitted 14 March, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

    MSC Class: 28A80

  50. arXiv:2502.01101  [pdf, other

    cs.CV cs.AI

    VidSketch: Hand-drawn Sketch-Driven Video Generation with Diffusion Control

    Authors: Lifan Jiang, Shuang Chen, Boxi Wu, Xiaotong Guan, Jiahui Zhang

    Abstract: With the advancement of generative artificial intelligence, previous studies have achieved the task of generating aesthetic images from hand-drawn sketches, fulfilling the public's needs for drawing. However, these methods are limited to static images and lack the ability to control video animation generation using hand-drawn sketches. To address this gap, we propose VidSketch, the first method ca… ▽ More

    Submitted 17 February, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

    Comments: 17pages, 15 figures