Skip to main content

Showing 201–250 of 857 results for author: Cheng, M

.
  1. arXiv:2402.15751  [pdf, other

    cs.LG cs.AI cs.CL

    Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning

    Authors: Yong Liu, Zirui Zhu, Chaoyu Gong, Minhao Cheng, Cho-Jui Hsieh, Yang You

    Abstract: While fine-tuning large language models (LLMs) for specific tasks often yields impressive results, it comes at the cost of memory inefficiency due to back-propagation in gradient-based training. Memory-efficient Zeroth-order (MeZO) optimizers, recently proposed to address this issue, only require forward passes during training, making them more memory-friendly. However, the quality of gradient est… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  2. arXiv:2402.15410  [pdf, other

    hep-ex hep-ph nucl-ex

    Detailed Report on the Measurement of the Positive Muon Anomalous Magnetic Moment to 0.20 ppm

    Authors: D. P. Aguillard, T. Albahri, D. Allspach, A. Anisenkov, K. Badgley, S. Baeßler, I. Bailey, L. Bailey, V. A. Baranov, E. Barlas-Yucel, T. Barrett, E. Barzi, F. Bedeschi, M. Berz, M. Bhattacharya, H. P. Binney, P. Bloom, J. Bono, E. Bottalico, T. Bowcock, S. Braun, M. Bressler, G. Cantatore, R. M. Carey, B. C. K. Casey , et al. (168 additional authors not shown)

    Abstract: We present details on a new measurement of the muon magnetic anomaly, $a_μ= (g_μ-2)/2$. The result is based on positive muon data taken at Fermilab's Muon Campus during the 2019 and 2020 accelerator runs. The measurement uses $3.1$ GeV$/c$ polarized muons stored in a $7.1$-m-radius storage ring with a $1.45$ T uniform magnetic field. The value of $ a_μ$ is determined from the measured difference b… ▽ More

    Submitted 22 May, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: 48 pages, 29 figures; 4 pages of Supplement Material; version accepted for publication in Physical Review D

    Report number: FERMILAB-PUB-24-0084-AD-CSAID-PPD

  3. arXiv:2402.12928  [pdf, other

    cs.DL cs.AI cs.CV

    A Literature Review of Literature Reviews in Pattern Analysis and Machine Intelligence

    Authors: Penghai Zhao, Xin Zhang, Jiayue Cao, Ming-Ming Cheng, Jian Yang, Xiang Li

    Abstract: The rapid advancements in Pattern Analysis and Machine Intelligence (PAMI) have led to an overwhelming expansion of scientific knowledge, spawning numerous literature reviews aimed at collecting and synthesizing fragmented information. This paper presents a thorough analysis of these literature reviews within the PAMI field, and tries to address three core research questions: (1) What are the prev… ▽ More

    Submitted 14 December, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: V2, V3, and V4 with incremental quality improvements. V5 introduces major updates, featuring 27 pages, 16 figures, and 12 tables

  4. arXiv:2402.12741  [pdf, other

    cs.CV

    MuLan: Multimodal-LLM Agent for Progressive and Interactive Multi-Object Diffusion

    Authors: Sen Li, Ruochen Wang, Cho-Jui Hsieh, Minhao Cheng, Tianyi Zhou

    Abstract: Existing text-to-image models still struggle to generate images of multiple objects, especially in handling their spatial positions, relative sizes, overlapping, and attribute bindings. To efficiently address these challenges, we develop a training-free Multimodal-LLM agent (MuLan), as a human painter, that can progressively generate multi-object with intricate planning and feedback control. MuLan… ▽ More

    Submitted 24 May, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Comments: Added the application to human-agent interaction; added discussion with concurrent work

  5. arXiv:2402.11241  [pdf, other

    cs.CV cs.AI

    DiffPoint: Single and Multi-view Point Cloud Reconstruction with ViT Based Diffusion Model

    Authors: Yu Feng, Xing Shi, Mengli Cheng, Yun Xiong

    Abstract: As the task of 2D-to-3D reconstruction has gained significant attention in various real-world scenarios, it becomes crucial to be able to generate high-quality point clouds. Despite the recent success of deep learning models in generating point clouds, there are still challenges in producing high-fidelity results due to the disparities between images and point clouds. While vision transformers (Vi… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  6. arXiv:2402.11129  [pdf, other

    cs.CL

    BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering

    Authors: Haoyu Wang, Ruirui Li, Haoming Jiang, Jinjin Tian, Zhengyang Wang, Chen Luo, Xianfeng Tang, Monica Cheng, Tuo Zhao, Jing Gao

    Abstract: Retrieval-augmented Large Language Models (LLMs) offer substantial benefits in enhancing performance across knowledge-intensive scenarios. However, these methods often face challenges with complex inputs and encounter difficulties due to noisy knowledge retrieval, notably hindering model effectiveness. To address this issue, we introduce BlendFilter, a novel approach that elevates retrieval-augmen… ▽ More

    Submitted 15 October, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: EMNLP 2024 main

  7. arXiv:2402.02056  [pdf, other

    cs.CL cs.AI cs.CY

    AnthroScore: A Computational Linguistic Measure of Anthropomorphism

    Authors: Myra Cheng, Kristina Gligoric, Tiziano Piccardi, Dan Jurafsky

    Abstract: Anthropomorphism, or the attribution of human-like characteristics to non-human entities, has shaped conversations about the impacts and possibilities of technology. We present AnthroScore, an automatic metric of implicit anthropomorphism in language. We use a masked language model to quantify how non-human entities are implicitly framed as human by the surrounding context. We show that AnthroScor… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: EACL 2024 Main Conference

  8. arXiv:2401.17357  [pdf, other

    cond-mat.str-el cond-mat.stat-mech hep-th quant-ph

    Mixed-state quantum anomaly and multipartite entanglement

    Authors: Leonardo A. Lessa, Meng Cheng, Chong Wang

    Abstract: Quantum entanglement measures of many-body states have been increasingly useful to characterize phases of matter. Here we explore a surprising connection between mixed state entanglement and 't Hooft anomaly. More specifically, we consider lattice systems in $d$ space dimensions with anomalous symmetry $G$ where the anomaly is characterized by an invariant in the group cohomology… ▽ More

    Submitted 29 November, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: 27 pages, 9 figures; New results on strong-weak mixed anomaly and other revisions

    Journal ref: Phys. Rev. X 15, 011069 (2025)

  9. arXiv:2401.17172  [pdf, other

    physics.comp-ph cs.LG math.NA

    Learning Domain-Independent Green's Function For Elliptic Partial Differential Equations

    Authors: Pawan Negi, Maggie Cheng, Mahesh Krishnamurthy, Wenjun Ying, Shuwang Li

    Abstract: Green's function characterizes a partial differential equation (PDE) and maps its solution in the entire domain as integrals. Finding the analytical form of Green's function is a non-trivial exercise, especially for a PDE defined on a complex domain or a PDE with variable coefficients. In this paper, we propose a novel boundary integral network to learn the domain-independent Green's function, ref… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  10. arXiv:2401.16181  [pdf, other

    cs.IT

    On Decentralized Linearly Separable Computation With the Minimum Computation Cost

    Authors: Haoning Chen, Minquan Cheng, Zhenhao Huang, Youlong Wu

    Abstract: The distributed linearly separable computation problem finds extensive applications across domains such as distributed gradient coding, distributed linear transform, real-time rendering, etc. In this paper, we investigate this problem in a fully decentralized scenario, where $\mathsf{N}$ workers collaboratively perform the computation task without a central master. Each worker aims to compute a li… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  11. arXiv:2401.09548  [pdf, other

    cond-mat.str-el cond-mat.stat-mech hep-th quant-ph

    Universal contributions to charge fluctuations in spin chains at finite temperature

    Authors: Kang-Le Cai, Meng Cheng

    Abstract: At finite temperature, conserved charges undergo thermal fluctuations in a quantum many-body system in the grand canonical ensemble. The full structure of the fluctuations of the total U(1) charge $Q$ can be succinctly captured by the generating function $G(θ)=\left\langle e^{i θQ}\right\rangle$. For a 1D translation-invariant spin chain, in the thermodynamic limit the magnitude $|G(θ)|$ scales wi… ▽ More

    Submitted 30 April, 2025; v1 submitted 17 January, 2024; originally announced January 2024.

    Comments: 21 pages, 5 figures, published version

    Journal ref: Phys. Rev. B 111, 205104 (2025)

  12. arXiv:2401.08052  [pdf, other

    eess.AS

    Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization

    Authors: Ming Cheng, Ming Li

    Abstract: Audio-visual learning has demonstrated promising results in many classical speech tasks (e.g., speech separation, automatic speech recognition, wake-word spotting). We believe that introducing visual modality will also benefit speaker diarization. To date, Target-Speaker Voice Activity Detection (TS-VAD) plays an important role in highly accurate speaker diarization. However, previous TS-VAD model… ▽ More

    Submitted 29 February, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: Under review of IEEE/ACM Transactions on Audio, Speech, and Language Processing

  13. arXiv:2401.00330  [pdf, other

    cs.LG cs.AI

    Two-Step Offline Preference-Based Reinforcement Learning with Constrained Actions

    Authors: Yinglun Xu, Tarun Suresh, Rohan Gumaste, David Zhu, Ruirui Li, Zhengyang Wang, Haoming Jiang, Xianfeng Tang, Qingyu Yin, Monica Xiao Cheng, Qi Zeng, Chao Zhang, Gagandeep Singh

    Abstract: Preference-based reinforcement learning (PBRL) in the offline setting has succeeded greatly in industrial applications such as chatbots. A two-step learning framework where one applies a reinforcement learning step after a reward modeling step has been widely adopted for the problem. However, such a method faces challenges from the risk of reward hacking and the complexity of reinforcement learnin… ▽ More

    Submitted 25 October, 2024; v1 submitted 30 December, 2023; originally announced January 2024.

  14. arXiv:2312.15661  [pdf, other

    cs.IR cs.AI

    Unlocking the Potential of Large Language Models for Explainable Recommendations

    Authors: Yucong Luo, Mingyue Cheng, Hao Zhang, Junyu Lu, Qi Liu, Enhong Chen

    Abstract: Generating user-friendly explanations regarding why an item is recommended has become increasingly common, largely due to advances in language generation technology, which can enhance user trust and facilitate more informed decision-making when using online services. However, existing explainable recommendation systems focus on using small-size language models. It remains uncertain what impact rep… ▽ More

    Submitted 3 January, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

  15. arXiv:2312.15190  [pdf, other

    cs.SD cs.AI cs.CR eess.AS

    SAIC: Integration of Speech Anonymization and Identity Classification

    Authors: Ming Cheng, Xingjian Diao, Shitong Cheng, Wenjun Liu

    Abstract: Speech anonymization and de-identification have garnered significant attention recently, especially in the healthcare area including telehealth consultations, patient voiceprint matching, and patient real-time monitoring. Speaker identity classification tasks, which involve recognizing specific speakers from audio to learn identity features, are crucial for de-identification. Since rare studies ha… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  16. Extracting subleading corrections in entanglement entropy at quantum phase transitions

    Authors: Menghan Song, Jiarui Zhao, Zi Yang Meng, Cenke Xu, Meng Cheng

    Abstract: We systematically investigate the finite size scaling behavior of the Rényi entanglement entropy (EE) of several representative 2d quantum many-body systems between a subregion and its complement, with smooth boundaries as well as boundaries with corners. In order to reveal the subleading correction, we investigate the quantity ``subtracted EE" $S^s(l) = S(2l) - 2S(l)$ for each model, which is des… ▽ More

    Submitted 16 July, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Journal ref: SciPost Phys. 17, 010 (2024)

  17. arXiv:2312.13311  [pdf, other

    cs.LG eess.IV

    Unlocking Deep Learning: A BP-Free Approach for Parallel Block-Wise Training of Neural Networks

    Authors: Anzhe Cheng, Zhenkun Wang, Chenzhong Yin, Mingxi Cheng, Heng Ping, Xiongye Xiao, Shahin Nazarian, Paul Bogdan

    Abstract: Backpropagation (BP) has been a successful optimization technique for deep learning models. However, its limitations, such as backward- and update-locking, and its biological implausibility, hinder the concurrent updating of layers and do not mimic the local learning processes observed in the human brain. To address these issues, recent research has suggested using local error signals to asynchron… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: The paper has been accepted by ICASSP2024

  18. arXiv:2312.12722  [pdf, other

    cs.CV

    Fine-Grained Knowledge Selection and Restoration for Non-Exemplar Class Incremental Learning

    Authors: Jiang-Tian Zhai, Xialei Liu, Lu Yu, Ming-Ming Cheng

    Abstract: Non-exemplar class incremental learning aims to learn both the new and old tasks without accessing any training data from the past. This strict restriction enlarges the difficulty of alleviating catastrophic forgetting since all techniques can only be applied to current task data. Considering this challenge, we propose a novel framework of fine-grained knowledge selection and restoration. The conv… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: to appear at AAAI 2024

  19. arXiv:2312.12667  [pdf, other

    cs.CR cs.AI cs.LG

    Discovering Malicious Signatures in Software from Structural Interactions

    Authors: Chenzhong Yin, Hantang Zhang, Mingxi Cheng, Xiongye Xiao, Xinghe Chen, Xin Ren, Paul Bogdan

    Abstract: Malware represents a significant security concern in today's digital landscape, as it can destroy or disable operating systems, steal sensitive user information, and occupy valuable disk space. However, current malware detection methods, such as static-based and dynamic-based approaches, struggle to identify newly developed (``zero-day") malware and are limited by customized virtual machine (VM) e… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: ICASSP 2024, Accepted

  20. arXiv:2312.09608  [pdf, other

    cs.CV

    Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model Inference

    Authors: Senmao Li, Taihang Hu, Joost van de Weijer, Fahad Shahbaz Khan, Tao Liu, Linxuan Li, Shiqi Yang, Yaxing Wang, Ming-Ming Cheng, Jian Yang

    Abstract: One of the main drawback of diffusion models is the slow inference time for image generation. Among the most successful approaches to addressing this problem are distillation methods. However, these methods require considerable computational resources. In this paper, we take another approach to diffusion model acceleration. We conduct a comprehensive study of the UNet encoder and empirically analy… ▽ More

    Submitted 15 October, 2024; v1 submitted 15 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2024

  21. arXiv:2312.08912  [pdf, other

    cs.CV

    Dataset Distillation via Adversarial Prediction Matching

    Authors: Mingyang Chen, Bo Huang, Junda Lu, Bing Li, Yi Wang, Minhao Cheng, Wei Wang

    Abstract: Dataset distillation is the technique of synthesizing smaller condensed datasets from large original datasets while retaining necessary information to persist the effect. In this paper, we approach the dataset distillation problem from a novel perspective: we regard minimizing the prediction discrepancy on the real data distribution between models, which are respectively trained on the large origi… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

  22. arXiv:2312.06947  [pdf, other

    cs.CV

    MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing

    Authors: Kangneng Zhou, Daiheng Gao, Xuan Wang, Jie Zhang, Peng Zhang, Xusen Sun, Longhao Zhang, Shiqi Yang, Bang Zhang, Liefeng Bo, Yaxing Wang, Ming-Ming Cheng

    Abstract: 3D-aware portrait editing has a wide range of applications in multiple fields. However, current approaches are limited due that they can only perform mask-guided or text-based editing. Even by fusing the two procedures into a model, the editing quality and stability cannot be ensured. To address this limitation, we propose \textbf{MaTe3D}: mask-guided text-based 3D-aware portrait editing. In this… ▽ More

    Submitted 5 July, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: 16 pages, 13 figures

  23. arXiv:2312.05830  [pdf, other

    cs.CV

    A Decoupled Spatio-Temporal Framework for Skeleton-based Action Segmentation

    Authors: Yunheng Li, Zhongyu Li, Shanghua Gao, Qilong Wang, Qibin Hou, Ming-Ming Cheng

    Abstract: Effectively modeling discriminative spatio-temporal information is essential for segmenting activities in long action sequences. However, we observe that existing methods are limited in weak spatio-temporal modeling capability due to two forms of decoupled modeling: (i) cascaded interaction couples spatial and temporal modeling, which over-smooths motion modeling over the long sequence, and (ii) j… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  24. arXiv:2312.05801  [pdf, other

    cond-mat.mtrl-sci

    Stability and Character of Zero Field Skyrmionic States in Hybrid Magnetic Multilayer Nanodots

    Authors: Alexander Kang-Jun Toh, McCoy W. Lim, T. S. Suraj, Xiaoye Chen, Hang Khume Tan, Royston Lim, Xuan Min Cheng, Nelson Lim, Sherry Yap, Durgesh Kumar, S. N. Piramanayagam, Pin Ho, Anjan Soumyanarayanan

    Abstract: Ambient magnetic skyrmions stabilized in multilayer nanostructures are of immense interest due to their relevance to magnetic tunnel junction (MTJ) devices for memory and unconventional computing applications. However, existing skyrmionic nanostructures built using conventional metallic or oxide multilayer nanodots are unable to concurrently fulfill the requirements of nanoscale skyrmion stability… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  25. arXiv:2312.05430  [pdf, other

    cs.CV

    FT2TF: First-Person Statement Text-To-Talking Face Generation

    Authors: Xingjian Diao, Ming Cheng, Wayner Barrios, SouYoung Jin

    Abstract: Talking face generation has gained immense popularity in the computer vision community, with various applications including AR, VR, teleconferencing, digital assistants, and avatars. Traditional methods are mainly audio-driven, which have to deal with the inevitable resource-intensive nature of audio storage and processing. To address such a challenge, we propose FT2TF - First-Person Statement Tex… ▽ More

    Submitted 19 November, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

    Comments: Accepted at WACV 2025

  26. arXiv:2312.04461  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

    Authors: Zhen Li, Mingdeng Cao, Xintao Wang, Zhongang Qi, Ming-Ming Cheng, Ying Shan

    Abstract: Recent advances in text-to-image generation have made remarkable progress in synthesizing realistic human photos conditioned on given text prompts. However, existing personalized generation methods cannot simultaneously satisfy the requirements of high efficiency, promising identity (ID) fidelity, and flexible text controllability. In this work, we introduce PhotoMaker, an efficient personalized t… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Tech report; Project page: https://photo-maker.github.io/

  27. arXiv:2312.04248  [pdf, other

    cs.CV

    TeMO: Towards Text-Driven 3D Stylization for Multi-Object Meshes

    Authors: Xuying Zhang, Bo-Wen Yin, Yuming Chen, Zheng Lin, Yunheng Li, Qibin Hou, Ming-Ming Cheng

    Abstract: Recent progress in the text-driven 3D stylization of a single object has been considerably promoted by CLIP-based methods. However, the stylization of multi-object 3D scenes is still impeded in that the image-text pairs used for pre-training CLIP mostly consist of an object. Meanwhile, the local details of multiple objects may be susceptible to omission due to the existing supervision manner prima… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  28. arXiv:2311.18282  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Pressure-Modulated Structural and Magnetic Phase Transitions in Two-Dimensional FeTe: Tetragonal and Hexagonal Polymorphs

    Authors: Wuxiao Han, Jiajia Feng, Hongliang Dong, Mo Cheng, Liu Yang, Yunfei Yu, Guoshuai Du, Jiayin Li, Yubing Du, Tiansong Zhang, Zhiwei Wang, Bin Chen, Jianping Shi, Yabin Chen

    Abstract: Two-dimensional (2D) Fe-chalcogenides with rich structures, magnetisms and superconductivities are highly desirable to reveal the torturous transition mechanism and explore their potential applications in spintronics and nanoelectronics. Hydrostatic pressure can effectively stimulate novel phase transitions between various ordered states and to plot the seductive phase diagram. Herein, the structu… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 22 Pages, 5 Figures

  29. arXiv:2311.00388  [pdf, other

    cs.IR

    AutoSAM: Towards Automatic Sampling of User Behaviors for Sequential Recommender Systems

    Authors: Hao Zhang, Mingyue Cheng, Qi Liu, Zhiding Liu, Junzhe Jiang, Enhong Chen

    Abstract: Sequential recommender systems (SRS) have gained widespread popularity in recommendation due to their ability to effectively capture dynamic user preferences. One default setting in the current SRS is to uniformly consider each historical behavior as a positive interaction. Actually, this setting has the potential to yield sub-optimal performance, as each item makes a distinct contribution to the… ▽ More

    Submitted 2 January, 2025; v1 submitted 1 November, 2023; originally announced November 2023.

  30. arXiv:2310.20348  [pdf, other

    cs.CV cs.LG

    Class Incremental Learning with Pre-trained Vision-Language Models

    Authors: Xialei Liu, Xusheng Cao, Haori Lu, Jia-wen Xiao, Andrew D. Bagdanov, Ming-Ming Cheng

    Abstract: With the advent of large-scale pre-trained models, interest in adapting and exploiting them for continual learning scenarios has grown. In this paper, we propose an approach to exploiting pre-trained vision-language models (e.g. CLIP) that enables further adaptation instead of only using zero-shot learning of new tasks. We augment a pre-trained CLIP model with additional layers after the Image E… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

  31. arXiv:2310.20239  [pdf, other

    cs.IT

    Coded Caching Schemes for Multiaccess Topologies via Combinatorial Design

    Authors: Minquan Cheng, Kai Wan, Petros Elia, Giuseppe Caire

    Abstract: This paper studies a multiaccess coded caching (MACC) where the connectivity topology between the users and the caches can be described by a class of combinatorial designs. Our model includes as special cases several MACC topologies considered in previous works. The considered MACC network includes a server containing $N$ files, $Γ$ cache nodes and $K$ cacheless users, where each user can access… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: 48 pages

  32. arXiv:2310.20167  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall

    Phase-Modulated Elastic Properties of Two-Dimensional Magnetic FeTe: Hexagonal and Tetragonal Polymorphs

    Authors: Yunfei Yu, Mo Cheng, Zicheng Tao, Wuxiao Han, Guoshuai Du, Yanfeng Guo, Jianping Shi, Yabin Chen

    Abstract: Two-dimensional (2D) layered magnets, such as iron chalcogenides, have emerged these years as a new family of unconventional superconductor and provided the key insights to understand the phonon-electron interaction and pairing mechanism. Their mechanical properties are of strategic importance for the potential applications in spintronics and optoelectronics. However, there is still lack of effici… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: 19 pages, 4 figures

  33. arXiv:2310.18439  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci

    Machine learning detecting Majorana Zero Mode from Zero Bias Peak measurements

    Authors: Mouyang Cheng, Ryotaro Okabe, Abhijatmedhi Chotrattanapituk, Mingda Li

    Abstract: Majorana zero modes (MZMs), emerging as exotic quasiparticles that carry non-Abelian statistics, hold great promise for achieving fault-tolerant topological quantum computation. A key signature of the presence of MZMs is the zero-bias peaks (ZBPs) from tunneling differential conductance. However, the identification of MZMs from ZBPs has faced tremendous challenges, due to the presence of topologic… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  34. arXiv:2310.17931  [pdf, other

    cs.IT

    Coded Caching Scheme for Partially Connected Linear Networks Via Multi-antenna Placement Delivery Array

    Authors: Minquan Cheng, Yun Xie, Zhenhao Huang, Mingming Zhang, Youlong Wu

    Abstract: In this paper, we study the coded caching scheme for the $(K,L,M_{\text{T}},M_{\text{U}},N)$ partially connected linear network, where there are $N$ files each of which has an equal size, $K+L-1$ transmitters and $K$ users; each user and transmitter caches at most $M_{\text{U}}$ and $M_{\text{T}}$ files respectively; each user cyclically communicates with $L$ transmitters. The goal is to design ca… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: 13 pages

  35. arXiv:2310.16878  [pdf, other

    cond-mat.str-el hep-th math-ph

    Topological holography, quantum criticality, and boundary states

    Authors: Sheng-Jie Huang, Meng Cheng

    Abstract: Topological holography is a holographic principle that describes the generalized global symmetry of a local quantum system in terms of a topological order in one higher dimension. This framework separates the topological data from the local dynamics of a theory and provides a unified description of the symmetry and duality in gapped and gapless phases of matter. In this work, we develop the topolo… ▽ More

    Submitted 1 April, 2025; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: 43 pages, 10 figures, 3 tables. v2: references added. v3: Added a conclusion section and minor revision

  36. arXiv:2310.15371  [pdf, other

    eess.IV cs.AI cs.CV cs.LG physics.med-ph

    Vicinal Feature Statistics Augmentation for Federated 3D Medical Volume Segmentation

    Authors: Yongsong Huang, Wanqing Xie, Mingzhen Li, Mingmei Cheng, Jinzhou Wu, Weixiao Wang, Jane You, Xiaofeng Liu

    Abstract: Federated learning (FL) enables multiple client medical institutes collaboratively train a deep learning (DL) model with privacy protection. However, the performance of FL can be constrained by the limited availability of labeled data in small institutes and the heterogeneous (i.e., non-i.i.d.) data distribution across institutes. Though data augmentation has been a proven technique to boost the g… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 28th biennial international conference on Information Processing in Medical Imaging (IPMI 2023): Oral Paper

    Journal ref: In: Frangi, A., de Bruijne, M., Wassermann, D., Navab, N. (eds) Information Processing in Medical Imaging. IPMI 2023. Lecture Notes in Computer Science, vol 13939. Springer, Cham

  37. arXiv:2310.13215  [pdf, other

    cs.CV

    Zone Evaluation: Revealing Spatial Bias in Object Detection

    Authors: Zhaohui Zheng, Yuming Chen, Qibin Hou, Xiang Li, Ping Wang, Ming-Ming Cheng

    Abstract: A fundamental limitation of object detectors is that they suffer from "spatial bias", and in particular perform less satisfactorily when detecting objects near image borders. For a long time, there has been a lack of effective ways to measure and identify spatial bias, and little is known about where it comes from and what degree it is. To this end, we present a new zone evaluation protocol, exten… ▽ More

    Submitted 1 June, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted by IEEE TPAMI

  38. The Braids on your Blanket

    Authors: Michelle Cheng, Robert Laugwitz

    Abstract: In this expositional essay, we introduce some elements of the study of groups by analysing the braid pattern on a knitted blanket. We determine that the blanket features pure braids with a minimal number of crossings. Moreover, we determine polynomial invariants associated to the links obtained by closing the braid patterns of the blanket.

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: Expositional article for a general readership. 32 pages, several figures

    MSC Class: 00A66 (Primary) 00-01; 20F36; 57K10 (Secondary)

    Journal ref: Journal of Humanistic Mathematics, Volume 14 Issue 2 (July 2024), pages 286-337. Available at: https://scholarship.claremont.edu/jhm/vol14/iss2/10

  39. arXiv:2310.11762  [pdf, other

    cs.LG

    A Quasi-Wasserstein Loss for Learning Graph Neural Networks

    Authors: Minjie Cheng, Hongteng Xu

    Abstract: When learning graph neural networks (GNNs) in node-level prediction tasks, most existing loss functions are applied for each node independently, even if node embeddings and their labels are non-i.i.d. because of their graph structures. To eliminate such inconsistency, in this study we propose a novel Quasi-Wasserstein (QW) loss with the help of the optimal transport defined on graphs, leading to n… ▽ More

    Submitted 13 March, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

  40. arXiv:2310.11501  [pdf, other

    cs.CL cs.AI cs.CY

    CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations

    Authors: Myra Cheng, Tiziano Piccardi, Diyi Yang

    Abstract: Recent work has aimed to capture nuances of human behavior by using LLMs to simulate responses from particular demographics in settings like social science experiments and public opinion surveys. However, there are currently no established ways to discuss or evaluate the quality of such LLM simulations. Moreover, there is growing concern that these LLM simulations are flattened caricatures of the… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: To appear at EMNLP 2023 (Main)

  41. arXiv:2310.08210  [pdf, other

    eess.SY

    CLExtract: Recovering Highly Corrupted DVB/GSE Satellite Stream with Contrastive Learning

    Authors: Minghao Lin, Minghao Cheng, Dongsheng Luo, Yueqi Chen

    Abstract: Since satellite systems are playing an increasingly important role in our civilization, their security and privacy weaknesses are more and more concerned. For example, prior work demonstrates that the communication channel between maritime VSAT and ground segment can be eavesdropped on using consumer-grade equipment. The stream decoder GSExtract developed in this prior work performs well for most… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: SpaceSec'23, 11 pages, 14 figures

  42. arXiv:2310.07885  [pdf, other

    cs.LG cs.AI

    Leader-Follower Neural Networks with Local Error Signals Inspired by Complex Collectives

    Authors: Chenzhong Yin, Mingxi Cheng, Xiongye Xiao, Xinghe Chen, Shahin Nazarian, Andrei Irimia, Paul Bogdan

    Abstract: The collective behavior of a network with heterogeneous, resource-limited information processing units (e.g., group of fish, flock of birds, or network of neurons) demonstrates high self-organization and complexity. These emergent properties arise from simple interaction rules where certain individuals can exhibit leadership-like behavior and influence the collective activity of the group. Motivat… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  43. Ethics of Artificial Intelligence and Robotics in the Architecture, Engineering, and Construction Industry

    Authors: Ci-Jyun Liang, Thai-Hoa Le, Youngjib Ham, Bharadwaj R. K. Mantha, Marvin H. Cheng, Jacob J. Lin

    Abstract: Artificial intelligence (AI) and robotics research and implementation emerged in the architecture, engineering, and construction (AEC) industry to positively impact project efficiency and effectiveness concerns such as safety, productivity, and quality. This shift, however, warrants the need for ethical considerations of AI and robotics adoption due to its potential negative impacts on aspects suc… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 109 pages, 5 figures, submitted to Automation in Construction

  44. arXiv:2310.05108  [pdf, other

    cs.CV

    Enhancing Representations through Heterogeneous Self-Supervised Learning

    Authors: Zhong-Yu Li, Bo-Wen Yin, Yongxiang Liu, Li Liu, Ming-Ming Cheng

    Abstract: Incorporating heterogeneous representations from different architectures has facilitated various vision tasks, e.g., some hybrid networks combine transformers and convolutions. However, complementarity between such heterogeneous architectures has not been well exploited in self-supervised learning. Thus, we propose Heterogeneous Self-Supervised Learning (HSSL), which enforces a base model to learn… ▽ More

    Submitted 23 April, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

  45. arXiv:2310.05026  [pdf, other

    cs.CV

    Low-Resolution Self-Attention for Semantic Segmentation

    Authors: Yu-Huan Wu, Shi-Chen Zhang, Yun Liu, Le Zhang, Xin Zhan, Daquan Zhou, Jiashi Feng, Ming-Ming Cheng, Liangli Zhen

    Abstract: Semantic segmentation tasks naturally require high-resolution information for pixel-wise segmentation and global context information for class prediction. While existing vision transformers demonstrate promising performance, they often utilize high-resolution context modeling, resulting in a computational bottleneck. In this work, we challenge conventional wisdom and introduce the Low-Resolution S… ▽ More

    Submitted 22 January, 2025; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: added many experiments. 13 pages, 12 tables, 6 figures

  46. arXiv:2310.01875  [pdf, other

    cs.LG cs.AI cs.CR

    Towards Stable Backdoor Purification through Feature Shift Tuning

    Authors: Rui Min, Zeyu Qin, Li Shen, Minhao Cheng

    Abstract: It has been widely observed that deep neural networks (DNN) are vulnerable to backdoor attacks where attackers could manipulate the model behavior maliciously by tampering with a small set of training samples. Although a line of defense methods is proposed to mitigate this threat, they either require complicated modifications to the training process or heavily rely on the specific model architectu… ▽ More

    Submitted 21 October, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 paper. The first two authors contributed equally

  47. arXiv:2310.00854  [pdf, other

    eess.SY

    Regulating CPU Temperature With Thermal-Aware Scheduling Using a Reduced Order Learning Thermal Model

    Authors: Anthony Dowling, Lin Jiang, Ming-Cheng Cheng, Yu Liu

    Abstract: Modern real-time systems utilize considerable amounts of power while executing computation-intensive tasks. The execution of these tasks leads to significant power dissipation and heating of the device. It therefore results in severe thermal issues like temperature escalation, high thermal gradients, and excessive hot spot formation, which may result in degrading chip performance, accelerating dev… ▽ More

    Submitted 6 February, 2024; v1 submitted 1 October, 2023; originally announced October 2023.

    Comments: This version includes revisions to the previous version to improve the clarity and presentation of the work

  48. arXiv:2309.15877   

    cs.LG cs.AI

    Neuro-Inspired Hierarchical Multimodal Learning

    Authors: Xiongye Xiao, Gengshuo Liu, Gaurav Gupta, Defu Cao, Shixuan Li, Yaxing Li, Tianqing Fang, Mingxi Cheng, Paul Bogdan

    Abstract: Integrating and processing information from various sources or modalities are critical for obtaining a comprehensive and accurate perception of the real world. Drawing inspiration from neuroscience, we develop the Information-Theoretic Hierarchical Perception (ITHP) model, which utilizes the concept of information bottleneck. Distinct from most traditional fusion models that aim to incorporate all… ▽ More

    Submitted 23 April, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: I am requesting the withdrawal of this submission due to an inadvertent duplication. The paper was submitted twice under different IDs, which was not intentional. The other submission (arXiv:2404.09403) contains the most updated and comprehensive version of the paper, and I would like to retain that as the sole version on the platform

  49. arXiv:2309.15084  [pdf, other

    cs.CV cs.CY

    The Surveillance AI Pipeline

    Authors: Pratyusha Ria Kalluri, William Agnew, Myra Cheng, Kentrell Owens, Luca Soldaini, Abeba Birhane

    Abstract: A rapidly growing number of voices argue that AI research, and computer vision in particular, is powering mass surveillance. Yet the direct path from computer vision research to surveillance has remained obscured and difficult to assess. Here, we reveal the Surveillance AI pipeline by analyzing three decades of computer vision research papers and downstream patents, more than 40,000 documents. We… ▽ More

    Submitted 17 October, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

  50. arXiv:2309.14415  [pdf, other

    astro-ph.GA

    Initial mass function variability from the integrated light of diverse stellar systems

    Authors: Chloe M. Cheng, Alexa Villaume, Michael L. Balogh, Jean P. Brodie, Ignacio Martín-Navarro, Aaron J. Romanowsky, Pieter G. van Dokkum

    Abstract: We present a uniform analysis of the stellar initial mass function (IMF) from integrated light spectroscopy of 15 compact stellar systems (11 globular clusters in M31 and 4 ultra compact dwarfs in the Virgo cluster, UCDs) and two brightest Coma cluster galaxies (BCGs), covering a wide range of metallicities ($-$1.7 $<$ [Fe/H] $<$ 0.01) and velocity dispersions (7.4 km~s$^{-1}$ $< σ<$ 275 km~s… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: Accepted for publication in MNRAS

    Report number: MN-23-2545-MJ