Skip to main content

Showing 1–50 of 90 results for author: Gong, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.16735  [pdf, other

    cs.CV eess.IV

    3DeepRep: 3D Deep Low-rank Tensor Representation for Hyperspectral Image Inpainting

    Authors: Yunshan Li, Wenwu Gong, Qianqian Wang, Chao Wang, Lili Yang

    Abstract: Recent approaches based on transform-based tensor nuclear norm (TNN) have demonstrated notable effectiveness in hyperspectral image (HSI) inpainting by leveraging low-rank structures in latent representations. Recent developments incorporate deep transforms to improve low-rank tensor representation; however, existing approaches typically restrict the transform to the spectral mode, neglecting low-… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  2. arXiv:2506.10011  [pdf, other

    cs.MM cs.AI cs.CV eess.SP

    WDMIR: Wavelet-Driven Multimodal Intent Recognition

    Authors: Weiyin Gong, Kai Zhang, Yanghai Zhang, Qi Liu, Xinjie Sun, Junyu Lu, Linbo Zhu

    Abstract: Multimodal intent recognition (MIR) seeks to accurately interpret user intentions by integrating verbal and non-verbal information across video, audio and text modalities. While existing approaches prioritize text analysis, they often overlook the rich semantic content embedded in non-verbal cues. This paper presents a novel Wavelet-Driven Multimodal Intent Recognition(WDMIR) framework that enhanc… ▽ More

    Submitted 26 May, 2025; originally announced June 2025.

    Comments: Accepted at IJCAI 2025, 9pages, 6figures

  3. arXiv:2505.22743  [pdf, ps, other

    quant-ph cs.CC cs.DS cs.LG

    Information-Computation Gaps in Quantum Learning via Low-Degree Likelihood

    Authors: Sitan Chen, Weiyuan Gong, Jonas Haferkamp, Yihui Quek

    Abstract: In a variety of physically relevant settings for learning from quantum data, designing protocols that can computationally efficiently extract information remains largely an art, and there are important cases where we believe this to be impossible, that is, where there is an information-computation gap. While there is a large array of tools in the classical literature for giving evidence for averag… ▽ More

    Submitted 17 June, 2025; v1 submitted 28 May, 2025; originally announced May 2025.

    Comments: 88 pages, 2 figures

  4. arXiv:2504.00432  [pdf, other

    cs.CV

    DecoFuse: Decomposing and Fusing the "What", "Where", and "How" for Brain-Inspired fMRI-to-Video Decoding

    Authors: Chong Li, Jingyang Huo, Weikang Gong, Yanwei Fu, Xiangyang Xue, Jianfeng Feng

    Abstract: Decoding visual experiences from brain activity is a significant challenge. Existing fMRI-to-video methods often focus on semantic content while overlooking spatial and motion information. However, these aspects are all essential and are processed through distinct pathways in the brain. Motivated by this, we propose DecoFuse, a novel brain-inspired framework for decoding videos from fMRI signals.… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  5. arXiv:2503.23016  [pdf, other

    cs.LG cs.AI

    Towards Understanding the Optimization Mechanisms in Deep Learning

    Authors: Binchuan Qi, Wei Gong, Li Li

    Abstract: In this paper, we adopt a probability distribution estimation perspective to explore the optimization mechanisms of supervised classification using deep neural networks. We demonstrate that, when employing the Fenchel-Young loss, despite the non-convex nature of the fitting error with respect to the model's parameters, global optimal solutions can be approximated by simultaneously minimizing both… ▽ More

    Submitted 29 March, 2025; originally announced March 2025.

  6. arXiv:2503.22224  [pdf, ps, other

    cs.NE

    Composite Indicator-Guided Infilling Sampling for Expensive Multi-Objective Optimization

    Authors: Huixiang Zhen, Xiaotong Li, Wenyin Gong, Xiangyun Hu

    Abstract: In expensive multi-objective optimization, where the evaluation budget is strictly limited, selecting promising candidate solutions for expensive fitness evaluations is critical for accelerating convergence and improving algorithmic performance. However, designing an optimization strategy that effectively balances convergence, diversity, and distribution remains a challenge. To tackle this issue,… ▽ More

    Submitted 12 June, 2025; v1 submitted 28 March, 2025; originally announced March 2025.

  7. arXiv:2503.20827  [pdf, other

    cs.CV

    Multimodal Image Matching based on Frequency-domain Information of Local Energy Response

    Authors: Meng Yang, Jun Chen, Wenping Gong, Longsheng Wei, Xin Tian

    Abstract: Complicated nonlinear intensity differences, nonlinear local geometric distortions, noises and rotation transformation are main challenges in multimodal image matching. In order to solve these problems, we propose a method based on Frequency-domain Information of Local Energy Response called FILER. The core of FILER is the local energy response model based on frequency-domain information, which ca… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: 34 pages, 11 figures

    MSC Class: 68U10 (Primary); 68T45; 68Wxx (Secondary) ACM Class: I.4.7; I.5.1

  8. arXiv:2503.08006  [pdf, other

    cs.LG cs.AI

    Injecting Imbalance Sensitivity for Multi-Task Learning

    Authors: Zhipeng Zhou, Liu Liu, Peilin Zhao, Wei Gong

    Abstract: Multi-task learning (MTL) has emerged as a promising approach for deploying deep learning models in real-life applications. Recent studies have proposed optimization-based learning paradigms to establish task-shared representations in MTL. However, our paper empirically argues that these studies, specifically gradient-based ones, primarily emphasize the conflict issue while neglecting the potentia… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: 9 pages, 6 figures, 4 tables

  9. arXiv:2502.11900  [pdf, ps, other

    quant-ph cs.IT cs.LG

    Ansatz-free Hamiltonian learning with Heisenberg-limited scaling

    Authors: Hong-Ye Hu, Muzhou Ma, Weiyuan Gong, Qi Ye, Yu Tong, Steven T. Flammia, Susanne F. Yelin

    Abstract: Learning the unknown interactions that govern a quantum system is crucial for quantum information processing, device benchmarking, and quantum sensing. The problem, known as Hamiltonian learning, is well understood under the assumption that interactions are local, but this assumption may not hold for arbitrary Hamiltonians. Previous methods all require high-order inverse polynomial dependency with… ▽ More

    Submitted 30 June, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: Updated version with expanded explanations, added pseudocode, and new numerical demonstrations. 10 pages, 4 figures. HYH and MM contributed equally

  10. UniGO: A Unified Graph Neural Network for Modeling Opinion Dynamics on Graphs

    Authors: Hao Li, Hao Jiang, Yuke Zheng, Hao Sun, Wenying Gong

    Abstract: Polarization and fragmentation in social media amplify user biases, making it increasingly important to understand the evolution of opinions. Opinion dynamics provide interpretability for studying opinion evolution, yet incorporating these insights into predictive models remains challenging. This challenge arises due to the inherent complexity of the diversity of opinion fusion rules and the diffi… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: WWW2025

  11. arXiv:2502.09657  [pdf

    cs.CV

    Integrating Spatiotemporal Vision Transformer into Digital Twins for High-Resolution Heat Stress Forecasting in Campus Environments

    Authors: Wenjing Gong, Xinyue Ye, Keshu Wu, Suphanut Jamonnak, Wenyu Zhang, Yifan Yang, Xiao Huang

    Abstract: Extreme heat events exacerbated by climate change pose significant challenges to urban resilience and planning. This study introduces a climate-responsive digital twin framework integrating the Spatiotemporal Vision Transformer (ST-ViT) model to enhance heat stress forecasting and decision-making. Using a Texas campus as a testbed, we synthesized high-resolution physical model simulations with spa… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

  12. arXiv:2502.07752  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension

    Authors: Wenbo Gong, Meyer Scetbon, Chao Ma, Edward Meeds

    Abstract: Designing efficient optimizers for large language models (LLMs) with low-memory requirements and fast convergence is an important and challenging problem. This paper makes a step towards the systematic design of such optimizers through the lens of structured Fisher information matrix (FIM) approximation. We show that many state-of-the-art efficient optimizers can be viewed as solutions to FIM appr… ▽ More

    Submitted 20 February, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

  13. arXiv:2502.06742  [pdf, other

    cs.LG cs.AI

    Gradient Multi-Normalization for Stateless and Scalable LLM Training

    Authors: Meyer Scetbon, Chao Ma, Wenbo Gong, Edward Meeds

    Abstract: Training large language models (LLMs) typically relies on adaptive optimizers like Adam (Kingma & Ba, 2015) which store additional state information to accelerate convergence but incur significant memory overhead. Recent efforts, such as SWAN (Ma et al., 2024) address this by eliminating the need for optimizer states while achieving performance comparable to Adam via a multi-step preprocessing pro… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  14. arXiv:2412.19022  [pdf, ps, other

    quant-ph cs.IT cs.LG

    Adaptivity can help exponentially for shadow tomography

    Authors: Sitan Chen, Weiyuan Gong, Zhihan Zhang

    Abstract: In recent years there has been significant interest in understanding the statistical complexity of learning from quantum data under the constraint that one can only make unentangled measurements. While a key challenge in establishing tight lower bounds in this setting is to deal with the fact that the measurements can be chosen in an adaptive fashion, a recurring theme has been that adaptivity off… ▽ More

    Submitted 25 December, 2024; originally announced December 2024.

    Comments: 6 pages

  15. arXiv:2412.13148  [pdf, other

    cs.LG cs.AI

    SWAN: SGD with Normalization and Whitening Enables Stateless LLM Training

    Authors: Chao Ma, Wenbo Gong, Meyer Scetbon, Edward Meeds

    Abstract: Adaptive optimizers such as Adam (Kingma & Ba, 2015) have been central to the success of large language models. However, they often require to maintain optimizer states throughout training, which can result in memory requirements several times greater than the model footprint. This overhead imposes constraints on scalability and computational efficiency. Stochastic Gradient Descent (SGD), in contr… ▽ More

    Submitted 21 February, 2025; v1 submitted 17 December, 2024; originally announced December 2024.

    Comments: In v2 we have revised the related work, added more comprehensive citations, and clarified our key contributions

  16. arXiv:2412.13028  [pdf, other

    q-bio.NC cs.CE

    Identification of Epileptic Spasms (ESES) Phases Using EEG Signals: A Vision Transformer Approach

    Authors: Wei Gong, Yaru Li

    Abstract: This work introduces a new approach to the Epileptic Spasms (ESES) detection based on the EEG signals using Vision Transformers (ViT). Classic ESES detection approaches have usually been performed with manual processing or conventional algorithms, suffering from poor sample sizes, single-channel-based analyses, and low generalization abilities. In contrast, the proposed ViT model overcomes these l… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

  17. arXiv:2412.04488  [pdf, other

    cs.CY cs.AI

    Optimizing Student Ability Assessment: A Hierarchy Constraint-Aware Cognitive Diagnosis Framework for Educational Contexts

    Authors: Xinjie Sun, Qi Liu, Kai Zhang, Shuanghong Shen, Fei Wang, Yan Zhuang, Zheng Zhang, Weiyin Gong, Shijin Wang, Lina Yang, Xingying Huo

    Abstract: Cognitive diagnosis (CD) aims to reveal students' proficiency in specific knowledge concepts. With the increasing adoption of intelligent education applications, accurately assessing students' knowledge mastery has become an urgent challenge. Although existing cognitive diagnosis frameworks enhance diagnostic accuracy by analyzing students' explicit response records, they primarily focus on indivi… ▽ More

    Submitted 21 November, 2024; originally announced December 2024.

    Comments: Cognitive Diagnosis

  18. arXiv:2411.18174  [pdf, other

    cs.RO

    ORB-SLAM3AB: Augmenting ORB-SLAM3 to Counteract Bumps with Optical Flow Inter-frame Matching

    Authors: Yangrui Dong, Weisheng Gong, Qingyong Li, Kaijie Su, Chen He, Z. Jane Wang

    Abstract: This paper proposes an enhancement to the ORB-SLAM3 algorithm, tailored for applications on rugged road surfaces. Our improved algorithm adeptly combines feature point matching with optical flow methods, capitalizing on the high robustness of optical flow in complex terrains and the high precision of feature points on smooth surfaces. By refining the inter-frame matching logic of ORB-SLAM3, we hav… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

  19. arXiv:2411.16483  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Graph Transformer Networks for Accurate Band Structure Prediction: An End-to-End Approach

    Authors: Weiyi Gong, Tao Sun, Hexin Bai, Jeng-Yuan Tsai, Haibin Ling, Qimin Yan

    Abstract: Predicting electronic band structures from crystal structures is crucial for understanding structure-property correlations in materials science. First-principles approaches are accurate but computationally intensive. Recent years, machine learning (ML) has been extensively applied to this field, while existing ML models predominantly focus on band gap predictions or indirect band structure estimat… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

    Comments: 8 pages, 3 figures

  20. arXiv:2411.06098  [pdf, other

    cs.CV cs.AI

    An Architectural Approach to Enhance Deep Long-Tailed Learning

    Authors: Yuhan Pan, Yanan Sun, Wei Gong

    Abstract: Deep long-tailed recognition has been widely studied to address the issue of imbalanced data distributions in real-world scenarios. However, there has been insufficient focus on the design of neural architectures, despite empirical evidence suggesting that architecture can significantly impact performance. In this paper, we attempt to mitigate long-tailed issues through architectural improvements.… ▽ More

    Submitted 2 December, 2024; v1 submitted 9 November, 2024; originally announced November 2024.

  21. arXiv:2410.12712  [pdf, other

    quant-ph cs.DS cs.IT cs.LG

    On the sample complexity of purity and inner product estimation

    Authors: Weiyuan Gong, Jonas Haferkamp, Qi Ye, Zhihan Zhang

    Abstract: We study the sample complexity of the prototypical tasks quantum purity estimation and quantum inner product estimation. In purity estimation, we are to estimate $tr(ρ^2)$ of an unknown quantum state $ρ$ to additive error $ε$. Meanwhile, for quantum inner product estimation, Alice and Bob are to estimate $tr(ρσ)$ to additive error $ε$ given copies of unknown quantum state $ρ$ and $σ$ using classic… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

    Comments: 33 pages, 1 figure

  22. arXiv:2410.07525  [pdf, other

    cs.LG cs.AI

    Offline Inverse Constrained Reinforcement Learning for Safe-Critical Decision Making in Healthcare

    Authors: Nan Fang, Guiliang Liu, Wei Gong

    Abstract: Reinforcement Learning (RL) applied in healthcare can lead to unsafe medical decisions and treatment, such as excessive dosages or abrupt changes, often due to agents overlooking common-sense constraints. Consequently, Constrained Reinforcement Learning (CRL) is a natural choice for safe decisions. However, specifying the exact cost function is inherently difficult in healthcare. Recent Inverse Co… ▽ More

    Submitted 14 October, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

  23. arXiv:2410.05807  [pdf, other

    cs.LG cs.DS math.OC

    Extended convexity and smoothness and their applications in deep learning

    Authors: Binchuan Qi, Wei Gong, Li Li

    Abstract: Classical assumptions like strong convexity and Lipschitz smoothness often fail to capture the nature of deep learning optimization problems, which are typically non-convex and non-smooth, making traditional analyses less applicable. This study aims to elucidate the mechanisms of non-convex optimization in deep learning by extending the conventional notions of strong convexity and Lipschitz smooth… ▽ More

    Submitted 30 April, 2025; v1 submitted 8 October, 2024; originally announced October 2024.

  24. arXiv:2409.18524  [pdf

    cs.NE eess.SY

    Adaptive Knowledge-based Multi-Objective Evolutionary Algorithm for Hybrid Flow Shop Scheduling Problems with Multiple Parallel Batch Processing Stages

    Authors: Feige Liu, Xin Li, Chao Lu, Wenying Gong

    Abstract: Parallel batch processing machines have extensive applications in the semiconductor manufacturing process. However, the problem models in previous studies regard parallel batch processing as a fixed processing stage in the machining process. This study generalizes the problem model, in which users can arbitrarily set certain stages as parallel batch processing stages according to their needs. A Hy… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

    Comments: 12 pages

  25. ESP-PCT: Enhanced VR Semantic Performance through Efficient Compression of Temporal and Spatial Redundancies in Point Cloud Transformers

    Authors: Luoyu Mei, Shuai Wang, Yun Cheng, Ruofeng Liu, Zhimeng Yin, Wenchao Jiang, Shuai Wang, Wei Gong

    Abstract: Semantic recognition is pivotal in virtual reality (VR) applications, enabling immersive and interactive experiences. A promising approach is utilizing millimeter-wave (mmWave) signals to generate point clouds. However, the high computational and memory demands of current mmWave point cloud models hinder their efficiency and reliability. To address this limitation, our paper introduces ESP-PCT, a… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Journal ref: Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence, IJCAI 2024

  26. arXiv:2408.16280  [pdf, other

    cs.NI eess.SP

    Double-decker: Productive Backscatter Communication Using a Single Commodity Receiver

    Authors: Qiwei Wang, Wei Gong

    Abstract: Backscatter communication has attracted significant attention for Internet-of-Things applications due to its ultra-low-power consumption. The state-of-the-art backscatter systems no longer require dedicated carrier generators and leverage ambient signals as carriers. However, there is an emerging challenge: most prior systems need dual receivers to capture the original and backscattered signals at… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  27. arXiv:2408.06967  [pdf, ps, other

    quant-ph cs.CC cs.DS cs.LG

    Stabilizer bootstrapping: A recipe for efficient agnostic tomography and magic estimation

    Authors: Sitan Chen, Weiyuan Gong, Qi Ye, Zhihan Zhang

    Abstract: We study the task of agnostic tomography: given copies of an unknown $n$-qubit state $ρ$ which has fidelity $τ$ with some state in a given class $C$, find a state which has fidelity $\ge τ- ε$ with $ρ$. We give a new framework, stabilizer bootstrapping, for designing computationally efficient protocols for this task, and use this to get new agnostic tomography protocols for the following classes:… ▽ More

    Submitted 4 December, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

    Comments: 68 pages

  28. arXiv:2407.17533  [pdf, other

    cs.LG cs.AI cs.DC

    SFPrompt: Communication-Efficient Split Federated Fine-Tuning for Large Pre-Trained Models over Resource-Limited Devices

    Authors: Linxiao Cao, Yifei Zhu, Wei Gong

    Abstract: Large pre-trained models have exhibited remarkable achievements across various domains. The substantial training costs associated with these models have led to wide studies of fine-tuning for effectively harnessing their capabilities in solving downstream tasks. Yet, conventional fine-tuning approaches become infeasible when the model lacks access to downstream data due to privacy concerns. Naivel… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  29. arXiv:2407.04697  [pdf, other

    cs.CV cs.MM

    VCoME: Verbal Video Composition with Multimodal Editing Effects

    Authors: Weibo Gong, Xiaojie Jin, Xin Li, Dongliang He, Xinglong Wu

    Abstract: Verbal videos, featuring voice-overs or text overlays, provide valuable content but present significant challenges in composition, especially when incorporating editing effects to enhance clarity and visual appeal. In this paper, we introduce the novel task of verbal video composition with editing effects. This task aims to generate coherent and visually appealing verbal videos by integrating mult… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  30. arXiv:2406.05666  [pdf, other

    cs.LG cs.IR stat.ML

    Probability Distribution Learning and Its Application in Deep Learning

    Authors: Binchuan Qi, Wei Gong, Li Li

    Abstract: This paper aims to elucidate the theoretical mechanisms underlying deep learning from a probability distribution estimation perspective, with Fenchel-Young Loss serving as the loss function. In our approach, the learning error , which measures the discrepancy between the model's predicted distribution and the posterior expectation of the true unknown distribution given sampling, is formulated as t… ▽ More

    Submitted 23 April, 2025; v1 submitted 9 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2105.04026 by other authors. arXiv admin note: text overlap with arXiv:2105.04026 by other authors

  31. arXiv:2406.00734  [pdf, other

    cs.LG

    GLADformer: A Mixed Perspective for Graph-level Anomaly Detection

    Authors: Fan Xu, Nan Wang, Hao Wu, Xuezhi Wen, Dalin Zhang, Siyang Lu, Binyong Li, Wei Gong, Hai Wan, Xibin Zhao

    Abstract: Graph-Level Anomaly Detection (GLAD) aims to distinguish anomalous graphs within a graph dataset. However, current methods are constrained by their receptive fields, struggling to learn global features within the graphs. Moreover, most contemporary methods are based on spatial domain and lack exploration of spectral characteristics. In this paper, we propose a multi-perspective hybrid graph-level… ▽ More

    Submitted 3 July, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

  32. arXiv:2405.00770  [pdf, other

    quant-ph cs.CC cs.LG

    Quantum-Classical Separations in Shallow-Circuit-Based Learning with and without Noises

    Authors: Zhihan Zhang, Weiyuan Gong, Weikang Li, Dong-Ling Deng

    Abstract: We study quantum-classical separations between classical and quantum supervised learning models based on constant depth (i.e., shallow) circuits, in scenarios with and without noises. We construct a classification problem defined by a noiseless shallow quantum circuit and rigorously prove that any classical neural network with bounded connectivity requires logarithmic depth to output correctly wit… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 14 pages, 3 figures

  33. arXiv:2404.19105  [pdf, other

    quant-ph cs.IT

    Optimal tradeoffs for estimating Pauli observables

    Authors: Sitan Chen, Weiyuan Gong, Qi Ye

    Abstract: We revisit the problem of Pauli shadow tomography: given copies of an unknown $n$-qubit quantum state $ρ$, estimate $\text{tr}(Pρ)$ for some set of Pauli operators $P$ to within additive error $ε$. This has been a popular testbed for exploring the advantage of protocols with quantum memory over those without: with enough memory to measure two copies at a time, one can use Bell sampling to estimate… ▽ More

    Submitted 14 November, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: 59 pages, 1 figure

  34. arXiv:2404.12529  [pdf, other

    cs.NI cs.HC

    A Survey of Bluetooth Indoor Localization

    Authors: Taolei Shi, Wei Gong

    Abstract: Nowadays, indoor localization has received extensive research interest due to more and more applications' needs for location information to provide a more precise and effective service [1], [2]. There are various wireless techniques and mechanisms that have been proposed; some of them have been studied in depth and come into use, such as Wi-Fi, RFID, and sensor networks. In comparison, the develop… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 8 pages, 2 figures

  35. arXiv:2404.09622  [pdf, other

    cs.RO cs.AI

    DIDLM: A SLAM Dataset for Difficult Scenarios Featuring Infrared, Depth Cameras, LIDAR, 4D Radar, and Others under Adverse Weather, Low Light Conditions, and Rough Roads

    Authors: Weisheng Gong, Kaijie Su, Qingyong Li, Chen He, Tong Wu, Z. Jane Wang

    Abstract: Adverse weather conditions, low-light environments, and bumpy road surfaces pose significant challenges to SLAM in robotic navigation and autonomous driving. Existing datasets in this field predominantly rely on single sensors or combinations of LiDAR, cameras, and IMUs. However, 4D millimeter-wave radar demonstrates robustness in adverse weather, infrared cameras excel in capturing details under… ▽ More

    Submitted 14 January, 2025; v1 submitted 15 April, 2024; originally announced April 2024.

  36. arXiv:2403.13869  [pdf, other

    cs.LG cs.AI

    Accurately Predicting Probabilities of Safety-Critical Rare Events for Intelligent Systems

    Authors: Ruoxuan Bai, Jingxuan Yang, Weiduo Gong, Yi Zhang, Qiujing Lu, Shuo Feng

    Abstract: Intelligent systems are increasingly integral to our daily lives, yet rare safety-critical events present significant latent threats to their practical deployment. Addressing this challenge hinges on accurately predicting the probability of safety-critical events occurring within a given time step from the current state, a metric we define as 'criticality'. The complexity of predicting criticality… ▽ More

    Submitted 5 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  37. arXiv:2403.01736  [pdf, other

    cs.CV

    Lightweight Object Detection: A Study Based on YOLOv7 Integrated with ShuffleNetv2 and Vision Transformer

    Authors: Wenkai Gong

    Abstract: As mobile computing technology rapidly evolves, deploying efficient object detection algorithms on mobile devices emerges as a pivotal research area in computer vision. This study zeroes in on optimizing the YOLOv7 algorithm to boost its operational efficiency and speed on mobile platforms while ensuring high accuracy. Leveraging a synergy of advanced techniques such as Group Convolution, ShuffleN… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  38. arXiv:2402.12381  [pdf, other

    cs.AI cs.NE

    Constrained Multi-objective Optimization with Deep Reinforcement Learning Assisted Operator Selection

    Authors: Fei Ming, Wenyin Gong, Ling Wang, Yaochu Jin

    Abstract: Solving constrained multi-objective optimization problems with evolutionary algorithms has attracted considerable attention. Various constrained multi-objective optimization evolutionary algorithms (CMOEAs) have been developed with the use of different algorithmic strategies, evolutionary operators, and constraint-handling techniques. The performance of CMOEAs may be heavily dependent on the opera… ▽ More

    Submitted 15 January, 2024; originally announced February 2024.

  39. arXiv:2402.06665  [pdf, other

    cs.AI cs.CL cs.LG cs.RO

    The Essential Role of Causality in Foundation World Models for Embodied AI

    Authors: Tarun Gupta, Wenbo Gong, Chao Ma, Nick Pawlowski, Agrin Hilmkil, Meyer Scetbon, Marc Rigter, Ade Famoti, Ashley Juan Llorens, Jianfeng Gao, Stefan Bauer, Danica Kragic, Bernhard Schölkopf, Cheng Zhang

    Abstract: Recent advances in foundation models, especially in large multi-modal models and conversational agents, have ignited interest in the potential of generally capable embodied agents. Such agents will require the ability to perform new tasks in many different real-world environments. However, current foundation models fail to accurately model physical interactions and are therefore insufficient for E… ▽ More

    Submitted 29 April, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  40. arXiv:2402.00763  [pdf, other

    cs.CV cs.GR

    360-GS: Layout-guided Panoramic Gaussian Splatting For Indoor Roaming

    Authors: Jiayang Bai, Letian Huang, Jie Guo, Wen Gong, Yuanqi Li, Yanwen Guo

    Abstract: 3D Gaussian Splatting (3D-GS) has recently attracted great attention with real-time and photo-realistic renderings. This technique typically takes perspective images as input and optimizes a set of 3D elliptical Gaussians by splatting them onto the image planes, resulting in 2D Gaussians. However, applying 3D-GS to panoramic inputs presents challenges in effectively modeling the projection onto th… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 11 pages, 10 figures

  41. RecDCL: Dual Contrastive Learning for Recommendation

    Authors: Dan Zhang, Yangliao Geng, Wenwen Gong, Zhongang Qi, Zhiyu Chen, Xing Tang, Ying Shan, Yuxiao Dong, Jie Tang

    Abstract: Self-supervised learning (SSL) has recently achieved great success in mining the user-item interactions for collaborative filtering. As a major paradigm, contrastive learning (CL) based SSL helps address data sparsity in Web platforms by contrasting the embeddings between raw and augmented data. However, existing CL-based methods mostly focus on contrasting in a batch-wise way, failing to exploit… ▽ More

    Submitted 18 February, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

    Comments: Accepted to WWW 2024

    Journal ref: Proceedings of TheWebConf 2024 (WWW '24), May 13--17, 2024, Singapore

  42. Exploring consumers response to text-based chatbots in e-commerce: The moderating role of task complexity and chatbot disclosure

    Authors: Xusen Cheng, Ying Bao, Alex Zarifis, Wankun Gong, Jian Mou

    Abstract: Artificial intelligence based chatbots have brought unprecedented business potential. This study aims to explore consumers trust and response to a text-based chatbot in ecommerce, involving the moderating effects of task complexity and chatbot identity disclosure. A survey method with 299 useable responses was conducted in this research. This study adopted the ordinary least squares regression to… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: Internet Research (2021)

  43. Complexity of Digital Quantum Simulation in the Low-Energy Subspace: Applications and a Lower Bound

    Authors: Weiyuan Gong, Shuo Zhou, Tongyang Li

    Abstract: Digital quantum simulation has broad applications in approximating unitary evolution of Hamiltonians. In practice, many simulation tasks for quantum systems focus on quantum states in the low-energy subspace instead of the entire Hilbert space. In this paper, we systematically investigate the complexity of digital quantum simulation based on product formulas in the low-energy subspace. We show tha… ▽ More

    Submitted 11 July, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: 34 pages, 4 figures, github repo: https://github.com/Qubit-Fernand/Digital-Quantum-Simulation

    Journal ref: Quantum 8, 1409 (2024)

  44. arXiv:2312.08134  [pdf, other

    cs.NE

    MToP: A MATLAB Optimization Platform for Evolutionary Multitasking

    Authors: Yanchi Li, Wenyin Gong, Fei Ming, Tingyu Zhang, Shuijia Li, Qiong Gu

    Abstract: Evolutionary multitasking (EMT) has emerged as a popular topic of evolutionary computation over the past decade. It aims to concurrently address multiple optimization tasks within limited computing resources, leveraging inter-task knowledge transfer techniques. Despite the abundance of multitask evolutionary algorithms (MTEAs) proposed for multitask optimization (MTO), there remains a comprehensiv… ▽ More

    Submitted 23 February, 2025; v1 submitted 13 December, 2023; originally announced December 2023.

  45. Bridge the Present and Future: A Cross-Layer Matching Game in Dynamic Cloud-Aided Mobile Edge Networks

    Authors: Houyi Qi, Minghui Liwang, Xianbin Wang, Li Li, Wei Gong, Jian Jin, Zhenzhen Jiao

    Abstract: Cloud-aided mobile edge networks (CAMENs) allow edge servers (ESs) to purchase resources from remote cloud servers (CSs), while overcoming resource shortage when handling computation-intensive tasks of mobile users (MUs). Conventional trading mechanisms (e.g., onsite trading) confront many challenges, including decision-making overhead (e.g., latency) and potential trading failures. This paper inv… ▽ More

    Submitted 8 June, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Journal ref: IEEE Transactions on Mobile Computing,2024

  46. arXiv:2311.03309  [pdf, other

    cs.LG cs.AI stat.ML

    Neural Structure Learning with Stochastic Differential Equations

    Authors: Benjie Wang, Joel Jennings, Wenbo Gong

    Abstract: Discovering the underlying relationships among variables from temporal observations has been a longstanding challenge in numerous scientific disciplines, including biology, finance, and climate science. The dynamics of such systems are often best described using continuous-time stochastic processes. Unfortunately, most existing structure learning approaches assume that the underlying process evolv… ▽ More

    Submitted 5 May, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: ICLR 2024

  47. arXiv:2309.14326  [pdf, other

    quant-ph cs.CC cs.IT cs.LG math.ST

    Efficient Pauli channel estimation with logarithmic quantum memory

    Authors: Sitan Chen, Weiyuan Gong

    Abstract: Here we revisit one of the prototypical tasks for characterizing the structure of noise in quantum devices: estimating every eigenvalue of an $n$-qubit Pauli noise channel to error $ε$. Prior work [14] proved no-go theorems for this task in the practical regime where one has a limited amount of quantum memory, e.g. any protocol with $\le 0.99n$ ancilla qubits of quantum memory must make exponentia… ▽ More

    Submitted 24 May, 2025; v1 submitted 25 September, 2023; originally announced September 2023.

    Comments: 57 pages, 1 figure

  48. arXiv:2308.00531  [pdf, ps, other

    cs.NI

    Adaptive Bitrate Video Semantic Communication over Wireless Networks

    Authors: Wentao Gong, Haonan Tong, Sihua Wang, Zhaohui Yang, Xinxin He, Changchuan Yin

    Abstract: This paper investigates the adaptive bitrate (ABR) video semantic communication over wireless networks. In the considered model, video sensing devices must transmit video semantic information to an edge server, to facilitate ubiquitous video sensing services such as road environment monitoring at the edge server in autonomous driving scenario. However, due to the varying wireless network condition… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  49. arXiv:2307.13917  [pdf, other

    cs.LG stat.ME

    BayesDAG: Gradient-Based Posterior Inference for Causal Discovery

    Authors: Yashas Annadani, Nick Pawlowski, Joel Jennings, Stefan Bauer, Cheng Zhang, Wenbo Gong

    Abstract: Bayesian causal discovery aims to infer the posterior distribution over causal models from observed data, quantifying epistemic uncertainty and benefiting downstream tasks. However, computational challenges arise due to joint inference over combinatorial space of Directed Acyclic Graphs (DAGs) and nonlinear functions. Despite recent progress towards efficient posterior inference over DAGs, existin… ▽ More

    Submitted 8 December, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  50. arXiv:2307.13028  [pdf, other

    quant-ph cs.DS

    Improved Digital Quantum Simulation by Non-Unitary Channels

    Authors: W. Gong, Yaroslav Kharkov, Minh C. Tran, Przemyslaw Bienias, Alexey V. Gorshkov

    Abstract: Simulating quantum systems is one of the most promising avenues to harness the computational power of quantum computers. However, hardware errors in noisy near-term devices remain a major obstacle for applications. Ideas based on the randomization of Suzuki-Trotter product formulas have been shown to be a powerful approach to reducing the errors of quantum simulation and lowering the gate count. I… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 24 pages, 9 figures