Skip to main content

Showing 1–10 of 10 results for author: Mai, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.18962  [pdf, ps, other

    cs.HC

    UniMind: Unleashing the Power of LLMs for Unified Multi-Task Brain Decoding

    Authors: Weiheng Lu, Chunfeng Song, Jiamin Wu, Pengyu Zhu, Yuchen Zhou, Weijian Mai, Qihao Zheng, Wanli Ouyang

    Abstract: Decoding human brain activity from electroencephalography (EEG) signals is a central challenge at the intersection of neuroscience and artificial intelligence, enabling diverse applications in mental state assessment, clinical monitoring, and human-machine interaction. Recent efforts have extensively explored EEG-based brain foundation models for generalized brain decoding, employing large-scale t… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: 19pages,4 figures

  2. arXiv:2506.17672  [pdf, ps, other

    cs.LG cs.ET

    Learning Personalized Utility Functions for Drivers in Ride-hailing Systems Using Ensemble Hypernetworks

    Authors: Weiming Mai, Jie Gao, Oded Cats

    Abstract: In ride-hailing systems, drivers decide whether to accept or reject ride requests based on factors such as order characteristics, traffic conditions, and personal preferences. Accurately predicting these decisions is essential for improving the efficiency and reliability of these systems. Traditional models, such as the Random Utility Maximization (RUM) approach, typically predict drivers' decisio… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  3. arXiv:2502.05034  [pdf, other

    cs.CV

    MindAligner: Explicit Brain Functional Alignment for Cross-Subject Visual Decoding from Limited fMRI Data

    Authors: Yuqin Dai, Zhouheng Yao, Chunfeng Song, Qihao Zheng, Weijian Mai, Kunyu Peng, Shuai Lu, Wanli Ouyang, Jian Yang, Jiamin Wu

    Abstract: Brain decoding aims to reconstruct visual perception of human subject from fMRI signals, which is crucial for understanding brain's perception mechanisms. Existing methods are confined to the single-subject paradigm due to substantial brain variability, which leads to weak generalization across individuals and incurs high training costs, exacerbated by limited availability of fMRI data. To address… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

  4. arXiv:2501.12210  [pdf, other

    cs.CR

    You Can't Eat Your Cake and Have It Too: The Performance Degradation of LLMs with Jailbreak Defense

    Authors: Wuyuao Mai, Geng Hong, Pei Chen, Xudong Pan, Baojun Liu, Yuan Zhang, Haixin Duan, Min Yang

    Abstract: With the rise of generative large language models (LLMs) like LLaMA and ChatGPT, these models have significantly transformed daily life and work by providing advanced insights. However, as jailbreak attacks continue to circumvent built-in safety mechanisms, exploiting carefully crafted scenarios or tokens, the safety risks of LLMs have come into focus. While numerous defense strategies--such as pr… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

  5. arXiv:2411.12248  [pdf, other

    cs.CV

    Neuro-3D: Towards 3D Visual Decoding from EEG Signals

    Authors: Zhanqiang Guo, Jiamin Wu, Yonghao Song, Jiahui Bu, Weijian Mai, Qihao Zheng, Wanli Ouyang, Chunfeng Song

    Abstract: Human's perception of the visual world is shaped by the stereo processing of 3D information. Understanding how the brain perceives and processes 3D visual stimuli in the real world has been a longstanding endeavor in neuroscience. Towards this goal, we introduce a new neuroscience task: decoding 3D visual perception from EEG signals, a neuroimaging technique that enables real-time monitoring of ne… ▽ More

    Submitted 21 November, 2024; v1 submitted 19 November, 2024; originally announced November 2024.

  6. arXiv:2409.07255  [pdf, ps, other

    cs.CV

    EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion

    Authors: Jian Zhang, Weijian Mai, Zhijun Zhang

    Abstract: The task of audio-driven portrait animation involves generating a talking head video using an identity image and an audio track of speech. While many existing approaches focus on lip synchronization and video quality, few tackle the challenge of generating emotion-driven talking head videos. The ability to control and edit emotions is essential for producing expressive and realistic animations. In… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Comments: 12 pages, 7 figures

  7. arXiv:2401.00430  [pdf, other

    cs.AI

    Brain-Conditional Multimodal Synthesis: A Survey and Taxonomy

    Authors: Weijian Mai, Jian Zhang, Pengfei Fang, Zhijun Zhang

    Abstract: In the era of Artificial Intelligence Generated Content (AIGC), conditional multimodal synthesis technologies (e.g., text-to-image, text-to-video, text-to-audio, etc) are gradually reshaping the natural content in the real world. The key to multimodal synthesis technology is to establish the mapping relationship between different modalities. Brain signals, serving as potential reflections of how t… ▽ More

    Submitted 3 January, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

  8. arXiv:2308.07428  [pdf, other

    cs.CV cs.AI

    UniBrain: Unify Image Reconstruction and Captioning All in One Diffusion Model from Human Brain Activity

    Authors: Weijian Mai, Zhijun Zhang

    Abstract: Image reconstruction and captioning from brain activity evoked by visual stimuli allow researchers to further understand the connection between the human brain and the visual perception system. While deep generative models have recently been employed in this field, reconstructing realistic captions and images with both low-level details and high semantic fidelity is still a challenging problem. In… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  9. arXiv:2207.14339   

    cs.CE

    Contact tracing Inspired Efficient Computation by Energy Tracing

    Authors: Wending Mai, Ronald P. Jenkins, Yifan Chen, Douglas H. Werner

    Abstract: Inspired by the epidemic contact tracing technique, we propose a method to efficiently solve electromagnetics by tracing the energy distribution. The computational domain is adaptively decomposed, and the available computational resources are focused on those energy-active (infections) and their adjacent (exposed) domains, while avoiding the unnecessary computation of energy-null (unexposed) domai… ▽ More

    Submitted 8 August, 2022; v1 submitted 9 July, 2022; originally announced July 2022.

    Comments: This article has been withdrawn due to an unresolvable internal author dispute

  10. KinD-LCE Curve Estimation And Retinex Fusion On Low-Light Image

    Authors: Xiaochun Lei, Weiliang Mai, Junlin Xie, He Liu, Zetao Jiang, Zhaoting Gong, Chang Lu, Linjun Lu

    Abstract: Low-light images often suffer from noise and color distortion. Object detection, semantic segmentation, instance segmentation, and other tasks are challenging when working with low-light images because of image noise and chromatic aberration. We also found that the conventional Retinex theory loses information in adjusting the image for low-light tasks. In response to the aforementioned problem, t… ▽ More

    Submitted 23 October, 2023; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: Accepted by Signal, Image and Video Processing