Skip to main content

Showing 1–6 of 6 results for author: Piao, Z

Searching in archive eess. Search in all archives.
.
  1. arXiv:2505.21928  [pdf

    eess.IV cs.AI cs.CV cs.LG

    Subspecialty-Specific Foundation Model for Intelligent Gastrointestinal Pathology

    Authors: Lianghui Zhu, Xitong Ling, Minxi Ouyang, Xiaoping Liu, Tian Guan, Mingxi Fu, Zhiqiang Cheng, Fanglei Fu, Maomao Zeng, Liming Liu, Song Duan, Qiang Huang, Ying Xiao, Jianming Li, Shanming Lu, Zhenghua Piao, Mingxi Zhu, Yibo Jin, Shan Xu, Qiming He, Yizhi Wang, Junru Cheng, Xuanyu Wang, Luxi Xie, Houqiang Li , et al. (2 additional authors not shown)

    Abstract: Gastrointestinal (GI) diseases represent a clinically significant burden, necessitating precise diagnostic approaches to optimize patient outcomes. Conventional histopathological diagnosis suffers from limited reproducibility and diagnostic variability. To overcome these limitations, we develop Digepath, a specialized foundation model for GI pathology. Our framework introduces a dual-phase iterati… ▽ More

    Submitted 6 June, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

  2. arXiv:2405.10502  [pdf, other

    cs.HC cs.SD eess.AS

    Enhancing DMI Interactions by Integrating Haptic Feedback for Intricate Vibrato Technique

    Authors: Ziyue Piao, Christian Frisson, Bavo Van Kerrebroeck, Marcelo M. Wanderley

    Abstract: This paper investigates the integration of force feedback in Digital Musical Instruments (DMI), specifically evaluating the reproduction of intricate vibrato techniques using haptic feedback controllers. We introduce our system for vibrato modulation using force feedback, composed of Bend-aid (a web-based sequencer platform using pre-designed haptic feedback models) and TorqueTuner (an open-source… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  3. arXiv:2312.13603  [pdf, other

    eess.AS cs.SD

    Style Modeling for Multi-Speaker Articulation-to-Speech

    Authors: Miseul Kim, Zhenyu Piao, Jihyun Lee, Hong-Goo Kang

    Abstract: In this paper, we propose a neural articulation-to-speech (ATS) framework that synthesizes high-quality speech from articulatory signal in a multi-speaker situation. Most conventional ATS approaches only focus on modeling contextual information of speech from a single speaker's articulatory features. To explicitly represent each speaker's speaking style as well as the contextual information, our p… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 5 pages, Accepted to ICASSP 2023

  4. arXiv:2312.13600  [pdf, other

    eess.AS cs.SD

    BrainTalker: Low-Resource Brain-to-Speech Synthesis with Transfer Learning using Wav2Vec 2.0

    Authors: Miseul Kim, Zhenyu Piao, Jihyun Lee, Hong-Goo Kang

    Abstract: Decoding spoken speech from neural activity in the brain is a fast-emerging research topic, as it could enable communication for people who have difficulties with producing audible speech. For this task, electrocorticography (ECoG) is a common method for recording brain activity with high temporal resolution and high spatial precision. However, due to the risky surgical procedure required for obta… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 5 pages. Accepted to BHI 2023

  5. arXiv:2305.06806  [pdf, other

    cs.SD eess.AS

    HappyQuokka System for ICASSP 2023 Auditory EEG Challenge

    Authors: Zhenyu Piao, Miseul Kim, Hyungchan Yoon, Hong-Goo Kang

    Abstract: This report describes our submission to Task 2 of the Auditory EEG Decoding Challenge at ICASSP 2023 Signal Processing Grand Challenge (SPGC). Task 2 is a regression problem that focuses on reconstructing a speech envelope from an EEG signal. For the task, we propose a pre-layer normalized feed-forward transformer (FFT) architecture. For within-subjects generation, we additionally utilize an auxil… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: First Place in Task 2 of Auditory EEG decoding Challenge, which is part of ICASSP Signal Processing Grand Challenge (SPGC) 2023

  6. arXiv:1909.12224  [pdf, other

    cs.CV cs.LG eess.IV

    Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis

    Authors: Wen Liu, Zhixin Piao, Jie Min, Wenhan Luo, Lin Ma, Shenghua Gao

    Abstract: We tackle the human motion imitation, appearance transfer, and novel view synthesis within a unified framework, which means that the model once being trained can be used to handle all these tasks. The existing task-specific methods mainly use 2D keypoints (pose) to estimate the human body structure. However, they only expresses the position information with no abilities to characterize the persona… ▽ More

    Submitted 1 October, 2019; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: accepted by ICCV2019