Skip to main content

Showing 1–9 of 9 results for author: Piao, Z

.
  1. arXiv:2505.21928  [pdf

    eess.IV cs.AI cs.CV cs.LG

    Subspecialty-Specific Foundation Model for Intelligent Gastrointestinal Pathology

    Authors: Lianghui Zhu, Xitong Ling, Minxi Ouyang, Xiaoping Liu, Tian Guan, Mingxi Fu, Zhiqiang Cheng, Fanglei Fu, Maomao Zeng, Liming Liu, Song Duan, Qiang Huang, Ying Xiao, Jianming Li, Shanming Lu, Zhenghua Piao, Mingxi Zhu, Yibo Jin, Shan Xu, Qiming He, Yizhi Wang, Junru Cheng, Xuanyu Wang, Luxi Xie, Houqiang Li , et al. (2 additional authors not shown)

    Abstract: Gastrointestinal (GI) diseases represent a clinically significant burden, necessitating precise diagnostic approaches to optimize patient outcomes. Conventional histopathological diagnosis suffers from limited reproducibility and diagnostic variability. To overcome these limitations, we develop Digepath, a specialized foundation model for GI pathology. Our framework introduces a dual-phase iterati… ▽ More

    Submitted 6 June, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

  2. arXiv:2503.04164  [pdf, other

    q-fin.CP

    CoFinDiff: Controllable Financial Diffusion Model for Time Series Generation

    Authors: Yuki Tanaka, Ryuji Hashimoto, Takehiro Takayanagi, Zhe Piao, Yuri Murayama, Kiyoshi Izumi

    Abstract: The generation of synthetic financial data is a critical technology in the financial domain, addressing challenges posed by limited data availability. Traditionally, statistical models have been employed to generate synthetic data. However, these models fail to capture the stylized facts commonly observed in financial data, limiting their practical applicability. Recently, machine learning models… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: 11 pages, 3 figures

  3. arXiv:2405.10502  [pdf, other

    cs.HC cs.SD eess.AS

    Enhancing DMI Interactions by Integrating Haptic Feedback for Intricate Vibrato Technique

    Authors: Ziyue Piao, Christian Frisson, Bavo Van Kerrebroeck, Marcelo M. Wanderley

    Abstract: This paper investigates the integration of force feedback in Digital Musical Instruments (DMI), specifically evaluating the reproduction of intricate vibrato techniques using haptic feedback controllers. We introduce our system for vibrato modulation using force feedback, composed of Bend-aid (a web-based sequencer platform using pre-designed haptic feedback models) and TorqueTuner (an open-source… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  4. arXiv:2312.13603  [pdf, other

    eess.AS cs.SD

    Style Modeling for Multi-Speaker Articulation-to-Speech

    Authors: Miseul Kim, Zhenyu Piao, Jihyun Lee, Hong-Goo Kang

    Abstract: In this paper, we propose a neural articulation-to-speech (ATS) framework that synthesizes high-quality speech from articulatory signal in a multi-speaker situation. Most conventional ATS approaches only focus on modeling contextual information of speech from a single speaker's articulatory features. To explicitly represent each speaker's speaking style as well as the contextual information, our p… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 5 pages, Accepted to ICASSP 2023

  5. arXiv:2312.13600  [pdf, other

    eess.AS cs.SD

    BrainTalker: Low-Resource Brain-to-Speech Synthesis with Transfer Learning using Wav2Vec 2.0

    Authors: Miseul Kim, Zhenyu Piao, Jihyun Lee, Hong-Goo Kang

    Abstract: Decoding spoken speech from neural activity in the brain is a fast-emerging research topic, as it could enable communication for people who have difficulties with producing audible speech. For this task, electrocorticography (ECoG) is a common method for recording brain activity with high temporal resolution and high spatial precision. However, due to the risky surgical procedure required for obta… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 5 pages. Accepted to BHI 2023

  6. arXiv:2305.06806  [pdf, other

    cs.SD eess.AS

    HappyQuokka System for ICASSP 2023 Auditory EEG Challenge

    Authors: Zhenyu Piao, Miseul Kim, Hyungchan Yoon, Hong-Goo Kang

    Abstract: This report describes our submission to Task 2 of the Auditory EEG Decoding Challenge at ICASSP 2023 Signal Processing Grand Challenge (SPGC). Task 2 is a regression problem that focuses on reconstructing a speech envelope from an EEG signal. For the task, we propose a pre-layer normalized feed-forward transformer (FFT) architecture. For within-subjects generation, we additionally utilize an auxil… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: First Place in Task 2 of Auditory EEG decoding Challenge, which is part of ICASSP Signal Processing Grand Challenge (SPGC) 2023

  7. Sensing the Breath: A Multimodal Singing Tutoring Interface with Breath Guidance

    Authors: Ziyue Piao, Gus Xia

    Abstract: Breath is a significant component in singing performance, which is still underresearched in most singing-related music interfaces. In this paper, we present a multimodal system that detects the learner's singing pitch and breathing states and provides real-time visual tutoring feedback. Specifically, the breath detector is a wearable belt with pressure sensors and flexible fabric. It monitors real… ▽ More

    Submitted 5 July, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: New Interfaces for Musical Expression 2022 Demo Paper

  8. arXiv:2011.09055  [pdf, other

    cs.CV

    Liquid Warping GAN with Attention: A Unified Framework for Human Image Synthesis

    Authors: Wen Liu, Zhixin Piao, Zhi Tu, Wenhan Luo, Lin Ma, Shenghua Gao

    Abstract: We tackle human image synthesis, including human motion imitation, appearance transfer, and novel view synthesis, within a unified framework. It means that the model, once being trained, can be used to handle all these tasks. The existing task-specific methods mainly use 2D keypoints to estimate the human body structure. However, they only express the position information with no abilities to char… ▽ More

    Submitted 22 November, 2020; v1 submitted 17 November, 2020; originally announced November 2020.

    Comments: Under review of IEEE Transactions on Pattern Analysis and Machine Intelligence. arXiv admin note: text overlap with arXiv:1909.12224

  9. arXiv:1909.12224  [pdf, other

    cs.CV cs.LG eess.IV

    Liquid Warping GAN: A Unified Framework for Human Motion Imitation, Appearance Transfer and Novel View Synthesis

    Authors: Wen Liu, Zhixin Piao, Jie Min, Wenhan Luo, Lin Ma, Shenghua Gao

    Abstract: We tackle the human motion imitation, appearance transfer, and novel view synthesis within a unified framework, which means that the model once being trained can be used to handle all these tasks. The existing task-specific methods mainly use 2D keypoints (pose) to estimate the human body structure. However, they only expresses the position information with no abilities to characterize the persona… ▽ More

    Submitted 1 October, 2019; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: accepted by ICCV2019