Skip to main content

Showing 1–3 of 3 results for author: Kimura, A

Searching in archive eess. Search in all archives.
.
  1. arXiv:2503.00389  [pdf, other

    cs.CV cs.AI cs.SD eess.AS

    BGM2Pose: Active 3D Human Pose Estimation with Non-Stationary Sounds

    Authors: Yuto Shibata, Yusuke Oumi, Go Irie, Akisato Kimura, Yoshimitsu Aoki, Mariko Isogawa

    Abstract: We propose BGM2Pose, a non-invasive 3D human pose estimation method using arbitrary music (e.g., background music) as active sensing signals. Unlike existing approaches that significantly limit practicality by employing intrusive chirp signals within the audible range, our method utilizes natural music that causes minimal discomfort to humans. Estimating human poses from standard music presents si… ▽ More

    Submitted 1 March, 2025; originally announced March 2025.

  2. arXiv:2409.03336  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Estimating Indoor Scene Depth Maps from Ultrasonic Echoes

    Authors: Junpei Honma, Akisato Kimura, Go Irie

    Abstract: Measuring 3D geometric structures of indoor scenes requires dedicated depth sensors, which are not always available. Echo-based depth estimation has recently been studied as a promising alternative solution. All previous studies have assumed the use of echoes in the audible range. However, one major problem is that audible echoes cannot be used in quiet spaces or other situations where producing a… ▽ More

    Submitted 8 September, 2024; v1 submitted 5 September, 2024; originally announced September 2024.

    Comments: ICIP 2024

  3. arXiv:2207.11964  [pdf, other

    eess.AS cs.LG cs.MM cs.SD

    ConceptBeam: Concept Driven Target Speech Extraction

    Authors: Yasunori Ohishi, Marc Delcroix, Tsubasa Ochiai, Shoko Araki, Daiki Takeuchi, Daisuke Niizumi, Akisato Kimura, Noboru Harada, Kunio Kashino

    Abstract: We propose a novel framework for target speech extraction based on semantic information, called ConceptBeam. Target speech extraction means extracting the speech of a target speaker in a mixture. Typical approaches have been exploiting properties of audio signals, such as harmonic structure and direction of arrival. In contrast, ConceptBeam tackles the problem with semantic clues. Specifically, we… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted to ACM Multimedia 2022