Skip to main content

Showing 1–7 of 7 results for author: Aoki, Y

Searching in archive eess. Search in all archives.
.
  1. arXiv:2504.18447  [pdf, other

    cs.CV cs.AI eess.IV

    Iterative Event-based Motion Segmentation by Variational Contrast Maximization

    Authors: Ryo Yamaki, Shintaro Shiba, Guillermo Gallego, Yoshimitsu Aoki

    Abstract: Event cameras provide rich signals that are suitable for motion estimation since they respond to changes in the scene. As any visual changes in the scene produce event data, it is paramount to classify the data into different motions (i.e., motion segmentation), which is useful for various tasks such as object detection and visual servoing. We propose an iterative motion segmentation method, by cl… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

    Comments: 11 pages, 9 figures, 3 tables, CVPR Workshop 2025

  2. arXiv:2504.04029  [pdf, other

    cs.CV cs.AI eess.IV

    Simultaneous Motion And Noise Estimation with Event Cameras

    Authors: Shintaro Shiba, Yoshimitsu Aoki, Guillermo Gallego

    Abstract: Event cameras are emerging vision sensors, whose noise is challenging to characterize. Existing denoising methods for event cameras consider other tasks such as motion estimation separately (i.e., sequentially after denoising). However, motion is an intrinsic part of event data, since scene edges cannot be sensed without motion. This work proposes, to the best of our knowledge, the first method th… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: 13 pages, 13 figures, 6 tables

  3. arXiv:2503.00389  [pdf, other

    cs.CV cs.AI cs.SD eess.AS

    BGM2Pose: Active 3D Human Pose Estimation with Non-Stationary Sounds

    Authors: Yuto Shibata, Yusuke Oumi, Go Irie, Akisato Kimura, Yoshimitsu Aoki, Mariko Isogawa

    Abstract: We propose BGM2Pose, a non-invasive 3D human pose estimation method using arbitrary music (e.g., background music) as active sensing signals. Unlike existing approaches that significantly limit practicality by employing intrusive chirp signals within the audible range, our method utilizes natural music that causes minimal discomfort to humans. Estimating human poses from standard music presents si… ▽ More

    Submitted 1 March, 2025; originally announced March 2025.

  4. arXiv:2410.00511  [pdf, other

    eess.AS cs.AI cs.CV

    Pre-training with Synthetic Patterns for Audio

    Authors: Yuchi Ishikawa, Tatsuya Komatsu, Yoshimitsu Aoki

    Abstract: In this paper, we propose to pre-train audio encoders using synthetic patterns instead of real audio data. Our proposed framework consists of two key elements. The first one is Masked Autoencoder (MAE), a self-supervised learning framework that learns from reconstructing data from randomly masked counterparts. MAEs tend to focus on low-level information such as visual patterns and regularities wit… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

    Comments: Submitted to ICASSP'25

  5. Event-based Background-Oriented Schlieren

    Authors: Shintaro Shiba, Friedhelm Hamann, Yoshimitsu Aoki, Guillermo Gallego

    Abstract: Schlieren imaging is an optical technique to observe the flow of transparent media, such as air or water, without any particle seeding. However, conventional frame-based techniques require both high spatial and temporal resolution cameras, which impose bright illumination and expensive computation limitations. Event cameras offer potential advantages (high dynamic range, high temporal resolution,… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted at IEEE T-PAMI

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, Oct. 2023

  6. arXiv:2212.12218  [pdf, other

    cs.CV cs.RO eess.SP

    Fast Event-based Optical Flow Estimation by Triplet Matching

    Authors: Shintaro Shiba, Yoshimitsu Aoki, Guillermo Gallego

    Abstract: Event cameras are novel bio-inspired sensors that offer advantages over traditional cameras (low latency, high dynamic range, low power, etc.). Optical flow estimation methods that work on packets of events trade off speed for accuracy, while event-by-event (incremental) methods have strong assumptions and have not been tested on common benchmarks that quantify progress in the field. Towards appli… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

    Comments: 5 pages, 4 figures, 2 tables

    Journal ref: IEEE Signal Processing Letters, Vol. 29, pp. 2712-2716, 2022

  7. arXiv:2212.09880  [pdf, other

    eess.SY

    Experimental Low-speed Positioning System with VecTwin Rudder for Automatic Docking (Berthing)

    Authors: Dimas M. Rachman, Yusuke Aoki, Yoshiki Miyauchi, Naoya Umeda, Atsuo Maki

    Abstract: A VecTwin rudder system comprises twin fishtail rudders with reaction fins to increase its performance. With a constant propeller revolution number, the vessel can execute special low-speed maneuvers like hover, crabbing, reverse, and rotation. Such low-speed maneuvers are termed dynamic positioning (DP), and a DP vessel should be fully/overly actuated with several thrusters. This article introduc… ▽ More

    Submitted 19 December, 2022; originally announced December 2022.