Skip to main content

Showing 1–3 of 3 results for author: Tsutsui, S

Searching in archive eess. Search in all archives.
.
  1. arXiv:2504.09899  [pdf, other

    cs.CV eess.IV

    Digital Staining with Knowledge Distillation: A Unified Framework for Unpaired and Paired-But-Misaligned Data

    Authors: Ziwang Xu, Lanqing Guo, Satoshi Tsutsui, Shuyan Zhang, Alex C. Kot, Bihan Wen

    Abstract: Staining is essential in cell imaging and medical diagnostics but poses significant challenges, including high cost, time consumption, labor intensity, and irreversible tissue alterations. Recent advances in deep learning have enabled digital staining through supervised model training. However, collecting large-scale, perfectly aligned pairs of stained and unstained images remains difficult. In th… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: Accepted to IEEE Transactions on Medical Imaging

  2. arXiv:2303.01777  [pdf, other

    eess.IV cs.CV

    Benchmarking White Blood Cell Classification Under Domain Shift

    Authors: Satoshi Tsutsui, Zhengyang Su, Bihan Wen

    Abstract: Recognizing the types of white blood cells (WBCs) in microscopic images of human blood smears is a fundamental task in the fields of pathology and hematology. Although previous studies have made significant contributions to the development of methods and datasets, few papers have investigated benchmarks or baselines that others can easily refer to. For instance, we observed notable variations in t… ▽ More

    Submitted 19 May, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted to the International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2023. More datasets are cited

  3. arXiv:2111.14448  [pdf, other

    cs.CV cs.MM eess.AS

    AVA-AVD: Audio-Visual Speaker Diarization in the Wild

    Authors: Eric Zhongcong Xu, Zeyang Song, Satoshi Tsutsui, Chao Feng, Mang Ye, Mike Zheng Shou

    Abstract: Audio-visual speaker diarization aims at detecting "who spoke when" using both auditory and visual signals. Existing audio-visual diarization datasets are mainly focused on indoor environments like meeting rooms or news studios, which are quite different from in-the-wild videos in many scenarios such as movies, documentaries, and audience sitcoms. To develop diarization methods for these challengi… ▽ More

    Submitted 16 July, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: ACMMM 2022