Skip to main content

Showing 1–4 of 4 results for author: Otake, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.20867  [pdf, ps, other

    cs.CV

    Enhancing Ambiguous Dynamic Facial Expression Recognition with Soft Label-based Data Augmentation

    Authors: Ryosuke Kawamura, Hideaki Hayashi, Shunsuke Otake, Noriko Takemura, Hajime Nagahara

    Abstract: Dynamic facial expression recognition (DFER) is a task that estimates emotions from facial expression video sequences. For practical applications, accurately recognizing ambiguous facial expressions -- frequently encountered in in-the-wild data -- is essential. In this study, we propose MIDAS, a data augmentation method designed to enhance DFER performance for ambiguous facial expression data usin… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  2. arXiv:2407.21066  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    ELP-Adapters: Parameter Efficient Adapter Tuning for Various Speech Processing Tasks

    Authors: Nakamasa Inoue, Shinta Otake, Takumi Hirose, Masanari Ohi, Rei Kawakami

    Abstract: Self-supervised learning has emerged as a key approach for learning generic representations from speech data. Despite promising results in downstream tasks such as speech recognition, speaker verification, and emotion recognition, a significant number of parameters is required, which makes fine-tuning for each task memory-inefficient. To address this limitation, we introduce ELP-adapter tuning, a… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  3. arXiv:2309.10551  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    A Neighbourhood-Aware Differential Privacy Mechanism for Static Word Embeddings

    Authors: Danushka Bollegala, Shuichi Otake, Tomoya Machide, Ken-ichi Kawarabayashi

    Abstract: We propose a Neighbourhood-Aware Differential Privacy (NADP) mechanism considering the neighbourhood of a word in a pretrained static word embedding space to determine the minimal amount of noise required to guarantee a specified privacy level. We first construct a nearest neighbour graph over the words using their embeddings, and factorise it into a set of connected components (i.e. neighbourhood… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Accepted to IJCNLP-AACL 2023

  4. arXiv:2212.02780  [pdf, ps, other

    cs.MM cs.SD eess.AS

    Parameter Efficient Transfer Learning for Various Speech Processing Tasks

    Authors: Shinta Otake, Rei Kawakami, Nakamasa Inoue

    Abstract: Fine-tuning of self-supervised models is a powerful transfer learning method in a variety of fields, including speech processing, since it can utilize generic feature representations obtained from large amounts of unlabeled data. Fine-tuning, however, requires a new parameter set for each downstream task, which is parameter inefficient. Adapter architecture is proposed to partially solve this issu… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.