Skip to main content

Showing 1–3 of 3 results for author: Tsouvalas, V

Searching in archive eess. Search in all archives.
.
  1. arXiv:2305.03058  [pdf, other

    eess.AS cs.LG cs.SD

    Plug-and-Play Multilingual Few-shot Spoken Words Recognition

    Authors: Aaqib Saeed, Vasileios Tsouvalas

    Abstract: As technology advances and digital devices become prevalent, seamless human-machine communication is increasingly gaining significance. The growing adoption of mobile, wearable, and other Internet of Things (IoT) devices has changed how we interact with these smart devices, making accurate spoken words recognition a crucial component for effective interaction. However, building robust spoken words… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Code: https://github.com/FewshotML/plix

  2. arXiv:2202.02611  [pdf, other

    cs.LG cs.AI cs.SD eess.AS

    Privacy-preserving Speech Emotion Recognition through Semi-Supervised Federated Learning

    Authors: Vasileios Tsouvalas, Tanir Ozcelebi, Nirvana Meratnia

    Abstract: Speech Emotion Recognition (SER) refers to the recognition of human emotions from natural speech. If done accurately, it can offer a number of benefits in building human-centered context-aware intelligent systems. Existing SER approaches are largely centralized, without considering users' privacy. Federated Learning (FL) is a distributed machine learning paradigm dealing with decentralization of p… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: text overlap with arXiv:2107.06877

  3. arXiv:2107.06877  [pdf, other

    cs.LG cs.DC cs.SD eess.AS

    Federated Self-Training for Semi-Supervised Audio Recognition

    Authors: Vasileios Tsouvalas, Aaqib Saeed, Tanir Ozcelebi

    Abstract: Federated Learning is a distributed machine learning paradigm dealing with decentralized and personal datasets. Since data reside on devices like smartphones and virtual assistants, labeling is entrusted to the clients, or labels are extracted in an automated way. Specifically, in the case of audio data, acquiring semantic annotations can be prohibitively expensive and time-consuming. As a result,… ▽ More

    Submitted 25 February, 2022; v1 submitted 14 July, 2021; originally announced July 2021.