Showing 1–1 of 1 results for author: Polle, R

Search v0.5.6 released 2020-02-24

arXiv:2505.23378 [pdf, ps, other]

cs.LG

Meta-Learning Approaches for Speaker-Dependent Voice Fatigue Models

Authors: Roseline Polle, Agnes Norbury, Alexandra Livia Georgescu, Nicholas Cummins, Stefano Goria

Abstract: Speaker-dependent modelling can substantially improve performance in speech-based health monitoring applications. While mixed-effect models are commonly used for such speaker adaptation, they require computationally expensive retraining for each new observation, making them impractical in a production environment. We reformulate this task as a meta-learning problem and explore three approaches of… ▽ More Speaker-dependent modelling can substantially improve performance in speech-based health monitoring applications. While mixed-effect models are commonly used for such speaker adaptation, they require computationally expensive retraining for each new observation, making them impractical in a production environment. We reformulate this task as a meta-learning problem and explore three approaches of increasing complexity: ensemble-based distance models, prototypical networks, and transformer-based sequence models. Using pre-trained speech embeddings, we evaluate these methods on a large longitudinal dataset of shift workers (N=1,185, 10,286 recordings), predicting time since sleep from speech as a function of fatigue, a symptom commonly associated with ill-health. Our results demonstrate that all meta-learning approaches tested outperformed both cross-sectional and conventional mixed-effects models, with a transformer-based method achieving the strongest performance. △ Less

Submitted 2 June, 2025; v1 submitted 29 May, 2025; originally announced May 2025.

Comments: 5 pages, 3 figures. To appear at Interspeech 2025

Search v0.5.6 released 2020-02-24