Skip to main content

Showing 1–4 of 4 results for author: Marín-Morales, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.02167  [pdf, other

    eess.AS cs.AI cs.CL cs.SD

    EMOVOME: A Dataset for Emotion Recognition in Spontaneous Real-Life Speech

    Authors: Lucía Gómez-Zaragozá, Rocío del Amor, María José Castro-Bleda, Valery Naranjo, Mariano Alcañiz Raya, Javier Marín-Morales

    Abstract: Spontaneous datasets for Speech Emotion Recognition (SER) are scarce and frequently derived from laboratory environments or staged scenarios, such as TV shows, limiting their application in real-world contexts. We developed and publicly released the Emotional Voice Messages (EMOVOME) dataset, including 999 voice messages from real conversations of 100 Spanish speakers on a messaging app, labeled i… ▽ More

    Submitted 3 December, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: This article is a merged version of the description of the EMOVOME database in arXiv:2402.17496v1 and the speech emotion recognition models in arXiv:2403.02167v1. This work has been submitted to the IEEE for possible publication

    ACM Class: I.5.1; I.5.4

  2. arXiv:2402.17496   

    cs.SD cs.AI cs.CL eess.AS

    Emotional Voice Messages (EMOVOME) database: emotion recognition in spontaneous voice messages

    Authors: Lucía Gómez Zaragozá, Rocío del Amor, Elena Parra Vargas, Valery Naranjo, Mariano Alcañiz Raya, Javier Marín-Morales

    Abstract: Emotional Voice Messages (EMOVOME) is a spontaneous speech dataset containing 999 audio messages from real conversations on a messaging app from 100 Spanish speakers, gender balanced. Voice messages were produced in-the-wild conditions before participants were recruited, avoiding any conscious bias due to laboratory environment. Audios were labeled in valence and arousal dimensions by three non-ex… ▽ More

    Submitted 13 June, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: This paper has been superseded by arXiv:2403.02167 (merged from the description of the EMOVOME database in arXiv:2402.17496v1 and the speech emotion recognition models in arXiv:2403.02167v1)

    ACM Class: I.5.1; I.5.4; I.2.7

  3. arXiv:2311.14533  [pdf, other

    cs.LG

    Introducing 3DCNN ResNets for ASD full-body kinematic assessment: a comparison with hand-crafted features

    Authors: Alberto Altozano, Maria Eleonora Minissi, Mariano Alcañiz, Javier Marín-Morales

    Abstract: Autism Spectrum Disorder (ASD) is characterized by challenges in social communication and restricted patterns, with motor abnormalities gaining traction for early detection. However, kinematic analysis in ASD is limited, often lacking robust validation and relying on hand-crafted features for single tasks, leading to inconsistencies across studies. End-to-end models have emerged as promising metho… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 November, 2023; originally announced November 2023.

    Comments: This work has been submitted to Expert Systems with Applications for possible publication

  4. arXiv:2306.03443  [pdf

    cs.CL cs.SD eess.AS eess.SP

    Alzheimer Disease Classification through ASR-based Transcriptions: Exploring the Impact of Punctuation and Pauses

    Authors: Lucía Gómez-Zaragozá, Simone Wills, Cristian Tejedor-Garcia, Javier Marín-Morales, Mariano Alcañiz, Helmer Strik

    Abstract: Alzheimer's Disease (AD) is the world's leading neurodegenerative disease, which often results in communication difficulties. Analysing speech can serve as a diagnostic tool for identifying the condition. The recent ADReSS challenge provided a dataset for AD classification and highlighted the utility of manual transcriptions. In this study, we used the new state-of-the-art Automatic Speech Recogni… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Journal ref: Proc. INTERSPEECH 2023, pp. 2403-2407, Dublin, Ireland, 20-24, August 2023