Skip to main content

Showing 1–4 of 4 results for author: Mena, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2111.07793  [pdf, ps, other

    cs.CL

    Analysis of Data Augmentation Methods for Low-Resource Maltese ASR

    Authors: Andrea DeMarco, Carlos Mena, Albert Gatt, Claudia Borg, Aiden Williams, Lonneke van der Plas

    Abstract: Recent years have seen an increased interest in the computational speech processing of Maltese, but resources remain sparse. In this paper, we consider data augmentation techniques for improving speech recognition for low-resource languages, focusing on Maltese as a test case. We consider three different types of data augmentation: unsupervised training, multilingual training and the use of synthe… ▽ More

    Submitted 20 January, 2023; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: 12 pages

  2. arXiv:2102.12564  [pdf, other

    cs.SD cs.AI eess.AS

    Triplet loss based embeddings for forensic speaker identification in Spanish

    Authors: Emmanuel Maqueda, Javier Alvarez-Jimenez, Carlos Mena, Ivan Meza

    Abstract: With the advent of digital technology, it is more common that committed crimes or legal disputes involve some form of speech recording where the identity of a speaker is questioned [1]. In face of this situation, the field of forensic speaker identification has been looking to shed light on the problem by quantifying how much a speech recording belongs to a particular person in relation to a popul… ▽ More

    Submitted 13 September, 2021; v1 submitted 24 February, 2021; originally announced February 2021.

    Comments: Long Paper: Neural Computing and Applications, Special Issue on LatinX in AI Research (2021). 11 pages, 5 figures

  3. arXiv:2008.05760  [pdf, other

    cs.CL cs.LG

    MASRI-HEADSET: A Maltese Corpus for Speech Recognition

    Authors: Carlos Mena, Albert Gatt, Andrea DeMarco, Claudia Borg, Lonneke van der Plas, Amanda Muscat, Ian Padovani

    Abstract: Maltese, the national language of Malta, is spoken by approximately 500,000 people. Speech processing for Maltese is still in its early stages of development. In this paper, we present the first spoken Maltese corpus designed purposely for Automatic Speech Recognition (ASR). The MASRI-HEADSET corpus was developed by the MASRI project at the University of Malta. It consists of 8 hours of speech pai… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: 8 pages, 2 figures, 4 tables, 1 appendix. Appears in Proceedings of the 12th edition of the Language Resources and Evaluation Conference (LREC'20)

  4. arXiv:1909.11114  [pdf, other

    stat.AP cs.LG stat.ML

    Churn Prediction with Sequential Data and Deep Neural Networks. A Comparative Analysis

    Authors: C. Gary Mena, Arno De Caigny, Kristof Coussement, Koen W. De Bock, Stefan Lessmann

    Abstract: Off-the-shelf machine learning algorithms for prediction such as regularized logistic regression cannot exploit the information of time-varying features without previously using an aggregation procedure of such sequential data. However, recurrent neural networks provide an alternative approach by which time-varying features can be readily used for modeling. This paper assesses the performance of n… ▽ More

    Submitted 24 September, 2019; originally announced September 2019.