Skip to main content

Showing 1–18 of 18 results for author: Illina, I

.
  1. arXiv:2505.20006  [pdf, ps, other

    cs.CL

    Mixture of LoRA Experts for Low-Resourced Multi-Accent Automatic Speech Recognition

    Authors: Raphaël Bagat, Irina Illina, Emmanuel Vincent

    Abstract: We aim to improve the robustness of Automatic Speech Recognition (ASR) systems against non-native speech, particularly in low-resourced multi-accent settings. We introduce Mixture of Accent-Specific LoRAs (MAS-LoRA), a fine-tuning method that leverages a mixture of Low-Rank Adaptation (LoRA) experts, each specialized in a specific accent. This method can be used when the accent is known or unknown… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Submitted to Interspeech 2025

  2. arXiv:2412.16719  [pdf, other

    cs.LG cs.AI

    Lillama: Large Language Models Compression via Low-Rank Feature Distillation

    Authors: Yaya Sy, Christophe Cerisara, Irina Illina

    Abstract: Current LLM structured pruning methods typically involve two steps: (1) compression with calibration data and (2) costly continued pretraining on billions of tokens to recover lost performance. This second step is necessary as the first significantly impacts model accuracy. Prior research suggests pretrained Transformer weights aren't inherently low-rank, unlike their activations, which may explai… ▽ More

    Submitted 28 December, 2024; v1 submitted 21 December, 2024; originally announced December 2024.

    Comments: 20 pages, 8 figures

  3. arXiv:2307.16582  [pdf, other

    eess.AS cs.SD

    SAMbA: Speech enhancement with Asynchronous ad-hoc Microphone Arrays

    Authors: Nicolas Furnon, Romain Serizel, Slim Essid, Irina Illina

    Abstract: Speech enhancement in ad-hoc microphone arrays is often hindered by the asynchronization of the devices composing the microphone array. Asynchronization comes from sampling time offset and sampling rate offset which inevitably occur when the microphones are embedded in different hardware components. In this paper, we propose a deep neural network (DNN)-based speech enhancement solution that is sui… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: Submitted to INTERSPEECH 2022

  4. arXiv:2210.09340  [pdf, other

    cs.CL

    Transferring Knowledge via Neighborhood-Aware Optimal Transport for Low-Resource Hate Speech Detection

    Authors: Tulika Bose, Irina Illina, Dominique Fohr

    Abstract: The concerning rise of hateful content on online platforms has increased the attention towards automatic hate speech detection, commonly formulated as a supervised classification task. State-of-the-art deep learning-based approaches usually require a substantial amount of labeled resources for training. However, annotating hate speech resources is expensive, time-consuming, and often harmful to th… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: AACL-IJCNLP 2022 preprint

  5. arXiv:2209.08681  [pdf, other

    cs.CL

    Domain Classification-based Source-specific Term Penalization for Domain Adaptation in Hate-speech Detection

    Authors: Tulika Bose, Nikolaos Aletras, Irina Illina, Dominique Fohr

    Abstract: State-of-the-art approaches for hate-speech detection usually exhibit poor performance in out-of-domain settings. This occurs, typically, due to classifiers overemphasizing source-specific information that negatively impacts its domain invariance. Prior work has attempted to penalize terms related to hate-speech from manually curated lists using feature attribution methods, which quantify the impo… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: COLING 2022 pre-print

  6. arXiv:2204.13400  [pdf, other

    cs.CL

    Placing M-Phasis on the Plurality of Hate: A Feature-Based Corpus of Hate Online

    Authors: Dana Ruiter, Liane Reiners, Ashwin Geet D'Sa, Thomas Kleinbauer, Dominique Fohr, Irina Illina, Dietrich Klakow, Christian Schemer, Angeliki Monnier

    Abstract: Even though hate speech (HS) online has been an important object of research in the last decade, most HS-related corpora over-simplify the phenomenon of hate by attempting to label user comments as "hate" or "neutral". This ignores the complex and subjective nature of HS, which limits the real-life applicability of classifiers trained on these corpora. In this study, we present the M-Phasis corpus… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    Comments: 14 pages, 4 figures, accepted at LREC 2022 (Full Paper)

  7. arXiv:2203.12536  [pdf, other

    cs.CL

    Dynamically Refined Regularization for Improving Cross-corpora Hate Speech Detection

    Authors: Tulika Bose, Nikolaos Aletras, Irina Illina, Dominique Fohr

    Abstract: Hate speech classifiers exhibit substantial performance degradation when evaluated on datasets different from the source. This is due to learning spurious correlations between words that are not necessarily relevant to hateful language, and hate speech labels from the training corpus. Previous work has attempted to mitigate this problem by regularizing specific terms from pre-defined static dictio… ▽ More

    Submitted 23 March, 2022; originally announced March 2022.

    Comments: Findings of ACL 2022 preprint

  8. arXiv:2106.07939  [pdf, other

    eess.SP cs.SD eess.AS

    Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes

    Authors: Nicolas Furnon, Romain Serizel, Slim Essid, Irina Illina

    Abstract: Speech enhancement promises higher efficiency in ad-hoc microphone arrays than in constrained microphone arrays thanks to the wide spatial coverage of the devices in the acoustic scene. However, speech enhancement in ad-hoc microphone arrays still raises many challenges. In particular, the algorithms should be able to handle a variable number of microphones, as some devices in the array might appe… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Journal ref: European Signal Processing Conference (EUSIPCO), IEEE, Aug 2021, Dublin, Ireland

  9. arXiv:2106.00237  [pdf, other

    cs.CL

    Improving Automatic Hate Speech Detection with Multiword Expression Features

    Authors: Nicolas Zampieri, Irina Illina, Dominique Fohr

    Abstract: The task of automatically detecting hate speech in social media is gaining more and more attention. Given the enormous volume of content posted daily, human monitoring of hate speech is unfeasible. In this work, we propose new word-level features for automatic hate speech detection (HSD): multiword expressions (MWEs). MWEs are lexical units greater than a word that have idiomatic and compositional… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

    Comments: In Proceedings of NLDB 2021

  10. arXiv:2011.01714  [pdf, other

    eess.SP

    DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays

    Authors: Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid

    Abstract: Deep neural network (DNN)-based speech enhancement algorithms in microphone arrays have now proven to be efficient solutions to speech understanding and speech recognition in noisy environments. However, in the context of ad-hoc microphone arrays, many challenges remain and raise the need for distributed processing. In this paper, we propose to extend a previously introduced distributed DNN-based… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: Submitted to TASLP

  11. arXiv:2011.00982  [pdf, other

    eess.SP

    Distributed speech separation in spatially unconstrained microphone arrays

    Authors: Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid

    Abstract: Speech separation with several speakers is a challenging task because of the non-stationarity of the speech and the strong signal similarity between interferent sources. Current state-of-the-art solutions can separate well the different sources using sophisticated deep neural networks which are very tedious to train. When several microphones are available, spatial information can be exploited to d… ▽ More

    Submitted 8 February, 2021; v1 submitted 2 November, 2020; originally announced November 2020.

    Journal ref: ICASSP 2021 - 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto, Canada

  12. arXiv:2011.00975  [pdf

    cs.CL

    DNN-Based Semantic Model for Rescoring N-best Speech Recognition List

    Authors: Dominique Fohr, Irina Illina

    Abstract: The word error rate (WER) of an automatic speech recognition (ASR) system increases when a mismatch occurs between the training and the testing conditions due to the noise, etc. In this case, the acoustic information can be less reliable. This work aims to improve ASR by modeling long-term semantic relations to compensate for distorted acoustic features. We propose to perform this through rescorin… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  13. arXiv:2002.06016  [pdf, other

    cs.SD cs.AI eess.AS

    DNN-Based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays

    Authors: Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid

    Abstract: Multichannel processing is widely used for speech enhancement but several limitations appear when trying to deploy these solutions to the real-world. Distributed sensor arrays that consider several devices with a few microphones is a viable alternative that allows for exploiting the multiple devices equipped with microphones that we are using in our everyday life. In this context, we propose to ex… ▽ More

    Submitted 16 March, 2020; v1 submitted 13 February, 2020; originally announced February 2020.

    Comments: Submitted to ICASSP2020

    Journal ref: International Conference on Audio, Signal and Speech Processing (ICASSP), May 2020, Barcelone, Spain

  14. arXiv:1911.08395  [pdf

    cs.LG stat.ML

    Towards non-toxic landscapes: Automatic toxic comment detection using DNN

    Authors: Ashwin Geet D'Sa, Irina Illina, Dominique Fohr

    Abstract: The spectacular expansion of the Internet has led to the development of a new research problem in the field of natural language processing: automatic toxic comment detection, since many countries prohibit hate speech in public media. There is no clear and formal definition of hate, offensive, toxic and abusive speeches. In this article, we put all these terms under the umbrella of "toxic" speech.… ▽ More

    Submitted 16 September, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

    Journal ref: In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying 2020 May (pp. 21-25)

  15. arXiv:1511.05389  [pdf, other

    cs.CL

    Learning to retrieve out-of-vocabulary words in speech recognition

    Authors: Imran Sheikh, Irina Illina, Dominique Fohr, Georges Linarès

    Abstract: Many Proper Names (PNs) are Out-Of-Vocabulary (OOV) words for speech recognition systems used to process diachronic audio data. To help recovery of the PNs missed by the system, relevant OOV PNs can be retrieved out of the many OOVs by exploiting semantic context of the spoken content. In this paper, we propose two neural network models targeted to retrieve OOV PNs relevant to an audio document: (… ▽ More

    Submitted 1 March, 2016; v1 submitted 17 November, 2015; originally announced November 2015.

    Comments: Updated references, added appendix discussing more results; added more discussion, replaced simple phone search results with KWS results; added KWS results for both training phase, probably last update

  16. arXiv:0711.1038  [pdf, ps, other

    cs.CL

    Amélioration des Performances des Systèmes Automatiques de Reconnaissance de la Parole pour la Parole Non Native

    Authors: Ghazi Bouselmi, Dominique Fohr, Irina Illina, Jean-Paul Haton

    Abstract: In this article, we present an approach for non native automatic speech recognition (ASR). We propose two methods to adapt existing ASR systems to the non-native accents. The first method is based on the modification of acoustic models through integration of acoustic models from the mother tong. The phonemes of the target language are pronounced in a similar manner to the native language of spea… ▽ More

    Submitted 7 November, 2007; originally announced November 2007.

    Journal ref: Dans TAIMA'07, Traitement et Analyse de l'Information : Méthodes et Applications (2007)

  17. arXiv:0711.0811  [pdf, ps, other

    cs.CL

    Combined Acoustic and Pronunciation Modelling for Non-Native Speech Recognition

    Authors: Ghazi Bouselmi, Dominique Fohr, Irina Illina

    Abstract: In this paper, we present several adaptation methods for non-native speech recognition. We have tested pronunciation modelling, MLLR and MAP non-native pronunciation adaptation and HMM models retraining on the HIWIRE foreign accented English speech database. The ``phonetic confusion'' scheme we have developed consists in associating to each spoken phone several sequences of confused phones. In o… ▽ More

    Submitted 6 November, 2007; originally announced November 2007.

    Journal ref: Dans InterSpeech 2007 (2007)

  18. arXiv:0711.0666  [pdf, ps, other

    cs.CL

    Discriminative Phoneme Sequences Extraction for Non-Native Speaker's Origin Classification

    Authors: Ghazi Bouselmi, Dominique Fohr, Irina Illina, Jean-Paul Haton

    Abstract: In this paper we present an automated method for the classification of the origin of non-native speakers. The origin of non-native speakers could be identified by a human listener based on the detection of typical pronunciations for each nationality. Thus we suppose the existence of several phoneme sequences that might allow the classification of the origin of non-native speakers. Our new method… ▽ More

    Submitted 5 November, 2007; originally announced November 2007.

    Journal ref: Dans ISSPA, International Symposium on Signal Processing and its Applications (2007)