Skip to main content

Showing 1–5 of 5 results for author: Serafini, L

Searching in archive eess. Search in all archives.
.
  1. arXiv:2307.16809  [pdf, other

    eess.AS

    An enhanced system for the detection and active cancellation of snoring signals

    Authors: Valeria Bruschi, Michela Cantarini, Luca Serafini, Stefano Nobili, Stefania Cecchi, Stefano Squartini

    Abstract: Snoring is a common disorder that affects people's social and marital lives. The annoyance caused by snoring can be partially solved with active noise control systems. In this context, the present work aims at introducing an enhanced system based on the use of a convolutional recurrent neural network for snoring activity detection and a delayless subband approach for active snoring cancellation. T… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

  2. arXiv:2307.15611  [pdf, other

    eess.AS

    A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment

    Authors: Carlo Aironi, Samuele Cornell, Luca Serafini, Stefano Squartini

    Abstract: Packet loss is a major cause of voice quality degradation in VoIP transmissions with serious impact on intelligibility and user experience. This paper describes a system based on a generative adversarial approach, which aims to repair the lost fragments during the transmission of audio streams. Inspired by the powerful image-to-image translation capability of Generative Adversarial Networks (GANs)… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: Accepted at EUSIPCO - 31st European Signal Processing Conference, 2023

  3. arXiv:2305.18074  [pdf, other

    eess.AS cs.SD eess.SP

    An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings

    Authors: Luca Serafini, Samuele Cornell, Giovanni Morrone, Enrico Zovato, Alessio Brutti, Stefano Squartini

    Abstract: We performed an experimental review of current diarization systems for the conversational telephone speech (CTS) domain. In detail, we considered a total of eight different algorithms belonging to clustering-based, end-to-end neural diarization (EEND), and speech separation guided diarization (SSGD) paradigms. We studied the inference-time computational requirements and diarization accuracy on fou… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

    Comments: 52 pages, 10 figures

  4. End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations

    Authors: Giovanni Morrone, Samuele Cornell, Luca Serafini, Enrico Zovato, Alessio Brutti, Stefano Squartini

    Abstract: Recent works show that speech separation guided diarization (SSGD) is an increasingly promising direction, mainly thanks to the recent progress in speech separation. It performs diarization by first separating the speakers and then applying voice activity detection (VAD) on each separated stream. In this work we conduct an in-depth study of SSGD in the conversational telephone speech (CTS) domain,… ▽ More

    Submitted 22 May, 2024; v1 submitted 21 March, 2023; originally announced March 2023.

    Comments: 16 pages, 7 figures

    Journal ref: Speech Communication 161 (2024) 103081

  5. arXiv:2204.02306  [pdf, other

    eess.AS

    Low-Latency Speech Separation Guided Diarization for Telephone Conversations

    Authors: Giovanni Morrone, Samuele Cornell, Desh Raj, Luca Serafini, Enrico Zovato, Alessio Brutti, Stefano Squartini

    Abstract: In this paper, we carry out an analysis on the use of speech separation guided diarization (SSGD) in telephone conversations. SSGD performs diarization by separating the speakers signals and then applying voice activity detection on each estimated speaker signal. In particular, we compare two low-latency speech separation models. Moreover, we show a post-processing algorithm that significantly red… ▽ More

    Submitted 27 October, 2022; v1 submitted 5 April, 2022; originally announced April 2022.

    Comments: Accepted for Presentation at IEEE Spoken Language Technology Workshop (SLT) 2022