Skip to main content

Showing 1–9 of 9 results for author: Veselý, K

Searching in archive eess. Search in all archives.
.
  1. arXiv:2310.11921  [pdf, other

    cs.SD eess.AS

    BUT CHiME-7 system description

    Authors: Martin Karafiát, Karel Veselý, Igor Szöke, Ladislav Mošner, Karel Beneš, Marcin Witkowski, Germán Barchi, Leonardo Pepino

    Abstract: This paper describes the joint effort of Brno University of Technology (BUT), AGH University of Krakow and University of Buenos Aires on the development of Automatic Speech Recognition systems for the CHiME-7 Challenge. We train and evaluate various end-to-end models with several toolkits. We heavily relied on Guided Source Separation (GSS) to convert multi-channel audio to single channel. The ASR… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: 6 pages, Chime-7 challenge 2023

  2. arXiv:2212.07164  [pdf, other

    cs.CL cs.AI cs.LG eess.AS

    Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator

    Authors: Amrutha Prasad, Juan Zuluaga-Gomez, Petr Motlicek, Saeed Sarfjoo, Iuliia Nigmatulina, Karel Vesely

    Abstract: This paper describes a simple yet efficient repetition-based modular system for speeding up air-traffic controllers (ATCos) training. E.g., a human pilot is still required in EUROCONTROL's ESCAPE lite simulator (see https://www.eurocontrol.int/simulator/escape) during ATCo training. However, this need can be substituted by an automatic system that could act as a pilot. In this paper, we aim to dev… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: Presented at Sesar Innovation Days 2022. https://www.sesarju.eu/sesarinnovationdays

  3. arXiv:2211.04054  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications

    Authors: Juan Zuluaga-Gomez, Karel Veselý, Igor Szöke, Alexander Blatt, Petr Motlicek, Martin Kocour, Mickael Rigault, Khalid Choukri, Amrutha Prasad, Seyyed Saeed Sarfjoo, Iuliia Nigmatulina, Claudia Cevenini, Pavel Kolčárek, Allan Tart, Jan Černocký, Dietrich Klakow

    Abstract: Personal assistants, automatic speech recognizers and dialogue understanding systems are becoming more critical in our interconnected digital world. A clear example is air traffic control (ATC) communications. ATC aims at guiding aircraft and controlling the airspace in a safe and optimal manner. These voice-based dialogues are carried between an air traffic controller (ATCO) and pilots via very-h… ▽ More

    Submitted 15 June, 2023; v1 submitted 8 November, 2022; originally announced November 2022.

    Comments: Manuscript under review; The code is available at: https://github.com/idiap/atco2-corpus

  4. arXiv:2204.06309  [pdf, other

    cs.CL cs.SD eess.AS

    Call-sign recognition and understanding for noisy air-traffic transcripts using surveillance information

    Authors: Alexander Blatt, Martin Kocour, Karel Veselý, Igor Szöke, Dietrich Klakow

    Abstract: Air traffic control (ATC) relies on communication via speech between pilot and air-traffic controller (ATCO). The call-sign, as unique identifier for each flight, is used to address a specific pilot by the ATCO. Extracting the call-sign from the communication is a challenge because of the noisy ATC voice channel and the additional noise introduced by the receiver. A low signal-to-noise ratio (SNR)… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: Accepted by ICASSP 2022

  5. arXiv:2104.03643  [pdf, other

    cs.CL cs.CV cs.LG eess.AS

    Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems

    Authors: Juan Zuluaga-Gomez, Iuliia Nigmatulina, Amrutha Prasad, Petr Motlicek, Karel Veselý, Martin Kocour, Igor Szöke

    Abstract: Air traffic management and specifically air-traffic control (ATC) rely mostly on voice communications between Air Traffic Controllers (ATCos) and pilots. In most cases, these voice communications follow a well-defined grammar that could be leveraged in Automatic Speech Recognition (ASR) technologies. The callsign used to address an airplane is an essential part of all ATCo-pilot communications. We… ▽ More

    Submitted 27 August, 2021; v1 submitted 8 April, 2021; originally announced April 2021.

    Comments: Presented at: Interspeech conference 2021 (Brno, Czechia, August 30 - September 3)

  6. arXiv:2104.02332  [pdf, other

    eess.AS

    Detecting English Speech in the Air Traffic Control Voice Communication

    Authors: Igor Szoke, Santosh Kesiraju, Ondrej Novotny, Martin Kocour, Karel Vesely, Jan "Honza" Cernocky

    Abstract: We launched a community platform for collecting the ATC speech world-wide in the ATCO2 project. Filtering out unseen non-English speech is one of the main components in the data processing pipeline. The proposed English Language Detection (ELD) system is based on the embeddings from Bayesian subspace multinomial model. It is trained on the word confusion network from an ASR system. It is robust, e… ▽ More

    Submitted 6 April, 2021; originally announced April 2021.

  7. arXiv:2101.12729  [pdf, other

    eess.AS cs.CL

    BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge

    Authors: Martin Kocour, Guillermo Cámbara, Jordi Luque, David Bonet, Mireia Farrús, Martin Karafiát, Karel Veselý, Jan ''Honza'' Ĉernocký

    Abstract: This paper describes joint effort of BUT and Telefónica Research on development of Automatic Speech Recognition systems for Albayzin 2020 Challenge. We compare approaches based on either hybrid or end-to-end models. In hybrid modelling, we explore the impact of SpecAugment layer on performance. For end-to-end modelling, we used a convolutional neural network with gated linear units (GLUs). The per… ▽ More

    Submitted 29 January, 2021; originally announced January 2021.

    Comments: fusion, end-to-end model, hybrid model, semisupervised, automatic speech recognition, convolutional neural network

  8. arXiv:2006.10304  [pdf, ps, other

    cs.CL cs.CV cs.LG cs.SD eess.AS

    Automatic Speech Recognition Benchmark for Air-Traffic Communications

    Authors: Juan Zuluaga-Gomez, Petr Motlicek, Qingran Zhan, Karel Vesely, Rudolf Braun

    Abstract: Advances in Automatic Speech Recognition (ASR) over the last decade opened new areas of speech-based automation such as in Air-Traffic Control (ATC) environment. Currently, voice communication and data links communications are the only way of contact between pilots and Air-Traffic Controllers (ATCo), where the former is the most widely used and the latter is a non-spoken method mandatory for ocean… ▽ More

    Submitted 13 August, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Accepted to: 21st INTERSPEECH conference (Shanghai, October 25-29)

  9. arXiv:2001.11360  [pdf, ps, other

    eess.AS cs.LG cs.SD

    BUT Opensat 2019 Speech Recognition System

    Authors: Martin Karafiát, Murali Karthick Baskar, Igor Szöke, Hari Krishna Vydana, Karel Veselý, Jan "Honza'' Černocký

    Abstract: The paper describes the BUT Automatic Speech Recognition (ASR) systems submitted for OpenSAT evaluations under two domain categories such as low resourced languages and public safety communications. The first was challenging due to lack of training data, therefore various architectures and multilingual approaches were employed. The combination led to superior performance. The second domain was cha… ▽ More

    Submitted 30 January, 2020; originally announced January 2020.

    Comments: REJECTED in ICASSP 2020