Skip to main content

Showing 1–50 of 51 results for author: Turchi, M

.
  1. arXiv:2502.03111  [pdf, other

    cs.CL cs.AI cs.LG

    Policies and Evaluation for Online Meeting Summarization

    Authors: Felix Schneider, Marco Turchi, Alex Waibel

    Abstract: With more and more meetings moving to a digital domain, meeting summarization has recently gained interest in both academic and commercial research. However, prior academic research focuses on meeting summarization as an offline task, performed after the meeting concludes. In this paper, we perform the first systematic study of online meeting summarization. For this purpose, we propose several pol… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

    Comments: 8 pages, 1 figure

  2. arXiv:2411.05088  [pdf

    cs.CL

    Findings of the IWSLT 2024 Evaluation Campaign

    Authors: Ibrahim Said Ahmad, Antonios Anastasopoulos, Ondřej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, William Chen, Qianqian Dong, Marcello Federico, Barry Haddow, Dávid Javorský, Mateusz Krubiński, Tsz Kin Lam, Xutai Ma, Prashant Mathur, Evgeny Matusov, Chandresh Maurya, John McCrae, Kenton Murray, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, Atul Kr. Ojha , et al. (20 additional authors not shown)

    Abstract: This paper reports on the shared tasks organized by the 21st IWSLT Conference. The shared tasks address 7 scientific challenges in spoken language translation: simultaneous and offline translation, automatic subtitling and dubbing, speech-to-speech translation, dialect and low-resource speech translation, and Indic languages. The shared tasks attracted 18 teams whose submissions are documented in… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: IWSLT 2024; 59 pages

  3. arXiv:2406.03881  [pdf, other

    cs.CL

    Evaluating the IWSLT2023 Speech Translation Tasks: Human Annotations, Automatic Metrics, and Segmentation

    Authors: Matthias Sperber, Ondřej Bojar, Barry Haddow, Dávid Javorský, Xutai Ma, Matteo Negri, Jan Niehues, Peter Polák, Elizabeth Salesky, Katsuhito Sudoh, Marco Turchi

    Abstract: Human evaluation is a critical component in machine translation system development and has received much attention in text translation research. However, little prior work exists on the topic of human evaluation for speech translation, which adds additional challenges such as noisy data and segmentation mismatches. We take first steps to fill this gap by conducting a comprehensive human evaluation… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: LREC-COLING2024 publication (with corrections for Table 3)

    Journal ref: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

  4. arXiv:2305.11408  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation

    Authors: Sara Papi, Marco Turchi, Matteo Negri

    Abstract: Attention is the core mechanism of today's most used architectures for natural language processing and has been analyzed from many perspectives, including its effectiveness for machine translation-related tasks. Among these studies, attention resulted to be a useful source of information to get insights about word alignment also when the input text is substituted with audio segments, as in the cas… ▽ More

    Submitted 19 July, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted at Interspeech 2023

    Journal ref: Proceedings of INTERSPEECH 2023

  5. Attention as a Guide for Simultaneous Speech Translation

    Authors: Sara Papi, Matteo Negri, Marco Turchi

    Abstract: The study of the attention mechanism has sparked interest in many fields, such as language modeling and machine translation. Although its patterns have been exploited to perform different tasks, from neural network understanding to textual alignment, no previous work has analysed the encoder-decoder attention behavior in speech translation (ST) nor used it to improve ST on a specific task. In this… ▽ More

    Submitted 11 May, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: Accepted to ACL 2023

    Journal ref: Proceedings of ACL 2023

  6. Joint Speech Translation and Named Entity Recognition

    Authors: Marco Gaido, Sara Papi, Matteo Negri, Marco Turchi

    Abstract: Modern automatic translation systems aim at place the human at the center by providing contextual support and knowledge. In this context, a critical task is enriching the output with information regarding the mentioned entities, which is currently achieved processing the generated translation with named entity recognition (NER) and entity linking systems. In light of the recent promising results s… ▽ More

    Submitted 20 May, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted at INTERSPEECH 2023

  7. arXiv:2209.13192  [pdf, other

    cs.CL

    Direct Speech Translation for Automatic Subtitling

    Authors: Sara Papi, Marco Gaido, Alina Karakanta, Mauro Cettolo, Matteo Negri, Marco Turchi

    Abstract: Automatic subtitling is the task of automatically translating the speech of audiovisual content into short pieces of timed text, i.e. subtitles and their corresponding timestamps. The generated subtitles need to conform to space and time requirements, while being synchronised with the speech and segmented in a way that facilitates comprehension. Given its considerable complexity, the task has so f… ▽ More

    Submitted 25 July, 2023; v1 submitted 27 September, 2022; originally announced September 2022.

    Comments: Accepted at TACL

  8. arXiv:2209.10608  [pdf, other

    cs.CL

    Dodging the Data Bottleneck: Automatic Subtitling with Automatically Segmented ST Corpora

    Authors: Sara Papi, Alina Karakanta, Matteo Negri, Marco Turchi

    Abstract: Speech translation for subtitling (SubST) is the task of automatically translating speech data into well-formed subtitles by inserting subtitle breaks compliant to specific displaying guidelines. Similar to speech translation (ST), model training requires parallel data comprising audio inputs paired with their textual translations. In SubST, however, the text has to be also annotated with subtitle… ▽ More

    Submitted 16 November, 2022; v1 submitted 21 September, 2022; originally announced September 2022.

    Journal ref: AACL 2022

  9. Over-Generation Cannot Be Rewarded: Length-Adaptive Average Lagging for Simultaneous Speech Translation

    Authors: Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi

    Abstract: Simultaneous speech translation (SimulST) systems aim at generating their output with the lowest possible latency, which is normally computed in terms of Average Lagging (AL). In this paper we highlight that, despite its widespread adoption, AL provides underestimated scores for systems that generate longer predictions compared to the corresponding references. We also show that this problem has pr… ▽ More

    Submitted 20 June, 2022; v1 submitted 12 June, 2022; originally announced June 2022.

    Comments: AutoSimTrans Workshop @ NAACL2022

    Journal ref: Proceedings of the Third Workshop on Automatic Simultaneous Translation (AutoSimTrans 2022)

  10. arXiv:2205.06755  [pdf, other

    cs.CL

    Who Are We Talking About? Handling Person Names in Speech Translation

    Authors: Marco Gaido, Matteo Negri, Marco Turchi

    Abstract: Recent work has shown that systems for speech translation (ST) -- similarly to automatic speech recognition (ASR) -- poorly handle person names. This shortcoming does not only lead to errors that can seriously distort the meaning of the input, but also hinders the adoption of such systems in application scenarios (like computer-assisted interpreting) where the translation of named entities, like p… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: Accepted at IWSLT2022

  11. Efficient yet Competitive Speech Translation: FBK@IWSLT2022

    Authors: Marco Gaido, Sara Papi, Dennis Fucci, Giuseppe Fiameni, Matteo Negri, Marco Turchi

    Abstract: The primary goal of this FBK's systems submission to the IWSLT 2022 offline and simultaneous speech translation tasks is to reduce model training costs without sacrificing translation quality. As such, we first question the need of ASR pre-training, showing that it is not essential to achieve competitive results. Second, we focus on data filtering, showing that a simple method that looks at the ra… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

    Comments: IWSLT 2022 System Description

    Journal ref: Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022)

  12. Does Simultaneous Speech Translation need Simultaneous Models?

    Authors: Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi

    Abstract: In simultaneous speech translation (SimulST), finding the best trade-off between high translation quality and low latency is a challenging task. To meet the latency constraints posed by the different application scenarios, multiple dedicated SimulST models are usually trained and maintained, generating high computational costs. In this paper, motivated by the increased social and environmental imp… ▽ More

    Submitted 16 November, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

    Comments: Findings of EMNLP 2022

    Journal ref: Findings of the Association for Computational Linguistics: EMNLP 2022

  13. arXiv:2203.09866  [pdf, other

    cs.CL

    Under the Morphosyntactic Lens: A Multifaceted Evaluation of Gender Bias in Speech Translation

    Authors: Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi

    Abstract: Gender bias is largely recognized as a problematic phenomenon affecting language technologies, with recent studies underscoring that it might surface differently across languages. However, most of current evaluation practices adopt a word-level focus on a narrow set of occupational nouns under synthetic conditions. Such protocols overlook key features of grammatical gender languages, which are cha… ▽ More

    Submitted 18 March, 2022; originally announced March 2022.

    Comments: Accepted at ACL 2022

  14. arXiv:2111.00514  [pdf, ps, other

    cs.CL

    Visualization: the missing factor in Simultaneous Speech Translation

    Authors: Sara Papi, Matteo Negri, Marco Turchi

    Abstract: Simultaneous speech translation (SimulST) is the task in which output generation has to be performed on partial, incremental speech input. In recent years, SimulST has become popular due to the spread of cross-lingual application scenarios, like international live conferences and streaming lectures, in which on-the-fly speech translation can facilitate users' access to audio-visual content. In thi… ▽ More

    Submitted 8 November, 2021; v1 submitted 31 October, 2021; originally announced November 2021.

    Comments: Accepted at CLIC-it 2021

    Journal ref: Italian Conference on Computational Linguistics 2021

  15. arXiv:2109.07439  [pdf, other

    cs.CL

    Is "moby dick" a Whale or a Bird? Named Entities and Terminology in Speech Translation

    Authors: Marco Gaido, Susana Rodríguez, Matteo Negri, Luisa Bentivogli, Marco Turchi

    Abstract: Automatic translation systems are known to struggle with rare words. Among these, named entities (NEs) and domain-specific terms are crucial, since errors in their translation can lead to severe meaning distortions. Despite their importance, previous speech translation (ST) studies have neglected them, also due to the dearth of publicly available resources tailored to their specific evaluation. To… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP2021

  16. Speechformer: Reducing Information Loss in Direct Speech Translation

    Authors: Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi

    Abstract: Transformer-based models have gained increasing popularity achieving state-of-the-art performance in many research fields including speech translation. However, Transformer's quadratic complexity with respect to the input sequence length prevents its adoption as is with audio signals, which are typically represented by long sequences. Current solutions resort to an initial sub-optimal compression… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: Accepted to EMNLP 2021 Main Conference

    Journal ref: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

  17. arXiv:2107.08807  [pdf, other

    cs.CL

    Simultaneous Speech Translation for Live Subtitling: from Delay to Display

    Authors: Alina Karakanta, Sara Papi, Matteo Negri, Marco Turchi

    Abstract: With the increased audiovisualisation of communication, the need for live subtitles in multilingual events is more relevant than ever. In an attempt to automatise the process, we aim at exploring the feasibility of simultaneous speech translation (SimulST) for live subtitling. However, the word-for-word rate of generation of SimulST systems is not optimal for displaying the subtitles in a comprehe… ▽ More

    Submitted 20 July, 2021; v1 submitted 19 July, 2021; originally announced July 2021.

    Journal ref: Proceedings of the 1st Workshop on Automatic Spoken Language Translation in Real-World Settings (ASLTRW 2021)

  18. arXiv:2107.06246  [pdf, ps, other

    cs.CL

    Between Flexibility and Consistency: Joint Generation of Captions and Subtitles

    Authors: Alina Karakanta, Marco Gaido, Matteo Negri, Marco Turchi

    Abstract: Speech translation (ST) has lately received growing interest for the generation of subtitles without the need for an intermediate source language transcription and timing (i.e. captions). However, the joint generation of source captions and target subtitles does not only bring potential output quality advantages when the two decoding processes inform each other, but it is also often required in mu… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

    Comments: Accepted at IWSLT 2021

  19. arXiv:2106.12607  [pdf, other

    cs.CL cs.SD eess.AS

    Dealing with training and test segmentation mismatch: FBK@IWSLT2021

    Authors: Sara Papi, Marco Gaido, Matteo Negri, Marco Turchi

    Abstract: This paper describes FBK's system submission to the IWSLT 2021 Offline Speech Translation task. We participated with a direct model, which is a Transformer-based architecture trained to translate English speech audio data into German texts. The training pipeline is characterized by knowledge distillation and a two-step fine-tuning procedure. Both knowledge distillation and the first fine-tuning st… ▽ More

    Submitted 28 June, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

    Comments: Accepted at IWSLT2021

    Journal ref: Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021)

  20. arXiv:2106.01045  [pdf, other

    cs.CL

    Cascade versus Direct Speech Translation: Do the Differences Still Make a Difference?

    Authors: Luisa Bentivogli, Mauro Cettolo, Marco Gaido, Alina Karakanta, Alberto Martinelli, Matteo Negri, Marco Turchi

    Abstract: Five years after the first published proofs of concept, direct approaches to speech translation (ST) are now competing with traditional cascade solutions. In light of this steady progress, can we claim that the performance gap between the two is closed? Starting from this question, we present a systematic comparison between state-of-the-art systems representative of the two paradigms. Focusing on… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted at ACL2021

  21. arXiv:2105.13782  [pdf, other

    cs.CL

    How to Split: the Effect of Word Segmentation on Gender Bias in Speech Translation

    Authors: Marco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri, Marco Turchi

    Abstract: Having recognized gender bias as a major issue affecting current translation technologies, researchers have primarily attempted to mitigate it by working on the data front. However, whether algorithmic aspects concur to exacerbate unwanted outputs remains so far under-investigated. In this work, we bring the analysis on gender bias in automatic translation onto a seemingly neutral yet critical com… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

    Comments: Accepted in Findings of ACL 2021

  22. arXiv:2104.11710  [pdf, other

    cs.SD cs.CL eess.AS

    Beyond Voice Activity Detection: Hybrid Audio Segmentation for Direct Speech Translation

    Authors: Marco Gaido, Matteo Negri, Mauro Cettolo, Marco Turchi

    Abstract: The audio segmentation mismatch between training data and those seen at run-time is a major problem in direct speech translation. Indeed, while systems are usually trained on manually segmented corpora, in real use cases they are often presented with continuous audio requiring automatic (and sub-optimal) segmentation. After comparing existing techniques (VAD-based, fixed-length and hybrid segmenta… ▽ More

    Submitted 14 October, 2021; v1 submitted 23 April, 2021; originally announced April 2021.

    Comments: Accepted to ICNLSP 2021

  23. arXiv:2104.06001  [pdf, other

    cs.CL

    Gender Bias in Machine Translation

    Authors: Beatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi

    Abstract: Machine translation (MT) technology has facilitated our daily tasks by providing accessible shortcuts for gathering, elaborating and communicating information. However, it can suffer from biases that harm users and society at large. As a relatively new field of inquiry, gender bias in MT still lacks internal cohesion, which advocates for a unified framework to ease future research. To this end, we… ▽ More

    Submitted 7 May, 2021; v1 submitted 13 April, 2021; originally announced April 2021.

    Comments: Accepted for publication in Transaction of the Association for Computational Linguistics (TACL), 2021

  24. arXiv:2103.05951  [pdf, other

    cs.CL

    Self-Learning for Zero Shot Neural Machine Translation

    Authors: Surafel M. Lakew, Matteo Negri, Marco Turchi

    Abstract: Neural Machine Translation (NMT) approaches employing monolingual data are showing steady improvements in resource rich conditions. However, evaluations using real-world low-resource languages still result in unsatisfactory performance. This work proposes a novel zero-shot NMT modeling approach that learns without the now-standard assumption of a pivot language sharing parallel data with the zero-… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

  25. arXiv:2102.01757  [pdf, other

    cs.CL

    The Multilingual TEDx Corpus for Speech Recognition and Translation

    Authors: Elizabeth Salesky, Matthew Wiesner, Jacob Bremerman, Roldano Cattoni, Matteo Negri, Marco Turchi, Douglas W. Oard, Matt Post

    Abstract: We present the Multilingual TEDx corpus, built to support speech recognition (ASR) and speech translation (ST) research across many non-English source languages. The corpus is a collection of audio recordings from TEDx talks in 8 source languages. We segment transcripts into sentences and align them to the source-language audio and target-language translations. The corpus is released along with op… ▽ More

    Submitted 14 June, 2021; v1 submitted 2 February, 2021; originally announced February 2021.

    Comments: Accepted to Interspeech 2021

  26. arXiv:2102.01578  [pdf, other

    cs.CL

    CTC-based Compression for Direct Speech Translation

    Authors: Marco Gaido, Mauro Cettolo, Matteo Negri, Marco Turchi

    Abstract: Previous studies demonstrated that a dynamic phone-informed compression of the input audio is beneficial for speech translation (ST). However, they required a dedicated model for phone recognition and did not test this solution for direct ST, in which a single model translates the input audio into the target language without intermediate representations. In this work, we propose the first method a… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

    Comments: Accepted at EACL2021

    Journal ref: Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume (2021), 690-696

  27. arXiv:2012.04964  [pdf, ps, other

    cs.CL

    On Knowledge Distillation for Direct Speech Translation

    Authors: Marco Gaido, Mattia A. Di Gangi, Matteo Negri, Marco Turchi

    Abstract: Direct speech translation (ST) has shown to be a complex task requiring knowledge transfer from its sub-tasks: automatic speech recognition (ASR) and machine translation (MT). For MT, one of the most promising techniques to transfer knowledge is knowledge distillation. In this paper, we compare the different solutions to distill knowledge in a sequence-to-sequence task like ST. Moreover, we analyz… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

    Comments: Accepted at CLiC-IT 2020

  28. arXiv:2012.04955  [pdf, ps, other

    cs.CL

    Breeding Gender-aware Direct Speech Translation Systems

    Authors: Marco Gaido, Beatrice Savoldi, Luisa Bentivogli, Matteo Negri, Marco Turchi

    Abstract: In automatic speech translation (ST), traditional cascade approaches involving separate transcription and translation steps are giving ground to increasingly competitive and more robust direct solutions. In particular, by translating speech audio data without intermediate transcription, direct ST models are able to leverage and preserve essential information present in the input (e.g. speaker's vo… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

    Comments: Outstanding paper at COLING 2020

    Journal ref: In Proceedings of the 28th International Conference on Computational Linguistics, Dec 2020, 3951-3964. Online

  29. arXiv:2009.04707  [pdf, other

    cs.CL

    On Target Segmentation for Direct Speech Translation

    Authors: Mattia Antonino Di Gangi, Marco Gaido, Matteo Negri, Marco Turchi

    Abstract: Recent studies on direct speech translation show continuous improvements by means of data augmentation techniques and bigger deep learning models. While these methods are helping to close the gap between this new approach and the more traditional cascaded one, there are many incongruities among different studies that make it difficult to assess the state of the art. Surprisingly, one point of disc… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

    Comments: 14 pages single column, 4 figures, accepted for presentation at the AMTA2020 research track

  30. arXiv:2008.02270  [pdf, other

    cs.CL

    Contextualized Translation of Automatically Segmented Speech

    Authors: Marco Gaido, Mattia Antonino Di Gangi, Matteo Negri, Mauro Cettolo, Marco Turchi

    Abstract: Direct speech-to-text translation (ST) models are usually trained on corpora segmented at sentence level, but at inference time they are commonly fed with audio split by a voice activity detector (VAD). Since VAD segmentation is not syntax-informed, the resulting segments do not necessarily correspond to well-formed sentences uttered by the speaker but, most likely, to fragments of one or more sen… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

    Comments: Interspeech 2020

  31. arXiv:2006.05754  [pdf, ps, other

    cs.CL cs.AI cs.SD eess.AS

    Gender in Danger? Evaluating Speech Translation Technology on the MuST-SHE Corpus

    Authors: Luisa Bentivogli, Beatrice Savoldi, Matteo Negri, Mattia Antonino Di Gangi, Roldano Cattoni, Marco Turchi

    Abstract: Translating from languages without productive grammatical gender like English into gender-marked languages is a well-known difficulty for machines. This difficulty is also due to the fact that the training data on which models are built typically reflect the asymmetries of natural languages, gender bias included. Exclusively fed with textual data, machine translation is intrinsically constrained b… ▽ More

    Submitted 10 June, 2020; originally announced June 2020.

    Comments: 9 pages of content, accepted at ACL 2020

  32. arXiv:2006.02965  [pdf, other

    cs.CL cs.SD eess.AS

    End-to-End Speech-Translation with Knowledge Distillation: FBK@IWSLT2020

    Authors: Marco Gaido, Mattia Antonino Di Gangi, Matteo Negri, Marco Turchi

    Abstract: This paper describes FBK's participation in the IWSLT 2020 offline speech translation (ST) task. The task evaluates systems' ability to translate English TED talks audio into German texts. The test talks are provided in two versions: one contains the data already segmented with automatic tools and the other is the raw data without any segmentation. Participants can decide whether to work on custom… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Comments: Accepted at IWSLT2020

  33. arXiv:2006.01080  [pdf, other

    cs.CL

    Is 42 the Answer to Everything in Subtitling-oriented Speech Translation?

    Authors: Alina Karakanta, Matteo Negri, Marco Turchi

    Abstract: Subtitling is becoming increasingly important for disseminating information, given the enormous amounts of audiovisual content becoming available daily. Although Neural Machine Translation (NMT) can speed up the process of translating audiovisual content, large manual effort is still required for transcribing the source language, and for spotting and segmenting the text into proper subtitles. Crea… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

    Comments: Accepted at IWSLT 2020

  34. arXiv:2003.14402  [pdf, other

    cs.CL

    Low Resource Neural Machine Translation: A Benchmark for Five African Languages

    Authors: Surafel M. Lakew, Matteo Negri, Marco Turchi

    Abstract: Recent advents in Neural Machine Translation (NMT) have shown improvements in low-resource language (LRL) translation tasks. In this work, we benchmark NMT between English and five African LRL pairs (Swahili, Amharic, Tigrigna, Oromo, Somali [SATOS]). We collected the available resources on the SATOS languages to evaluate the current state of NMT for LRLs. Our evaluation, comparing a baseline sing… ▽ More

    Submitted 31 March, 2020; originally announced March 2020.

    Comments: Accepted for AfricaNLP workshop at ICLR 2020

  35. arXiv:2002.10829  [pdf, other

    cs.CL

    MuST-Cinema: a Speech-to-Subtitles corpus

    Authors: Alina Karakanta, Matteo Negri, Marco Turchi

    Abstract: Growing needs in localising audiovisual content in multiple languages through subtitles call for the development of automatic solutions for human subtitling. Neural Machine Translation (NMT) can contribute to the automatisation of subtitling, facilitating the work of human subtitlers and reducing turn-around times and related costs. NMT requires high-quality, large, task-specific training data. Th… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

    Comments: Accepted at LREC 2020

  36. arXiv:1910.13998  [pdf, other

    cs.CL

    Adapting Multilingual Neural Machine Translation to Unseen Languages

    Authors: Surafel M. Lakew, Alina Karakanta, Marcello Federico, Matteo Negri, Marco Turchi

    Abstract: Multilingual Neural Machine Translation (MNMT) for low-resource languages (LRL) can be enhanced by the presence of related high-resource languages (HRL), but the relatedness of HRL usually relies on predefined linguistic assumptions about language similarity. Recently, adapting MNMT to a LRL has shown to greatly improve performance. In this work, we explore the problem of adapting an MNMT model to… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: Accepted at the 16th International Workshop on Spoken Language Translation (IWSLT), November, 2019

  37. arXiv:1910.10663  [pdf, ps, other

    cs.CL eess.AS

    Instance-Based Model Adaptation For Direct Speech Translation

    Authors: Mattia Antonino Di Gangi, Viet-Nhat Nguyen, Matteo Negri, Marco Turchi

    Abstract: Despite recent technology advancements, the effectiveness of neural approaches to end-to-end speech-to-text translation is still limited by the paucity of publicly available training corpora. We tackle this limitation with a method to improve data exploitation and boost the system's performance at inference time. Our approach allows us to customize "on the fly" an existing model to each incoming t… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: 6 pages, under review at ICASSP 2020

  38. arXiv:1910.03320  [pdf, other

    cs.CL eess.AS

    One-To-Many Multilingual End-to-end Speech Translation

    Authors: Mattia Antonino Di Gangi, Matteo Negri, Marco Turchi

    Abstract: Nowadays, training end-to-end neural models for spoken language translation (SLT) still has to confront with extreme data scarcity conditions. The existing SLT parallel corpora are indeed orders of magnitude smaller than those available for the closely related tasks of automatic speech recognition (ASR) and machine translation (MT), which usually comprise tens of millions of instances. To cope wit… ▽ More

    Submitted 8 October, 2019; originally announced October 2019.

    Comments: 8 pages, one figure, version accepted at ASRU 2019

  39. arXiv:1910.00478  [pdf, other

    cs.CL

    Machine Translation for Machines: the Sentiment Classification Use Case

    Authors: Amirhossein Tebbifakhr, Luisa Bentivogli, Matteo Negri, Marco Turchi

    Abstract: We propose a neural machine translation (NMT) approach that, instead of pursuing adequacy and fluency ("human-oriented" quality criteria), aims to generate translations that are best suited as input to a natural language processing component designed for a specific downstream task (a "machine-oriented" criterion). Towards this objective, we present a reinforcement learning technique based on a new… ▽ More

    Submitted 1 October, 2019; originally announced October 2019.

  40. arXiv:1909.07342  [pdf, other

    cs.CL

    Multilingual Neural Machine Translation for Zero-Resource Languages

    Authors: Surafel M. Lakew, Marcello Federico, Matteo Negri, Marco Turchi

    Abstract: In recent years, Neural Machine Translation (NMT) has been shown to be more effective than phrase-based statistical methods, thus quickly becoming the state of the art in machine translation (MT). However, NMT systems are limited in translating low-resourced languages, due to the significant amount of parallel data that is required to learn useful mappings between languages. In this work, we show… ▽ More

    Submitted 16 September, 2019; originally announced September 2019.

    Comments: 15 pages, Published on Italian Journal of Computational Linguistics (IJCoL) -- Multilingual Neural Machine Translation for Low-Resource Languages, June 2018

  41. arXiv:1811.01389  [pdf, other

    cs.CL

    Improving Zero-Shot Translation of Low-Resource Languages

    Authors: Surafel M. Lakew, Quintino F. Lotito, Matteo Negri, Marco Turchi, Marcello Federico

    Abstract: Recent work on multilingual neural machine translation reported competitive performance with respect to bilingual models and surprisingly good performance even on (zeroshot) translation directions not observed at training time. We investigate here a zero-shot translation in a particularly lowresource multilingual setting. We propose a simple iterative training procedure that leverages a duality of… ▽ More

    Submitted 4 November, 2018; originally announced November 2018.

    Comments: Published at the International Workshop on Spoken Language Translation (IWSLT), Tokyo, Japan, December 2017

  42. arXiv:1811.01137  [pdf, other

    cs.CL

    Transfer Learning in Multilingual Neural Machine Translation with Dynamic Vocabulary

    Authors: Surafel M. Lakew, Aliia Erofeeva, Matteo Negri, Marcello Federico, Marco Turchi

    Abstract: We propose a method to transfer knowledge across neural machine translation (NMT) models by means of a shared dynamic vocabulary. Our approach allows to extend an initial model for a given language pair to cover new languages by adapting its vocabulary as long as new data become available (i.e., introducing new vocabulary items if they are not included in the initial model). The parameter transfer… ▽ More

    Submitted 2 November, 2018; originally announced November 2018.

    Comments: Published at the International Workshop on Spoken Language Translation (IWSLT), 2018

  43. arXiv:1810.07652  [pdf, other

    eess.AS cs.CL cs.LG cs.SD stat.ML

    Fine-tuning on Clean Data for End-to-End Speech Translation: FBK @ IWSLT 2018

    Authors: Mattia Antonino Di Gangi, Roberto Dessì, Roldano Cattoni, Matteo Negri, Marco Turchi

    Abstract: This paper describes FBK's submission to the end-to-end English-German speech translation task at IWSLT 2018. Our system relies on a state-of-the-art model based on LSTMs and CNNs, where the CNNs are used to reduce the temporal dimension of the audio input, which is in general much higher than machine translation input. Our model was trained only on the audio-to-text parallel data released for the… ▽ More

    Submitted 16 October, 2018; originally announced October 2018.

    Comments: 6 pages, 2 figures, system description at the 15th International Workshop on Spoken Language Translation (IWSLT) 2018

  44. arXiv:1803.07274  [pdf, ps, other

    cs.CL

    eSCAPE: a Large-scale Synthetic Corpus for Automatic Post-Editing

    Authors: Matteo Negri, Marco Turchi, Rajen Chatterjee, Nicola Bertoldi

    Abstract: Training models for the automatic correction of machine-translated text usually relies on data consisting of (source, MT, human post- edit) triplets providing, for each source sentence, examples of translation errors with the corresponding corrections made by a human post-editor. Ideally, a large amount of data of this kind should allow the model to learn reliable correction patterns and effective… ▽ More

    Submitted 20 March, 2018; originally announced March 2018.

    Comments: Accepted at LREC 2018

  45. arXiv:1707.09879  [pdf, ps, other

    cs.CL

    Linguistically Motivated Vocabulary Reduction for Neural Machine Translation from Turkish to English

    Authors: Duygu Ataman, Matteo Negri, Marco Turchi, Marcello Federico

    Abstract: The necessity of using a fixed-size word vocabulary in order to control the model complexity in state-of-the-art neural machine translation (NMT) systems is an important bottleneck on performance, especially for morphologically rich languages. Conventional methods that aim to overcome this problem by using sub-word or character-level representations solely rely on statistics and disregard the ling… ▽ More

    Submitted 31 July, 2017; originally announced July 2017.

    Comments: The 20th Annual Conference of the European Association for Machine Translation (EAMT), Research Paper, 12 pages

    Journal ref: The Prague Bulletin of Mathematical Linguistics. No. 108, 2017, pp. 331-342

  46. Automatic Quality Estimation for ASR System Combination

    Authors: Shahab Jalalvand, Matteo Negri, Daniele Falavigna, Marco Matassoni, Marco Turchi

    Abstract: Recognizer Output Voting Error Reduction (ROVER) has been widely used for system combination in automatic speech recognition (ASR). In order to select the most appropriate words to insert at each position in the output transcriptions, some ROVER extensions rely on critical information such as confidence scores and other ASR decoder features. This information, which is not always available, highly… ▽ More

    Submitted 22 June, 2017; originally announced June 2017.

  47. arXiv:1702.01714  [pdf, ps, other

    cs.CL

    DNN adaptation by automatic quality estimation of ASR hypotheses

    Authors: Daniele Falavigna, Marco Matassoni, Shahab Jalalvand, Matteo Negri, Marco Turchi

    Abstract: In this paper we propose to exploit the automatic Quality Estimation (QE) of ASR hypotheses to perform the unsupervised adaptation of a deep neural network modeling acoustic probabilities. Our hypothesis is that significant improvements can be achieved by: i)automatically transcribing the evaluation data we are currently trying to recognise, and ii) selecting from it a subset of "good quality" ins… ▽ More

    Submitted 6 February, 2017; originally announced February 2017.

    Comments: Computer Speech & Language December 2016

  48. SentiWords: Deriving a High Precision and High Coverage Lexicon for Sentiment Analysis

    Authors: Lorenzo Gatti, Marco Guerini, Marco Turchi

    Abstract: Deriving prior polarity lexica for sentiment analysis - where positive or negative scores are associated with words out of context - is a challenging task. Usually, a trade-off between precision and coverage is hard to find, and it depends on the methodology used to build the lexicon. Manually annotated lexica provide a high precision but lack in coverage, whereas automatic derivation from pre-exi… ▽ More

    Submitted 30 October, 2015; originally announced October 2015.

    Comments: in Affective Computing, IEEE Transactions on (2015)

  49. arXiv:1401.2943  [pdf, ps, other

    cs.CL

    ONTS: "Optima" News Translation System

    Authors: Marco Turchi, Martin Atkinson, Alastair Wilcox, Brett Crawley, Stefano Bucci, Ralf Steinberger, Erik Van der Goot

    Abstract: We propose a real-time machine translation system that allows users to select a news category and to translate the related live news articles from Arabic, Czech, Danish, Farsi, French, German, Italian, Polish, Portuguese, Spanish and Turkish into English. The Moses-based system was optimised for the news domain and differs from other available systems in four ways: (1) News items are automatically… ▽ More

    Submitted 13 January, 2014; originally announced January 2014.

    ACM Class: I.2.7; H.3.3; H.3.6

    Journal ref: Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, pages 25-30, Avignon, France, April 23 - 27 2012. Association for Computational Linguistics

  50. arXiv:1309.5843  [pdf, ps, other

    cs.CL

    Sentiment Analysis: How to Derive Prior Polarities from SentiWordNet

    Authors: Marco Guerini, Lorenzo Gatti, Marco Turchi

    Abstract: Assigning a positive or negative score to a word out of context (i.e. a word's prior polarity) is a challenging task for sentiment analysis. In the literature, various approaches based on SentiWordNet have been proposed. In this paper, we compare the most often used techniques together with newly proposed ones and incorporate all of them in a learning framework to see whether blending them can fur… ▽ More

    Submitted 23 September, 2013; originally announced September 2013.

    Comments: To appear in Proceedings of EMNLP 2013