Skip to main content

Showing 1–50 of 68 results for author: Specia, L

.
  1. arXiv:2412.13421  [pdf, other

    cs.SD eess.AS

    Detecting Machine-Generated Music with Explainability -- A Challenge and Early Benchmarks

    Authors: Yupei Li, Qiyang Sun, Hanqian Li, Lucia Specia, Björn W. Schuller

    Abstract: Machine-generated music (MGM) has become a groundbreaking innovation with wide-ranging applications, such as music therapy, personalised editing, and creative inspiration within the music industry. However, the unregulated proliferation of MGM presents considerable challenges to the entertainment, education, and arts sectors by potentially undermining the value of high-quality human compositions.… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

  2. arXiv:2412.12679  [pdf, other

    cs.CL

    Detecting Document-level Paraphrased Machine Generated Content: Mimicking Human Writing Style and Involving Discourse Features

    Authors: Yupei Li, Manuel Milling, Lucia Specia, Björn W. Schuller

    Abstract: The availability of high-quality APIs for Large Language Models (LLMs) has facilitated the widespread creation of Machine-Generated Content (MGC), posing challenges such as academic plagiarism and the spread of misinformation. Existing MGC detectors often focus solely on surface-level information, overlooking implicit and structural features. This makes them susceptible to deception by surface-lev… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

  3. arXiv:2412.06001  [pdf, other

    cs.SD cs.MM eess.AS

    M6: Multi-generator, Multi-domain, Multi-lingual and cultural, Multi-genres, Multi-instrument Machine-Generated Music Detection Databases

    Authors: Yupei Li, Hanqian Li, Lucia Specia, Björn W. Schuller

    Abstract: Machine-generated music (MGM) has emerged as a powerful tool with applications in music therapy, personalised editing, and creative inspiration for the music community. However, its unregulated use threatens the entertainment, education, and arts sectors by diminishing the value of high-quality human compositions. Detecting machine-generated music (MGMD) is, therefore, critical to safeguarding the… ▽ More

    Submitted 8 December, 2024; originally announced December 2024.

  4. arXiv:2412.00571  [pdf, other

    cs.SD eess.AS

    From Audio Deepfake Detection to AI-Generated Music Detection -- A Pathway and Overview

    Authors: Yupei Li, Manuel Milling, Lucia Specia, Björn W. Schuller

    Abstract: As Artificial Intelligence (AI) technologies continue to evolve, their use in generating realistic, contextually appropriate content has expanded into various domains. Music, an art form and medium for entertainment, deeply rooted into human culture, is seeing an increased involvement of AI into its production. However, despite the effective application of AI music generation (AIGM) tools, the unr… ▽ More

    Submitted 10 December, 2024; v1 submitted 30 November, 2024; originally announced December 2024.

  5. arXiv:2410.13077  [pdf, other

    cs.CL cs.AI

    Tuning Language Models by Mixture-of-Depths Ensemble

    Authors: Haoyan Luo, Lucia Specia

    Abstract: Transformer-based Large Language Models (LLMs) traditionally rely on final-layer loss for training and final-layer representations for predictions, potentially overlooking the predictive power embedded in intermediate layers. Surprisingly, we find that focusing training efforts on these intermediate layers can yield training losses comparable to those of final layers, with complementary test-time… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  6. arXiv:2407.11002  [pdf, other

    cs.CL cs.AI cs.LG

    MoESD: Mixture of Experts Stable Diffusion to Mitigate Gender Bias

    Authors: Guorun Wang, Lucia Specia

    Abstract: Text-to-image models are known to propagate social biases. For example, when prompted to generate images of people in certain professions, these models tend to systematically generate specific genders or ethnicities. In this paper, we show that this bias is already present in the text encoder of the model and introduce a Mixture-of-Experts approach by identifying text-encoded bias in the latent sp… ▽ More

    Submitted 24 October, 2024; v1 submitted 25 June, 2024; originally announced July 2024.

  7. arXiv:2407.00248  [pdf, other

    cs.CL

    DiffuseDef: Improved Robustness to Adversarial Attacks via Iterative Denoising

    Authors: Zhenhao Li, Huichi Zhou, Marek Rei, Lucia Specia

    Abstract: Pretrained language models have significantly advanced performance across various natural language processing tasks. However, adversarial attacks continue to pose a critical challenge to systems built using these models, as they can be exploited with carefully crafted adversarial texts. Inspired by the ability of diffusion models to predict and reduce noise in computer vision, we propose a novel a… ▽ More

    Submitted 16 May, 2025; v1 submitted 28 June, 2024; originally announced July 2024.

    Comments: Accepted to ACL 2025

  8. arXiv:2401.12874  [pdf, other

    cs.CL cs.AI

    From Understanding to Utilization: A Survey on Explainability for Large Language Models

    Authors: Haoyan Luo, Lucia Specia

    Abstract: Explainability for Large Language Models (LLMs) is a critical yet challenging aspect of natural language processing. As LLMs are increasingly integral to diverse applications, their "black-box" nature sparks significant concerns regarding transparency and ethical use. This survey underscores the imperative for increased explainability in LLMs, delving into both the research on explainability and t… ▽ More

    Submitted 21 February, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

  9. arXiv:2211.09878  [pdf, other

    cs.CL

    Reducing Hallucinations in Neural Machine Translation with Feature Attribution

    Authors: Joël Tang, Marina Fomicheva, Lucia Specia

    Abstract: Neural conditional language generation models achieve the state-of-the-art in Neural Machine Translation (NMT) but are highly dependent on the quality of parallel training dataset. When trained on low-quality datasets, these models are prone to various error types, including hallucinations, i.e. outputs that are fluent, but unrelated to the source sentences. These errors are particularly dangerous… ▽ More

    Submitted 14 June, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

  10. arXiv:2210.10836  [pdf, other

    cs.CV cs.LG

    Scene Text Recognition with Semantics

    Authors: Joshua Cesare Placidi, Yishu Miao, Zixu Wang, Lucia Specia

    Abstract: Scene Text Recognition (STR) models have achieved high performance in recent years on benchmark datasets where text images are presented with minimal noise. Traditional STR recognition pipelines take a cropped image as sole input and attempt to identify the characters present. This infrastructure can fail in instances where the input image is noisy or the text is partially obscured. This paper pro… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 11 pages, 7 figures

  11. arXiv:2210.05039  [pdf, other

    cs.LG cs.CV

    Contrastive Video-Language Learning with Fine-grained Frame Sampling

    Authors: Zixu Wang, Yujie Zhong, Yishu Miao, Lin Ma, Lucia Specia

    Abstract: Despite recent progress in video and language representation learning, the weak or sparse correspondence between the two modalities remains a bottleneck in the area. Most video-language models are trained via pair-level loss to predict whether a pair of video and text is aligned. However, even in paired video-text segments, only a subset of the frames are semantically relevant to the corresponding… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: AACL-IJCNLP 2022

  12. arXiv:2206.12469  [pdf, other

    cs.SD cs.CL eess.AS

    Burst2Vec: An Adversarial Multi-Task Approach for Predicting Emotion, Age, and Origin from Vocal Bursts

    Authors: Atijit Anuchitanukul, Lucia Specia

    Abstract: We present Burst2Vec, our multi-task learning approach to predict emotion, age, and origin (i.e., native country/language) from vocal bursts. Burst2Vec utilises pre-trained speech representations to capture acoustic information from raw waveforms and incorporates the concept of model debiasing via adversarial training. Our models achieve a relative 30 % performance gain over baselines using pre-ex… ▽ More

    Submitted 18 October, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

  13. arXiv:2205.00047  [pdf, other

    cs.LG cs.CL cs.CR

    Logically Consistent Adversarial Attacks for Soft Theorem Provers

    Authors: Alexander Gaskell, Yishu Miao, Lucia Specia, Francesca Toni

    Abstract: Recent efforts within the AI community have yielded impressive results towards "soft theorem proving" over natural language sentences using language models. We propose a novel, generative adversarial framework for probing and improving these models' reasoning capabilities. Adversarial attacks in this domain suffer from the logical inconsistency problem, whereby perturbations to the input may alter… ▽ More

    Submitted 29 April, 2022; originally announced May 2022.

    Comments: IJCAI-ECAI 2022

  14. Supervised Visual Attention for Simultaneous Multimodal Machine Translation

    Authors: Veneta Haralampieva, Ozan Caglayan, Lucia Specia

    Abstract: Recently, there has been a surge in research in multimodal machine translation (MMT), where additional modalities such as images are used to improve translation quality of textual systems. A particular use for such multimodal systems is the task of simultaneous machine translation, where visual context has been shown to complement the partial information provided by the source sentence, especially… ▽ More

    Submitted 29 June, 2022; v1 submitted 23 January, 2022; originally announced January 2022.

    Comments: Accepted to Journal of Artificial Intelligence Research (JAIR)

    Journal ref: Journal of Artificial Intelligence Research 74 (2022) 1059-1089

  15. arXiv:2111.12447  [pdf, other

    cs.CL

    Revisiting Contextual Toxicity Detection in Conversations

    Authors: Atijit Anuchitanukul, Julia Ive, Lucia Specia

    Abstract: Understanding toxicity in user conversations is undoubtedly an important problem. Addressing "covert" or implicit cases of toxicity is particularly hard and requires context. Very few previous studies have analysed the influence of conversational context in human perception or in automated detection models. We dive deeper into both these directions. We start by analysing existing contextual datase… ▽ More

    Submitted 18 October, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

  16. arXiv:2110.08226  [pdf, other

    cs.LG cs.CL cs.CV

    Guiding Visual Question Generation

    Authors: Nihir Vedd, Zixu Wang, Marek Rei, Yishu Miao, Lucia Specia

    Abstract: In traditional Visual Question Generation (VQG), most images have multiple concepts (e.g. objects and categories) for which a question could be generated, but models are trained to mimic an arbitrary choice of concept as given in their training data. This makes training difficult and also poses issues for evaluation -- multiple valid questions exist for most images but only one or a few are captur… ▽ More

    Submitted 26 July, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: 14 pages including references and Appendix. 3 figures and 4 tables

  17. arXiv:2109.10859  [pdf, other

    cs.CL cs.AI

    Pushing the Right Buttons: Adversarial Evaluation of Quality Estimation

    Authors: Diptesh Kanojia, Marina Fomicheva, Tharindu Ranasinghe, Frédéric Blain, Constantin Orăsan, Lucia Specia

    Abstract: Current Machine Translation (MT) systems achieve very good results on a growing variety of language pairs and datasets. However, they are known to produce fluent translation outputs that can contain important meaning errors, thus undermining their reliability in practice. Quality Estimation (QE) is the task of automatically assessing the performance of MT systems at test time. Thus, in order to be… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: Accepted to WMT 2021 Conference co-located with EMNLP 2021. 14 pages with a 4 page appendix

  18. arXiv:2109.08627  [pdf, other

    cs.CL

    Classification-based Quality Estimation: Small and Efficient Models for Real-world Applications

    Authors: Shuo Sun, Ahmed El-Kishky, Vishrav Chaudhary, James Cross, Francisco Guzmán, Lucia Specia

    Abstract: Sentence-level Quality estimation (QE) of machine translation is traditionally formulated as a regression task, and the performance of QE models is typically measured by Pearson correlation with human labels. Recent QE models have achieved previously-unseen levels of correlation with human judgments, but they rely on large multilingual contextualized language models that are computationally expens… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  19. arXiv:2109.08120  [pdf, other

    cs.CL

    A Survey of Online Hate Speech through the Causal Lens

    Authors: Antigoni-Maria Founta, Lucia Specia

    Abstract: The societal issue of digital hostility has previously attracted a lot of attention. The topic counts an ample body of literature, yet remains prominent and challenging as ever due to its subjective nature. We posit that a better understanding of this problem will require the use of causal inference frameworks. This survey summarises the relevant research that revolves around estimations of causal… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: Accepted to CI+NLP: First Workshop on Causal Inference and NLP, part of EMNLP 2021

  20. arXiv:2108.12197  [pdf, other

    cs.CL

    Translation Error Detection as Rationale Extraction

    Authors: Marina Fomicheva, Lucia Specia, Nikolaos Aletras

    Abstract: Recent Quality Estimation (QE) models based on multilingual pre-trained representations have achieved very competitive results when predicting the overall quality of translated sentences. Predicting translation errors, i.e. detecting specifically which words are incorrect, is a more challenging task, especially with limited amounts of training data. We hypothesize that, not unlike humans, successf… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

  21. arXiv:2107.00411  [pdf, other

    cs.CL

    Knowledge Distillation for Quality Estimation

    Authors: Amit Gajbhiye, Marina Fomicheva, Fernando Alva-Manchego, Frédéric Blain, Abiola Obamuyide, Nikolaos Aletras, Lucia Specia

    Abstract: Quality Estimation (QE) is the task of automatically predicting Machine Translation quality in the absence of reference translations, making it applicable in real-time settings, such as translating online social media conversations. Recent success in QE stems from the use of multilingual pre-trained representations, where very large models lead to impressive results. However, the inference time, d… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: ACL Findings 2021

  22. arXiv:2106.03484  [pdf, other

    cs.CL

    BERTGEN: Multi-task Generation through BERT

    Authors: Faidon Mitzalis, Ozan Caglayan, Pranava Madhyastha, Lucia Specia

    Abstract: We present BERTGEN, a novel generative, decoder-only model which extends BERT by fusing multimodal and multilingual pretrained models VL-BERT and M-BERT, respectively. BERTGEN is auto-regressively trained for language generation tasks, namely image captioning, machine translation and multimodal machine translation, under a multitask setting. With a comprehensive set of evaluations, we show that BE… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: Accepted to ACL 2021 Main Conference

  23. arXiv:2105.04780  [pdf, other

    cs.CV cs.CL

    Cross-Modal Generative Augmentation for Visual Question Answering

    Authors: Zixu Wang, Yishu Miao, Lucia Specia

    Abstract: Data augmentation has been shown to effectively improve the performance of multimodal machine learning models. This paper introduces a generative model for data augmentation by leveraging the correlations among multiple modalities. Different from conventional data augmentation approaches that apply low-level operations with deterministic heuristics, our method learns a generator that generates sam… ▽ More

    Submitted 22 October, 2021; v1 submitted 11 May, 2021; originally announced May 2021.

    Comments: BMVC 2021

  24. arXiv:2104.07112  [pdf, other

    cs.CL

    What Makes a Scientific Paper be Accepted for Publication?

    Authors: Panagiotis Fytas, Georgios Rizos, Lucia Specia

    Abstract: Despite peer-reviewing being an essential component of academia since the 1600s, it has repeatedly received criticisms for lack of transparency and consistency. We posit that recent work in machine learning and explainable AI provide tools that enable insights into the decisions from a given peer review process. We start by extracting global explanations in the form of linguistic features that aff… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    MSC Class: 68T50 ACM Class: I.2.7

  25. arXiv:2104.05688  [pdf, other

    cs.CL cs.HC

    Backtranslation Feedback Improves User Confidence in MT, Not Quality

    Authors: Vilém Zouhar, Michal Novák, Matúš Žilinec, Ondřej Bojar, Mateo Obregón, Robin L. Hill, Frédéric Blain, Marina Fomicheva, Lucia Specia, Lisa Yankovskaya

    Abstract: Translating text into a language unknown to the text's author, dubbed outbound translation, is a modern need for which the user experience has significant room for improvement, beyond the basic machine translation facility. We demonstrate this by showing three ways in which user confidence in the outbound translation, as well as its overall final quality, can be affected: backward translation, qua… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: 9 pages (excluding references); to appear at NAACL-HWT 2021

  26. Visual Cues and Error Correction for Translation Robustness

    Authors: Zhenhao Li, Marek Rei, Lucia Specia

    Abstract: Neural Machine Translation models are sensitive to noise in the input texts, such as misspelled words and ungrammatical constructions. Existing robustness techniques generally fail when faced with unseen types of noise and their performance degrades on clean texts. In this paper, we focus on three types of realistic noise that are commonly generated by humans and introduce the idea of visual conte… ▽ More

    Submitted 2 May, 2022; v1 submitted 12 March, 2021; originally announced March 2021.

    Comments: Accepted at Findings of EMNLP 2021; add acknowledgements

  27. arXiv:2103.01910  [pdf, other

    cs.CL

    MultiSubs: A Large-scale Multimodal and Multilingual Dataset

    Authors: Josiah Wang, Pranava Madhyastha, Josiel Figueiredo, Chiraag Lala, Lucia Specia

    Abstract: This paper introduces a large-scale multimodal and multilingual dataset that aims to facilitate research on grounding words to images in their contextual usage in language. The dataset consists of images selected to unambiguously illustrate concepts expressed in sentences from movie subtitles. The dataset is a valuable resource as (i) the images are aligned to text fragments rather than whole sent… ▽ More

    Submitted 16 June, 2022; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: Added an n-gram with back-off baseline model to the lexical translation task (Section 7.2.4). Also synchronised the paper structure to the LREC2022 version of this work. This arxiv version is a longer version of the LREC2022 version including more experiments and an additional lexical translation task

  28. arXiv:2102.11403  [pdf, other

    cs.CL

    Exploring Supervised and Unsupervised Rewards in Machine Translation

    Authors: Julia Ive, Zixu Wang, Marina Fomicheva, Lucia Specia

    Abstract: Reinforcement Learning (RL) is a powerful framework to address the discrepancy between loss functions used during training and the final evaluation metrics to be used at test time. When applied to neural Machine Translation (MT), it minimises the mismatch between the cross-entropy loss and non-differentiable evaluation metrics like BLEU. However, the suitability of these metrics as reward function… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: Long paper accepted to EACL 2021, Camera-ready version

  29. arXiv:2102.11387  [pdf, other

    cs.CL

    Exploiting Multimodal Reinforcement Learning for Simultaneous Machine Translation

    Authors: Julia Ive, Andy Mingren Li, Yishu Miao, Ozan Caglayan, Pranava Madhyastha, Lucia Specia

    Abstract: This paper addresses the problem of simultaneous machine translation (SiMT) by exploring two main concepts: (a) adaptive policies to learn a good trade-off between high translation quality and low latency; and (b) visual information to support this process by providing additional (visual) contextual information which may be available before the textual input is produced. For that, we propose a mul… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: Long paper accepted to EACL 2021, Camera-ready version

  30. arXiv:2102.04020  [pdf, other

    cs.CL

    Quality Estimation without Human-labeled Data

    Authors: Yi-Lin Tuan, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Francisco Guzmán, Lucia Specia

    Abstract: Quality estimation aims to measure the quality of translated content without access to a reference translation. This is crucial for machine translation systems in real-world scenarios where high-quality translation is needed. While many approaches exist for quality estimation, they are based on supervised machine learning requiring costly human labelled data. As an alternative, we propose a techni… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: Accepted by EACL2021

  31. arXiv:2101.10044  [pdf, other

    cs.CL cs.CV

    Cross-lingual Visual Pre-training for Multimodal Machine Translation

    Authors: Ozan Caglayan, Menekse Kuyu, Mustafa Sercan Amac, Pranava Madhyastha, Erkut Erdem, Aykut Erdem, Lucia Specia

    Abstract: Pre-trained language models have been shown to improve performance in many natural language tasks substantially. Although the early focus of such models was single language pre-training, recent advances have resulted in cross-lingual and visual pre-training methods. In this paper, we combine these two approaches to learn visually-grounded cross-lingual representations. Specifically, we extend the… ▽ More

    Submitted 20 April, 2021; v1 submitted 25 January, 2021; originally announced January 2021.

    Comments: Accepted to EACL 2021 (Camera-ready version)

  32. arXiv:2101.06399  [pdf, other

    cs.CV cs.AI cs.CL

    Latent Variable Models for Visual Question Answering

    Authors: Zixu Wang, Yishu Miao, Lucia Specia

    Abstract: Current work on Visual Question Answering (VQA) explore deterministic approaches conditioned on various types of image and question features. We posit that, in addition to image and question pairs, other modalities are useful for teaching machine to carry out question answering. Hence in this paper, we propose latent variable models for VQA where extra information (e.g. captions and answer categor… ▽ More

    Submitted 26 September, 2021; v1 submitted 16 January, 2021; originally announced January 2021.

    Comments: ICCV21 CLVL: 4th Workshop on Closing the Loop Between Vision and Language

  33. arXiv:2012.07098  [pdf, other

    cs.CV

    MSVD-Turkish: A Comprehensive Multimodal Dataset for Integrated Vision and Language Research in Turkish

    Authors: Begum Citamak, Ozan Caglayan, Menekse Kuyu, Erkut Erdem, Aykut Erdem, Pranava Madhyastha, Lucia Specia

    Abstract: Automatic generation of video descriptions in natural language, also called video captioning, aims to understand the visual content of the video and produce a natural language sentence depicting the objects and actions in the scene. This challenging integrated vision and language problem, however, has been predominantly addressed for English. The lack of data and the linguistic properties of other… ▽ More

    Submitted 13 December, 2020; originally announced December 2020.

  34. arXiv:2011.09634  [pdf, other

    cs.CV

    Watch and Learn: Mapping Language and Noisy Real-world Videos with Self-supervision

    Authors: Yujie Zhong, Linhai Xie, Sen Wang, Lucia Specia, Yishu Miao

    Abstract: In this paper, we teach machines to understand visuals and natural language by learning the mapping between sentences and noisy video snippets without explicit annotations. Firstly, we define a self-supervised learning framework that captures the cross-modal information. A novel adversarial learning module is then introduced to explicitly handle the noises in the natural videos, where the subtitle… ▽ More

    Submitted 11 January, 2021; v1 submitted 18 November, 2020; originally announced November 2020.

    Comments: NeurIPS 2020 Self-Supervised Learning Workshop

  35. arXiv:2010.13588  [pdf, ps, other

    cs.CL

    Curious Case of Language Generation Evaluation Metrics: A Cautionary Tale

    Authors: Ozan Caglayan, Pranava Madhyastha, Lucia Specia

    Abstract: Automatic evaluation of language generation systems is a well-studied problem in Natural Language Processing. While novel metrics are proposed every year, a few popular metrics remain as the de facto metrics to evaluate tasks such as image captioning and machine translation, despite their known limitations. This is partly due to ease of use, and partly because researchers expect to see them and kn… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: 7 pages, accepted to COLING 2020

  36. arXiv:2010.04987  [pdf, other

    cs.CL cs.HC cs.LG

    FIND: Human-in-the-Loop Debugging Deep Text Classifiers

    Authors: Piyawat Lertvittayakumjorn, Lucia Specia, Francesca Toni

    Abstract: Since obtaining a perfect training dataset (i.e., a dataset which is considerably large, unbiased, and well-representative of unseen cases) is hardly possible, many real-world text classifiers are trained on the available, yet imperfect, datasets. These classifiers are thus likely to have undesirable properties. For instance, they may have biases against some sub-populations or may not work effect… ▽ More

    Submitted 10 October, 2020; originally announced October 2020.

    Comments: 17 pages including appendices; To appear at EMNLP 2020

  37. arXiv:2010.04480  [pdf, other

    cs.CL

    MLQE-PE: A Multilingual Quality Estimation and Post-Editing Dataset

    Authors: Marina Fomicheva, Shuo Sun, Erick Fonseca, Chrysoula Zerva, Frédéric Blain, Vishrav Chaudhary, Francisco Guzmán, Nina Lopatina, Lucia Specia, André F. T. Martins

    Abstract: We present MLQE-PE, a new dataset for Machine Translation (MT) Quality Estimation (QE) and Automatic Post-Editing (APE). The dataset contains eleven language pairs, with human labels for up to 10,000 translations per language pair in the following formats: sentence-level direct assessments and post-editing effort, and word-level good/bad labels. It also contains the post-edited sentences, as well… ▽ More

    Submitted 11 October, 2021; v1 submitted 9 October, 2020; originally announced October 2020.

  38. arXiv:2009.07310  [pdf, other

    cs.CL

    Simultaneous Machine Translation with Visual Context

    Authors: Ozan Caglayan, Julia Ive, Veneta Haralampieva, Pranava Madhyastha, Loïc Barrault, Lucia Specia

    Abstract: Simultaneous machine translation (SiMT) aims to translate a continuous input text stream into another language with the lowest latency and highest quality possible. The translation thus has to start with an incomplete source text, which is read progressively, creating the need for anticipation. In this paper, we seek to understand whether the addition of visual information can compensate for the m… ▽ More

    Submitted 13 October, 2020; v1 submitted 15 September, 2020; originally announced September 2020.

    Comments: Long paper accepted to EMNLP 2020, Camera-ready version

  39. arXiv:2005.10608  [pdf, other

    cs.CL

    Unsupervised Quality Estimation for Neural Machine Translation

    Authors: Marina Fomicheva, Shuo Sun, Lisa Yankovskaya, Frédéric Blain, Francisco Guzmán, Mark Fishel, Nikolaos Aletras, Vishrav Chaudhary, Lucia Specia

    Abstract: Quality Estimation (QE) is an important component in making Machine Translation (MT) useful in real-world applications, as it is aimed to inform the user on the quality of the MT output at test time. Existing approaches require large amounts of expert annotated data, computation and time for training. As an alternative, we devise an unsupervised approach to QE where no training or access to additi… ▽ More

    Submitted 20 July, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

    Comments: Accepted for publication in TACL. Authors' final version

  40. arXiv:2005.00481  [pdf, other

    cs.CL

    ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations

    Authors: Fernando Alva-Manchego, Louis Martin, Antoine Bordes, Carolina Scarton, Benoît Sagot, Lucia Specia

    Abstract: In order to simplify a sentence, human editors perform multiple rewriting transformations: they split it into several shorter sentences, paraphrase words (i.e. replacing complex words or phrases by simpler synonyms), reorder components, and/or delete information deemed unnecessary. Despite these varied range of possible text alterations, current models for automatic sentence simplification are eva… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

    Comments: Accepted to ACL 2020 (camera-ready version)

  41. arXiv:1911.12798  [pdf, other

    cs.CL

    Multimodal Machine Translation through Visuals and Speech

    Authors: Umut Sulubacak, Ozan Caglayan, Stig-Arne Grönroos, Aku Rouhe, Desmond Elliott, Lucia Specia, Jörg Tiedemann

    Abstract: Multimodal machine translation involves drawing information from more than one modality, based on the assumption that the additional modalities will contain useful alternative views of the input data. The most prominent tasks in this area are spoken language translation, image-guided translation, and video-guided translation, which exploit audio and visual modalities, respectively. These tasks are… ▽ More

    Submitted 28 November, 2019; originally announced November 2019.

    Comments: 34 pages, 4 tables, 8 figures. Submitted (Nov 2019) to the Machine Translation journal (Springer)

  42. arXiv:1910.13215  [pdf, other

    cs.CL

    Transformer-based Cascaded Multimodal Speech Translation

    Authors: Zixiu Wu, Ozan Caglayan, Julia Ive, Josiah Wang, Lucia Specia

    Abstract: This paper describes the cascaded multimodal speech translation systems developed by Imperial College London for the IWSLT 2019 evaluation campaign. The architecture consists of an automatic speech recognition (ASR) system followed by a Transformer-based multimodal machine translation (MMT) system. While the ASR component is identical across the experiments, the MMT model varies in terms of the wa… ▽ More

    Submitted 8 November, 2019; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: Accepted to IWSLT 2019

  43. arXiv:1910.07482  [pdf, other

    cs.CL cs.NE

    Imperial College London Submission to VATEX Video Captioning Task

    Authors: Ozan Caglayan, Zixiu Wu, Pranava Madhyastha, Josiah Wang, Lucia Specia

    Abstract: This paper describes the Imperial College London team's submission to the 2019' VATEX video captioning challenge, where we first explore two sequence-to-sequence models, namely a recurrent (GRU) model and a transformer model, which generate captions from the I3D action features. We then investigate the effect of dropping the encoder and the attention mechanism and instead conditioning the GRU deco… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

  44. arXiv:1910.06204  [pdf, other

    cs.CL

    Estimating post-editing effort: a study on human judgements, task-based and reference-based metrics of MT quality

    Authors: Carolina Scarton, Mikel L. Forcada, Miquel Esplà-Gomis, Lucia Specia

    Abstract: Devising metrics to assess translation quality has always been at the core of machine translation (MT) research. Traditional automatic reference-based metrics, such as BLEU, have shown correlations with human judgements of adequacy and fluency and have been paramount for the advancement of MT system development. Crowd-sourcing has popularised and enabled the scalability of metrics based on human j… ▽ More

    Submitted 14 October, 2019; originally announced October 2019.

    Comments: IWSLT 2019, Hong Kong, November 2 and 3, 2019

  45. Improving Neural Machine Translation Robustness via Data Augmentation: Beyond Back Translation

    Authors: Zhenhao Li, Lucia Specia

    Abstract: Neural Machine Translation (NMT) models have been proved strong when translating clean texts, but they are very sensitive to noise in the input. Improving NMT models robustness can be seen as a form of "domain" adaption to noise. The recently created Machine Translation on Noisy Text task corpus provides noisy-clean parallel data for a few language pairs, but this data is very limited in size and… ▽ More

    Submitted 14 October, 2019; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: add missing content & references, fix url line break in footnotes

  46. arXiv:1908.07553  [pdf, other

    cs.CV cs.CL cs.LG

    Phrase Localization Without Paired Training Examples

    Authors: Josiah Wang, Lucia Specia

    Abstract: Localizing phrases in images is an important part of image understanding and can be useful in many applications that require mappings between textual and visual information. Existing work attempts to learn these mappings from examples of phrase-image region correspondences (strong supervision) or from phrase-image pairs (weak supervision). We postulate that such paired annotations are unnecessary,… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: Accepted for oral presentation at the IEEE/CVF International Conference on Computer Vision (ICCV) 2019

  47. arXiv:1908.04567  [pdf, other

    cs.CL

    EASSE: Easier Automatic Sentence Simplification Evaluation

    Authors: Fernando Alva-Manchego, Louis Martin, Carolina Scarton, Lucia Specia

    Abstract: We introduce EASSE, a Python package aiming to facilitate and standardise automatic evaluation and comparison of Sentence Simplification (SS) systems. EASSE provides a single access point to a broad range of evaluation resources: standard automatic metrics for assessing SS outputs (e.g. SARI), word-level accuracy scores for certain simplification transformations, reference-independent quality esti… ▽ More

    Submitted 13 September, 2019; v1 submitted 13 August, 2019; originally announced August 2019.

    Comments: EMNLP-IJCNLP 2019 Demo (Camera-ready Version)

  48. arXiv:1908.01665  [pdf, other

    cs.CL

    Predicting Actions to Help Predict Translations

    Authors: Zixiu Wu, Julia Ive, Josiah Wang, Pranava Madhyastha, Lucia Specia

    Abstract: We address the task of text translation on the How2 dataset using a state of the art transformer-based multimodal approach. The question we ask ourselves is whether visual features can support the translation process, in particular, given that this is a dataset extracted from videos, we focus on the translation of actions, which we believe are poorly captured in current static image-text datasets… ▽ More

    Submitted 18 August, 2019; v1 submitted 5 August, 2019; originally announced August 2019.

    Comments: Accepted to workshop "The How2 Challenge: New Tasks for Vision & Language" of International Conference on Machine Learning 2019

  49. arXiv:1907.09340  [pdf, other

    cs.CL cs.CV cs.LG

    VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions

    Authors: Pranava Madhyastha, Josiah Wang, Lucia Specia

    Abstract: We address the task of evaluating image description generation systems. We propose a novel image-aware metric for this task: VIFIDEL. It estimates the faithfulness of a generated caption with respect to the content of the actual image, based on the semantic similarity between labels of objects depicted in images and words in the description. The metric is also able to take into account the relativ… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: Accepted for publication at ACL 2019

  50. arXiv:1907.01055  [pdf, other

    cs.CL cs.LG

    Is artificial data useful for biomedical Natural Language Processing algorithms?

    Authors: Zixu Wang, Julia Ive, Sumithra Velupillai, Lucia Specia

    Abstract: A major obstacle to the development of Natural Language Processing (NLP) methods in the biomedical domain is data accessibility. This problem can be addressed by generating medical data artificially. Most previous studies have focused on the generation of short clinical text, and evaluation of the data utility has been limited. We propose a generic methodology to guide the generation of clinical t… ▽ More

    Submitted 7 August, 2019; v1 submitted 1 July, 2019; originally announced July 2019.

    Comments: BioNLP 2019