Skip to main content

Showing 1–6 of 6 results for author: Glushkova, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.19144  [pdf, other

    cs.CL

    BLEU Meets COMET: Combining Lexical and Neural Metrics Towards Robust Machine Translation Evaluation

    Authors: Taisiya Glushkova, Chrysoula Zerva, André F. T. Martins

    Abstract: Although neural-based machine translation evaluation metrics, such as COMET or BLEURT, have achieved strong correlations with human judgements, they are sometimes unreliable in detecting certain phenomena that can be considered as critical errors, such as deviations in entities and numbers. In contrast, traditional evaluation metrics, such as BLEU or chrF, which measure lexical or character overla… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted at EAMT 2023

  2. arXiv:2209.06243  [pdf, other

    cs.CL cs.LG

    CometKiwi: IST-Unbabel 2022 Submission for the Quality Estimation Shared Task

    Authors: Ricardo Rei, Marcos Treviso, Nuno M. Guerreiro, Chrysoula Zerva, Ana C. Farinha, Christine Maroti, José G. C. de Souza, Taisiya Glushkova, Duarte M. Alves, Alon Lavie, Luisa Coheur, André F. T. Martins

    Abstract: We present the joint contribution of IST and Unbabel to the WMT 2022 Shared Task on Quality Estimation (QE). Our team participated on all three subtasks: (i) Sentence and Word-level Quality Prediction; (ii) Explainable QE; and (iii) Critical Error Detection. For all tasks we build on top of the COMET framework, connecting it with the predictor-estimator architecture of OpenKiwi, and equipping it w… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: WMT 2022 Quality Estimation shared task

  3. arXiv:2204.06546  [pdf, other

    cs.CL

    Disentangling Uncertainty in Machine Translation Evaluation

    Authors: Chrysoula Zerva, Taisiya Glushkova, Ricardo Rei, André F. T. Martins

    Abstract: Trainable evaluation metrics for machine translation (MT) exhibit strong correlation with human judgements, but they are often hard to interpret and might produce unreliable scores under noisy or out-of-domain data. Recent work has attempted to mitigate this with simple uncertainty quantification techniques (Monte Carlo dropout and deep ensembles), however these techniques (as we show) are limited… ▽ More

    Submitted 29 November, 2022; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: accepted at EMNLP 2022

  4. Uncertainty-Aware Machine Translation Evaluation

    Authors: Taisiya Glushkova, Chrysoula Zerva, Ricardo Rei, André F. T. Martins

    Abstract: Several neural-based metrics have been recently proposed to evaluate machine translation quality. However, all of them resort to point estimates, which provide limited information at segment level. This is made worse as they are trained on noisy, biased and scarce human judgements, often resulting in unreliable quality predictions. In this paper, we introduce uncertainty-aware MT evaluation and an… ▽ More

    Submitted 24 March, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: Findings of EMNLP 2021 v2: corrected typos (esp. Tab 5)

  5. DaNetQA: a yes/no Question Answering Dataset for the Russian Language

    Authors: Taisia Glushkova, Alexey Machnev, Alena Fenogenova, Tatiana Shavrina, Ekaterina Artemova, Dmitry I. Ignatov

    Abstract: DaNetQA, a new question-answering corpus, follows (Clark et. al, 2019) design: it comprises natural yes/no questions. Each question is paired with a paragraph from Wikipedia and an answer, derived from the paragraph. The task is to take both the question and a paragraph as input and come up with a yes/no answer, i.e. to produce a binary output. In this paper, we present a reproducible approach to… ▽ More

    Submitted 15 October, 2020; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: Analysis of Images, Social Networks and Texts - 9 th International Conference, AIST 2020, Skolkovo, Russia, October 15-16, 2020, Revised Selected Papers. Lecture Notes in Computer Science (https://dblp.org/db/series/lncs/index.html), Springer 2020

  6. Char-RNN and Active Learning for Hashtag Segmentation

    Authors: Taisiya Glushkova, Ekaterina Artemova

    Abstract: We explore the abilities of character recurrent neural network (char-RNN) for hashtag segmentation. Our approach to the task is the following: we generate synthetic training dataset according to frequent n-grams that satisfy predefined morpho-syntactic patterns to avoid any manual annotation. The active learning strategy limits the training dataset and selects informative training subset. The appr… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: to appear in Cicling2019