Skip to main content

Showing 1–9 of 9 results for author: Eddine, M K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.17378  [pdf, other

    cs.CL

    Questioning the Validity of Summarization Datasets and Improving Their Factual Consistency

    Authors: Yanzhu Guo, ChloƩ Clavel, Moussa Kamal Eddine, Michalis Vazirgiannis

    Abstract: The topic of summarization evaluation has recently attracted a surge of attention due to the rapid development of abstractive summarization systems. However, the formulation of the task is rather ambiguous, neither the linguistic nor the natural language processing community has succeeded in giving a mutually agreed-upon definition. Due to this lack of well-defined formulation, a large number of p… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: Accepted to EMNLP 2022

  2. arXiv:2210.06576  [pdf, other

    cs.CL

    DATScore: Evaluating Translation with Data Augmented Translations

    Authors: Moussa Kamal Eddine, Guokan Shang, Michalis Vazirgiannis

    Abstract: The rapid development of large pretrained language models has revolutionized not only the field of Natural Language Generation (NLG) but also its evaluation. Inspired by the recent work of BARTScore: a metric leveraging the BART language model to evaluate the quality of generated text from various aspects, we introduce DATScore. DATScore uses data augmentation techniques to improve the evaluation… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  3. Word Sense Induction with Hierarchical Clustering and Mutual Information Maximization

    Authors: Hadi Abdine, Moussa Kamal Eddine, Michalis Vazirgiannis, Davide Buscaldi

    Abstract: Word sense induction (WSI) is a difficult problem in natural language processing that involves the unsupervised automatic detection of a word's senses (i.e. meanings). Recent work achieves significant results on the WSI task by pre-training a language model that can exclusively disambiguate word senses, whereas others employ previously pre-trained language models in conjunction with additional str… ▽ More

    Submitted 11 October, 2022; originally announced October 2022.

  4. arXiv:2206.11249  [pdf, other

    cs.CL cs.AI cs.LG

    GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

    Authors: Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhyay, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di Jin, Dimitra Gkatzia, Dragomir Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter , et al. (52 additional authors not shown)

    Abstract: Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal as better alternatives arise. This problem is especially pertinent in natural language generation which requires ever-improving suites of datasets, metrics, an… ▽ More

    Submitted 24 June, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

  5. arXiv:2203.10945  [pdf, other

    cs.CL

    AraBART: a Pretrained Arabic Sequence-to-Sequence Model for Abstractive Summarization

    Authors: Moussa Kamal Eddine, Nadi Tomeh, Nizar Habash, Joseph Le Roux, Michalis Vazirgiannis

    Abstract: Like most natural language understanding and generation tasks, state-of-the-art models for summarization are transformer-based sequence-to-sequence architectures that are pretrained on large corpora. While most existing models focused on English, Arabic remained understudied. In this paper we propose AraBART, the first Arabic model in which the encoder and the decoder are pretrained end-to-end, ba… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  6. arXiv:2112.00566  [pdf, ps, other

    cs.CL

    NLP Research and Resources at DaSciM, Ecole Polytechnique

    Authors: Hadi Abdine, Yanzhu Guo, Moussa Kamal Eddine, Giannis Nikolentzos, Stamatis Outsios, Guokan Shang, Christos Xypolopoulos, Michalis Vazirgiannis

    Abstract: DaSciM (Data Science and Mining) part of LIX at Ecole Polytechnique, established in 2013 and since then producing research results in the area of large scale data analysis via methods of machine and deep learning. The group has been specifically active in the area of NLP and text mining with interesting results at methodological and resources level. Here follow our different contributions of inter… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  7. arXiv:2110.08559  [pdf, other

    cs.CL

    FrugalScore: Learning Cheaper, Lighter and Faster Evaluation Metricsfor Automatic Text Generation

    Authors: Moussa Kamal Eddine, Guokan Shang, Antoine J. -P. Tixier, Michalis Vazirgiannis

    Abstract: Fast and reliable evaluation metrics are key to R&D progress. While traditional natural language generation metrics are fast, they are not very reliable. Conversely, new metrics based on large pretrained language models are much more reliable, but require significant computational resources. In this paper, we propose FrugalScore, an approach to learn a fixed, low cost version of any expensive NLG… ▽ More

    Submitted 16 October, 2021; originally announced October 2021.

  8. arXiv:2105.01990  [pdf, other

    cs.CL

    Evaluation Of Word Embeddings From Large-Scale French Web Content

    Authors: Hadi Abdine, Christos Xypolopoulos, Moussa Kamal Eddine, Michalis Vazirgiannis

    Abstract: Distributed word representations are popularly used in many tasks in natural language processing. Adding that pretrained word vectors on huge text corpus achieved high performance in many different NLP tasks. This paper introduces multiple high-quality word vectors for the French language where two of them are trained on massive crawled French data during this study and the others are trained on a… ▽ More

    Submitted 10 March, 2022; v1 submitted 5 May, 2021; originally announced May 2021.

  9. arXiv:2010.12321  [pdf, ps, other

    cs.CL

    BARThez: a Skilled Pretrained French Sequence-to-Sequence Model

    Authors: Moussa Kamal Eddine, Antoine J. -P. Tixier, Michalis Vazirgiannis

    Abstract: Inductive transfer learning has taken the entire NLP field by storm, with models such as BERT and BART setting new state of the art on countless NLU tasks. However, most of the available models and research have been conducted for English. In this work, we introduce BARThez, the first large-scale pretrained seq2seq model for French. Being based on BART, BARThez is particularly well-suited for gene… ▽ More

    Submitted 9 February, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: More experiments and results, human evaluation, reorganization of paper