Skip to main content

Showing 1–2 of 2 results for author: Baushenko, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.10931  [pdf, ps, other

    cs.CL

    A Family of Pretrained Transformer Language Models for Russian

    Authors: Dmitry Zmitrovich, Alexander Abramov, Andrey Kalmykov, Maria Tikhonova, Ekaterina Taktasheva, Danil Astafurov, Mark Baushenko, Artem Snegirev, Vitalii Kadulin, Sergey Markov, Tatiana Shavrina, Vladislav Mikhailov, Alena Fenogenova

    Abstract: Transformer language models (LMs) are fundamental to NLP research methodologies and applications in various languages. However, developing such models specifically for the Russian language has received little attention. This paper introduces a collection of 13 Russian Transformer LMs, which spans encoder (ruBERT, ruRoBERTa, ruELECTRA), decoder (ruGPT-3), and encoder-decoder (ruT5, FRED-T5) archite… ▽ More

    Submitted 2 August, 2024; v1 submitted 19 September, 2023; originally announced September 2023.

    Comments: LREC-COLING-2024

    Journal ref: https://aclanthology.org/2024.lrec-main.45/

  2. arXiv:2308.09435  [pdf, other

    cs.CL

    A Methodology for Generative Spelling Correction via Natural Spelling Errors Emulation across Multiple Domains and Languages

    Authors: Nikita Martynov, Mark Baushenko, Anastasia Kozlova, Katerina Kolomeytseva, Aleksandr Abramov, Alena Fenogenova

    Abstract: Modern large language models demonstrate impressive capabilities in text generation and generalization. However, they often struggle with solving text editing tasks, particularly when it comes to correcting spelling errors and mistypings. In this paper, we present a methodology for generative spelling correction (SC), which was tested on English and Russian languages and potentially can be extende… ▽ More

    Submitted 13 September, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: to appear in EACL 2024