Skip to main content

Showing 1–1 of 1 results for author: Briedienė, M

.
  1. arXiv:2201.13242  [pdf, other

    cs.CL cs.IR cs.LG stat.ML

    Correcting diacritics and typos with a ByT5 transformer model

    Authors: Lukas Stankevičius, Mantas Lukoševičius, Jurgita Kapočiūtė-Dzikienė, Monika Briedienė, Tomas Krilavičius

    Abstract: Due to the fast pace of life and online communications and the prevalence of English and the QWERTY keyboard, people tend to forgo using diacritics, make typographical errors (typos) when typing in other languages. Restoring diacritics and correcting spelling is important for proper language use and the disambiguation of texts for both humans and downstream algorithms. However, both of these probl… ▽ More

    Submitted 18 March, 2022; v1 submitted 31 January, 2022; originally announced January 2022.

    MSC Class: 68T07; 68T50; 68T05 ACM Class: I.2.6; I.2.7

    Journal ref: Appl. Sci. 2022, 12(5), 2636