Skip to main content

Showing 1–4 of 4 results for author: Voloshina, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  2. arXiv:2210.13236  [pdf, other

    cs.CL cs.AI

    Universal and Independent: Multilingual Probing Framework for Exhaustive Model Interpretation and Evaluation

    Authors: Oleg Serikov, Vitaly Protasov, Ekaterina Voloshina, Viktoria Knyazkova, Tatiana Shavrina

    Abstract: Linguistic analysis of language models is one of the ways to explain and describe their reasoning, weaknesses, and limitations. In the probing part of the model interpretability research, studies concern individual languages as well as individual linguistic structures. The question arises: are the detected regularities linguistically coherent, or on the contrary, do they dissonate at the typologic… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted to BlackBoxNLP, EMNLP 2022

    MSC Class: 68-04; 68-06; 68T50 ACM Class: G.3; I.2.7

  3. Is neural language acquisition similar to natural? A chronological probing study

    Authors: Ekaterina Voloshina, Oleg Serikov, Tatiana Shavrina

    Abstract: The probing methodology allows one to obtain a partial representation of linguistic phenomena stored in the inner layers of the neural network, using external classifiers and statistical analysis. Pre-trained transformer-based language models are widely used both for natural language understanding (NLU) and natural language generation (NLG) tasks making them most commonly used for downstream appli… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: Published in proceedings of Dialogue-2022 "Computational Linguistics and Intellectual Technologies"

  4. arXiv:2201.09997  [pdf, other

    cs.CL

    Razmecheno: Named Entity Recognition from Digital Archive of Diaries "Prozhito"

    Authors: Timofey Atnashev, Veronika Ganeeva, Roman Kazakov, Daria Matyash, Michael Sonkin, Ekaterina Voloshina, Oleg Serikov, Ekaterina Artemova

    Abstract: The vast majority of existing datasets for Named Entity Recognition (NER) are built primarily on news, research papers and Wikipedia with a few exceptions, created from historical and literary texts. What is more, English is the main source for data for further labelling. This paper aims to fill in multiple gaps by creating a novel dataset "Razmecheno", gathered from the diary texts of the project… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: Submitted to LREC 2022