Skip to main content

Showing 1–11 of 11 results for author: Toneva, M

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2506.03832  [pdf, ps, other

    cs.CL cs.SD eess.AS q-bio.NC

    Brain-tuned Speech Models Better Reflect Speech Processing Stages in the Brain

    Authors: Omer Moussa, Mariya Toneva

    Abstract: Pretrained self-supervised speech models excel in speech tasks but do not reflect the hierarchy of human speech processing, as they encode rich semantics in middle layers and poor semantics in late layers. Recent work showed that brain-tuning (fine-tuning models using human brain recordings) improves speech models' semantic understanding. Here, we examine how well brain-tuned models further reflec… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: Proceedings of Interspeech 2025

  2. arXiv:2311.04664  [pdf, other

    cs.CL cs.LG eess.AS q-bio.NC

    Speech language models lack important brain-relevant semantics

    Authors: Subba Reddy Oota, Emin Çelik, Fatma Deniz, Mariya Toneva

    Abstract: Despite known differences between reading and listening in the brain, recent work has shown that text-based language models predict both text-evoked and speech-evoked brain activity to an impressive degree. This poses the question of what types of information language models truly predict in the brain. We investigate this question via a direct approach, in which we systematically remove specific l… ▽ More

    Submitted 16 June, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: 26 pages, 20 figures, The 62nd Annual Meeting of the Association for Computational Linguistics, Long paper - Main

  3. arXiv:2310.13018  [pdf, other

    q-bio.NC cs.AI cs.LG cs.NE

    Getting aligned on representational alignment

    Authors: Ilia Sucholutsky, Lukas Muttenthaler, Adrian Weller, Andi Peng, Andreea Bobu, Been Kim, Bradley C. Love, Christopher J. Cueva, Erin Grant, Iris Groen, Jascha Achterberg, Joshua B. Tenenbaum, Katherine M. Collins, Katherine L. Hermann, Kerem Oktar, Klaus Greff, Martin N. Hebart, Nathan Cloos, Nikolaus Kriegeskorte, Nori Jacoby, Qiuyi Zhang, Raja Marjieh, Robert Geirhos, Sherol Chen, Simon Kornblith , et al. (8 additional authors not shown)

    Abstract: Biological and artificial information processing systems form representations of the world that they can use to categorize, reason, plan, navigate, and make decisions. How can we measure the similarity between the representations formed by these diverse systems? Do similarities in representations then translate into similar behavior? If so, then how can a system's representations be modified to be… ▽ More

    Submitted 26 November, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: 51 pages; Working paper (changes to be made in upcoming revisions)

  4. arXiv:2301.10297  [pdf, other

    cs.CL q-bio.NC

    Large language models can segment narrative events similarly to humans

    Authors: Sebastian Michelmann, Manoj Kumar, Kenneth A. Norman, Mariya Toneva

    Abstract: Humans perceive discrete events such as "restaurant visits" and "train rides" in their continuous experience. One important prerequisite for studying human event perception is the ability of researchers to quantify when one event ends and another begins. Typically, this information is derived by aggregating behavioral annotations from several observers. Here we present an alternative computational… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

  5. arXiv:2212.10898  [pdf, other

    cs.CL q-bio.NC

    Training language models to summarize narratives improves brain alignment

    Authors: Khai Loong Aw, Mariya Toneva

    Abstract: Building systems that achieve a deeper understanding of language is one of the central goals of natural language processing (NLP). Towards this goal, recent works have begun to train language models on narrative datasets which require extracting the most critical information by integrating across long contexts. However, it is still an open question whether these models are learning a deeper unders… ▽ More

    Submitted 28 February, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: ICLR 2023 (notable top 25%)

  6. arXiv:2212.08094  [pdf, other

    cs.CL q-bio.NC

    Joint processing of linguistic properties in brains and language models

    Authors: Subba Reddy Oota, Manish Gupta, Mariya Toneva

    Abstract: Language models have been shown to be very effective in predicting brain recordings of subjects experiencing complex language stimuli. For a deeper understanding of this alignment, it is important to understand the correspondence between the detailed processing of linguistic information by the human brain versus language models. We investigate this correspondence via a direct approach, in which we… ▽ More

    Submitted 8 November, 2023; v1 submitted 15 December, 2022; originally announced December 2022.

    Comments: 22 pages, 12 figures, To be published in the proceedings of the 37th Conference on Neural Information Processing Systems (NeurIPS 2023), New Orleans, USA

  7. arXiv:2212.00596  [pdf, other

    cs.CL q-bio.NC

    Language models and brains align due to more than next-word prediction and word-level information

    Authors: Gabriele Merlin, Mariya Toneva

    Abstract: Pretrained language models have been shown to significantly predict brain recordings of people comprehending language. Recent work suggests that the prediction of the next word is a key mechanism that contributes to this alignment. What is not yet understood is whether prediction of the next word is necessary for this observed alignment or simply sufficient, and whether there are other shared mech… ▽ More

    Submitted 3 October, 2024; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: Accepted to EMNLP 2024

  8. arXiv:2202.10376  [pdf, other

    q-bio.NC cs.AI cs.LG cs.NE stat.AP

    Same Cause; Different Effects in the Brain

    Authors: Mariya Toneva, Jennifer Williams, Anand Bollu, Christoph Dann, Leila Wehbe

    Abstract: To study information processing in the brain, neuroscientists manipulate experimental stimuli while recording participant brain activity. They can then use encoding models to find out which brain "zone" (e.g. which region of interest, volume pixel or electrophysiology sensor) is predicted from the stimulus properties. Given the assumptions underlying this setup, when stimulus properties are predic… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Comments: Accepted to CLeaR 2022

  9. arXiv:2108.10231  [pdf, other

    q-bio.NC

    A roadmap to reverse engineering real-world generalization by combining naturalistic paradigms, deep sampling, and predictive computational models

    Authors: Peer Herholz, Eddy Fortier, Mariya Toneva, Nicolas Farrugia, Leila Wehbe, Valentina Borghesani

    Abstract: Real-world generalization, e.g., deciding to approach a never-seen-before animal, relies on contextual information as well as previous experiences. Such a seemingly easy behavioral choice requires the interplay of multiple neural mechanisms, from integrative encoding to category-based inference, weighted differently according to the circumstances. Here, we argue that a comprehensive theory of the… ▽ More

    Submitted 14 January, 2022; v1 submitted 23 August, 2021; originally announced August 2021.

  10. arXiv:1911.03268  [pdf, other

    q-bio.NC cs.CL cs.LG

    Inducing brain-relevant bias in natural language processing models

    Authors: Dan Schwartz, Mariya Toneva, Leila Wehbe

    Abstract: Progress in natural language processing (NLP) models that estimate representations of word sequences has recently been leveraged to improve the understanding of language processing in the brain. However, these models have not been specifically designed to capture the way the brain represents language meaning. We hypothesize that fine-tuning these models to predict recordings of brain activity of p… ▽ More

    Submitted 29 October, 2019; originally announced November 2019.

    Comments: To be published in the proceedings of the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  11. arXiv:1905.11833  [pdf, other

    cs.CL cs.AI cs.LG q-bio.NC

    Interpreting and improving natural-language processing (in machines) with natural language-processing (in the brain)

    Authors: Mariya Toneva, Leila Wehbe

    Abstract: Neural networks models for NLP are typically implemented without the explicit encoding of language rules and yet they are able to break one performance record after another. This has generated a lot of research interest in interpreting the representations learned by these networks. We propose here a novel interpretation approach that relies on the only processing system we have that does understan… ▽ More

    Submitted 13 November, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: NeurIPS 2019