Skip to main content

Showing 1–5 of 5 results for author: Schouten, S F

.
  1. arXiv:2404.18865  [pdf, other

    cs.CL

    Truth-value judgment in language models: belief directions are context sensitive

    Authors: Stefan F. Schouten, Peter Bloem, Ilia Markov, Piek Vossen

    Abstract: Recent work has demonstrated that the latent spaces of large language models (LLMs) contain directions predictive of the truth of sentences. Multiple methods recover such directions and build probes that are described as getting at a model's "knowledge" or "beliefs". We investigate this phenomenon, looking closely at the impact of context on the probes. Our experiments establish where in the LLM t… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  2. arXiv:2310.14657  [pdf, other

    cs.CL cs.AI

    Reasoning about Ambiguous Definite Descriptions

    Authors: Stefan F. Schouten, Peter Bloem, Ilia Markov, Piek Vossen

    Abstract: Natural language reasoning plays an increasingly important role in improving language models' ability to solve complex language understanding tasks. An interesting use case for reasoning is the resolution of context-dependent ambiguity. But no resources exist to evaluate how well Large Language Models can use explicit reasoning to resolve ambiguity in language. We propose to use ambiguous definite… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Findings

  3. arXiv:2306.09642  [pdf, ps, other

    cs.CL cs.LG

    Cross-Domain Toxic Spans Detection

    Authors: Stefan F. Schouten, Baran Barbarestani, Wondimagegnhue Tufa, Piek Vossen, Ilia Markov

    Abstract: Given the dynamic nature of toxic language use, automated methods for detecting toxic spans are likely to encounter distributional shift. To explore this phenomenon, we evaluate three approaches for detecting toxic spans under cross-domain conditions: lexicon-based, rationale extraction, and fine-tuned language models. Our findings indicate that a simple method using off-the-shelf lexicons perform… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: NLDB 2023

  4. arXiv:2205.04559  [pdf, other

    cs.CL cs.LG

    A Song of (Dis)agreement: Evaluating the Evaluation of Explainable Artificial Intelligence in Natural Language Processing

    Authors: Michael Neely, Stefan F. Schouten, Maurits Bleeker, Ana Lucic

    Abstract: There has been significant debate in the NLP community about whether or not attention weights can be used as an explanation - a mechanism for interpreting how important each input token is for a particular prediction. The validity of "attention as explanation" has so far been evaluated by computing the rank correlation between attention-based explanations and existing feature attribution explanati… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

  5. arXiv:2105.03287  [pdf, other

    cs.LG cs.CL

    Order in the Court: Explainable AI Methods Prone to Disagreement

    Authors: Michael Neely, Stefan F. Schouten, Maurits J. R. Bleeker, Ana Lucic

    Abstract: By computing the rank correlation between attention weights and feature-additive explanation methods, previous analyses either invalidate or support the role of attention-based explanations as a faithful and plausible measure of salience. To investigate whether this approach is appropriate, we compare LIME, Integrated Gradients, DeepLIFT, Grad-SHAP, Deep-SHAP, and attention-based explanations, app… ▽ More

    Submitted 6 July, 2021; v1 submitted 7 May, 2021; originally announced May 2021.

    Comments: Accepted for presentation at the ICML Workshop on Theoretic Foundation, Criticism, and Application Trend of Explainable AI