Skip to main content

Showing 1–5 of 5 results for author: Kavumba, P

.
  1. arXiv:2503.23899  [pdf, ps, other

    cs.CL

    Rubrik's Cube: Testing a New Rubric for Evaluating Explanations on the CUBE dataset

    Authors: Diana Galvan-Sosa, Gabrielle Gaudeau, Pride Kavumba, Yunmeng Li, Hongyi gu, Zheng Yuan, Keisuke Sakaguchi, Paula Buttery

    Abstract: The performance and usability of Large-Language Models (LLMs) are driving their use in explanation generation tasks. However, despite their widespread adoption, LLM explanations have been found to be unreliable, making it difficult for users to distinguish good from bad explanations. To address this issue, we present Rubrik's CUBE, an education-inspired rubric and a dataset of 26k explanations, wr… ▽ More

    Submitted 4 June, 2025; v1 submitted 31 March, 2025; originally announced March 2025.

    Comments: 10 main pages (24 appendix pages), 9 figures, accepted to ACL 2025

    ACM Class: I.2.7

  2. arXiv:2205.09295  [pdf, other

    cs.CL

    Are Prompt-based Models Clueless?

    Authors: Pride Kavumba, Ryo Takahashi, Yusuke Oda

    Abstract: Finetuning large pre-trained language models with a task-specific head has advanced the state-of-the-art on many natural language understanding benchmarks. However, models with a task-specific head require a lot of training data, making them susceptible to learning and exploiting dataset-specific superficial cues that do not generalize to other datasets. Prompting has reduced the data requirement… ▽ More

    Submitted 19 May, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

  3. arXiv:2201.06777  [pdf, other

    cs.CL

    COPA-SSE: Semi-structured Explanations for Commonsense Reasoning

    Authors: Ana Brassard, Benjamin Heinzerling, Pride Kavumba, Kentaro Inui

    Abstract: We present Semi-Structured Explanations for COPA (COPA-SSE), a new crowdsourced dataset of 9,747 semi-structured, English common sense explanations for Choice of Plausible Alternatives (COPA) questions. The explanations are formatted as a set of triple-like common sense statements with ConceptNet relations but freely written concepts. This semi-structured format strikes a balance between the high… ▽ More

    Submitted 11 May, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

    Comments: 6 pages, 6 figures, LREC 2022. Data available at https://github.com/a-brassard/copa-sse

  4. arXiv:2104.11514  [pdf, other

    cs.CL

    Learning to Learn to be Right for the Right Reasons

    Authors: Pride Kavumba, Benjamin Heinzerling, Ana Brassard, Kentaro Inui

    Abstract: Improving model generalization on held-out data is one of the core objectives in commonsense reasoning. Recent work has shown that models trained on the dataset with superficial cues tend to perform well on the easy test set with superficial cues but perform poorly on the hard test set without superficial cues. Previous approaches have resorted to manual methods of encouraging models not to overfi… ▽ More

    Submitted 23 April, 2021; originally announced April 2021.

  5. arXiv:1911.00225  [pdf, other

    cs.CL

    When Choosing Plausible Alternatives, Clever Hans can be Clever

    Authors: Pride Kavumba, Naoya Inoue, Benjamin Heinzerling, Keshav Singh, Paul Reisert, Kentaro Inui

    Abstract: Pretrained language models, such as BERT and RoBERTa, have shown large improvements in the commonsense reasoning benchmark COPA. However, recent work found that many improvements in benchmarks of natural language understanding are not due to models learning the task, but due to their increasing ability to exploit superficial cues, such as tokens that occur more often in the correct answer than the… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

    Comments: Accepted to the COmmonsense INference in Natural Language Processing workshop (COIN)