Skip to main content

Showing 1–3 of 3 results for author: Shirakami, H

.
  1. arXiv:2410.13502  [pdf, other

    cs.LG cs.AI cs.CL

    MathGAP: Out-of-Distribution Evaluation on Problems with Arbitrarily Complex Proofs

    Authors: Andreas Opedal, Haruki Shirakami, Bernhard Schölkopf, Abulhair Saparov, Mrinmaya Sachan

    Abstract: Large language models (LLMs) can solve arithmetic word problems with high accuracy, but little is known about how well they generalize to more complex problems. This is difficult to study, as (i) much of the available evaluation data has already been seen by the most capable models during training, and (ii) existing benchmarks do not capture how problem proofs may be arbitrarily complex in various… ▽ More

    Submitted 14 February, 2025; v1 submitted 17 October, 2024; originally announced October 2024.

    Comments: ICLR 2025

  2. arXiv:2401.18070  [pdf, other

    cs.CL cs.AI cs.LG

    Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?

    Authors: Andreas Opedal, Alessandro Stolfo, Haruki Shirakami, Ying Jiao, Ryan Cotterell, Bernhard Schölkopf, Abulhair Saparov, Mrinmaya Sachan

    Abstract: There is increasing interest in employing large language models (LLMs) as cognitive models. For such purposes, it is central to understand which properties of human cognition are well-modeled by LLMs, and which are not. In this work, we study the biases of LLMs in relation to those known in children when solving arithmetic word problems. Surveying the learning science literature, we posit that the… ▽ More

    Submitted 17 June, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: Accepted at ICML 2024

  3. Knowledge discovery from emergency ambulance dispatch during COVID-19: A case study of Nagoya City, Japan

    Authors: Essam A. Rashed, Sachiko Kodera, Hidenobu Shirakami, Ryotetsu Kawaguchi, Kazuhiro Watanabe, Akimasa Hirata

    Abstract: Accurate forecasting of medical service requirements is an important big data problem that is crucial for resource management in critical times such as natural disasters and pandemics. With the global spread of coronavirus disease 2019 (COVID-19), several concerns have been raised regarding the ability of medical systems to handle sudden changes in the daily routines of healthcare providers. One s… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: 15 pages, 12 figures, 2 tables

    Journal ref: Journal of Biomedical Informatics, 2021