Skip to main content

Showing 1–1 of 1 results for author: Mackintosh, A

.
  1. arXiv:2506.18710  [pdf, ps, other

    cs.CL cs.AI

    Benchmarking the Pedagogical Knowledge of Large Language Models

    Authors: Maxime Lelièvre, Amy Waldock, Meng Liu, Natalia Valdés Aspillaga, Alasdair Mackintosh, María José Ogando Portela, Jared Lee, Paul Atherton, Robin A. A. Ince, Oliver G. B. Garrod

    Abstract: Benchmarks like Massive Multitask Language Understanding (MMLU) have played a pivotal role in evaluating AI's knowledge and abilities across diverse domains. However, existing benchmarks predominantly focus on content knowledge, leaving a critical gap in assessing models' understanding of pedagogy - the method and practice of teaching. This paper introduces The Pedagogy Benchmark, a novel dataset… ▽ More

    Submitted 1 July, 2025; v1 submitted 23 June, 2025; originally announced June 2025.