Skip to main content

Showing 1–1 of 1 results for author: Esua-Mensah, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.00001  [pdf, other

    cs.CL

    Rosetta-PL: Propositional Logic as a Benchmark for Large Language Model Reasoning

    Authors: Shaun Baek, Shaun Esua-Mensah, Cyrus Tsui, Sejan Vigneswaralingam, Abdullah Alali, Michael Lu, Vasu Sharma, Sean O'Brien, Kevin Zhu

    Abstract: Large Language Models (LLMs) are primarily trained on high-resource natural languages, limiting their effectiveness in low-resource settings and in tasks requiring deep logical reasoning. This research introduces Rosetta-PL, a benchmark designed to evaluate LLMs' logical reasoning and generalization capabilities in a controlled environment. We construct Rosetta-PL by translating a dataset of logic… ▽ More

    Submitted 2 May, 2025; v1 submitted 25 March, 2025; originally announced May 2025.