Skip to main content

Showing 1–15 of 15 results for author: Saparov, A

.
  1. arXiv:2412.04703  [pdf, other

    cs.CL cs.AI cs.LG

    Transformers Struggle to Learn to Search

    Authors: Abulhair Saparov, Srushti Pawar, Shreyas Pimpalgaonkar, Nitish Joshi, Richard Yuanzhe Pang, Vishakh Padmakumar, Seyed Mehran Kazemi, Najoung Kim, He He

    Abstract: Search is an ability foundational in many important tasks, and recent studies have shown that large language models (LLMs) struggle to perform search robustly. It is unknown whether this inability is due to a lack of data, insufficient model parameters, or fundamental limitations of the transformer architecture. In this work, we use the foundational graph connectivity problem as a testbed to gener… ▽ More

    Submitted 16 March, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

    Comments: Published as a conference paper at ICLR 2025

  2. arXiv:2410.13502  [pdf, other

    cs.LG cs.AI cs.CL

    MathGAP: Out-of-Distribution Evaluation on Problems with Arbitrarily Complex Proofs

    Authors: Andreas Opedal, Haruki Shirakami, Bernhard Schölkopf, Abulhair Saparov, Mrinmaya Sachan

    Abstract: Large language models (LLMs) can solve arithmetic word problems with high accuracy, but little is known about how well they generalize to more complex problems. This is difficult to study, as (i) much of the available evaluation data has already been seen by the most capable models during training, and (ii) existing benchmarks do not capture how problem proofs may be arbitrarily complex in various… ▽ More

    Submitted 14 February, 2025; v1 submitted 17 October, 2024; originally announced October 2024.

    Comments: ICLR 2025

  3. arXiv:2407.02646  [pdf, other

    cs.AI cs.CL

    A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models

    Authors: Daking Rai, Yilun Zhou, Shi Feng, Abulhair Saparov, Ziyu Yao

    Abstract: Mechanistic interpretability (MI) is an emerging sub-field of interpretability that seeks to understand a neural network model by reverse-engineering its internal computations. Recently, MI has garnered significant attention for interpreting transformer-based language models (LMs), resulting in many novel insights yet introducing new challenges. However, there has not been work that comprehensivel… ▽ More

    Submitted 15 March, 2025; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: 35 pages, 13 figures, Preprint

    ACM Class: I.2.7

  4. arXiv:2406.12158  [pdf, other

    cs.CL cs.AI

    LLMs Are Prone to Fallacies in Causal Inference

    Authors: Nitish Joshi, Abulhair Saparov, Yixin Wang, He He

    Abstract: Recent work shows that causal facts can be effectively extracted from LLMs through prompting, facilitating the creation of causal graphs for causal inference tasks. However, it is unclear if this success is limited to explicitly-mentioned causal facts in the pretraining data which the model can memorize. Thus, this work investigates: Can LLMs infer causal relations from other relational data in te… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  5. arXiv:2404.09932  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    Foundational Challenges in Assuring Alignment and Safety of Large Language Models

    Authors: Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, Jose Hernandez-Orallo, Lewis Hammond, Eric Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Zhang, Ruiqi Zhong, Seán Ó hÉigeartaigh, Gabriel Recchia, Giulio Corsi , et al. (17 additional authors not shown)

    Abstract: This work identifies 18 foundational challenges in assuring the alignment and safety of large language models (LLMs). These challenges are organized into three different categories: scientific understanding of LLMs, development and deployment methods, and sociotechnical challenges. Based on the identified challenges, we pose $200+$ concrete research questions.

    Submitted 5 September, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  6. arXiv:2401.18070  [pdf, other

    cs.CL cs.AI cs.LG

    Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?

    Authors: Andreas Opedal, Alessandro Stolfo, Haruki Shirakami, Ying Jiao, Ryan Cotterell, Bernhard Schölkopf, Abulhair Saparov, Mrinmaya Sachan

    Abstract: There is increasing interest in employing large language models (LLMs) as cognitive models. For such purposes, it is central to understand which properties of human cognition are well-modeled by LLMs, and which are not. In this work, we study the biases of LLMs in relation to those known in children when solving arithmetic word problems. Surveying the learning science literature, we posit that the… ▽ More

    Submitted 17 June, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: Accepted at ICML 2024

  7. arXiv:2311.00258  [pdf, other

    cs.CL cs.LG

    Noisy Exemplars Make Large Language Models More Robust: A Domain-Agnostic Behavioral Analysis

    Authors: Hongyi Zheng, Abulhair Saparov

    Abstract: Recent advances in prompt engineering enable large language models (LLMs) to solve multi-hop logical reasoning problems with impressive accuracy. However, there is little existing work investigating the robustness of LLMs with few-shot prompting techniques. Therefore, we introduce a systematic approach to test the robustness of LLMs in multi-hop reasoning tasks via domain-agnostic perturbations. W… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  8. arXiv:2310.18168  [pdf, other

    cs.CL cs.AI cs.LG

    Personas as a Way to Model Truthfulness in Language Models

    Authors: Nitish Joshi, Javier Rando, Abulhair Saparov, Najoung Kim, He He

    Abstract: Large language models (LLMs) are trained on vast amounts of text from the internet, which contains both factual and misleading information about the world. While unintuitive from a classic view of LMs, recent work has shown that the truth value of a statement can be elicited from the model's representations. This paper presents an explanation for why LMs appear to know the truth despite not being… ▽ More

    Submitted 6 February, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

  9. arXiv:2310.14435  [pdf, other

    cs.CL cs.AI

    Retrieval-Augmented Chain-of-Thought in Semi-structured Domains

    Authors: Vaibhav Mavi, Abulhair Saparov, Chen Zhao

    Abstract: Applying existing question answering (QA) systems to specialized domains like law and finance presents challenges that necessitate domain expertise. Although large language models (LLMs) have shown impressive language comprehension and in-context learning capabilities, their inability to handle very long inputs/contexts is well known. Tasks specific to these domains need significant background kno… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: to appear in NLLP 2023

  10. arXiv:2306.04347  [pdf, other

    cs.CL

    World Models for Math Story Problems

    Authors: Andreas Opedal, Niklas Stoehr, Abulhair Saparov, Mrinmaya Sachan

    Abstract: Solving math story problems is a complex task for students and NLP models alike, requiring them to understand the world as described in the story and reason over it to compute an answer. Recent years have seen impressive performance on automatically solving these problems with large pre-trained language models and innovative techniques to prompt them. However, it remains unclear if these models po… ▽ More

    Submitted 26 February, 2025; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: ACL Findings 2023

  11. arXiv:2305.15269  [pdf, other

    cs.CL cs.AI

    Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples

    Authors: Abulhair Saparov, Richard Yuanzhe Pang, Vishakh Padmakumar, Nitish Joshi, Seyed Mehran Kazemi, Najoung Kim, He He

    Abstract: Given the intractably large size of the space of proofs, any model that is capable of general deductive reasoning must generalize to proofs of greater complexity. Recent studies have shown that large language models (LLMs) possess some abstract deductive reasoning ability given chain-of-thought prompts. However, they have primarily been tested on proofs using modus ponens or of a specific size, an… ▽ More

    Submitted 3 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Published as a conference paper at NeurIPS 2023

  12. arXiv:2210.01240  [pdf, other

    cs.CL

    Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-Thought

    Authors: Abulhair Saparov, He He

    Abstract: Large language models (LLMs) have shown remarkable reasoning capabilities given chain-of-thought prompts (examples with intermediate reasoning steps). Existing benchmarks measure reasoning ability indirectly, by evaluating accuracy on downstream tasks such as mathematical reasoning. However, it is unclear how these models obtain the answers and whether they rely on simple heuristics rather than th… ▽ More

    Submitted 1 March, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: Published as a conference paper at ICLR 2023

  13. arXiv:2105.02486  [pdf, other

    cs.CL

    Towards General Natural Language Understanding with Probabilistic Worldbuilding

    Authors: Abulhair Saparov, Tom M. Mitchell

    Abstract: We introduce the Probabilistic Worldbuilding Model (PWM), a new fully-symbolic Bayesian model of semantic parsing and reasoning, as a first step in a research program toward more domain- and task-general NLU and AI. Humans create internal mental models of their observations which greatly aid in their ability to understand and reason about a large variety of problems. In PWM, the meanings of senten… ▽ More

    Submitted 20 December, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: Accepted to TACL; pre-MIT Press publication version

  14. arXiv:2002.06306  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Jelly Bean World: A Testbed for Never-Ending Learning

    Authors: Emmanouil Antonios Platanios, Abulhair Saparov, Tom Mitchell

    Abstract: Machine learning has shown growing success in recent years. However, current machine learning systems are highly specialized, trained for particular problems or domains, and typically on a single narrow dataset. Human learning, on the other hand, is highly general and adaptable. Never-ending learning is a machine learning paradigm that aims to bridge this gap, with the goal of encouraging research… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: Published as a conference paper at ICLR 2020

    Journal ref: International Conference on Learning Representations 2020

  15. arXiv:1606.06361  [pdf, ps, other

    cs.CL cs.LG stat.ML

    A Probabilistic Generative Grammar for Semantic Parsing

    Authors: Abulhair Saparov

    Abstract: Domain-general semantic parsing is a long-standing goal in natural language processing, where the semantic parser is capable of robustly parsing sentences from domains outside of which it was trained. Current approaches largely rely on additional supervision from new domains in order to generalize to those domains. We present a generative model of natural language utterances and logical forms and… ▽ More

    Submitted 9 February, 2022; v1 submitted 20 June, 2016; originally announced June 2016.