Skip to main content

Showing 1–4 of 4 results for author: Khalid, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.23487  [pdf, ps, other

    cs.AI cs.CL cs.LG

    Large Language and Reasoning Models are Shallow Disjunctive Reasoners

    Authors: Irtaza Khalid, Amir Masoud Nourollah, Steven Schockaert

    Abstract: Large Language Models (LLMs) have been found to struggle with systematic reasoning. Even on tasks where they appear to perform well, their performance often depends on shortcuts, rather than on genuine reasoning abilities, leading them to collapse on out-of-distribution (OOD) examples. Post-training strategies based on reinforcement learning and chain-of-thought prompting have recently been hailed… ▽ More

    Submitted 2 June, 2025; v1 submitted 30 March, 2025; originally announced March 2025.

    Comments: ACL 2025 main conference

  2. arXiv:2503.05371  [pdf, other

    cs.LG cs.AI cs.CL

    Shifting Perspectives: Steering Vector Ensembles for Robust Bias Mitigation in LLMs

    Authors: Zara Siddique, Irtaza Khalid, Liam D. Turner, Luis Espinosa-Anke

    Abstract: We present a novel approach to bias mitigation in large language models (LLMs) by applying steering vectors to modify model activations in forward passes. We employ Bayesian optimization to systematically identify effective contrastive pair datasets across nine bias axes. When optimized on the BBQ dataset, our individually tuned steering vectors achieve average improvements of 12.2%, 4.7%, and 3.2… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: Submitted to ACL 2025

  3. arXiv:2407.17396  [pdf, other

    cs.AI cs.LG

    Systematic Relational Reasoning With Epistemic Graph Neural Networks

    Authors: Irtaza Khalid, Steven Schockaert

    Abstract: Developing models that can learn to reason is a notoriously challenging problem. We focus on reasoning in relational domains, where the use of Graph Neural Networks (GNNs) seems like a natural choice. However, previous work has shown that regular GNNs lack the ability to systematically generalize from training examples on test graphs requiring longer inference chains, which fundamentally limits th… ▽ More

    Submitted 27 February, 2025; v1 submitted 24 July, 2024; originally announced July 2024.

    Comments: 10+29 pages, 5+13 figures, 4+10 tables. Comments welcome!

    Journal ref: ICLR 2025 main

  4. arXiv:2304.09718  [pdf, other

    quant-ph cs.AI cs.LG

    Sample-efficient Model-based Reinforcement Learning for Quantum Control

    Authors: Irtaza Khalid, Carrie A. Weidner, Edmond A. Jonckheere, Sophie G. Shermer, Frank C. Langbein

    Abstract: We propose a model-based reinforcement learning (RL) approach for noisy time-dependent gate optimization with improved sample complexity over model-free RL. Sample complexity is the number of controller interactions with the physical system. Leveraging an inductive bias, inspired by recent advances in neural ordinary differential equations (ODEs), we use an auto-differentiable ODE parametrised by… ▽ More

    Submitted 2 October, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 14+10 pages, 6+6 figures, revised version

    Journal ref: Phys. Rev. Research 5, 043002 (2023)