Skip to main content

Showing 1–6 of 6 results for author: Asher, N

Searching in archive stat. Search in all archives.
.
  1. arXiv:2502.03503  [pdf, ps, other

    stat.ML cs.AI cs.LG

    Analyzing limits for in-context learning

    Authors: Omar Naim, Nicholas Asher

    Abstract: We examine limits of in-context learning (ICL) in transformer models trained from scratch, focusing on function approximation tasks as a controlled setting to uncover fundamental behaviors. While we show empirically that transformer models can generalize, approximating unseen classes of polynomial (non linear) functions, they cannot generalize beyond certain values. We provide both empirical and m… ▽ More

    Submitted 30 May, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

  2. arXiv:2408.03560  [pdf, other

    cs.LG stat.ML

    In2Core: Leveraging Influence Functions for Coreset Selection in Instruction Finetuning of Large Language Models

    Authors: Ayrton San Joaquin, Bin Wang, Zhengyuan Liu, Nicholas Asher, Brian Lim, Philippe Muller, Nancy F. Chen

    Abstract: Despite advancements, fine-tuning Large Language Models (LLMs) remains costly due to the extensive parameter count and substantial data requirements for model generalization. Accessibility to computing resources remains a barrier for the open-source community. To address this challenge, we propose the In2Core algorithm, which selects a coreset by analyzing the correlation between training and eval… ▽ More

    Submitted 2 October, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

    Comments: EMNLP 2024 - Findings

  3. arXiv:2312.06499  [pdf, other

    cs.CL stat.ML

    TaCo: Targeted Concept Erasure Prevents Non-Linear Classifiers From Detecting Protected Attributes

    Authors: Fanny Jourdan, Louis Béthune, Agustin Picard, Laurent Risser, Nicholas Asher

    Abstract: Ensuring fairness in NLP models is crucial, as they often encode sensitive attributes like gender and ethnicity, leading to biased outcomes. Current concept erasure methods attempt to mitigate this by modifying final latent representations to remove sensitive information without retraining the entire model. However, these methods typically rely on linear classifiers, which leave models vulnerable… ▽ More

    Submitted 16 October, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  4. arXiv:2306.05307  [pdf, other

    cs.CL cs.CY cs.LG stat.ML

    Are fairness metric scores enough to assess discrimination biases in machine learning?

    Authors: Fanny Jourdan, Laurent Risser, Jean-Michel Loubes, Nicholas Asher

    Abstract: This paper presents novel experiments shedding light on the shortcomings of current metrics for assessing biases of gender discrimination made by machine learning algorithms on textual data. We focus on the Bios dataset, and our learning task is to predict the occupation of individuals, based on their biography. Such prediction tasks are common in commercial Natural Language Processing (NLP) appli… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted for publication at Third Workshop on Trustworthy Natural Language Processing, ACL 2023

  5. arXiv:2305.06754  [pdf, other

    cs.CL stat.ML

    COCKATIEL: COntinuous Concept ranKed ATtribution with Interpretable ELements for explaining neural net classifiers on NLP tasks

    Authors: Fanny Jourdan, Agustin Picard, Thomas Fel, Laurent Risser, Jean Michel Loubes, Nicholas Asher

    Abstract: Transformer architectures are complex and their use in NLP, while it has engendered many successes, makes their interpretability or explainability challenging. Recent debates have shown that attention maps and attribution methods are unreliable (Pruthi et al., 2019; Brunner et al., 2019). In this paper, we present some of their limitations and introduce COCKATIEL, which successfully addresses some… ▽ More

    Submitted 14 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

    Comments: Accepted for publication at Findings of ACL 2023

  6. arXiv:2302.14063  [pdf, other

    cs.LG cs.AI cs.CY stat.ML

    How optimal transport can tackle gender biases in multi-class neural-network classifiers for job recommendations?

    Authors: Fanny Jourdan, Titon Tshiongo Kaninku, Nicholas Asher, Jean-Michel Loubes, Laurent Risser

    Abstract: Automatic recommendation systems based on deep neural networks have become extremely popular during the last decade. Some of these systems can however be used for applications which are ranked as High Risk by the European Commission in the A.I. act, as for instance for online job candidate recommendation. When used in the European Union, commercial AI systems for this purpose will then be required… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.