Skip to main content

Showing 1–8 of 8 results for author: Minervini, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2412.16475  [pdf, other

    cs.LG cs.AI stat.ML

    When Can Proxies Improve the Sample Complexity of Preference Learning?

    Authors: Yuchen Zhu, Daniel Augusto de Souza, Zhengyan Shi, Mengyue Yang, Pasquale Minervini, Alexander D'Amour, Matt J. Kusner

    Abstract: We address the problem of reward hacking, where maximising a proxy reward does not necessarily increase the true reward. This is a key concern for Large Language Models (LLMs), as they are often fine-tuned on human preferences that may not accurately reflect a true objective. Existing work uses various tricks such as regularisation, tweaks to the reward model, and reward hacking detectors, to limi… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

  2. arXiv:2404.08458  [pdf, other

    stat.ML cs.AI cs.LG

    On the Independence Assumption in Neurosymbolic Learning

    Authors: Emile van Krieken, Pasquale Minervini, Edoardo M. Ponti, Antonio Vergari

    Abstract: State-of-the-art neurosymbolic learning systems use probabilistic reasoning to guide neural networks towards predictions that conform to logical constraints over symbols. Many such systems assume that the probabilities of the considered symbols are conditionally independent given the input to simplify learning and reasoning. We study and criticise this assumption, highlighting how it can hinder op… ▽ More

    Submitted 7 June, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

    Comments: Accepted at ICML 2024

  3. arXiv:2210.15353  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Discrete Directed Acyclic Graphs via Backpropagation

    Authors: Andrew J. Wren, Pasquale Minervini, Luca Franceschi, Valentina Zantedeschi

    Abstract: Recently continuous relaxations have been proposed in order to learn Directed Acyclic Graphs (DAGs) from data by backpropagation, instead of using combinatorial optimization. However, a number of techniques for fully discrete backpropagation could instead be applied. In this paper, we explore that direction and propose DAG-DB, a framework for learning DAGs by Discrete Backpropagation. Based on the… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: 15 pages, 2 figures, 7 tables. Accepted for NeurIPS 2022 workshops on: Causal Machine Learning for Real-World Impact; and Neuro Causal and Symbolic AI

  4. arXiv:2004.07790  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Avoiding the Hypothesis-Only Bias in Natural Language Inference via Ensemble Adversarial Training

    Authors: Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Sebastian Riedel, Tim Rocktäschel

    Abstract: Natural Language Inference (NLI) datasets contain annotation artefacts resulting in spurious correlations between the natural language utterances and their respective entailment classes. These artefacts are exploited by neural networks even when only considering the hypothesis and ignoring the premise, leading to unwanted biases. Belinkov et al. (2019b) proposed tackling this problem via adversari… ▽ More

    Submitted 27 May, 2021; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: Accepted at EMNLP 2020

  5. arXiv:2003.04808  [pdf, other

    cs.CL cs.LG stat.ML

    Undersensitivity in Neural Reading Comprehension

    Authors: Johannes Welbl, Pasquale Minervini, Max Bartolo, Pontus Stenetorp, Sebastian Riedel

    Abstract: Current reading comprehension models generalise well to in-distribution test sets, yet perform poorly on adversarially selected inputs. Most prior work on adversarial inputs studies oversensitivity: semantically invariant text perturbations that cause a model's prediction to change when it should not. In this work we focus on the complementary problem: excessive prediction undersensitivity, where… ▽ More

    Submitted 15 February, 2020; originally announced March 2020.

    Comments: 15 pages, 4 figures

  6. arXiv:1906.04985  [pdf, other

    cs.LG cs.AI cs.SC stat.ML

    Neural Variational Inference For Estimating Uncertainty in Knowledge Graph Embeddings

    Authors: Alexander I. Cowen-Rivers, Pasquale Minervini, Tim Rocktaschel, Matko Bosnjak, Sebastian Riedel, Jun Wang

    Abstract: Recent advances in Neural Variational Inference allowed for a renaissance in latent variable models in a variety of domains involving high-dimensional data. While traditional variational methods derive an analytical approximation for the intractable distribution over the latent variables, here we construct an inference network conditioned on the symbolic representation of entities and relation typ… ▽ More

    Submitted 18 August, 2019; v1 submitted 12 June, 2019; originally announced June 2019.

    Comments: Accepted at IJCAI 19 Neural-Symbolic Learning and Reasoning Workshop

  7. arXiv:1808.08609  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Adversarially Regularising Neural NLI Models to Integrate Logical Background Knowledge

    Authors: Pasquale Minervini, Sebastian Riedel

    Abstract: Adversarial examples are inputs to machine learning models designed to cause the model to make a mistake. They are useful for understanding the shortcomings of machine learning models, interpreting their results, and for regularisation. In NLP, however, most example generation strategies produce input text by using known, pre-specified semantic transformations, requiring significant manual effort… ▽ More

    Submitted 26 August, 2018; originally announced August 2018.

    Comments: Accepted at the SIGNLL Conference on Computational Natural Language Learning (CoNLL 2018)

  8. arXiv:1806.08727  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Jack the Reader - A Machine Reading Framework

    Authors: Dirk Weissenborn, Pasquale Minervini, Tim Dettmers, Isabelle Augenstein, Johannes Welbl, Tim Rocktäschel, Matko Bošnjak, Jeff Mitchell, Thomas Demeester, Pontus Stenetorp, Sebastian Riedel

    Abstract: Many Machine Reading and Natural Language Understanding tasks require reading supporting text in order to answer questions. For example, in Question Answering, the supporting text can be newswire or Wikipedia articles; in Natural Language Inference, premises can be seen as the supporting text and hypotheses as questions. Providing a set of useful primitives operating in a single framework of relat… ▽ More

    Submitted 19 June, 2018; originally announced June 2018.

    Comments: Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL 2018), System Demonstrations