Skip to main content

Showing 1–13 of 13 results for author: Gorur, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.12180  [pdf, other

    cs.CL cs.LG

    Finetuning Language Models to Emit Linguistic Expressions of Uncertainty

    Authors: Arslan Chaudhry, Sridhar Thiagarajan, Dilan Gorur

    Abstract: Large language models (LLMs) are increasingly employed in information-seeking and decision-making tasks. Despite their broad utility, LLMs tend to generate information that conflicts with real-world facts, and their persuasive style can make these inaccuracies appear confident and convincing. As a result, end-users struggle to consistently align the confidence expressed by LLMs with the accuracy o… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

  2. arXiv:2407.12687  [pdf, other

    cs.CY cs.AI cs.LG

    Towards Responsible Development of Generative AI for Education: An Evaluation-Driven Approach

    Authors: Irina Jurenka, Markus Kunesch, Kevin R. McKee, Daniel Gillick, Shaojian Zhu, Sara Wiltberger, Shubham Milind Phal, Katherine Hermann, Daniel Kasenberg, Avishkar Bhoopchand, Ankit Anand, Miruna Pîslar, Stephanie Chan, Lisa Wang, Jennifer She, Parsa Mahmoudieh, Aliya Rysbek, Wei-Jen Ko, Andrea Huber, Brett Wiltshire, Gal Elidan, Roni Rabin, Jasmin Rubinovitz, Amit Pitaru, Mac McAllister , et al. (49 additional authors not shown)

    Abstract: A major challenge facing the world is the provision of equitable and universal access to quality education. Recent advances in generative AI (gen AI) have created excitement about the potential of new technologies to offer a personal tutor for every learner and a teaching assistant for every teacher. The full extent of this dream, however, has not yet materialised. We argue that this is primarily… ▽ More

    Submitted 19 July, 2024; v1 submitted 21 May, 2024; originally announced July 2024.

  3. arXiv:2405.10729  [pdf, other

    cs.AI

    Contestable AI needs Computational Argumentation

    Authors: Francesco Leofante, Hamed Ayoobi, Adam Dejl, Gabriel Freedman, Deniz Gorur, Junqi Jiang, Guilherme Paulino-Passos, Antonio Rago, Anna Rapberger, Fabrizio Russo, Xiang Yin, Dekai Zhang, Francesca Toni

    Abstract: AI has become pervasive in recent years, but state-of-the-art approaches predominantly neglect the need for AI systems to be contestable. Instead, contestability is advocated by AI guidelines (e.g. by the OECD) and regulation of automated decision-making (e.g. GDPR). In this position paper we explore how contestability can be achieved computationally in and for AI. We argue that contestable AI req… ▽ More

    Submitted 3 August, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted at KR 2024

  4. arXiv:2405.02079  [pdf, other

    cs.CL cs.AI

    Argumentative Large Language Models for Explainable and Contestable Claim Verification

    Authors: Gabriel Freedman, Adam Dejl, Deniz Gorur, Xiang Yin, Antonio Rago, Francesca Toni

    Abstract: The profusion of knowledge encoded in large language models (LLMs) and their ability to apply this knowledge zero-shot in a range of settings makes them promising candidates for use in decision-making. However, they are currently limited by their inability to provide outputs which can be faithfully explained and effectively contested to correct mistakes. In this paper, we attempt to reconcile thes… ▽ More

    Submitted 18 April, 2025; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: 18 pages, 18 figures. Accepted as an oral presentation at AAAI 2025

    ACM Class: I.2.7

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, 39(14), 14930-14939. 2025

  5. arXiv:2402.11243  [pdf, other

    cs.CL cs.AI

    Can Large Language Models perform Relation-based Argument Mining?

    Authors: Deniz Gorur, Antonio Rago, Francesca Toni

    Abstract: Argument mining (AM) is the process of automatically extracting arguments, their components and/or relations amongst arguments and components from text. As the number of platforms supporting online debate increases, the need for AM becomes ever more urgent, especially in support of downstream tasks. Relation-based AM (RbAM) is a form of AM focusing on identifying agreement (support) and disagreeme… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

    Comments: 10 pages, 9 figures, submitted to ACL 2024

    ACM Class: I.2.7

  6. arXiv:2303.08207  [pdf, other

    cs.LG cs.AI

    Is forgetting less a good inductive bias for forward transfer?

    Authors: Jiefeng Chen, Timothy Nguyen, Dilan Gorur, Arslan Chaudhry

    Abstract: One of the main motivations of studying continual learning is that the problem setting allows a model to accrue knowledge from past tasks to learn new tasks more efficiently. However, recent studies suggest that the key metric that continual learning algorithms optimize, reduction in catastrophic forgetting, does not correlate well with the forward transfer of knowledge. We believe that the conclu… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: Published as a conference paper at ICLR 2023

    Journal ref: ICLR 2023

  7. arXiv:2202.00275  [pdf, other

    cs.LG cs.AI

    Architecture Matters in Continual Learning

    Authors: Seyed Iman Mirzadeh, Arslan Chaudhry, Dong Yin, Timothy Nguyen, Razvan Pascanu, Dilan Gorur, Mehrdad Farajtabar

    Abstract: A large body of research in continual learning is devoted to overcoming the catastrophic forgetting of neural networks by designing new algorithms that are robust to the distribution shifts. However, the majority of these works are strictly focused on the "algorithmic" part of continual learning for a "fixed neural network architecture", and the implications of using different architectures are mo… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: preprint

  8. arXiv:2111.01956  [pdf, ps, other

    cs.LG

    One Pass ImageNet

    Authors: Huiyi Hu, Ang Li, Daniele Calandriello, Dilan Gorur

    Abstract: We present the One Pass ImageNet (OPIN) problem, which aims to study the effectiveness of deep learning in a streaming setting. ImageNet is a widely known benchmark dataset that has helped drive and evaluate recent advancements in deep learning. Typically, deep learning methods are trained on static data that the models have random access to, using multiple passes over the dataset with a random sh… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: Accepted to NeurIPS 2021 Workshop on Imagenet: past, present and future

  9. arXiv:2110.11526  [pdf, other

    cs.LG cs.AI cs.CV

    Wide Neural Networks Forget Less Catastrophically

    Authors: Seyed Iman Mirzadeh, Arslan Chaudhry, Dong Yin, Huiyi Hu, Razvan Pascanu, Dilan Gorur, Mehrdad Farajtabar

    Abstract: A primary focus area in continual learning research is alleviating the "catastrophic forgetting" problem in neural networks by designing new algorithms that are more robust to the distribution shifts. While the recent progress in continual learning literature is encouraging, our understanding of what properties of neural networks contribute to catastrophic forgetting is still limited. To address t… ▽ More

    Submitted 14 July, 2022; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: ICML 2022

  10. arXiv:2010.04495  [pdf, other

    cs.LG cs.AI cs.CV

    Linear Mode Connectivity in Multitask and Continual Learning

    Authors: Seyed Iman Mirzadeh, Mehrdad Farajtabar, Dilan Gorur, Razvan Pascanu, Hassan Ghasemzadeh

    Abstract: Continual (sequential) training and multitask (simultaneous) training are often attempting to solve the same overall objective: to find a solution that performs well on all considered tasks. The main difference is in the training regimes, where continual learning can only have access to one task at a time, which for neural networks typically leads to catastrophic forgetting. That is, the solution… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

  11. arXiv:2006.12620  [pdf, other

    cs.LG cs.AI

    A maximum-entropy approach to off-policy evaluation in average-reward MDPs

    Authors: Nevena Lazic, Dong Yin, Mehrdad Farajtabar, Nir Levine, Dilan Gorur, Chris Harris, Dale Schuurmans

    Abstract: This work focuses on off-policy evaluation (OPE) with function approximation in infinite-horizon undiscounted Markov decision processes (MDPs). For MDPs that are ergodic and linear (i.e. where rewards and dynamics are linear in some known features), we provide the first finite-sample OPE error bound, extending existing results beyond the episodic and discounted cases. In a more general setting, wh… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

  12. arXiv:1902.02767  [pdf, other

    cs.LG stat.ML

    Hybrid Models with Deep and Invertible Features

    Authors: Eric Nalisnick, Akihiro Matsukawa, Yee Whye Teh, Dilan Gorur, Balaji Lakshminarayanan

    Abstract: We propose a neural hybrid model consisting of a linear model defined on a set of features computed by a deep, invertible transformation (i.e. a normalizing flow). An attractive property of our model is that both p(features), the density of the features, and p(targets | features), the predictive distribution, can be computed exactly in a single feed-forward pass. We show that our hybrid model, des… ▽ More

    Submitted 29 May, 2019; v1 submitted 7 February, 2019; originally announced February 2019.

    Comments: ICML 2019

  13. arXiv:1810.09136  [pdf, other

    stat.ML cs.LG

    Do Deep Generative Models Know What They Don't Know?

    Authors: Eric Nalisnick, Akihiro Matsukawa, Yee Whye Teh, Dilan Gorur, Balaji Lakshminarayanan

    Abstract: A neural network deployed in the wild may be asked to make predictions for inputs that were drawn from a different distribution than that of the training data. A plethora of work has demonstrated that it is easy to find or synthesize inputs for which a neural network is highly confident yet wrong. Generative models are widely viewed to be robust to such mistaken confidence as modeling the density… ▽ More

    Submitted 24 February, 2019; v1 submitted 22 October, 2018; originally announced October 2018.

    Comments: ICLR 2019