Skip to main content

Showing 1–12 of 12 results for author: Goliński, A

.
  1. arXiv:2505.23996  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMs

    Authors: Yinong Oliver Wang, Nivedha Sivakumar, Falaah Arif Khan, Rin Metcalf Susa, Adam Golinski, Natalie Mackraz, Barry-John Theobald, Luca Zappella, Nicholas Apostoloff

    Abstract: The recent rapid adoption of large language models (LLMs) highlights the critical need for benchmarking their fairness. Conventional fairness metrics, which focus on discrete accuracy-based evaluations (i.e., prediction correctness), fail to capture the implicit impact of model uncertainty (e.g., higher model confidence about one group over another despite similar accuracy). To address this limita… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: 9 pages, 8 figures, and 1 table in main paper. Supplementary appendix attached. Accepted at ICML 2025

  2. arXiv:2505.20295  [pdf, ps, other

    cs.CL cs.AI cs.LG stat.ML

    Self-reflective Uncertainties: Do LLMs Know Their Internal Answer Distribution?

    Authors: Michael Kirchhof, Luca Füger, Adam Goliński, Eeshan Gunesh Dhekane, Arno Blaas, Sinead Williamson

    Abstract: To reveal when a large language model (LLM) is uncertain about a response, uncertainty quantification commonly produces percentage numbers along with the output. But is this all we can do? We argue that in the output space of LLMs, the space of strings, exist strings expressive enough to summarize the distribution over output strings the LLM deems possible. We lay a foundation for this new avenue… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  3. arXiv:2504.13677  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results

    Authors: Andrea Santilli, Adam Golinski, Michael Kirchhof, Federico Danieli, Arno Blaas, Miao Xiong, Luca Zappella, Sinead Williamson

    Abstract: Uncertainty Quantification (UQ) in Language Models (LMs) is key to improving their safety and reliability. Evaluations often use metrics like AUROC to assess how well UQ methods (e.g., negative sequence probabilities) correlate with task correctness functions (e.g., ROUGE-L). We show that mutual biases--when both UQ methods and correctness functions are biased by the same factors--systematically d… ▽ More

    Submitted 4 June, 2025; v1 submitted 18 April, 2025; originally announced April 2025.

    Comments: Accepted at ACL 2025 (Main)

  4. arXiv:2410.19575  [pdf, other

    stat.ML cs.LG

    Considerations for Distribution Shift Robustness of Diagnostic Models in Healthcare

    Authors: Arno Blaas, Adam Goliński, Andrew Miller, Luca Zappella, Jörn-Henrik Jacobsen, Christina Heinze-Deml

    Abstract: We consider robustness to distribution shifts in the context of diagnostic models in healthcare, where the prediction target $Y$, e.g., the presence of a disease, is causally upstream of the observations $X$, e.g., a biomarker. Distribution shifts may occur, for instance, when the training data is collected in a domain with patients having particular demographic characteristics while the model is… ▽ More

    Submitted 25 October, 2024; originally announced October 2024.

  5. arXiv:2307.10907  [pdf, other

    cs.LG

    The Role of Entropy and Reconstruction in Multi-View Self-Supervised Learning

    Authors: Borja Rodríguez-Gálvez, Arno Blaas, Pau Rodríguez, Adam Goliński, Xavier Suau, Jason Ramapuram, Dan Busbridge, Luca Zappella

    Abstract: The mechanisms behind the success of multi-view self-supervised learning (MVSSL) are not yet fully understood. Contrastive MVSSL methods have been studied through the lens of InfoNCE, a lower bound of the Mutual Information (MI). However, the relation between other MVSSL methods and MI remains unclear. We consider a different lower bound on the MI consisting of an entropy and a reconstruction term… ▽ More

    Submitted 9 December, 2023; v1 submitted 20 July, 2023; originally announced July 2023.

    Comments: 18 pages: 9 of main text, 2 of references, and 7 of supplementary material [Updated typo in page 6 (Section 3.2)]. Appears in the proceedings of ICML 2023

  6. arXiv:2206.14882  [pdf, other

    stat.ML cs.LG

    LIDL: Local Intrinsic Dimension Estimation Using Approximate Likelihood

    Authors: Piotr Tempczyk, Rafał Michaluk, Łukasz Garncarek, Przemysław Spurek, Jacek Tabor, Adam Goliński

    Abstract: Most of the existing methods for estimating the local intrinsic dimension of a data distribution do not scale well to high-dimensional data. Many of them rely on a non-parametric nearest neighbors approach which suffers from the curse of dimensionality. We attempt to address that challenge by proposing a novel approach to the problem: Local Intrinsic Dimension estimation using approximate Likeliho… ▽ More

    Submitted 11 July, 2022; v1 submitted 29 June, 2022; originally announced June 2022.

    Comments: ICML 2022

  7. arXiv:2201.12904  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    COIN++: Neural Compression Across Modalities

    Authors: Emilien Dupont, Hrushikesh Loya, Milad Alizadeh, Adam Goliński, Yee Whye Teh, Arnaud Doucet

    Abstract: Neural compression algorithms are typically based on autoencoders that require specialized encoder and decoder architectures for different data modalities. In this paper, we propose COIN++, a neural compression framework that seamlessly handles a wide range of data modalities. Our approach is based on converting data to implicit neural representations, i.e. neural functions that map coordinates (s… ▽ More

    Submitted 8 December, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

    Comments: TMLR camera ready

  8. arXiv:2106.04953  [pdf, other

    cs.LG cs.PL

    Expectation Programming: Adapting Probabilistic Programming Systems to Estimate Expectations Efficiently

    Authors: Tim Reichelt, Adam Goliński, Luke Ong, Tom Rainforth

    Abstract: We show that the standard computational pipeline of probabilistic programming systems (PPSs) can be inefficient for estimating expectations and introduce the concept of expectation programming to address this. In expectation programming, the aim of the backend inference engine is to directly estimate expected return values of programs, as opposed to approximating their conditional distributions. T… ▽ More

    Submitted 21 June, 2022; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: Accepted for the 38th Conference on Uncertainty in Artificial Intelligence (UAI 2022)

  9. arXiv:2103.03123  [pdf, other

    eess.IV cs.CV cs.LG

    COIN: COmpression with Implicit Neural representations

    Authors: Emilien Dupont, Adam Goliński, Milad Alizadeh, Yee Whye Teh, Arnaud Doucet

    Abstract: We propose a new simple approach for image compression: instead of storing the RGB values for each pixel of an image, we store the weights of a neural network overfitted to the image. Specifically, to encode an image, we fit it with an MLP which maps pixel locations to RGB values. We then quantize and store the weights of this MLP as a code for the image. To decode the image, we simply evaluate th… ▽ More

    Submitted 10 April, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: Added qualitative comparisons and link to github repo https://github.com/EmilienDupont/coin

  10. arXiv:2004.04342  [pdf, other

    cs.LG cs.CV stat.ML

    Feedback Recurrent Autoencoder for Video Compression

    Authors: Adam Golinski, Reza Pourreza, Yang Yang, Guillaume Sautiere, Taco S Cohen

    Abstract: Recent advances in deep generative modeling have enabled efficient modeling of high dimensional data distributions and opened up a new horizon for solving data compression problems. Specifically, autoencoder based learned image or video compression solutions are emerging as strong competitors to traditional approaches. In this work, We propose a new network architecture, based on common and well s… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

  11. arXiv:1907.08082  [pdf, other

    stat.ML cs.LG stat.CO

    Amortized Monte Carlo Integration

    Authors: Adam Goliński, Frank Wood, Tom Rainforth

    Abstract: Current approaches to amortizing Bayesian inference focus solely on approximating the posterior distribution. Typically, this approximation is, in turn, used to calculate expectations for one or more target functions - a computational pipeline which is inefficient when the target function(s) are known upfront. In this paper, we address this inefficiency by introducing AMCI, a method for amortizing… ▽ More

    Submitted 18 July, 2019; originally announced July 2019.

    Comments: Awarded Best Paper Honourable Mention at International Conference on Machine Learning (ICML) 2019

  12. arXiv:1712.00287  [pdf, other

    stat.ML cs.LG

    Faithful Inversion of Generative Models for Effective Amortized Inference

    Authors: Stefan Webb, Adam Golinski, Robert Zinkov, N. Siddharth, Tom Rainforth, Yee Whye Teh, Frank Wood

    Abstract: Inference amortization methods share information across multiple posterior-inference problems, allowing each to be carried out more efficiently. Generally, they require the inversion of the dependency structure in the generative model, as the modeller must learn a mapping from observations to distributions approximating the posterior. Previous approaches have involved inverting the dependency stru… ▽ More

    Submitted 29 November, 2018; v1 submitted 1 December, 2017; originally announced December 2017.

    Comments: To appear at the 32nd Conference on Neural Information Processing Systems (NeurIPS 2018), Montreal, Canada