-
Validity and efficiency of the conformal CUSUM procedure
Abstract: In this paper we study the validity and efficiency of a conformal version of the CUSUM procedure for change detection both experimentally and theoretically.
Submitted 4 December, 2024; originally announced December 2024.
Comments: 19 pages, 7 figures
MSC Class: 62G10 (Primary) 68T05; 68Q32; 62L10 (Secondary)
-
Calibrated Large Language Models for Binary Question Answering
Abstract: Quantifying the uncertainty of predictions made by large language models (LLMs) in binary text classification tasks remains a challenge. Calibration, in the context of LLMs, refers to the alignment between the model's predicted probabilities and the actual correctness of its predictions. A well-calibrated model should produce probabilities that accurately reflect the likelihood of its predictions… ▽ More
Submitted 1 July, 2024; originally announced July 2024.
Comments: Accepted to COPA 2024 (13th Symposium on Conformal and Probabilistic Prediction with Applications)
-
Conformal testing: binary case with Markov alternatives
Abstract: We continue study of conformal testing in binary model situations. In this note we consider Markov alternatives to the null hypothesis of exchangeability. We propose two new classes of conformal test martingales; one class is statistically efficient in our experiments, and the other class partially sacrifices statistical efficiency to gain computational efficiency.
Submitted 2 November, 2021; originally announced November 2021.
Comments: 8 pages, 8 figures
MSC Class: 68Q32 (Primary) 62G10; 60G42 (Secondary)
-
Protected probabilistic classification
Abstract: This paper proposes a way of protecting probabilistic prediction models against changes in the data distribution, concentrating on the case of classification and paying particular attention to binary classification. This is important in applications of machine learning, where the quality of a trained prediction algorithm may drop significantly in the process of its exploitation. Our techniques are… ▽ More
Submitted 22 October, 2021; v1 submitted 4 July, 2021; originally announced July 2021.
Comments: 23 pages, 14 figures, and 4 tables
MSC Class: 68Q32 (Primary) 68T05; 60G25; 60G42; 62F03; 62M20 (Secondary)
-
Retrain or not retrain: Conformal test martingales for change-point detection
Abstract: We argue for supplementing the process of training a prediction algorithm by setting up a scheme for detecting the moment when the distribution of the data changes and the algorithm needs to be retrained. Our proposed schemes are based on exchangeability martingales, i.e., processes that are martingales under any exchangeable distribution for the data. Our method, based on conformal prediction, is… ▽ More
Submitted 20 February, 2021; originally announced February 2021.
Comments: 22 pages, 19 figures, 3 tables
MSC Class: 68Q32 (Primary) 62G10; 60G42; 68T05 (Secondary)
-
Computationally efficient versions of conformal predictive distributions
Abstract: Conformal predictive systems are a recent modification of conformal predictors that output, in regression problems, probability distributions for labels of test observations rather than set predictions. The extra information provided by conformal predictive systems may be useful, e.g., in decision making problems. Conformal predictive systems inherit the relative computational inefficiency of conf… ▽ More
Submitted 3 November, 2019; originally announced November 2019.
Comments: 31 pages, 14 figures, 1 table. The conference version published in the Proceedings of COPA 2018, and the journal version is to appear in Neurocomputing
MSC Class: 68T05
-
Conformal calibrators
Abstract: Most existing examples of full conformal predictive systems, split-conformal predictive systems, and cross-conformal predictive systems impose severe restrictions on the adaptation of predictive distributions to the test object at hand. In this paper we develop split-conformal and cross-conformal predictive systems that are fully adaptive. Our method consists in calibrating existing predictive sys… ▽ More
Submitted 18 February, 2019; originally announced February 2019.
Comments: 10 pages, 2 figures
Report number: 23 MSC Class: 68T05 ACM Class: I.2.6
-
Conformal predictive distributions with kernels
Abstract: This paper reviews the checkered history of predictive distributions in statistics and discusses two developments, one from recent literature and the other new. The first development is bringing predictive distributions into machine learning, whose early development was so deeply influenced by two remarkable groups at the Institute of Automation and Remote Control. The second development is combin… ▽ More
Submitted 24 October, 2017; originally announced October 2017.
Comments: 20 pages, 3 figures, prepared for the Proceedings of the Braverman Readings (Boston, 28-30 April 2017)
MSC Class: 68Q32 (Primary) 68T05; 62M20; 60G25; 62J07; 62G08; 62F15 (Secondary)
-
Conformal Predictors for Compound Activity Prediction
Abstract: The paper presents an application of Conformal Predictors to a chemoinformatics problem of identifying activities of chemical compounds. The paper addresses some specific challenges of this domain: a large number of compounds (training examples), high-dimensionality of feature space, sparseness and a strong class imbalance. A variant of conformal predictors called Inductive Mondrian Conformal Pred… ▽ More
Submitted 14 March, 2016; originally announced March 2016.
Comments: 17 pages, 5 figures
-
Criteria of efficiency for conformal prediction
Abstract: We study optimal conformity measures for various criteria of efficiency of classification in an idealised setting. This leads to an important class of criteria of efficiency that we call probabilistic; it turns out that the most standard criteria of efficiency used in literature on conformal prediction are not probabilistic unless the problem of classification is binary. We consider both unconditi… ▽ More
Submitted 14 September, 2016; v1 submitted 14 March, 2016; originally announced March 2016.
Comments: 31 pages
MSC Class: 68T05 ACM Class: I.2.6
-
Regression Conformal Prediction with Nearest Neighbours
Abstract: In this paper we apply Conformal Prediction (CP) to the k-Nearest Neighbours Regression (k-NNR) algorithm and propose ways of extending the typical nonconformity measure used for regression so far. Unlike traditional regression methods which produce point predictions, Conformal Predictors output predictive regions that satisfy a given confidence level. The regions produced by any Conformal Predict… ▽ More
Submitted 16 January, 2014; originally announced January 2014.
Journal ref: Journal Of Artificial Intelligence Research, Volume 40, pages 815-840, 2011
-
Learning by Transduction
Abstract: We describe a method for predicting a classification of an object given classifications of the objects in the training set, assuming that the pairs object/classification are generated by an i.i.d. process from a continuous probability distribution. Our method is a modification of Vapnik's support-vector machine; its main novelty is that it gives not only the prediction itself but also a practicabl… ▽ More
Submitted 30 January, 2013; originally announced January 2013.
Comments: Appears in Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI1998)
Report number: UAI-P-1998-PG-148-155
-
On-line Prediction with Kernels and the Complexity Approximation Principle
Abstract: The paper describes an application of Aggregating Algorithm to the problem of regression. It generalizes earlier results concerned with plain linear regression to kernel techniques and presents an on-line algorithm which performs nearly as well as any oblivious kernel predictor. The paper contains the derivation of an estimate on the performance of this algorithm. The estimate is then used to deri… ▽ More
Submitted 11 July, 2012; originally announced July 2012.
Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)
Report number: UAI-P-2004-PG-170-176
-
Plug-in martingales for testing exchangeability on-line
Abstract: A standard assumption in machine learning is the exchangeability of data, which is equivalent to assuming that the examples are generated from the same probability distribution independently. This paper is devoted to testing the assumption of exchangeability on-line: the examples arrive one by one, and after receiving each example we would like to have a valid measure of the degree to which the as… ▽ More
Submitted 28 June, 2012; v1 submitted 15 April, 2012; originally announced April 2012.
Comments: 8 pages, 7 figures; ICML 2012 Conference Proceedings
Report number: On-line Compression Modelling Project (New Series), Working Paper 04 MSC Class: 62G10 ACM Class: I.2.6
-
arXiv:0904.1579 [pdf, ps, other]
Online prediction of ovarian cancer
Abstract: In this paper we apply computer learning methods to diagnosing ovarian cancer using the level of the standard biomarker CA125 in conjunction with information provided by mass-spectrometry. We are working with a new data set collected over a period of 7 years. Using the level of CA125 and mass-spectrometry peaks, our algorithm gives probability predictions for the disease. To estimate classificat… ▽ More
Submitted 9 April, 2009; originally announced April 2009.
Comments: 11 pages, 4 figures, uses llncs.cls
ACM Class: I.2.1
-
arXiv:cs/0611011 [pdf, ps, other]
Hedging predictions in machine learning
Abstract: Recent advances in machine learning make it possible to design efficient prediction algorithms for data sets with huge numbers of parameters. This paper describes a new technique for "hedging" the predictions output by many such algorithms, including support vector machines, kernel ridge regression, kernel nearest neighbours, and by many other state-of-the-art methods. The hedged predictions for… ▽ More
Submitted 2 November, 2006; originally announced November 2006.
Comments: 24 pages; 9 figures; 2 tables; a version of this paper (with discussion and rejoinder) is to appear in "The Computer Journal"
Report number: On-line Compression Modelling Project (New Series), Working Paper 02
Journal ref: Computer Journal, 50:151-177, 2007
-
arXiv:cs/0505079 [pdf, ps, other]
Application of Kolmogorov complexity and universal codes to identity testing and nonparametric testing of serial independence for time series
Abstract: We show that Kolmogorov complexity and such its estimators as universal codes (or data compression methods) can be applied for hypotheses testing in a framework of classical mathematical statistics. The methods for identity testing and nonparametric testing of serial independence for time series are suggested.
Submitted 29 May, 2005; originally announced May 2005.
Comments: submitted