Skip to main content

Showing 1–17 of 17 results for author: Gammerman, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.03464  [pdf, other

    math.ST cs.LG

    Validity and efficiency of the conformal CUSUM procedure

    Authors: Vladimir Vovk, Ilia Nouretdinov, Alex Gammerman

    Abstract: In this paper we study the validity and efficiency of a conformal version of the CUSUM procedure for change detection both experimentally and theoretically.

    Submitted 4 December, 2024; originally announced December 2024.

    Comments: 19 pages, 7 figures

    MSC Class: 62G10 (Primary) 68T05; 68Q32; 62L10 (Secondary)

  2. arXiv:2407.01122  [pdf, other

    cs.CL cs.LG

    Calibrated Large Language Models for Binary Question Answering

    Authors: Patrizio Giovannotti, Alexander Gammerman

    Abstract: Quantifying the uncertainty of predictions made by large language models (LLMs) in binary text classification tasks remains a challenge. Calibration, in the context of LLMs, refers to the alignment between the model's predicted probabilities and the actual correctness of its predictions. A well-calibrated model should produce probabilities that accurately reflect the likelihood of its predictions… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: Accepted to COPA 2024 (13th Symposium on Conformal and Probabilistic Prediction with Applications)

  3. arXiv:2111.01885  [pdf, other

    math.ST cs.LG

    Conformal testing: binary case with Markov alternatives

    Authors: Vladimir Vovk, Ilia Nouretdinov, Alex Gammerman

    Abstract: We continue study of conformal testing in binary model situations. In this note we consider Markov alternatives to the null hypothesis of exchangeability. We propose two new classes of conformal test martingales; one class is statistically efficient in our experiments, and the other class partially sacrifices statistical efficiency to gain computational efficiency.

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: 8 pages, 8 figures

    MSC Class: 68Q32 (Primary) 62G10; 60G42 (Secondary)

  4. arXiv:2107.01726  [pdf, other

    cs.LG

    Protected probabilistic classification

    Authors: Vladimir Vovk, Ivan Petej, Alex Gammerman

    Abstract: This paper proposes a way of protecting probabilistic prediction models against changes in the data distribution, concentrating on the case of classification and paying particular attention to binary classification. This is important in applications of machine learning, where the quality of a trained prediction algorithm may drop significantly in the process of its exploitation. Our techniques are… ▽ More

    Submitted 22 October, 2021; v1 submitted 4 July, 2021; originally announced July 2021.

    Comments: 23 pages, 14 figures, and 4 tables

    MSC Class: 68Q32 (Primary) 68T05; 60G25; 60G42; 62F03; 62M20 (Secondary)

  5. arXiv:2102.10439  [pdf, other

    cs.LG stat.ML

    Retrain or not retrain: Conformal test martingales for change-point detection

    Authors: Vladimir Vovk, Ivan Petej, Ilia Nouretdinov, Ernst Ahlberg, Lars Carlsson, Alex Gammerman

    Abstract: We argue for supplementing the process of training a prediction algorithm by setting up a scheme for detecting the moment when the distribution of the data changes and the algorithm needs to be retrained. Our proposed schemes are based on exchangeability martingales, i.e., processes that are martingales under any exchangeable distribution for the data. Our method, based on conformal prediction, is… ▽ More

    Submitted 20 February, 2021; originally announced February 2021.

    Comments: 22 pages, 19 figures, 3 tables

    MSC Class: 68Q32 (Primary) 62G10; 60G42; 68T05 (Secondary)

  6. arXiv:1911.00941  [pdf, other

    cs.LG stat.ML

    Computationally efficient versions of conformal predictive distributions

    Authors: Vladimir Vovk, Ivan Petej, Ilia Nouretdinov, Valery Manokhin, Alex Gammerman

    Abstract: Conformal predictive systems are a recent modification of conformal predictors that output, in regression problems, probability distributions for labels of test observations rather than set predictions. The extra information provided by conformal predictive systems may be useful, e.g., in decision making problems. Conformal predictive systems inherit the relative computational inefficiency of conf… ▽ More

    Submitted 3 November, 2019; originally announced November 2019.

    Comments: 31 pages, 14 figures, 1 table. The conference version published in the Proceedings of COPA 2018, and the journal version is to appear in Neurocomputing

    MSC Class: 68T05

  7. arXiv:1902.06579  [pdf, other

    cs.LG stat.ML

    Conformal calibrators

    Authors: Vladimir Vovk, Ivan Petej, Paolo Toccaceli, Alex Gammerman

    Abstract: Most existing examples of full conformal predictive systems, split-conformal predictive systems, and cross-conformal predictive systems impose severe restrictions on the adaptation of predictive distributions to the test object at hand. In this paper we develop split-conformal and cross-conformal predictive systems that are fully adaptive. Our method consists in calibrating existing predictive sys… ▽ More

    Submitted 18 February, 2019; originally announced February 2019.

    Comments: 10 pages, 2 figures

    Report number: 23 MSC Class: 68T05 ACM Class: I.2.6

  8. arXiv:1710.08894  [pdf, other

    cs.LG stat.ML

    Conformal predictive distributions with kernels

    Authors: Vladimir Vovk, Ilia Nouretdinov, Valery Manokhin, Alex Gammerman

    Abstract: This paper reviews the checkered history of predictive distributions in statistics and discusses two developments, one from recent literature and the other new. The first development is bringing predictive distributions into machine learning, whose early development was so deeply influenced by two remarkable groups at the Institute of Automation and Remote Control. The second development is combin… ▽ More

    Submitted 24 October, 2017; originally announced October 2017.

    Comments: 20 pages, 3 figures, prepared for the Proceedings of the Braverman Readings (Boston, 28-30 April 2017)

    MSC Class: 68Q32 (Primary) 68T05; 62M20; 60G25; 62J07; 62G08; 62F15 (Secondary)

  9. arXiv:1603.04506  [pdf, other

    cs.LG

    Conformal Predictors for Compound Activity Prediction

    Authors: Paolo Toccacheli, Ilia Nouretdinov, Alexander Gammerman

    Abstract: The paper presents an application of Conformal Predictors to a chemoinformatics problem of identifying activities of chemical compounds. The paper addresses some specific challenges of this domain: a large number of compounds (training examples), high-dimensionality of feature space, sparseness and a strong class imbalance. A variant of conformal predictors called Inductive Mondrian Conformal Pred… ▽ More

    Submitted 14 March, 2016; originally announced March 2016.

    Comments: 17 pages, 5 figures

  10. arXiv:1603.04416  [pdf, other

    cs.LG

    Criteria of efficiency for conformal prediction

    Authors: Vladimir Vovk, Ilia Nouretdinov, Valentina Fedorova, Ivan Petej, Alex Gammerman

    Abstract: We study optimal conformity measures for various criteria of efficiency of classification in an idealised setting. This leads to an important class of criteria of efficiency that we call probabilistic; it turns out that the most standard criteria of efficiency used in literature on conformal prediction are not probabilistic unless the problem of classification is binary. We consider both unconditi… ▽ More

    Submitted 14 September, 2016; v1 submitted 14 March, 2016; originally announced March 2016.

    Comments: 31 pages

    MSC Class: 68T05 ACM Class: I.2.6

  11. Regression Conformal Prediction with Nearest Neighbours

    Authors: Harris Papadopoulos, Vladimir Vovk, Alex Gammerman

    Abstract: In this paper we apply Conformal Prediction (CP) to the k-Nearest Neighbours Regression (k-NNR) algorithm and propose ways of extending the typical nonconformity measure used for regression so far. Unlike traditional regression methods which produce point predictions, Conformal Predictors output predictive regions that satisfy a given confidence level. The regions produced by any Conformal Predict… ▽ More

    Submitted 16 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 40, pages 815-840, 2011

  12. arXiv:1301.7375  [pdf

    cs.LG stat.ML

    Learning by Transduction

    Authors: Alex Gammerman, Volodya Vovk, Vladimir Vapnik

    Abstract: We describe a method for predicting a classification of an object given classifications of the objects in the training set, assuming that the pairs object/classification are generated by an i.i.d. process from a continuous probability distribution. Our method is a modification of Vapnik's support-vector machine; its main novelty is that it gives not only the prediction itself but also a practicabl… ▽ More

    Submitted 30 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI1998)

    Report number: UAI-P-1998-PG-148-155

  13. arXiv:1207.4113  [pdf

    cs.LG stat.ML

    On-line Prediction with Kernels and the Complexity Approximation Principle

    Authors: Alex Gammerman, Yuri Kalnishkan, Vladimir Vovk

    Abstract: The paper describes an application of Aggregating Algorithm to the problem of regression. It generalizes earlier results concerned with plain linear regression to kernel techniques and presents an on-line algorithm which performs nearly as well as any oblivious kernel predictor. The paper contains the derivation of an estimate on the performance of this algorithm. The estimate is then used to deri… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-170-176

  14. arXiv:1204.3251  [pdf, other

    cs.LG stat.ME

    Plug-in martingales for testing exchangeability on-line

    Authors: Valentina Fedorova, Alex Gammerman, Ilia Nouretdinov, Vladimir Vovk

    Abstract: A standard assumption in machine learning is the exchangeability of data, which is equivalent to assuming that the examples are generated from the same probability distribution independently. This paper is devoted to testing the assumption of exchangeability on-line: the examples arrive one by one, and after receiving each example we would like to have a valid measure of the degree to which the as… ▽ More

    Submitted 28 June, 2012; v1 submitted 15 April, 2012; originally announced April 2012.

    Comments: 8 pages, 7 figures; ICML 2012 Conference Proceedings

    Report number: On-line Compression Modelling Project (New Series), Working Paper 04 MSC Class: 62G10 ACM Class: I.2.6

  15. arXiv:0904.1579  [pdf, ps, other

    cs.AI cs.LG

    Online prediction of ovarian cancer

    Authors: Fedor Zhdanov, Vladimir Vovk, Brian Burford, Dmitry Devetyarov, Ilia Nouretdinov, Alex Gammerman

    Abstract: In this paper we apply computer learning methods to diagnosing ovarian cancer using the level of the standard biomarker CA125 in conjunction with information provided by mass-spectrometry. We are working with a new data set collected over a period of 7 years. Using the level of CA125 and mass-spectrometry peaks, our algorithm gives probability predictions for the disease. To estimate classificat… ▽ More

    Submitted 9 April, 2009; originally announced April 2009.

    Comments: 11 pages, 4 figures, uses llncs.cls

    ACM Class: I.2.1

  16. Hedging predictions in machine learning

    Authors: Alexander Gammerman, Vladimir Vovk

    Abstract: Recent advances in machine learning make it possible to design efficient prediction algorithms for data sets with huge numbers of parameters. This paper describes a new technique for "hedging" the predictions output by many such algorithms, including support vector machines, kernel ridge regression, kernel nearest neighbours, and by many other state-of-the-art methods. The hedged predictions for… ▽ More

    Submitted 2 November, 2006; originally announced November 2006.

    Comments: 24 pages; 9 figures; 2 tables; a version of this paper (with discussion and rejoinder) is to appear in "The Computer Journal"

    Report number: On-line Compression Modelling Project (New Series), Working Paper 02

    Journal ref: Computer Journal, 50:151-177, 2007

  17. arXiv:cs/0505079  [pdf, ps, other

    cs.CC

    Application of Kolmogorov complexity and universal codes to identity testing and nonparametric testing of serial independence for time series

    Authors: Boris Ryabko, Jaakko Astola, Alex Gammerman

    Abstract: We show that Kolmogorov complexity and such its estimators as universal codes (or data compression methods) can be applied for hypotheses testing in a framework of classical mathematical statistics. The methods for identity testing and nonparametric testing of serial independence for time series are suggested.

    Submitted 29 May, 2005; originally announced May 2005.

    Comments: submitted