-
Universal consistency of the $k$-NN rule in metric spaces and Nagata dimension. II
Abstract: We continue to investigate the $k$ nearest neighbour ($k$-NN) learning rule in complete separable metric spaces. Thanks to the results of Cérou and Guyader (2006) and Preiss (1983), this rule is known to be universally consistent in every such metric space that is sigma-finite dimensional in the sense of Nagata. Here we show that the rule is strongly universally consistent in such spaces in the ab… ▽ More
Submitted 20 March, 2024; v1 submitted 26 May, 2023; originally announced May 2023.
Comments: Latex 2e, 27 pages, 1 figure. Minor revisions to conform with the last set of journal page proofs: two typos corrected, the bibliography rearranged in the order of citations (the ESAIM:PS home style), and two articles that were no longer cited removed
MSC Class: 62H30; 54F45
Journal ref: ESAIM Probability & Statistics 28(2024), 132-160
-
arXiv:2005.01886 [pdf, ps, other]
A learning problem whose consistency is equivalent to the non-existence of real-valued measurable cardinals
Abstract: We show that the $k$-nearest neighbour learning rule is universally consistent in a metric space $X$ if and only if it is universally consistent in every separable subspace of $X$ and the density of $X$ is less than every real-measurable cardinal. In particular, the $k$-NN classifier is universally consistent in every metric space whose separable subspaces are sigma-finite dimensional in the sense… ▽ More
Submitted 4 May, 2020; originally announced May 2020.
Comments: 16 pp., journal macros
MSC Class: 62H30; 54F45; 03E55 ACM Class: I.2.6
Journal ref: Addendum was revised and published as a separate paper: On a result of K P. Hart about non-existence of measurable solutions to the discrete expectation maximization problem, Comment. Math. Univ. Carolin. 64 (2023), 353--358
-
Universal consistency of the $k$-NN rule in metric spaces and Nagata dimension
Abstract: The $k$ nearest neighbour learning rule (under the uniform distance tie breaking) is universally consistent in every metric space $X$ that is sigma-finite dimensional in the sense of Nagata. This was pointed out by Cérou and Guyader (2006) as a consequence of the main result by those authors, combined with a theorem in real analysis sketched by D. Preiss (1971) (and elaborated in detail by Assouad… ▽ More
Submitted 14 June, 2020; v1 submitted 28 February, 2020; originally announced March 2020.
Comments: 21 pp., 2 figures, latex with ESAIM: Probability and Statistics macros, a version with the two anonymous referees comments taken into account
MSC Class: 62H30; 54F45
Journal ref: ESAIM: Probability and Statistics 24 (2020), 914--934
-
Elementos da teoria de aprendizagem de máquina supervisionada
Abstract: This is a set of lecture notes for an introductory course (advanced undergaduates or the 1st graduate course) on foundations of supervised machine learning (in Portuguese). The topics include: the geometry of the Hamming cube, concentration of measure, shattering and VC dimension, Glivenko-Cantelli classes, PAC learnability, universal consistency and the k-NN classifier in metric spaces, dimension… ▽ More
Submitted 5 October, 2019; originally announced October 2019.
Comments: 390 pp. + vii, in Portuguese, a preliminary version, to be published by IMPA as a book of lectures of the 23nd Brazilian Math Colloquium (July 28 - Aug 2, 2019), submitted to arXiv upon IMPA permission
MSC Class: 68Q32; 62H30; 68T05; 68T10