-
Weighting vectors for machine learning: numerical harmonic analysis applied to boundary detection
Authors:
Eric Bunch,
Jeffery Kline,
Daniel Dickinson,
Suhaas Bhat,
Glenn Fung
Abstract:
Metric space magnitude, an active field of research in algebraic topology, is a scalar quantity that summarizes the effective number of distinct points that live in a general metric space. The {\em weighting vector} is a closely-related concept that captures, in a nontrivial way, much of the underlying geometry of the original metric space. Recent work has demonstrated that when the metric space i…
▽ More
Metric space magnitude, an active field of research in algebraic topology, is a scalar quantity that summarizes the effective number of distinct points that live in a general metric space. The {\em weighting vector} is a closely-related concept that captures, in a nontrivial way, much of the underlying geometry of the original metric space. Recent work has demonstrated that when the metric space is Euclidean, the weighting vector serves as an effective tool for boundary detection. We recast this result and show the weighting vector may be viewed as a solution to a kernelized SVM. As one consequence, we apply this new insight to the task of outlier detection, and we demonstrate performance that is competitive or exceeds performance of state-of-the-art techniques on benchmark data sets. Under mild assumptions, we show the weighting vector, which has computational cost of matrix inversion, can be efficiently approximated in linear time. We show how nearest neighbor methods can approximate solutions to the minimization problems defined by SVMs.
△ Less
Submitted 1 June, 2021;
originally announced June 2021.
-
Practical applications of metric space magnitude and weighting vectors
Authors:
Eric Bunch,
Daniel Dickinson,
Jeffery Kline,
Glenn Fung
Abstract:
Metric space magnitude, an active subject of research in algebraic topology, originally arose in the context of biology, where it was used to represent the effective number of distinct species in an environment. In a more general setting, the magnitude of a metric space is a real number that aims to quantify the effective number of distinct points in the space. The contribution of each point to a…
▽ More
Metric space magnitude, an active subject of research in algebraic topology, originally arose in the context of biology, where it was used to represent the effective number of distinct species in an environment. In a more general setting, the magnitude of a metric space is a real number that aims to quantify the effective number of distinct points in the space. The contribution of each point to a metric space's global magnitude, which is encoded by the {\em weighting vector}, captures much of the underlying geometry of the original metric space.
Surprisingly, when the metric space is Euclidean, the weighting vector also serves as an effective tool for boundary detection. This allows the weighting vector to serve as the foundation of novel algorithms for classic machine learning tasks such as classification, outlier detection and active learning. We demonstrate, using experiments and comparisons on classic benchmark datasets, the promise of the proposed magnitude and weighting vector-based approaches.
△ Less
Submitted 2 July, 2020; v1 submitted 24 June, 2020;
originally announced June 2020.
-
Approximating the Convex Hull via Metric Space Magnitude
Authors:
Glenn Fung,
Eric Bunch,
Dan Dickinson
Abstract:
Magnitude of a finite metric space and the related notion of magnitude functions on metric spaces is an active area of research in algebraic topology. Magnitude originally arose in the context of biology, where it represents the number of effective species in an environment; when applied to a one-parameter family of metric spaces $tX$ with scale parameter $t$, the magnitude captures much of the un…
▽ More
Magnitude of a finite metric space and the related notion of magnitude functions on metric spaces is an active area of research in algebraic topology. Magnitude originally arose in the context of biology, where it represents the number of effective species in an environment; when applied to a one-parameter family of metric spaces $tX$ with scale parameter $t$, the magnitude captures much of the underlying geometry of the space. Prior work has mostly focussed on properties of magnitude in a global sense; in this paper we restrict the sets to finite subsets of Euclidean space and investigate its individual components. We give an explicit formula for the corrected inclusion-exclusion principle, and define a quantity associated with each point, called the $\textit{moment}$ which gives an intrinsic ordering to the points. We exploit this in order to form an algorithm which approximates the convex hull.
△ Less
Submitted 7 August, 2019;
originally announced August 2019.
-
Diophantine approximation on planar curves and the distribution of rational points
Authors:
Victor Beresnevich,
Detta Dickinson,
Sanju Velani
Abstract:
Let $\cal C$ be a non--degenerate planar curve and for a real, positive decreasing function $ψ$ let $\cal C(ψ)$ denote the set of simultaneously $ψ$--approximable points lying on $\cal C$. We show that $\cal C$ is of Khintchine type for divergence; i.e. if a certain sum diverges then the one-dimensional Lebesgue measure on $\cal C$ of $\cal C(ψ)$ is full. We also obtain the Hausdorff measure ana…
▽ More
Let $\cal C$ be a non--degenerate planar curve and for a real, positive decreasing function $ψ$ let $\cal C(ψ)$ denote the set of simultaneously $ψ$--approximable points lying on $\cal C$. We show that $\cal C$ is of Khintchine type for divergence; i.e. if a certain sum diverges then the one-dimensional Lebesgue measure on $\cal C$ of $\cal C(ψ)$ is full. We also obtain the Hausdorff measure analogue of the divergent Khintchine type result. In the case that $\cal C$ is a rational quadric the convergence counterparts of the divergent results are also obtained. Furthermore, for functions $ψ$ with lower order in a critical range we determine a general, exact formula for the Hausdorff dimension of $\cal C(ψ)$. These results constitute the first precise and general results in the theory of simultaneous Diophantine approximation on manifolds.
△ Less
Submitted 26 April, 2006; v1 submitted 14 January, 2004;
originally announced January 2004.
-
Measure theoretic laws for lim sup sets
Authors:
Victor Beresnevich,
Detta Dickinson,
Sanju Velani
Abstract:
Given a compact metric space (X,d) equipped with a non-atomic, probability measure m and a real, positive decreasing function p we consider a `natural' class of limsup subsets La(p) of X. The classical limsup sets of `well approximable' numbers in the theory of metric Diophantine approximation fall within this class. We show that m(La(p))>0 under a `global ubiquity' hypothesis and the divergence…
▽ More
Given a compact metric space (X,d) equipped with a non-atomic, probability measure m and a real, positive decreasing function p we consider a `natural' class of limsup subsets La(p) of X. The classical limsup sets of `well approximable' numbers in the theory of metric Diophantine approximation fall within this class. We show that m(La(p))>0 under a `global ubiquity' hypothesis and the divergence of a certain m--volume sum. In fact, under a `local ubiquity' hypothesis we show that La(p) has full measure; i.e. m(La(p)) =1 . This is the analogue of the divergent part of the classical Khintchine-Groshev theorem in number theory. Moreover, if the 'local ubiquity' hypothesis is satisfied and a certain f-volume sum diverges then we are able to show that the Hausdorff f--measure of La(p) is infinite. A simple consequence of this is a lower bound for the Hausdorff dimension of La(p) and various results concerning the dimension and measure of related `exact order' sets.
Essentially, the notion of `local ubiquity' unexpectedly unifies `divergent' type results for La(p) with respect to the natural measure m and general Hausdorff measures. Applications of the general framework include those from number theory, Kleinian groups and rational maps. Even for the classical limsup sets of `well approximable' numbers, the framework strengthens the classical Hausdorff measure result of Jarnik and opens up the Duffin-Schaeffer conjecture for Hausdorff measures.
△ Less
Submitted 17 December, 2004; v1 submitted 12 January, 2004;
originally announced January 2004.