A Mutual Contamination Analysis of Mixed Membership and Partial Label Models

Katz-Samuels, Julian; Scott, Clayton

Statistics > Machine Learning

arXiv:1602.06235 (stat)

[Submitted on 19 Feb 2016]

Title:A Mutual Contamination Analysis of Mixed Membership and Partial Label Models

Authors:Julian Katz-Samuels, Clayton Scott

View PDF

Abstract:Many machine learning problems can be characterized by mutual contamination models. In these problems, one observes several random samples from different convex combinations of a set of unknown base distributions. It is of interest to decontaminate mutual contamination models, i.e., to recover the base distributions either exactly or up to a permutation. This paper considers the general setting where the base distributions are defined on arbitrary probability spaces. We examine the decontamination problem in two mutual contamination models that describe popular machine learning tasks: recovering the base distributions up to a permutation in a mixed membership model, and recovering the base distributions exactly in a partial label model for classification. We give necessary and sufficient conditions for identifiability of both mutual contamination models, algorithms for both problems in the infinite and finite sample cases, and introduce novel proof techniques based on affine geometry.

Subjects:	Machine Learning (stat.ML)
Cite as:	arXiv:1602.06235 [stat.ML]
	(or arXiv:1602.06235v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1602.06235

Submission history

From: Julian Katz-Samuels [view email]
[v1] Fri, 19 Feb 2016 17:40:58 UTC (117 KB)

Statistics > Machine Learning

Title:A Mutual Contamination Analysis of Mixed Membership and Partial Label Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:A Mutual Contamination Analysis of Mixed Membership and Partial Label Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators