A Graph-Theoretic Framework for Understanding Open-World Semi-Supervised Learning

Sun, Yiyou; Shi, Zhenmei; Li, Yixuan

Computer Science > Machine Learning

arXiv:2311.03524 (cs)

[Submitted on 6 Nov 2023]

Title:A Graph-Theoretic Framework for Understanding Open-World Semi-Supervised Learning

Authors:Yiyou Sun, Zhenmei Shi, Yixuan Li

View PDF

Abstract:Open-world semi-supervised learning aims at inferring both known and novel classes in unlabeled data, by harnessing prior knowledge from a labeled set with known classes. Despite its importance, there is a lack of theoretical foundations for this problem. This paper bridges the gap by formalizing a graph-theoretic framework tailored for the open-world setting, where the clustering can be theoretically characterized by graph factorization. Our graph-theoretic framework illuminates practical algorithms and provides guarantees. In particular, based on our graph formulation, we apply the algorithm called Spectral Open-world Representation Learning (SORL), and show that minimizing our loss is equivalent to performing spectral decomposition on the graph. Such equivalence allows us to derive a provable error bound on the clustering performance for both known and novel classes, and analyze rigorously when labeled data helps. Empirically, SORL can match or outperform several strong baselines on common benchmark datasets, which is appealing for practical usage while enjoying theoretical guarantees.

Comments:	Accepted to NeurIPS 2023 (Spotlight)
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2311.03524 [cs.LG]
	(or arXiv:2311.03524v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.03524

Submission history

From: Yiyou Sun [view email]
[v1] Mon, 6 Nov 2023 21:15:09 UTC (4,342 KB)

Computer Science > Machine Learning

Title:A Graph-Theoretic Framework for Understanding Open-World Semi-Supervised Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Graph-Theoretic Framework for Understanding Open-World Semi-Supervised Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators