Doubly Stochastic Adaptive Neighbors Clustering via the Marcus Mapping

Yuan, Jinghui; Zeng, Chusheng; Xie, Fangyuan; Cao, Zhe; Chen, Mulin; Wang, Rong; Nie, Feiping; Yuan, Yuan

Computer Science > Machine Learning

arXiv:2408.02932 (cs)

[Submitted on 6 Aug 2024 (v1), last revised 12 Aug 2024 (this version, v2)]

Title:Doubly Stochastic Adaptive Neighbors Clustering via the Marcus Mapping

Authors:Jinghui Yuan, Chusheng Zeng, Fangyuan Xie, Zhe Cao, Mulin Chen, Rong Wang, Feiping Nie, Yuan Yuan

View PDF HTML (experimental)

Abstract:Clustering is a fundamental task in machine learning and data science, and similarity graph-based clustering is an important approach within this domain. Doubly stochastic symmetric similarity graphs provide numerous benefits for clustering problems and downstream tasks, yet learning such graphs remains a significant challenge. Marcus theorem states that a strictly positive symmetric matrix can be transformed into a doubly stochastic symmetric matrix by diagonal matrices. However, in clustering, learning sparse matrices is crucial for computational efficiency. We extend Marcus theorem by proposing the Marcus mapping, which indicates that certain sparse matrices can also be transformed into doubly stochastic symmetric matrices via diagonal matrices. Additionally, we introduce rank constraints into the clustering problem and propose the Doubly Stochastic Adaptive Neighbors Clustering algorithm based on the Marcus Mapping (ANCMM). This ensures that the learned graph naturally divides into the desired number of clusters. We validate the effectiveness of our algorithm through extensive comparisons with state-of-the-art algorithms. Finally, we explore the relationship between the Marcus mapping and optimal transport. We prove that the Marcus mapping solves a specific type of optimal transport problem and demonstrate that solving this problem through Marcus mapping is more efficient than directly applying optimal transport methods.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2408.02932 [cs.LG]
	(or arXiv:2408.02932v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2408.02932

Submission history

From: Fangyuan Xie [view email]
[v1] Tue, 6 Aug 2024 03:34:43 UTC (1,631 KB)
[v2] Mon, 12 Aug 2024 09:48:45 UTC (2,669 KB)

Computer Science > Machine Learning

Title:Doubly Stochastic Adaptive Neighbors Clustering via the Marcus Mapping

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Doubly Stochastic Adaptive Neighbors Clustering via the Marcus Mapping

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators