Inconsistency of evaluation metrics in link prediction

Bi, Yilin; Jiao, Xinshan; Lee, Yan-Li; Zhou, Tao

Computer Science > Social and Information Networks

arXiv:2402.08893 (cs)

[Submitted on 14 Feb 2024 (v1), last revised 24 Feb 2024 (this version, v2)]

Title:Inconsistency of evaluation metrics in link prediction

Authors:Yilin Bi, Xinshan Jiao, Yan-Li Lee, Tao Zhou

View PDF HTML (experimental)

Abstract:Link prediction is a paradigmatic and challenging problem in network science, which aims to predict missing links, future links and temporal links based on known topology. Along with the increasing number of link prediction algorithms, a critical yet previously ignored risk is that the evaluation metrics for algorithm performance are usually chosen at will. This paper implements extensive experiments on hundreds of real networks and 25 well-known algorithms, revealing significant inconsistency among evaluation metrics, namely different metrics probably produce remarkably different rankings of algorithms. Therefore, we conclude that any single metric cannot comprehensively or credibly evaluate algorithm performance. Further analysis suggests the usage of at least two metrics: one is the area under the receiver operating characteristic curve (AUC), and the other is one of the following three candidates, say the area under the precision-recall curve (AUPR), the area under the precision curve (AUC-Precision), and the normalized discounted cumulative gain (NDCG). In addition, as we have proved the essential equivalence of threshold-dependent metrics, if in a link prediction task, some specific thresholds are meaningful, we can consider any one threshold-dependent metric with those thresholds. This work completes a missing part in the landscape of link prediction, and provides a starting point toward a well-accepted criterion or standard to select proper evaluation metrics for link prediction.

Comments:	20 pages, 9 figures
Subjects:	Social and Information Networks (cs.SI); Physics and Society (physics.soc-ph)
Cite as:	arXiv:2402.08893 [cs.SI]
	(or arXiv:2402.08893v2 [cs.SI] for this version)
	https://doi.org/10.48550/arXiv.2402.08893

Submission history

From: Yilin Bi [view email]
[v1] Wed, 14 Feb 2024 02:00:28 UTC (5,662 KB)
[v2] Sat, 24 Feb 2024 15:04:30 UTC (5,663 KB)

Computer Science > Social and Information Networks

Title:Inconsistency of evaluation metrics in link prediction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Social and Information Networks

Title:Inconsistency of evaluation metrics in link prediction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators