Take a Fresh Look at Recommender Systems from an Evaluation Standpoint

Sun, Aixin

doi:10.1145/3539618.3591931

Computer Science > Information Retrieval

arXiv:2210.04149 (cs)

[Submitted on 9 Oct 2022 (v1), last revised 25 Apr 2023 (this version, v2)]

Title:Take a Fresh Look at Recommender Systems from an Evaluation Standpoint

Authors:Aixin Sun

View PDF

Abstract:Recommendation has become a prominent area of research in the field of Information Retrieval (IR). Evaluation is also a traditional research topic in this community. Motivated by a few counter-intuitive observations reported in recent studies, this perspectives paper takes a fresh look at recommender systems from an evaluation standpoint. Rather than examining metrics like recall, hit rate, or NDCG, or perspectives like novelty and diversity, the key focus here is on how these metrics are calculated when evaluating a recommender algorithm. Specifically, the commonly used train/test data splits and their consequences are re-examined. We begin by examining common data splitting methods, such as random split or leave-one-out, and discuss why the popularity baseline is poorly defined under such splits. We then move on to explore the two implications of neglecting a global timeline during evaluation: data leakage and oversimplification of user preference modeling. Afterwards, we present new perspectives on recommender systems, including techniques for evaluating algorithm performance that more accurately reflect real-world scenarios, and possible approaches to consider decision contexts in user preference modeling.

Comments:	Accepted for SIGIR 2023 (Perspectives Paper Track)
Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:2210.04149 [cs.IR]
	(or arXiv:2210.04149v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2210.04149
Journal reference:	SIGIR 2023
Related DOI:	https://doi.org/10.1145/3539618.3591931

Submission history

From: Aixin Sun [view email]
[v1] Sun, 9 Oct 2022 02:56:12 UTC (451 KB)
[v2] Tue, 25 Apr 2023 01:15:20 UTC (1,088 KB)

Computer Science > Information Retrieval

Title:Take a Fresh Look at Recommender Systems from an Evaluation Standpoint

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Take a Fresh Look at Recommender Systems from an Evaluation Standpoint

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators