A re-visit of the popularity baseline in recommender systems
Popularity is often included in experimental evaluation to provide a reference performance for a recommendation task. To understand how popularity baseline is defined and evaluated, we sample 12 papers from top-tier conferences including KDD, WWW, SIGIR, and RecSys, and 6 open source toolkits. We no...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/144423 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-144423 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1444232020-11-05T01:11:07Z A re-visit of the popularity baseline in recommender systems Ji, Yitong Sun, Aixin Zhang, Jie Li, Chenliang School of Computer Science and Engineering 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’20) Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Recommender Systems Popularity Popularity is often included in experimental evaluation to provide a reference performance for a recommendation task. To understand how popularity baseline is defined and evaluated, we sample 12 papers from top-tier conferences including KDD, WWW, SIGIR, and RecSys, and 6 open source toolkits. We note that the widely adopted MostPop baseline simply ranks items based on the number of interactions in the training data.We argue that the current evaluation of popularity (i) does not reflect the popular items at the time when a user interacts with the system, and (ii) may recommend items released after a user’s last interaction with the system. On the widely used MovieLens dataset, we show that the performance of popularity could be significantly improved by 70% or more, if we consider the popular items at the time point when a user interacts with the system. We further show that, on MovieLens dataset, the users having lower tendencies on movies tend to follow the crowd and rate more popular movies. Movie lovers who rate a large number of movies, rate movies based on their own preferences and interests. Through this study, we call for a re-visit of the popularity baseline in recommender system to better reflect its effectiveness. Accepted version 2020-11-05T01:07:00Z 2020-11-05T01:07:00Z 2020 Conference Paper Ji, Y., Sun, A., Zhang, J., & Li, C. (2020). A re-visit of the popularity baseline in recommender systems. Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’20), 1749–1752. doi:10.1145/3397271.3401233 https://hdl.handle.net/10356/144423 10.1145/3397271.3401233 1749 1752 en © 2020 Association for Computing Machinery. All rights reserved. This paper was published in Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’20) and is made available with permission of Association for Computing Machinery. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Recommender Systems Popularity |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Recommender Systems Popularity Ji, Yitong Sun, Aixin Zhang, Jie Li, Chenliang A re-visit of the popularity baseline in recommender systems |
description |
Popularity is often included in experimental evaluation to provide a reference performance for a recommendation task. To understand how popularity baseline is defined and evaluated, we sample 12 papers from top-tier conferences including KDD, WWW, SIGIR, and RecSys, and 6 open source toolkits. We note that the widely adopted MostPop baseline simply ranks items based on the number of interactions in the training data.We argue that the current evaluation of popularity (i) does not reflect the popular items at the time when a user interacts with the system, and (ii) may recommend items released after a user’s last interaction with the system. On the widely used MovieLens dataset, we show that the performance of popularity could be significantly improved by 70% or more, if we consider the popular items at the time point when a user interacts with the system. We further show that, on MovieLens dataset, the users having lower tendencies on movies tend to follow the crowd and rate more popular movies. Movie lovers who rate a large number of movies, rate movies based on their own preferences and interests. Through this study, we call for a re-visit of the popularity baseline in recommender system to better reflect its effectiveness. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Ji, Yitong Sun, Aixin Zhang, Jie Li, Chenliang |
format |
Conference or Workshop Item |
author |
Ji, Yitong Sun, Aixin Zhang, Jie Li, Chenliang |
author_sort |
Ji, Yitong |
title |
A re-visit of the popularity baseline in recommender systems |
title_short |
A re-visit of the popularity baseline in recommender systems |
title_full |
A re-visit of the popularity baseline in recommender systems |
title_fullStr |
A re-visit of the popularity baseline in recommender systems |
title_full_unstemmed |
A re-visit of the popularity baseline in recommender systems |
title_sort |
re-visit of the popularity baseline in recommender systems |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/144423 |
_version_ |
1688665462743760896 |