A re-visit of the popularity baseline in recommender systems

Popularity is often included in experimental evaluation to provide a reference performance for a recommendation task. To understand how popularity baseline is defined and evaluated, we sample 12 papers from top-tier conferences including KDD, WWW, SIGIR, and RecSys, and 6 open source toolkits. We no...

Full description

Saved in:
Bibliographic Details
Main Authors: Ji, Yitong, Sun, Aixin, Zhang, Jie, Li, Chenliang
Other Authors: School of Computer Science and Engineering
Format: Conference or Workshop Item
Language:English
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/10356/144423
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-144423
record_format dspace
spelling sg-ntu-dr.10356-1444232020-11-05T01:11:07Z A re-visit of the popularity baseline in recommender systems Ji, Yitong Sun, Aixin Zhang, Jie Li, Chenliang School of Computer Science and Engineering 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’20) Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Recommender Systems Popularity Popularity is often included in experimental evaluation to provide a reference performance for a recommendation task. To understand how popularity baseline is defined and evaluated, we sample 12 papers from top-tier conferences including KDD, WWW, SIGIR, and RecSys, and 6 open source toolkits. We note that the widely adopted MostPop baseline simply ranks items based on the number of interactions in the training data.We argue that the current evaluation of popularity (i) does not reflect the popular items at the time when a user interacts with the system, and (ii) may recommend items released after a user’s last interaction with the system. On the widely used MovieLens dataset, we show that the performance of popularity could be significantly improved by 70% or more, if we consider the popular items at the time point when a user interacts with the system. We further show that, on MovieLens dataset, the users having lower tendencies on movies tend to follow the crowd and rate more popular movies. Movie lovers who rate a large number of movies, rate movies based on their own preferences and interests. Through this study, we call for a re-visit of the popularity baseline in recommender system to better reflect its effectiveness. Accepted version 2020-11-05T01:07:00Z 2020-11-05T01:07:00Z 2020 Conference Paper Ji, Y., Sun, A., Zhang, J., & Li, C. (2020). A re-visit of the popularity baseline in recommender systems. Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’20), 1749–1752. doi:10.1145/3397271.3401233 https://hdl.handle.net/10356/144423 10.1145/3397271.3401233 1749 1752 en © 2020 Association for Computing Machinery. All rights reserved. This paper was published in Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’20) and is made available with permission of Association for Computing Machinery. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Recommender Systems
Popularity
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Recommender Systems
Popularity
Ji, Yitong
Sun, Aixin
Zhang, Jie
Li, Chenliang
A re-visit of the popularity baseline in recommender systems
description Popularity is often included in experimental evaluation to provide a reference performance for a recommendation task. To understand how popularity baseline is defined and evaluated, we sample 12 papers from top-tier conferences including KDD, WWW, SIGIR, and RecSys, and 6 open source toolkits. We note that the widely adopted MostPop baseline simply ranks items based on the number of interactions in the training data.We argue that the current evaluation of popularity (i) does not reflect the popular items at the time when a user interacts with the system, and (ii) may recommend items released after a user’s last interaction with the system. On the widely used MovieLens dataset, we show that the performance of popularity could be significantly improved by 70% or more, if we consider the popular items at the time point when a user interacts with the system. We further show that, on MovieLens dataset, the users having lower tendencies on movies tend to follow the crowd and rate more popular movies. Movie lovers who rate a large number of movies, rate movies based on their own preferences and interests. Through this study, we call for a re-visit of the popularity baseline in recommender system to better reflect its effectiveness.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Ji, Yitong
Sun, Aixin
Zhang, Jie
Li, Chenliang
format Conference or Workshop Item
author Ji, Yitong
Sun, Aixin
Zhang, Jie
Li, Chenliang
author_sort Ji, Yitong
title A re-visit of the popularity baseline in recommender systems
title_short A re-visit of the popularity baseline in recommender systems
title_full A re-visit of the popularity baseline in recommender systems
title_fullStr A re-visit of the popularity baseline in recommender systems
title_full_unstemmed A re-visit of the popularity baseline in recommender systems
title_sort re-visit of the popularity baseline in recommender systems
publishDate 2020
url https://hdl.handle.net/10356/144423
_version_ 1688665462743760896