The predictions of performance metrics in information retrieval: An experimental study
Information retrieval systems are widely used by people from all walks of life to meet diverse user needs. Hence, the ability of these retrieval systems to return the relevant information in response to user queries has been a matter of concern to the information retrieval research community. To add...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Published: |
Univ Malaya, Fac Computer Science & Information Tech
2021
|
Subjects: | |
Online Access: | http://eprints.um.edu.my/35384/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Malaya |
id |
my.um.eprints.35384 |
---|---|
record_format |
eprints |
spelling |
my.um.eprints.353842022-10-26T02:32:14Z http://eprints.um.edu.my/35384/ The predictions of performance metrics in information retrieval: An experimental study Muwanei, Sinyinda Ravana, Sri Devi Hoo, Wai Lam Kunda, Douglas QA75 Electronic computers. Computer science Information retrieval systems are widely used by people from all walks of life to meet diverse user needs. Hence, the ability of these retrieval systems to return the relevant information in response to user queries has been a matter of concern to the information retrieval research community. To address this concern, evaluations of these retrieval systems is extremely critical and the most popular way is the approach that employs test collections. This approach has been the popular evaluation approach in information retrieval for several decades. However, one of the limitations of this evaluation approach concerns the costly creation of relevance judgments. In recent research, this limitation was addressed by predicting performance metrics at the high cut-off depths of documents by using performance metrics computed at low cut-off depths. However, the challenge the research community is faced with is how to predict the precision and the non-cumulative gain performance metrics at the high cut-off depths of documents while using other performance metrics computed at the low cut-off depths of at most 30 documents. This study addresses this challenge by investigating the predictability of performance metrics and proposing two approaches that predict the precision and the non-cumulative discounted gain performance metrics. This study has shown that there exist dataset shifts in the performance metrics computed from different test collections. Furthermore, the proposed approaches have demonstrated better results of the ranked correlations of the predictions of performance metrics than existing research. Univ Malaya, Fac Computer Science & Information Tech 2021 Article PeerReviewed Muwanei, Sinyinda and Ravana, Sri Devi and Hoo, Wai Lam and Kunda, Douglas (2021) The predictions of performance metrics in information retrieval: An experimental study. Malaysian Journal of Computer Science (SI). pp. 35-54. ISSN 0127-9084, DOI https://doi.org/10.22452/mjcs.sp2021no2.3 <https://doi.org/10.22452/mjcs.sp2021no2.3>. 10.22452/mjcs.sp2021no2.3 |
institution |
Universiti Malaya |
building |
UM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaya |
content_source |
UM Research Repository |
url_provider |
http://eprints.um.edu.my/ |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Muwanei, Sinyinda Ravana, Sri Devi Hoo, Wai Lam Kunda, Douglas The predictions of performance metrics in information retrieval: An experimental study |
description |
Information retrieval systems are widely used by people from all walks of life to meet diverse user needs. Hence, the ability of these retrieval systems to return the relevant information in response to user queries has been a matter of concern to the information retrieval research community. To address this concern, evaluations of these retrieval systems is extremely critical and the most popular way is the approach that employs test collections. This approach has been the popular evaluation approach in information retrieval for several decades. However, one of the limitations of this evaluation approach concerns the costly creation of relevance judgments. In recent research, this limitation was addressed by predicting performance metrics at the high cut-off depths of documents by using performance metrics computed at low cut-off depths. However, the challenge the research community is faced with is how to predict the precision and the non-cumulative gain performance metrics at the high cut-off depths of documents while using other performance metrics computed at the low cut-off depths of at most 30 documents. This study addresses this challenge by investigating the predictability of performance metrics and proposing two approaches that predict the precision and the non-cumulative discounted gain performance metrics. This study has shown that there exist dataset shifts in the performance metrics computed from different test collections. Furthermore, the proposed approaches have demonstrated better results of the ranked correlations of the predictions of performance metrics than existing research. |
format |
Article |
author |
Muwanei, Sinyinda Ravana, Sri Devi Hoo, Wai Lam Kunda, Douglas |
author_facet |
Muwanei, Sinyinda Ravana, Sri Devi Hoo, Wai Lam Kunda, Douglas |
author_sort |
Muwanei, Sinyinda |
title |
The predictions of performance metrics in information retrieval: An experimental study |
title_short |
The predictions of performance metrics in information retrieval: An experimental study |
title_full |
The predictions of performance metrics in information retrieval: An experimental study |
title_fullStr |
The predictions of performance metrics in information retrieval: An experimental study |
title_full_unstemmed |
The predictions of performance metrics in information retrieval: An experimental study |
title_sort |
predictions of performance metrics in information retrieval: an experimental study |
publisher |
Univ Malaya, Fac Computer Science & Information Tech |
publishDate |
2021 |
url |
http://eprints.um.edu.my/35384/ |
_version_ |
1748181084425682944 |