Correlation and prediction of high-cost information retrieval evaluation metrics using deep learning

Introduction. To reduce cost of the evaluation of information retrieval systems, this study proposes a method that employs deep learning to predict the precision evaluation metric. It also aims to show why some of existing evaluation metrics correlate with each other while considering the varying di...

Full description

Saved in:

Bibliographic Details
Main Authors:	Muwanei, Sinyinda, Ravana, Sri Devi, Hoo, Wai Lam, Kunda, Douglas, Rajagopal, Prabha, Sodhi, Prabhpreet Singh
Format:	Article
Published:	Univ Sheffield Dept Information Studies 2022
Subjects:	QA Mathematics QA75 Electronic computers. Computer science QA76 Computer software
Online Access:	http://eprints.um.edu.my/42045/ https://doi.org/10.47989/irpaper920
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Universiti Malaya

id	my.um.eprints.42045
record_format	eprints
spelling	my.um.eprints.420452023-10-17T01:37:17Z http://eprints.um.edu.my/42045/ Correlation and prediction of high-cost information retrieval evaluation metrics using deep learning Muwanei, Sinyinda Ravana, Sri Devi Hoo, Wai Lam Kunda, Douglas Rajagopal, Prabha Sodhi, Prabhpreet Singh QA Mathematics QA75 Electronic computers. Computer science QA76 Computer software Introduction. To reduce cost of the evaluation of information retrieval systems, this study proposes a method that employs deep learning to predict the precision evaluation metric. It also aims to show why some of existing evaluation metrics correlate with each other while considering the varying distributions of relevance assessments. It aims to ensure reproducibility of all the presented experiments. Method. Using data from several test collections of the Text REetrieval Conference (TREC) we show why some evaluation metrics correlate with each other, through mathematical intuitions. In addition, regression models were used to investigate how the predictions of the evaluation metrics are affected by queries or topics with variations of relevance assessments. Lastly, the proposed prediction method employs deep learning. Analysis. We use coefficient of determination, Kendall's tau, Spearman and Pearson correlations. Results. This study showed that the proposed method performed better predictions than other recently proposed methods in retrieval research. It also showed why the correlation exists between precision and rank biased precision metrics, and why recall and average precision metrics have reduced correlation when the cut-off depth increases. Conclusions. The proposed method and the justifications for the correlations between some pairs of retrieval metrics will be valuable to researchers for the predictions of the evaluation metrics of information retrieval systems. Univ Sheffield Dept Information Studies 2022-03 Article PeerReviewed Muwanei, Sinyinda and Ravana, Sri Devi and Hoo, Wai Lam and Kunda, Douglas and Rajagopal, Prabha and Sodhi, Prabhpreet Singh (2022) Correlation and prediction of high-cost information retrieval evaluation metrics using deep learning. Information Research-An International Electronic Journal, 27 (1). ISSN 1368-1613, DOI https://doi.org/10.47989/irpaper920 <https://doi.org/10.47989/irpaper920>. https://doi.org/10.47989/irpaper920 10.47989/irpaper920
institution	Universiti Malaya
building	UM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Malaya
content_source	UM Research Repository
url_provider	http://eprints.um.edu.my/
topic	QA Mathematics QA75 Electronic computers. Computer science QA76 Computer software
spellingShingle	QA Mathematics QA75 Electronic computers. Computer science QA76 Computer software Muwanei, Sinyinda Ravana, Sri Devi Hoo, Wai Lam Kunda, Douglas Rajagopal, Prabha Sodhi, Prabhpreet Singh Correlation and prediction of high-cost information retrieval evaluation metrics using deep learning
description	Introduction. To reduce cost of the evaluation of information retrieval systems, this study proposes a method that employs deep learning to predict the precision evaluation metric. It also aims to show why some of existing evaluation metrics correlate with each other while considering the varying distributions of relevance assessments. It aims to ensure reproducibility of all the presented experiments. Method. Using data from several test collections of the Text REetrieval Conference (TREC) we show why some evaluation metrics correlate with each other, through mathematical intuitions. In addition, regression models were used to investigate how the predictions of the evaluation metrics are affected by queries or topics with variations of relevance assessments. Lastly, the proposed prediction method employs deep learning. Analysis. We use coefficient of determination, Kendall's tau, Spearman and Pearson correlations. Results. This study showed that the proposed method performed better predictions than other recently proposed methods in retrieval research. It also showed why the correlation exists between precision and rank biased precision metrics, and why recall and average precision metrics have reduced correlation when the cut-off depth increases. Conclusions. The proposed method and the justifications for the correlations between some pairs of retrieval metrics will be valuable to researchers for the predictions of the evaluation metrics of information retrieval systems.
format	Article
author	Muwanei, Sinyinda Ravana, Sri Devi Hoo, Wai Lam Kunda, Douglas Rajagopal, Prabha Sodhi, Prabhpreet Singh
author_facet	Muwanei, Sinyinda Ravana, Sri Devi Hoo, Wai Lam Kunda, Douglas Rajagopal, Prabha Sodhi, Prabhpreet Singh
author_sort	Muwanei, Sinyinda
title	Correlation and prediction of high-cost information retrieval evaluation metrics using deep learning
title_short	Correlation and prediction of high-cost information retrieval evaluation metrics using deep learning
title_full	Correlation and prediction of high-cost information retrieval evaluation metrics using deep learning
title_fullStr	Correlation and prediction of high-cost information retrieval evaluation metrics using deep learning
title_full_unstemmed	Correlation and prediction of high-cost information retrieval evaluation metrics using deep learning
title_sort	correlation and prediction of high-cost information retrieval evaluation metrics using deep learning
publisher	Univ Sheffield Dept Information Studies
publishDate	2022
url	http://eprints.um.edu.my/42045/ https://doi.org/10.47989/irpaper920
_version_	1781704587882790912

Correlation and prediction of high-cost information retrieval evaluation metrics using deep learning

Similar Items