Joint ranking for multilingual web search

Ranking for multilingual information retrieval (MLIR) is a task to rank documents of different languages solely based on their relevancy to the query regardless of query’s language. Existing approaches are focused on combining relevance scores of different retrieval settings, but do not learn the ra...

Full description

Saved in:

Bibliographic Details
Main Authors:	GAO, Wei, NIU, Cheng, ZHOU, Ming, WONG, Kam-Fai
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2009
Subjects:	Databases and Information Systems
Online Access:	https://ink.library.smu.edu.sg/sis_research/4599 https://ink.library.smu.edu.sg/context/sis_research/article/5602/viewcontent/Gao2009_Chapter_JointRankingForMultilingualWeb.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-5602
record_format	dspace
spelling	sg-smu-ink.sis_research-56022019-12-26T07:44:59Z Joint ranking for multilingual web search GAO, Wei NIU, Cheng ZHOU, Ming WONG, Kam-Fai Ranking for multilingual information retrieval (MLIR) is a task to rank documents of different languages solely based on their relevancy to the query regardless of query’s language. Existing approaches are focused on combining relevance scores of different retrieval settings, but do not learn the ranking function directly. We approach Web MLIR ranking within the learning-to-rank (L2R) framework. Besides adopting popular L2R algorithms to MLIR, a joint ranking model is created to exploit the correlations among documents, and induce the joint relevance probability for all the documents. Using this method, the relevant documents of one language can be leveraged to improve the relevance estimation for documents of different languages. A probabilistic graphical model is trained for the joint relevance estimation. Especially, a hidden layer of nodes is introduced to represent the salient topics among the retrieved documents, and the ranks of the relevant documents and topics are determined collaboratively while the model approaching to its thermal equilibrium. Furthermore, the model parameters are trained under two settings: (1) optimize the accuracy of identifying relevant documents; (2) directly optimize information retrieval evaluation measures, such as mean average precision. Benchmarks show that our model significantly outperforms the existing approaches for MLIR tasks. 2009-04-09T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4599 info:doi/10.1007/978-3-642-00958-7_13 https://ink.library.smu.edu.sg/context/sis_research/article/5602/viewcontent/Gao2009_Chapter_JointRankingForMultilingualWeb.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Databases and Information Systems
spellingShingle	Databases and Information Systems GAO, Wei NIU, Cheng ZHOU, Ming WONG, Kam-Fai Joint ranking for multilingual web search
description	Ranking for multilingual information retrieval (MLIR) is a task to rank documents of different languages solely based on their relevancy to the query regardless of query’s language. Existing approaches are focused on combining relevance scores of different retrieval settings, but do not learn the ranking function directly. We approach Web MLIR ranking within the learning-to-rank (L2R) framework. Besides adopting popular L2R algorithms to MLIR, a joint ranking model is created to exploit the correlations among documents, and induce the joint relevance probability for all the documents. Using this method, the relevant documents of one language can be leveraged to improve the relevance estimation for documents of different languages. A probabilistic graphical model is trained for the joint relevance estimation. Especially, a hidden layer of nodes is introduced to represent the salient topics among the retrieved documents, and the ranks of the relevant documents and topics are determined collaboratively while the model approaching to its thermal equilibrium. Furthermore, the model parameters are trained under two settings: (1) optimize the accuracy of identifying relevant documents; (2) directly optimize information retrieval evaluation measures, such as mean average precision. Benchmarks show that our model significantly outperforms the existing approaches for MLIR tasks.
format	text
author	GAO, Wei NIU, Cheng ZHOU, Ming WONG, Kam-Fai
author_facet	GAO, Wei NIU, Cheng ZHOU, Ming WONG, Kam-Fai
author_sort	GAO, Wei
title	Joint ranking for multilingual web search
title_short	Joint ranking for multilingual web search
title_full	Joint ranking for multilingual web search
title_fullStr	Joint ranking for multilingual web search
title_full_unstemmed	Joint ranking for multilingual web search
title_sort	joint ranking for multilingual web search
publisher	Institutional Knowledge at Singapore Management University
publishDate	2009
url	https://ink.library.smu.edu.sg/sis_research/4599 https://ink.library.smu.edu.sg/context/sis_research/article/5602/viewcontent/Gao2009_Chapter_JointRankingForMultilingualWeb.pdf
_version_	1770574926546206720

Joint ranking for multilingual web search

Similar Items