Issues in evaluating the retrieval performance of multiscript translation of Al-Quran

The main aim of this paper is to present on the issues of evaluating the retrieval performance of the multi-script indexing of translated texts of al-Quran. Translations of al-Quran has played a major role in the recitation of al-Quran in its original texts and understanding through the translated w...

Full description

Saved in:
Bibliographic Details
Main Authors: Othman, Roslina, Abdul Wahid, Fauziah
Format: Conference or Workshop Item
Language:English
Published: 2011
Subjects:
Online Access:http://irep.iium.edu.my/7481/1/Issues_of_ret_perf.pdf
http://irep.iium.edu.my/7481/
http://kict.iium.edu.my/wcomlis2011/index.php?option=com_content&view=article&id=24&Itemid=471
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Islam Antarabangsa Malaysia
Language: English
id my.iium.irep.7481
record_format dspace
spelling my.iium.irep.74812011-11-21T13:06:38Z http://irep.iium.edu.my/7481/ Issues in evaluating the retrieval performance of multiscript translation of Al-Quran Othman, Roslina Abdul Wahid, Fauziah Z665 Library Science. Information Science The main aim of this paper is to present on the issues of evaluating the retrieval performance of the multi-script indexing of translated texts of al-Quran. Translations of al-Quran has played a major role in the recitation of al-Quran in its original texts and understanding through the translated words, among the public. Even in querying, non-Arabic speakers will find the texts through the translated words in addition to topical search. Transliteration is a need in the absence of terminology in the normal conduct of Cross-Language Information Retrieval research area, while in the case of this research, the transliterated version was meant for those with the ability to read the older script in its own original translation. The Malay Roman script has its own version of the translation. Objectives include to examine the reported retrieval performance of these texts and to evaluate the retrieval performance of the translations available in two different scripts of a language: Malay Rumi and Malay Jawi, built upon Pimpinan ar-Rahman version, Indri and Jawi software. Measures include recall, precision and overlap. Recall explains the performance in retrieving all relevant items, while precision describes the performance in rejecting non-relevant items. Overlap exhibits the retrieval of items common in both sub-collections. Queries are constructed from questions posed by newspaper readers in both scripts resulted as keywords with semantic, while relevance judgment is made by a panel of expert based on answers to the questions. Findings based on recall, precision and overlaps revealed the major issues of standardized texts, translation and transliteration, text alignments, queries construction, question-answering relevance vs. topical relevance. Indri's performance is not a major issue, while the Jawi software requires improvement to a minor extent. This paper contributes to the issues of handling test collections involving parallel corpus in the area of Cross Language IR facing the Muslim World. 2011-11-16 Conference or Workshop Item REM application/pdf en http://irep.iium.edu.my/7481/1/Issues_of_ret_perf.pdf Othman, Roslina and Abdul Wahid, Fauziah (2011) Issues in evaluating the retrieval performance of multiscript translation of Al-Quran. In: 6th World Congress of Muslim Librarians and Information Scientists 2011 (WCOMLIS 2011), 16 - 17 November 2011, IIUM. (Unpublished) http://kict.iium.edu.my/wcomlis2011/index.php?option=com_content&view=article&id=24&Itemid=471
institution Universiti Islam Antarabangsa Malaysia
building IIUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider International Islamic University Malaysia
content_source IIUM Repository (IREP)
url_provider http://irep.iium.edu.my/
language English
topic Z665 Library Science. Information Science
spellingShingle Z665 Library Science. Information Science
Othman, Roslina
Abdul Wahid, Fauziah
Issues in evaluating the retrieval performance of multiscript translation of Al-Quran
description The main aim of this paper is to present on the issues of evaluating the retrieval performance of the multi-script indexing of translated texts of al-Quran. Translations of al-Quran has played a major role in the recitation of al-Quran in its original texts and understanding through the translated words, among the public. Even in querying, non-Arabic speakers will find the texts through the translated words in addition to topical search. Transliteration is a need in the absence of terminology in the normal conduct of Cross-Language Information Retrieval research area, while in the case of this research, the transliterated version was meant for those with the ability to read the older script in its own original translation. The Malay Roman script has its own version of the translation. Objectives include to examine the reported retrieval performance of these texts and to evaluate the retrieval performance of the translations available in two different scripts of a language: Malay Rumi and Malay Jawi, built upon Pimpinan ar-Rahman version, Indri and Jawi software. Measures include recall, precision and overlap. Recall explains the performance in retrieving all relevant items, while precision describes the performance in rejecting non-relevant items. Overlap exhibits the retrieval of items common in both sub-collections. Queries are constructed from questions posed by newspaper readers in both scripts resulted as keywords with semantic, while relevance judgment is made by a panel of expert based on answers to the questions. Findings based on recall, precision and overlaps revealed the major issues of standardized texts, translation and transliteration, text alignments, queries construction, question-answering relevance vs. topical relevance. Indri's performance is not a major issue, while the Jawi software requires improvement to a minor extent. This paper contributes to the issues of handling test collections involving parallel corpus in the area of Cross Language IR facing the Muslim World.
format Conference or Workshop Item
author Othman, Roslina
Abdul Wahid, Fauziah
author_facet Othman, Roslina
Abdul Wahid, Fauziah
author_sort Othman, Roslina
title Issues in evaluating the retrieval performance of multiscript translation of Al-Quran
title_short Issues in evaluating the retrieval performance of multiscript translation of Al-Quran
title_full Issues in evaluating the retrieval performance of multiscript translation of Al-Quran
title_fullStr Issues in evaluating the retrieval performance of multiscript translation of Al-Quran
title_full_unstemmed Issues in evaluating the retrieval performance of multiscript translation of Al-Quran
title_sort issues in evaluating the retrieval performance of multiscript translation of al-quran
publishDate 2011
url http://irep.iium.edu.my/7481/1/Issues_of_ret_perf.pdf
http://irep.iium.edu.my/7481/
http://kict.iium.edu.my/wcomlis2011/index.php?option=com_content&view=article&id=24&Itemid=471
_version_ 1643605950188224512