QURAN VERSE SEARCH RELATED TO INDONESIAN DOCUMENT

The Quran is the main source of knowledge for Muslims in practicing their religion. But, research on Quran in Indonesia is still rare, mainly in the use of its Indonesian textual translation. Whereas Muslims in Indonesia have a need to look find correlations between daily activities and the Islamic...

Full description

Saved in:

Bibliographic Details
Main Author:	Ghifari Haznitrama, Faiz
Format:	Final Project
Language:	Indonesia
Online Access:	https://digilib.itb.ac.id/gdl/view/43425
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Institut Teknologi Bandung
Language:	Indonesia

id	id-itb.:43425
spelling	id-itb.:434252019-09-27T08:49:58ZQURAN VERSE SEARCH RELATED TO INDONESIAN DOCUMENT Ghifari Haznitrama, Faiz Indonesia Final Project Alquran, multilabel classification, information retrieval, Indonesian document INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/43425 The Quran is the main source of knowledge for Muslims in practicing their religion. But, research on Quran in Indonesia is still rare, mainly in the use of its Indonesian textual translation. Whereas Muslims in Indonesia have a need to look find correlations between daily activities and the Islamic values contained in the Quran. In this final project, the writer designed a system to search Quran verses that are related to Indonesian documents. There are at least two methods that can be used. First uses information retrieval, the second combines the categorization of text or keyword extraction with information retrieval. In this issue, these two methods are used, and their performance is compared. For the second method, text categorization is done using multilabel classification on Indonesian documents to get the corresponding Quran topics. Multilabel classification is done using the problem transformation approach utilizing the SVM algorithm, Random Forest, Decision Tree, and Naïve-Bayes. Feature extraction techniques used are bag of words and TF-IDF. Then the collection of Quran topic labels is used as a query for the information retrieval module to Quran verses. The information retrieval technique uses vector space model by utilizing the cosine similarity function. Based on experiments conducted, the first method with information retrieval alone gives the best performance with a micro-average precision of 23,557%. This proves that simplifying the text into a collection of keywords does not provide better performance. The use of a knowledge base that is less commonly used also affects the performance of multilabel classification, so that the results from information retrieval module is less appropriate to the query according to evaluators. text
institution	Institut Teknologi Bandung
building	Institut Teknologi Bandung Library
continent	Asia
country	Indonesia Indonesia
content_provider	Institut Teknologi Bandung
collection	Digital ITB
language	Indonesia
description	The Quran is the main source of knowledge for Muslims in practicing their religion. But, research on Quran in Indonesia is still rare, mainly in the use of its Indonesian textual translation. Whereas Muslims in Indonesia have a need to look find correlations between daily activities and the Islamic values contained in the Quran. In this final project, the writer designed a system to search Quran verses that are related to Indonesian documents. There are at least two methods that can be used. First uses information retrieval, the second combines the categorization of text or keyword extraction with information retrieval. In this issue, these two methods are used, and their performance is compared. For the second method, text categorization is done using multilabel classification on Indonesian documents to get the corresponding Quran topics. Multilabel classification is done using the problem transformation approach utilizing the SVM algorithm, Random Forest, Decision Tree, and Naïve-Bayes. Feature extraction techniques used are bag of words and TF-IDF. Then the collection of Quran topic labels is used as a query for the information retrieval module to Quran verses. The information retrieval technique uses vector space model by utilizing the cosine similarity function. Based on experiments conducted, the first method with information retrieval alone gives the best performance with a micro-average precision of 23,557%. This proves that simplifying the text into a collection of keywords does not provide better performance. The use of a knowledge base that is less commonly used also affects the performance of multilabel classification, so that the results from information retrieval module is less appropriate to the query according to evaluators.
format	Final Project
author	Ghifari Haznitrama, Faiz
spellingShingle	Ghifari Haznitrama, Faiz QURAN VERSE SEARCH RELATED TO INDONESIAN DOCUMENT
author_facet	Ghifari Haznitrama, Faiz
author_sort	Ghifari Haznitrama, Faiz
title	QURAN VERSE SEARCH RELATED TO INDONESIAN DOCUMENT
title_short	QURAN VERSE SEARCH RELATED TO INDONESIAN DOCUMENT
title_full	QURAN VERSE SEARCH RELATED TO INDONESIAN DOCUMENT
title_fullStr	QURAN VERSE SEARCH RELATED TO INDONESIAN DOCUMENT
title_full_unstemmed	QURAN VERSE SEARCH RELATED TO INDONESIAN DOCUMENT
title_sort	quran verse search related to indonesian document
url	https://digilib.itb.ac.id/gdl/view/43425
_version_	1822926574746337280

QURAN VERSE SEARCH RELATED TO INDONESIAN DOCUMENT

Similar Items