Malay documents clustering algorithm based on singular value decomposition.

Document categorization is a widely researched area of information retrieval. A research on Malay natural language processing has been done up to the level of retrieving documents but not to the extent of automatic semantic categorization. Thus, an approach for the clustering of Malay documents bas...

Full description

Saved in:
Bibliographic Details
Main Authors: Ab Samat, Nordianah, Azmi Murad, Masrah Azrifah, Abdullah, Muhamad Taufik, Atan, Rodziah
Format: Article
Language:English
English
Published: Asian Research Publishing Network (ARPN) 2009
Online Access:http://psasir.upm.edu.my/id/eprint/15515/1/Malay%20documents%20clustering%20algorithm%20based%20on%20singular%20value%20decomposition.pdf
http://psasir.upm.edu.my/id/eprint/15515/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Putra Malaysia
Language: English
English
id my.upm.eprints.15515
record_format eprints
spelling my.upm.eprints.155152015-11-24T06:40:03Z http://psasir.upm.edu.my/id/eprint/15515/ Malay documents clustering algorithm based on singular value decomposition. Ab Samat, Nordianah Azmi Murad, Masrah Azrifah Abdullah, Muhamad Taufik Atan, Rodziah Document categorization is a widely researched area of information retrieval. A research on Malay natural language processing has been done up to the level of retrieving documents but not to the extent of automatic semantic categorization. Thus, an approach for the clustering of Malay documents based on semantic relations between words is proposed in this paper. The method described in this paper uses Singular Value Decomposition (SVD) technique for the vector representation of each document where familiar clustering techniques can be applied in this space. The experimental results we obtained taking into account the semantics of the document that performed good document clustering by obtaining relevant subjects appearing in a cluster. Asian Research Publishing Network (ARPN) 2009 Article PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/15515/1/Malay%20documents%20clustering%20algorithm%20based%20on%20singular%20value%20decomposition.pdf Ab Samat, Nordianah and Azmi Murad, Masrah Azrifah and Abdullah, Muhamad Taufik and Atan, Rodziah (2009) Malay documents clustering algorithm based on singular value decomposition. Journal of Theoretical and Applied Information Technology, 8 (2). pp. 180-186. ISSN 1992-8645 English
institution Universiti Putra Malaysia
building UPM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Putra Malaysia
content_source UPM Institutional Repository
url_provider http://psasir.upm.edu.my/
language English
English
description Document categorization is a widely researched area of information retrieval. A research on Malay natural language processing has been done up to the level of retrieving documents but not to the extent of automatic semantic categorization. Thus, an approach for the clustering of Malay documents based on semantic relations between words is proposed in this paper. The method described in this paper uses Singular Value Decomposition (SVD) technique for the vector representation of each document where familiar clustering techniques can be applied in this space. The experimental results we obtained taking into account the semantics of the document that performed good document clustering by obtaining relevant subjects appearing in a cluster.
format Article
author Ab Samat, Nordianah
Azmi Murad, Masrah Azrifah
Abdullah, Muhamad Taufik
Atan, Rodziah
spellingShingle Ab Samat, Nordianah
Azmi Murad, Masrah Azrifah
Abdullah, Muhamad Taufik
Atan, Rodziah
Malay documents clustering algorithm based on singular value decomposition.
author_facet Ab Samat, Nordianah
Azmi Murad, Masrah Azrifah
Abdullah, Muhamad Taufik
Atan, Rodziah
author_sort Ab Samat, Nordianah
title Malay documents clustering algorithm based on singular value decomposition.
title_short Malay documents clustering algorithm based on singular value decomposition.
title_full Malay documents clustering algorithm based on singular value decomposition.
title_fullStr Malay documents clustering algorithm based on singular value decomposition.
title_full_unstemmed Malay documents clustering algorithm based on singular value decomposition.
title_sort malay documents clustering algorithm based on singular value decomposition.
publisher Asian Research Publishing Network (ARPN)
publishDate 2009
url http://psasir.upm.edu.my/id/eprint/15515/1/Malay%20documents%20clustering%20algorithm%20based%20on%20singular%20value%20decomposition.pdf
http://psasir.upm.edu.my/id/eprint/15515/
_version_ 1643825952634961920