Malay documents clustering algorithm based on singular value decomposition.
Document categorization is a widely researched area of information retrieval. A research on Malay natural language processing has been done up to the level of retrieving documents but not to the extent of automatic semantic categorization. Thus, an approach for the clustering of Malay documents bas...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English English |
Published: |
Asian Research Publishing Network (ARPN)
2009
|
Online Access: | http://psasir.upm.edu.my/id/eprint/15515/1/Malay%20documents%20clustering%20algorithm%20based%20on%20singular%20value%20decomposition.pdf http://psasir.upm.edu.my/id/eprint/15515/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Putra Malaysia |
Language: | English English |
id |
my.upm.eprints.15515 |
---|---|
record_format |
eprints |
spelling |
my.upm.eprints.155152015-11-24T06:40:03Z http://psasir.upm.edu.my/id/eprint/15515/ Malay documents clustering algorithm based on singular value decomposition. Ab Samat, Nordianah Azmi Murad, Masrah Azrifah Abdullah, Muhamad Taufik Atan, Rodziah Document categorization is a widely researched area of information retrieval. A research on Malay natural language processing has been done up to the level of retrieving documents but not to the extent of automatic semantic categorization. Thus, an approach for the clustering of Malay documents based on semantic relations between words is proposed in this paper. The method described in this paper uses Singular Value Decomposition (SVD) technique for the vector representation of each document where familiar clustering techniques can be applied in this space. The experimental results we obtained taking into account the semantics of the document that performed good document clustering by obtaining relevant subjects appearing in a cluster. Asian Research Publishing Network (ARPN) 2009 Article PeerReviewed application/pdf en http://psasir.upm.edu.my/id/eprint/15515/1/Malay%20documents%20clustering%20algorithm%20based%20on%20singular%20value%20decomposition.pdf Ab Samat, Nordianah and Azmi Murad, Masrah Azrifah and Abdullah, Muhamad Taufik and Atan, Rodziah (2009) Malay documents clustering algorithm based on singular value decomposition. Journal of Theoretical and Applied Information Technology, 8 (2). pp. 180-186. ISSN 1992-8645 English |
institution |
Universiti Putra Malaysia |
building |
UPM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Putra Malaysia |
content_source |
UPM Institutional Repository |
url_provider |
http://psasir.upm.edu.my/ |
language |
English English |
description |
Document categorization is a widely researched area of information retrieval. A research on Malay natural
language processing has been done up to the level of retrieving documents but not to the extent of automatic semantic categorization. Thus, an approach for the clustering of Malay documents based on semantic relations between words is proposed in this paper. The method described in this paper uses Singular Value Decomposition (SVD) technique for the vector representation of each document where familiar clustering techniques can be applied in this space. The experimental results we obtained taking into account the semantics of the document that performed good document clustering by obtaining relevant
subjects appearing in a cluster. |
format |
Article |
author |
Ab Samat, Nordianah Azmi Murad, Masrah Azrifah Abdullah, Muhamad Taufik Atan, Rodziah |
spellingShingle |
Ab Samat, Nordianah Azmi Murad, Masrah Azrifah Abdullah, Muhamad Taufik Atan, Rodziah Malay documents clustering algorithm based on singular value decomposition. |
author_facet |
Ab Samat, Nordianah Azmi Murad, Masrah Azrifah Abdullah, Muhamad Taufik Atan, Rodziah |
author_sort |
Ab Samat, Nordianah |
title |
Malay documents clustering algorithm based on singular value decomposition. |
title_short |
Malay documents clustering algorithm based on singular value decomposition. |
title_full |
Malay documents clustering algorithm based on singular value decomposition. |
title_fullStr |
Malay documents clustering algorithm based on singular value decomposition. |
title_full_unstemmed |
Malay documents clustering algorithm based on singular value decomposition. |
title_sort |
malay documents clustering algorithm based on singular value decomposition. |
publisher |
Asian Research Publishing Network (ARPN) |
publishDate |
2009 |
url |
http://psasir.upm.edu.my/id/eprint/15515/1/Malay%20documents%20clustering%20algorithm%20based%20on%20singular%20value%20decomposition.pdf http://psasir.upm.edu.my/id/eprint/15515/ |
_version_ |
1643825952634961920 |