Relation and fuzzy clustering for document categorization and analysis

This thesis focuses on the investigations of using fuzzy clustering for automatic document categorization based on relations between document and other types of objects. Three approaches called Fk-Parts, LinkFCM and FC-MR are proposed to handle the document clustering problem under different scenari...

Full description

Saved in:
Bibliographic Details
Main Author: Mei, Jian-Ping
Other Authors: Chen Lihui
Format: Theses and Dissertations
Language:English
Published: 2012
Subjects:
Online Access:https://hdl.handle.net/10356/48627
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-48627
record_format dspace
spelling sg-ntu-dr.10356-486272023-07-04T16:12:57Z Relation and fuzzy clustering for document categorization and analysis Mei, Jian-Ping Chen Lihui School of Electrical and Electronic Engineering DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence This thesis focuses on the investigations of using fuzzy clustering for automatic document categorization based on relations between document and other types of objects. Three approaches called Fk-Parts, LinkFCM and FC-MR are proposed to handle the document clustering problem under different scenarios. We start with a basic situation, and propose Fk-Parts to cluster documents based on document-document relation. The new mechanism of using multiple weighted medoids to represent each cluster makes Fk-Parts perform better than single medoid based approaches. After that, we consider situations where both vector representation of documents and document-document relation are available. LinkFCM is then formulated by incorporating relation into the well known fuzzy c-means approach, so that both types of data are considered in clustering. Finally we propose a fuzzy approach of multi-type relational data clustering FC-MR. This approach simultaneously clusters documents and other types of objects based on the relations among them. DOCTOR OF PHILOSOPHY (EEE) 2012-04-27T07:43:56Z 2012-04-27T07:43:56Z 2012 2012 Thesis Mei, J.-P. (2012). Relation and fuzzy clustering for document categorization and analysis. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/48627 10.32657/10356/48627 en 191 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Mei, Jian-Ping
Relation and fuzzy clustering for document categorization and analysis
description This thesis focuses on the investigations of using fuzzy clustering for automatic document categorization based on relations between document and other types of objects. Three approaches called Fk-Parts, LinkFCM and FC-MR are proposed to handle the document clustering problem under different scenarios. We start with a basic situation, and propose Fk-Parts to cluster documents based on document-document relation. The new mechanism of using multiple weighted medoids to represent each cluster makes Fk-Parts perform better than single medoid based approaches. After that, we consider situations where both vector representation of documents and document-document relation are available. LinkFCM is then formulated by incorporating relation into the well known fuzzy c-means approach, so that both types of data are considered in clustering. Finally we propose a fuzzy approach of multi-type relational data clustering FC-MR. This approach simultaneously clusters documents and other types of objects based on the relations among them.
author2 Chen Lihui
author_facet Chen Lihui
Mei, Jian-Ping
format Theses and Dissertations
author Mei, Jian-Ping
author_sort Mei, Jian-Ping
title Relation and fuzzy clustering for document categorization and analysis
title_short Relation and fuzzy clustering for document categorization and analysis
title_full Relation and fuzzy clustering for document categorization and analysis
title_fullStr Relation and fuzzy clustering for document categorization and analysis
title_full_unstemmed Relation and fuzzy clustering for document categorization and analysis
title_sort relation and fuzzy clustering for document categorization and analysis
publishDate 2012
url https://hdl.handle.net/10356/48627
_version_ 1772825639123943424