Clique percolation for finding naturally cohesive and overlapping document clusters

Techniques for find document clusters mostly depend on models that impose strong explicit and/or implicit priori assumptions. As a consequence, the clustering effects tend to be unnatural and stray away from the intrinsic grouping natures of a document collection. We apply a novel graph-theoretic te...

Full description

Saved in:
Bibliographic Details
Main Authors: GAO, Wei, WONG, Kam-Fai, XIA, Yunqing, XU, Ruifeng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2006
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4602
https://ink.library.smu.edu.sg/context/sis_research/article/5605/viewcontent/Gao2006_Chapter_CliquePercolationMethodForFind.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5605
record_format dspace
spelling sg-smu-ink.sis_research-56052019-12-26T07:42:26Z Clique percolation for finding naturally cohesive and overlapping document clusters GAO, Wei WONG, Kam-Fai XIA, Yunqing XU, Ruifeng Techniques for find document clusters mostly depend on models that impose strong explicit and/or implicit priori assumptions. As a consequence, the clustering effects tend to be unnatural and stray away from the intrinsic grouping natures of a document collection. We apply a novel graph-theoretic technique called Clique Percolation Method (CPM) for document clustering. In this method, a process of enumerating highly cohesive maximal document cliques is performed in a random graph, where those strongly adjacent cliques are mingled to form naturally overlapping clusters. Our clustering results can unveil the inherent structural connections of the underlying data. Experiments show that CPM can outperform some typical algorithms on benchmark data sets, and shed light on its advantages on natural document clustering. 2006-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4602 info:doi/10.1007/11940098_10 https://ink.library.smu.edu.sg/context/sis_research/article/5605/viewcontent/Gao2006_Chapter_CliquePercolationMethodForFind.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
spellingShingle Databases and Information Systems
GAO, Wei
WONG, Kam-Fai
XIA, Yunqing
XU, Ruifeng
Clique percolation for finding naturally cohesive and overlapping document clusters
description Techniques for find document clusters mostly depend on models that impose strong explicit and/or implicit priori assumptions. As a consequence, the clustering effects tend to be unnatural and stray away from the intrinsic grouping natures of a document collection. We apply a novel graph-theoretic technique called Clique Percolation Method (CPM) for document clustering. In this method, a process of enumerating highly cohesive maximal document cliques is performed in a random graph, where those strongly adjacent cliques are mingled to form naturally overlapping clusters. Our clustering results can unveil the inherent structural connections of the underlying data. Experiments show that CPM can outperform some typical algorithms on benchmark data sets, and shed light on its advantages on natural document clustering.
format text
author GAO, Wei
WONG, Kam-Fai
XIA, Yunqing
XU, Ruifeng
author_facet GAO, Wei
WONG, Kam-Fai
XIA, Yunqing
XU, Ruifeng
author_sort GAO, Wei
title Clique percolation for finding naturally cohesive and overlapping document clusters
title_short Clique percolation for finding naturally cohesive and overlapping document clusters
title_full Clique percolation for finding naturally cohesive and overlapping document clusters
title_fullStr Clique percolation for finding naturally cohesive and overlapping document clusters
title_full_unstemmed Clique percolation for finding naturally cohesive and overlapping document clusters
title_sort clique percolation for finding naturally cohesive and overlapping document clusters
publisher Institutional Knowledge at Singapore Management University
publishDate 2006
url https://ink.library.smu.edu.sg/sis_research/4602
https://ink.library.smu.edu.sg/context/sis_research/article/5605/viewcontent/Gao2006_Chapter_CliquePercolationMethodForFind.pdf
_version_ 1770574927222538240