Clique percolation for finding naturally cohesive and overlapping document clusters
Techniques for find document clusters mostly depend on models that impose strong explicit and/or implicit priori assumptions. As a consequence, the clustering effects tend to be unnatural and stray away from the intrinsic grouping natures of a document collection. We apply a novel graph-theoretic te...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2006
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/4602 https://ink.library.smu.edu.sg/context/sis_research/article/5605/viewcontent/Gao2006_Chapter_CliquePercolationMethodForFind.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-5605 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-56052019-12-26T07:42:26Z Clique percolation for finding naturally cohesive and overlapping document clusters GAO, Wei WONG, Kam-Fai XIA, Yunqing XU, Ruifeng Techniques for find document clusters mostly depend on models that impose strong explicit and/or implicit priori assumptions. As a consequence, the clustering effects tend to be unnatural and stray away from the intrinsic grouping natures of a document collection. We apply a novel graph-theoretic technique called Clique Percolation Method (CPM) for document clustering. In this method, a process of enumerating highly cohesive maximal document cliques is performed in a random graph, where those strongly adjacent cliques are mingled to form naturally overlapping clusters. Our clustering results can unveil the inherent structural connections of the underlying data. Experiments show that CPM can outperform some typical algorithms on benchmark data sets, and shed light on its advantages on natural document clustering. 2006-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4602 info:doi/10.1007/11940098_10 https://ink.library.smu.edu.sg/context/sis_research/article/5605/viewcontent/Gao2006_Chapter_CliquePercolationMethodForFind.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Databases and Information Systems |
spellingShingle |
Databases and Information Systems GAO, Wei WONG, Kam-Fai XIA, Yunqing XU, Ruifeng Clique percolation for finding naturally cohesive and overlapping document clusters |
description |
Techniques for find document clusters mostly depend on models that impose strong explicit and/or implicit priori assumptions. As a consequence, the clustering effects tend to be unnatural and stray away from the intrinsic grouping natures of a document collection. We apply a novel graph-theoretic technique called Clique Percolation Method (CPM) for document clustering. In this method, a process of enumerating highly cohesive maximal document cliques is performed in a random graph, where those strongly adjacent cliques are mingled to form naturally overlapping clusters. Our clustering results can unveil the inherent structural connections of the underlying data. Experiments show that CPM can outperform some typical algorithms on benchmark data sets, and shed light on its advantages on natural document clustering. |
format |
text |
author |
GAO, Wei WONG, Kam-Fai XIA, Yunqing XU, Ruifeng |
author_facet |
GAO, Wei WONG, Kam-Fai XIA, Yunqing XU, Ruifeng |
author_sort |
GAO, Wei |
title |
Clique percolation for finding naturally cohesive and overlapping document clusters |
title_short |
Clique percolation for finding naturally cohesive and overlapping document clusters |
title_full |
Clique percolation for finding naturally cohesive and overlapping document clusters |
title_fullStr |
Clique percolation for finding naturally cohesive and overlapping document clusters |
title_full_unstemmed |
Clique percolation for finding naturally cohesive and overlapping document clusters |
title_sort |
clique percolation for finding naturally cohesive and overlapping document clusters |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2006 |
url |
https://ink.library.smu.edu.sg/sis_research/4602 https://ink.library.smu.edu.sg/context/sis_research/article/5605/viewcontent/Gao2006_Chapter_CliquePercolationMethodForFind.pdf |
_version_ |
1770574927222538240 |