High-dimensional multi-view dataset pre-processing and clustering
The objective of this project is two-fold: the first one is to perform pre-processing on high-dimensional multi-view datasets and to investigate potential applications of new data mining techniques with the new datasets. The second part of the project is to design and implement an existing...
Saved in:
主要作者: | |
---|---|
其他作者: | |
格式: | Final Year Project |
語言: | English |
出版: |
2012
|
主題: | |
在線閱讀: | http://hdl.handle.net/10356/49881 |
標簽: |
添加標簽
沒有標簽, 成為第一個標記此記錄!
|
機構: | Nanyang Technological University |
語言: | English |
總結: | The objective of this project is two-fold: the first one is to perform pre-processing on
high-dimensional multi-view datasets and to investigate potential applications of new
data mining techniques with the new datasets.
The second part of the project is to design and implement an existing clustering
algorithm and conduct extensive experimental study to test the performances of the
clustering algorithm on various text benchmark datasets. This report highlights the
implementation of the pre-processing approach for the data processing. It also includes
the design and implementation ideas of the clustering algorithm Semi-Supervised
Spherical K-Means and the simulation results. The performances of SS-SKM are
documented and reported. |
---|