Unsupervised data clustering for energy efficiency monitoring and analysis
For the first half of the Final Year Project, the main research focus is to improve K-Means clustering, to make the K-Means algorithms stable, efficient and auto-determine number of K. Under this research, I build a program that combines density-based clustering techniques with K-Means clustering, e...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2015
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/62693 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-62693 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-626932023-03-03T20:30:18Z Unsupervised data clustering for energy efficiency monitoring and analysis Fu, Rong Li Xiang Ng Wee Keong School of Computer Engineering A*STAR SIMTech DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition For the first half of the Final Year Project, the main research focus is to improve K-Means clustering, to make the K-Means algorithms stable, efficient and auto-determine number of K. Under this research, I build a program that combines density-based clustering techniques with K-Means clustering, enabling stable selection of K initial centroids. The drawback of this algorithm for stable selection is the time complexity O(n2). In addition, 2 ways of auto-determining number of clusters K are summarized in the Literature Review. Possible improvement for the second approach is also stated, but its validation and application could be investigated in the future research. For the second half of the Final Year Project, my main research focus is to explore the possibility of using data clustering techniques, especially K-Means clustering for energy efficiency monitoring and analysis. Under this research, two different approaches (Whole Batch Feature Extraction & Window Feature Extraction) are investigated. In addition, I build a system that could select features for K-Means, input training/testing split percentage, output training & testing accuracies, and save excel file of cluster results. Currently, the system uses K-Means to learn energy consumption patterns offline, and build models for each energy consumption pattern. Each model is actually a cluster with a cluster center. In the future, the models from the offline training could be used to classify online streaming data and identify their consumption pattern classes once one window data is ready. Our proposed method gives models with training and testing accuracies up to 78%, and reveals some interesting discoveries, relevant to our case study data. Bachelor of Engineering (Computer Science) 2015-04-27T07:18:52Z 2015-04-27T07:18:52Z 2015 2015 Final Year Project (FYP) http://hdl.handle.net/10356/62693 en Nanyang Technological University 72 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition |
spellingShingle |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition Fu, Rong Unsupervised data clustering for energy efficiency monitoring and analysis |
description |
For the first half of the Final Year Project, the main research focus is to improve K-Means clustering, to make the K-Means algorithms stable, efficient and auto-determine number of K. Under this research, I build a program that combines density-based clustering techniques with K-Means clustering, enabling stable selection of K initial centroids. The drawback of this algorithm for stable selection is the time complexity O(n2). In addition, 2 ways of auto-determining number of clusters K are summarized in the Literature Review. Possible improvement for the second approach is also stated, but its validation and application could be investigated in the future research. For the second half of the Final Year Project, my main research focus is to explore the possibility of using data clustering techniques, especially K-Means clustering for energy efficiency monitoring and analysis. Under this research, two different approaches (Whole Batch Feature Extraction & Window Feature Extraction) are investigated. In addition, I build a system that could select features for K-Means, input training/testing split percentage, output training & testing accuracies, and save excel file of cluster results. Currently, the system uses K-Means to learn energy consumption patterns offline, and build models for each energy consumption pattern. Each model is actually a cluster with a cluster center. In the future, the models from the offline training could be used to classify online streaming data and identify their consumption pattern classes once one window data is ready. Our proposed method gives models with training and testing accuracies up to 78%, and reveals some interesting discoveries, relevant to our case study data. |
author2 |
Li Xiang |
author_facet |
Li Xiang Fu, Rong |
format |
Final Year Project |
author |
Fu, Rong |
author_sort |
Fu, Rong |
title |
Unsupervised data clustering for energy efficiency monitoring and analysis |
title_short |
Unsupervised data clustering for energy efficiency monitoring and analysis |
title_full |
Unsupervised data clustering for energy efficiency monitoring and analysis |
title_fullStr |
Unsupervised data clustering for energy efficiency monitoring and analysis |
title_full_unstemmed |
Unsupervised data clustering for energy efficiency monitoring and analysis |
title_sort |
unsupervised data clustering for energy efficiency monitoring and analysis |
publishDate |
2015 |
url |
http://hdl.handle.net/10356/62693 |
_version_ |
1759857313155383296 |