Unsupervised data clustering for energy efficiency monitoring and analysis

For the first half of the Final Year Project, the main research focus is to improve K-Means clustering, to make the K-Means algorithms stable, efficient and auto-determine number of K. Under this research, I build a program that combines density-based clustering techniques with K-Means clustering, e...

Full description

Saved in:
Bibliographic Details
Main Author: Fu, Rong
Other Authors: Li Xiang
Format: Final Year Project
Language:English
Published: 2015
Subjects:
Online Access:http://hdl.handle.net/10356/62693
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-62693
record_format dspace
spelling sg-ntu-dr.10356-626932023-03-03T20:30:18Z Unsupervised data clustering for energy efficiency monitoring and analysis Fu, Rong Li Xiang Ng Wee Keong School of Computer Engineering A*STAR SIMTech DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition For the first half of the Final Year Project, the main research focus is to improve K-Means clustering, to make the K-Means algorithms stable, efficient and auto-determine number of K. Under this research, I build a program that combines density-based clustering techniques with K-Means clustering, enabling stable selection of K initial centroids. The drawback of this algorithm for stable selection is the time complexity O(n2). In addition, 2 ways of auto-determining number of clusters K are summarized in the Literature Review. Possible improvement for the second approach is also stated, but its validation and application could be investigated in the future research. For the second half of the Final Year Project, my main research focus is to explore the possibility of using data clustering techniques, especially K-Means clustering for energy efficiency monitoring and analysis. Under this research, two different approaches (Whole Batch Feature Extraction & Window Feature Extraction) are investigated. In addition, I build a system that could select features for K-Means, input training/testing split percentage, output training & testing accuracies, and save excel file of cluster results. Currently, the system uses K-Means to learn energy consumption patterns offline, and build models for each energy consumption pattern. Each model is actually a cluster with a cluster center. In the future, the models from the offline training could be used to classify online streaming data and identify their consumption pattern classes once one window data is ready. Our proposed method gives models with training and testing accuracies up to 78%, and reveals some interesting discoveries, relevant to our case study data. Bachelor of Engineering (Computer Science) 2015-04-27T07:18:52Z 2015-04-27T07:18:52Z 2015 2015 Final Year Project (FYP) http://hdl.handle.net/10356/62693 en Nanyang Technological University 72 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
spellingShingle DRNTU::Engineering::Computer science and engineering::Computing methodologies::Pattern recognition
Fu, Rong
Unsupervised data clustering for energy efficiency monitoring and analysis
description For the first half of the Final Year Project, the main research focus is to improve K-Means clustering, to make the K-Means algorithms stable, efficient and auto-determine number of K. Under this research, I build a program that combines density-based clustering techniques with K-Means clustering, enabling stable selection of K initial centroids. The drawback of this algorithm for stable selection is the time complexity O(n2). In addition, 2 ways of auto-determining number of clusters K are summarized in the Literature Review. Possible improvement for the second approach is also stated, but its validation and application could be investigated in the future research. For the second half of the Final Year Project, my main research focus is to explore the possibility of using data clustering techniques, especially K-Means clustering for energy efficiency monitoring and analysis. Under this research, two different approaches (Whole Batch Feature Extraction & Window Feature Extraction) are investigated. In addition, I build a system that could select features for K-Means, input training/testing split percentage, output training & testing accuracies, and save excel file of cluster results. Currently, the system uses K-Means to learn energy consumption patterns offline, and build models for each energy consumption pattern. Each model is actually a cluster with a cluster center. In the future, the models from the offline training could be used to classify online streaming data and identify their consumption pattern classes once one window data is ready. Our proposed method gives models with training and testing accuracies up to 78%, and reveals some interesting discoveries, relevant to our case study data.
author2 Li Xiang
author_facet Li Xiang
Fu, Rong
format Final Year Project
author Fu, Rong
author_sort Fu, Rong
title Unsupervised data clustering for energy efficiency monitoring and analysis
title_short Unsupervised data clustering for energy efficiency monitoring and analysis
title_full Unsupervised data clustering for energy efficiency monitoring and analysis
title_fullStr Unsupervised data clustering for energy efficiency monitoring and analysis
title_full_unstemmed Unsupervised data clustering for energy efficiency monitoring and analysis
title_sort unsupervised data clustering for energy efficiency monitoring and analysis
publishDate 2015
url http://hdl.handle.net/10356/62693
_version_ 1759857313155383296