Environment data processing for a data centre
There are rising concerns over the carbon footprint of data centre (DC) in Singapore as their total electricity consumption has increased from 5.3% in 2019 to 7% in 2020. This figure is projected to increase, in line with the growing demand for DC. In this study, we will focus on the DC’s informa...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/163017 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-163017 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1630172022-11-16T06:15:44Z Environment data processing for a data centre Tan, Mei Xuan Tan Rui School of Computer Science and Engineering tanrui@ntu.edu.sg Engineering::Computer science and engineering::Data There are rising concerns over the carbon footprint of data centre (DC) in Singapore as their total electricity consumption has increased from 5.3% in 2019 to 7% in 2020. This figure is projected to increase, in line with the growing demand for DC. In this study, we will focus on the DC’s information technology (IT) system which is one of the major energy consumers. Five types of predictive models: Multilayer Perceptron, Linear Regression, Decision Tree, Support Vector Regression and Stacking-based Ensemble were developed to predict total IT power consumption using IT facilities' operating conditions and meteorological parameters as inputs. This research aims to investigate the relationship between a DC’s IT power consumption with its IT facilities and environmental factors outside the DC. This paper also presents a feature importance analysis and a detailed comparison of the performance of different models. The results of the feature importance analysis indicate that CPU utilization is the most significant factor that will affect the total IT power consumption in the DC. As for the model evaluation, the Stacking-based ensemble performs the best. Based on the Stacking-based ensemble model, a set of recommendations on the most optimal IT operating conditions was made. Bachelor of Engineering (Computer Science) 2022-11-16T06:07:59Z 2022-11-16T06:07:59Z 2022 Final Year Project (FYP) Tan, M. X. (2022). Environment data processing for a data centre. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/163017 https://hdl.handle.net/10356/163017 en SCSE21-0579 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Data |
spellingShingle |
Engineering::Computer science and engineering::Data Tan, Mei Xuan Environment data processing for a data centre |
description |
There are rising concerns over the carbon footprint of data centre (DC) in Singapore as their total
electricity consumption has increased from 5.3% in 2019 to 7% in 2020. This figure is projected
to increase, in line with the growing demand for DC. In this study, we will focus on the DC’s
information technology (IT) system which is one of the major energy consumers. Five types of
predictive models: Multilayer Perceptron, Linear Regression, Decision Tree, Support Vector
Regression and Stacking-based Ensemble were developed to predict total IT power consumption
using IT facilities' operating conditions and meteorological parameters as inputs. This research
aims to investigate the relationship between a DC’s IT power consumption with its IT facilities
and environmental factors outside the DC. This paper also presents a feature importance analysis
and a detailed comparison of the performance of different models. The results of the feature
importance analysis indicate that CPU utilization is the most significant factor that will affect
the total IT power consumption in the DC. As for the model evaluation, the Stacking-based
ensemble performs the best. Based on the Stacking-based ensemble model, a set of
recommendations on the most optimal IT operating conditions was made. |
author2 |
Tan Rui |
author_facet |
Tan Rui Tan, Mei Xuan |
format |
Final Year Project |
author |
Tan, Mei Xuan |
author_sort |
Tan, Mei Xuan |
title |
Environment data processing for a data centre |
title_short |
Environment data processing for a data centre |
title_full |
Environment data processing for a data centre |
title_fullStr |
Environment data processing for a data centre |
title_full_unstemmed |
Environment data processing for a data centre |
title_sort |
environment data processing for a data centre |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/163017 |
_version_ |
1751548580933402624 |