Environment data processing for a data centre

There are rising concerns over the carbon footprint of data centre (DC) in Singapore as their total electricity consumption has increased from 5.3% in 2019 to 7% in 2020. This figure is projected to increase, in line with the growing demand for DC. In this study, we will focus on the DC’s informa...

Full description

Saved in:
Bibliographic Details
Main Author: Tan, Mei Xuan
Other Authors: Tan Rui
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/163017
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-163017
record_format dspace
spelling sg-ntu-dr.10356-1630172022-11-16T06:15:44Z Environment data processing for a data centre Tan, Mei Xuan Tan Rui School of Computer Science and Engineering tanrui@ntu.edu.sg Engineering::Computer science and engineering::Data There are rising concerns over the carbon footprint of data centre (DC) in Singapore as their total electricity consumption has increased from 5.3% in 2019 to 7% in 2020. This figure is projected to increase, in line with the growing demand for DC. In this study, we will focus on the DC’s information technology (IT) system which is one of the major energy consumers. Five types of predictive models: Multilayer Perceptron, Linear Regression, Decision Tree, Support Vector Regression and Stacking-based Ensemble were developed to predict total IT power consumption using IT facilities' operating conditions and meteorological parameters as inputs. This research aims to investigate the relationship between a DC’s IT power consumption with its IT facilities and environmental factors outside the DC. This paper also presents a feature importance analysis and a detailed comparison of the performance of different models. The results of the feature importance analysis indicate that CPU utilization is the most significant factor that will affect the total IT power consumption in the DC. As for the model evaluation, the Stacking-based ensemble performs the best. Based on the Stacking-based ensemble model, a set of recommendations on the most optimal IT operating conditions was made. Bachelor of Engineering (Computer Science) 2022-11-16T06:07:59Z 2022-11-16T06:07:59Z 2022 Final Year Project (FYP) Tan, M. X. (2022). Environment data processing for a data centre. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/163017 https://hdl.handle.net/10356/163017 en SCSE21-0579 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Data
spellingShingle Engineering::Computer science and engineering::Data
Tan, Mei Xuan
Environment data processing for a data centre
description There are rising concerns over the carbon footprint of data centre (DC) in Singapore as their total electricity consumption has increased from 5.3% in 2019 to 7% in 2020. This figure is projected to increase, in line with the growing demand for DC. In this study, we will focus on the DC’s information technology (IT) system which is one of the major energy consumers. Five types of predictive models: Multilayer Perceptron, Linear Regression, Decision Tree, Support Vector Regression and Stacking-based Ensemble were developed to predict total IT power consumption using IT facilities' operating conditions and meteorological parameters as inputs. This research aims to investigate the relationship between a DC’s IT power consumption with its IT facilities and environmental factors outside the DC. This paper also presents a feature importance analysis and a detailed comparison of the performance of different models. The results of the feature importance analysis indicate that CPU utilization is the most significant factor that will affect the total IT power consumption in the DC. As for the model evaluation, the Stacking-based ensemble performs the best. Based on the Stacking-based ensemble model, a set of recommendations on the most optimal IT operating conditions was made.
author2 Tan Rui
author_facet Tan Rui
Tan, Mei Xuan
format Final Year Project
author Tan, Mei Xuan
author_sort Tan, Mei Xuan
title Environment data processing for a data centre
title_short Environment data processing for a data centre
title_full Environment data processing for a data centre
title_fullStr Environment data processing for a data centre
title_full_unstemmed Environment data processing for a data centre
title_sort environment data processing for a data centre
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/163017
_version_ 1751548580933402624