Big data analytics
With the ease of access to connected devices and online services, data of a wide variety are constantly being collected by various service providers. These data can be used for trend-finding and the prediction of future values, such outcomes having an importance in optimization for a variety of indu...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2019
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/78353 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-78353 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-783532023-07-07T16:06:21Z Big data analytics Chua, Zhen Hong Bi Guoan Chua Hock Chuan School of Electrical and Electronic Engineering DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems With the ease of access to connected devices and online services, data of a wide variety are constantly being collected by various service providers. These data can be used for trend-finding and the prediction of future values, such outcomes having an importance in optimization for a variety of industrial, commercial and even consumer processes. With the widespread availability of highly capable computing systems and programming tools, resource-intensive tasks like the implementation of predictive machine learning is now possible at low cost for a determined user. The objective of this project is to produce a machine learning-based process, capable of predicting a numerical output based upon a set of mixed-type input data. This process is implemented with open-source programming tools. Furthermore, the project also seeks to predict the relative importance of the different data features. In this project, we have developed a machine-learning process capable of predicting a numerical output with up to 0.77 explained variance. The process encompasses the entire data analysis procedure, from data importation, data pre-processing, hyperparameter optimization and prediction. The process developed shows that a functional, and reasonably accurate data analysis model, can be produced using open-source software. Using a variety of machine-learning algorithms, the project also shows the relative accuracy of, and time taken by the different models in producing a predicted output. Bachelor of Engineering (Electrical and Electronic Engineering) 2019-06-19T01:25:05Z 2019-06-19T01:25:05Z 2019 Final Year Project (FYP) http://hdl.handle.net/10356/78353 en Nanyang Technological University 48 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems |
spellingShingle |
DRNTU::Engineering::Electrical and electronic engineering::Computer hardware, software and systems Chua, Zhen Hong Big data analytics |
description |
With the ease of access to connected devices and online services, data of a wide variety are constantly being collected by various service providers. These data can be used for trend-finding and the prediction of future values, such outcomes having an importance in optimization for a variety of industrial, commercial and even consumer processes. With the widespread availability of highly capable computing systems and programming tools, resource-intensive tasks like the implementation of predictive machine learning is now possible at low cost for a determined user. The objective of this project is to produce a machine learning-based process, capable of predicting a numerical output based upon a set of mixed-type input data. This process is implemented with open-source programming tools. Furthermore, the project also seeks to predict the relative importance of the different data features. In this project, we have developed a machine-learning process capable of predicting a numerical output with up to 0.77 explained variance. The process encompasses the entire data analysis procedure, from data importation, data pre-processing, hyperparameter optimization and prediction. The process developed shows that a functional, and reasonably accurate data analysis model, can be produced using open-source software. Using a variety of machine-learning algorithms, the project also shows the relative accuracy of, and time taken by the different models in producing a predicted output. |
author2 |
Bi Guoan |
author_facet |
Bi Guoan Chua, Zhen Hong |
format |
Final Year Project |
author |
Chua, Zhen Hong |
author_sort |
Chua, Zhen Hong |
title |
Big data analytics |
title_short |
Big data analytics |
title_full |
Big data analytics |
title_fullStr |
Big data analytics |
title_full_unstemmed |
Big data analytics |
title_sort |
big data analytics |
publishDate |
2019 |
url |
http://hdl.handle.net/10356/78353 |
_version_ |
1772828211015581696 |