Development of big data analytics tools for e-learning

Background: The purpose of this research is to predict students’ final outcome on a weekly basis, using machines learning algorithms. Data generated from online learning platforms can be analysed and make a prediction. Method: Analysed attributes for significant differences with the use of AVNOA...

Full description

Saved in:
Bibliographic Details
Main Author: Tan, Samuel Boon Hao
Other Authors: Chua Hock Chuan
Format: Final Year Project
Language:English
Published: 2018
Subjects:
Online Access:http://hdl.handle.net/10356/74683
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-74683
record_format dspace
spelling sg-ntu-dr.10356-746832023-07-07T16:05:40Z Development of big data analytics tools for e-learning Tan, Samuel Boon Hao Chua Hock Chuan School of Electrical and Electronic Engineering DRNTU::Engineering Background: The purpose of this research is to predict students’ final outcome on a weekly basis, using machines learning algorithms. Data generated from online learning platforms can be analysed and make a prediction. Method: Analysed attributes for significant differences with the use of AVNOA or Z-test. Classification machine learning models were used namely, K Nearest Neighbor (KNN), Support Vector Classifier (SVC), Gradient Boost Classifier (GBM), Logistic Regression (LR), Decision Tree Classifier (DTC), Naive Bayesian (NB) and Random Forest Classifier (RFC). Prediction for past batch was used to obtain models accuracy score. Result: ‘V’, ‘BG_S’ and ‘C3’ attributes do not have effects on final outcome (P>0.05). Only ‘BG_Cat’, ‘C1’ and ‘C2’ attributes have an effect. (P<0.05) Different classification machine learning models were used as each operates differently. The best model score was KNN with an accuracy score of 77.17%, followed by GBM, RFC, DTC, SVC, LR and NB. (77.17%., 72.83%, 71.74%, 71.74%, 60.87% and 46.74%, respectively.) Conclusion: Relevant data fitted for machine learning will have an impact on accuracy prediction rate and cross-validation can be used for all models to train data in different cases. Accuracy score can be further increased with the use of newer machine learning algorithms. ‘C3’ was not used although it does affect 13% to final, as almost every data points have a value that is close to average value. Feature engineering was used modified for any zero-valued attribute ‘H’, the predicted result should be ‘F’. Bachelor of Engineering 2018-05-23T02:51:23Z 2018-05-23T02:51:23Z 2018 Final Year Project (FYP) http://hdl.handle.net/10356/74683 en Nanyang Technological University 50 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering
spellingShingle DRNTU::Engineering
Tan, Samuel Boon Hao
Development of big data analytics tools for e-learning
description Background: The purpose of this research is to predict students’ final outcome on a weekly basis, using machines learning algorithms. Data generated from online learning platforms can be analysed and make a prediction. Method: Analysed attributes for significant differences with the use of AVNOA or Z-test. Classification machine learning models were used namely, K Nearest Neighbor (KNN), Support Vector Classifier (SVC), Gradient Boost Classifier (GBM), Logistic Regression (LR), Decision Tree Classifier (DTC), Naive Bayesian (NB) and Random Forest Classifier (RFC). Prediction for past batch was used to obtain models accuracy score. Result: ‘V’, ‘BG_S’ and ‘C3’ attributes do not have effects on final outcome (P>0.05). Only ‘BG_Cat’, ‘C1’ and ‘C2’ attributes have an effect. (P<0.05) Different classification machine learning models were used as each operates differently. The best model score was KNN with an accuracy score of 77.17%, followed by GBM, RFC, DTC, SVC, LR and NB. (77.17%., 72.83%, 71.74%, 71.74%, 60.87% and 46.74%, respectively.) Conclusion: Relevant data fitted for machine learning will have an impact on accuracy prediction rate and cross-validation can be used for all models to train data in different cases. Accuracy score can be further increased with the use of newer machine learning algorithms. ‘C3’ was not used although it does affect 13% to final, as almost every data points have a value that is close to average value. Feature engineering was used modified for any zero-valued attribute ‘H’, the predicted result should be ‘F’.
author2 Chua Hock Chuan
author_facet Chua Hock Chuan
Tan, Samuel Boon Hao
format Final Year Project
author Tan, Samuel Boon Hao
author_sort Tan, Samuel Boon Hao
title Development of big data analytics tools for e-learning
title_short Development of big data analytics tools for e-learning
title_full Development of big data analytics tools for e-learning
title_fullStr Development of big data analytics tools for e-learning
title_full_unstemmed Development of big data analytics tools for e-learning
title_sort development of big data analytics tools for e-learning
publishDate 2018
url http://hdl.handle.net/10356/74683
_version_ 1772826350207369216