Machine learning for mathematical question difficulty classification

This project is an experimental study on how machine learning models can be used for classification of GCE ‘A’ Level mathematical questions. Two levels of classification are carried out. First, the classification of questions to their respective topics and second, the classification of the questions...

全面介紹

Saved in:
書目詳細資料
主要作者: Pang, Jarald Qi Kai
其他作者: Hui Siu Cheung
格式: Final Year Project
語言:English
出版: 2019
主題:
在線閱讀:http://hdl.handle.net/10356/76982
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Nanyang Technological University
語言: English
實物特徵
總結:This project is an experimental study on how machine learning models can be used for classification of GCE ‘A’ Level mathematical questions. Two levels of classification are carried out. First, the classification of questions to their respective topics and second, the classification of the questions to their difficulty level. The report will contain detailed explanations of the steps gone through during the experiment. The grading metrics used in this experiment are F1 Score, Precision, Recall and Accuracy. For data pre-processing three text vectorization methods, count vector, word level TF-IDF and N-gram level TF-IDF were used and tested. Four machine learning methods, Support Vector Machines, Naïve Bayes, Random Forest and Extreme Gradient Boosting, were then used to classify the data to their respective topic. Analysis was then done on the models’ performance on each topic. The same 4 machine learning methods were then again used to classify the difficulty of each question using the vectorized question and predicted topic. A final analysis was then done on the performance of the models in difficulty classification.