Gated convolutional neural network for fine-grained automatic essay scoring
This report proposes a novel method for incorporating character level information into neural models for Automatic Essay Scoring (AES). Intuitively, character level information is highly important to the AES task but is unfortunately neglected in many newly proposed neural models. Notably, the us...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2017
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/72840 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-72840 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-728402023-03-03T20:47:21Z Gated convolutional neural network for fine-grained automatic essay scoring Lee, Xing Zhao Hui Siu Cheung School of Computer Science and Engineering DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence This report proposes a novel method for incorporating character level information into neural models for Automatic Essay Scoring (AES). Intuitively, character level information is highly important to the AES task but is unfortunately neglected in many newly proposed neural models. Notably, the usage of character level information is also an issue of correctness aside from performance. Firstly, character level information is capable of modelling fine-grained features that are predictive of writing ability such as spelling or grammatical errors. Secondly, it also able to mitigate and handle out-of-vocabulary (OOV) words. Given that vocabularies are often constructed based on frequent words, assigning unknown tokens (UNK-ing) can be unfair to students who use rare and difficult words. Additionally, spelling mistakes tend to be UNKed which might cause an essay to become semantically incomprehensible. However, using both characters and words concurrently may incur a risk of overfitting. As such, a novel neural mechanism that incorporates character level information and models the relationship between words and characters using novel neural gating mechanism has been designed. To this end, two variations of word-char gating for adaptively controlling information fusion between words and characters are proposed. Extensive experiments show that the proposed neural word-char gating achieves state-of-the-art performance on the ASAP Kaggle dataset for AES while containing significantly lesser parameters than many other published works and neural network architectures. Ablation studies confirm the effectiveness of the proposed neural gating mechanism over numerous neural and feature engineering baselines. A web system prototype is also built to demonstrate how the system works. Bachelor of Engineering (Computer Science) 2017-11-23T12:53:19Z 2017-11-23T12:53:19Z 2017 Final Year Project (FYP) http://hdl.handle.net/10356/72840 en Nanyang Technological University 46 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence |
spellingShingle |
DRNTU::Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Lee, Xing Zhao Gated convolutional neural network for fine-grained automatic essay scoring |
description |
This report proposes a novel method for incorporating character level information into neural
models for Automatic Essay Scoring (AES). Intuitively, character level information is highly
important to the AES task but is unfortunately neglected in many newly proposed neural
models. Notably, the usage of character level information is also an issue of correctness aside
from performance. Firstly, character level information is capable of modelling fine-grained
features that are predictive of writing ability such as spelling or grammatical errors. Secondly,
it also able to mitigate and handle out-of-vocabulary (OOV) words. Given that vocabularies are
often constructed based on frequent words, assigning unknown tokens (UNK-ing) can be unfair
to students who use rare and difficult words. Additionally, spelling mistakes tend to be UNKed
which might cause an essay to become semantically incomprehensible. However, using both
characters and words concurrently may incur a risk of overfitting. As such, a novel neural
mechanism that incorporates character level information and models the relationship between
words and characters using novel neural gating mechanism has been designed. To this end, two
variations of word-char gating for adaptively controlling information fusion between words
and characters are proposed. Extensive experiments show that the proposed neural word-char
gating achieves state-of-the-art performance on the ASAP Kaggle dataset for AES while
containing significantly lesser parameters than many other published works and neural network
architectures. Ablation studies confirm the effectiveness of the proposed neural gating
mechanism over numerous neural and feature engineering baselines. A web system prototype
is also built to demonstrate how the system works. |
author2 |
Hui Siu Cheung |
author_facet |
Hui Siu Cheung Lee, Xing Zhao |
format |
Final Year Project |
author |
Lee, Xing Zhao |
author_sort |
Lee, Xing Zhao |
title |
Gated convolutional neural network for fine-grained automatic essay scoring |
title_short |
Gated convolutional neural network for fine-grained automatic essay scoring |
title_full |
Gated convolutional neural network for fine-grained automatic essay scoring |
title_fullStr |
Gated convolutional neural network for fine-grained automatic essay scoring |
title_full_unstemmed |
Gated convolutional neural network for fine-grained automatic essay scoring |
title_sort |
gated convolutional neural network for fine-grained automatic essay scoring |
publishDate |
2017 |
url |
http://hdl.handle.net/10356/72840 |
_version_ |
1759856288735428608 |