Enhancing GAN-LCS Performance Using an Abbreviations Checker in Automatic Short Answer Scoring

Automatic short answer scoring methods have been developed with various algorithms over the decades. In the Indonesian language, the string-based similarity is more commonly used. This method is difficult to accurately measure the similarity of two sentences with significantly different word lengths...

Full description

Saved in:
Bibliographic Details
Main Authors: Muhammad, Ar-Razy, Permanasari, Adhistya Erna, Hidayah, Indriana
Format: Article PeerReviewed
Language:English
Published: MDPI 2022
Subjects:
Online Access:https://repository.ugm.ac.id/278927/1/Muhammad_TK.pdf
https://repository.ugm.ac.id/278927/
https://www.mdpi.com/2073-431X/11/7/108
https://doi.org/10.3390/computers11070108
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universitas Gadjah Mada
Language: English
id id-ugm-repo.278927
record_format dspace
spelling id-ugm-repo.2789272023-10-20T06:20:05Z https://repository.ugm.ac.id/278927/ Enhancing GAN-LCS Performance Using an Abbreviations Checker in Automatic Short Answer Scoring Muhammad, Ar-Razy Permanasari, Adhistya Erna Hidayah, Indriana Engineering Automatic short answer scoring methods have been developed with various algorithms over the decades. In the Indonesian language, the string-based similarity is more commonly used. This method is difficult to accurately measure the similarity of two sentences with significantly different word lengths. This problem has been handled by the Geometric Average Normalized-Longest Common Subsequence (GAN-LCS) method by eliminating non-contributive words utilizing the Longest Common Subsequence method. However, students’ answers may vary not only in character length but also in the words they choose. For instance, some students tend only to write the abbreviations or acronyms of the phrase instead of writing meaningful words. As a result, it will reduce the intersection character between the reference answer and the student answer. Moreover, it can change the sentence structure even though it has the same meaning by definition. Therefore, this study aims to improve GAN-LCS method performance by incorporating the abbreviation checker to handle the abbreviations or acronyms found in the reference answer or student answer. The dataset used in this study consisted of 10 questions with 1 reference answer for each question and 585 student answers. The experimental results show an improvement in GAN-LCS performance that could run 34.43% faster. Meanwhile, the Root Mean Square Error (RSME) value became lower by 7.65% and the correlation value was increased by 8%. Looking forward, future studies may continue to investigate a method for automatically generate the abbreviations dictionary. MDPI 2022-07-01 Article PeerReviewed application/pdf en https://repository.ugm.ac.id/278927/1/Muhammad_TK.pdf Muhammad, Ar-Razy and Permanasari, Adhistya Erna and Hidayah, Indriana (2022) Enhancing GAN-LCS Performance Using an Abbreviations Checker in Automatic Short Answer Scoring. MDPI, 11 (7). pp. 1-13. ISSN 2073431X https://www.mdpi.com/2073-431X/11/7/108 https://doi.org/10.3390/computers11070108
institution Universitas Gadjah Mada
building UGM Library
continent Asia
country Indonesia
Indonesia
content_provider UGM Library
collection Repository Civitas UGM
language English
topic Engineering
spellingShingle Engineering
Muhammad, Ar-Razy
Permanasari, Adhistya Erna
Hidayah, Indriana
Enhancing GAN-LCS Performance Using an Abbreviations Checker in Automatic Short Answer Scoring
description Automatic short answer scoring methods have been developed with various algorithms over the decades. In the Indonesian language, the string-based similarity is more commonly used. This method is difficult to accurately measure the similarity of two sentences with significantly different word lengths. This problem has been handled by the Geometric Average Normalized-Longest Common Subsequence (GAN-LCS) method by eliminating non-contributive words utilizing the Longest Common Subsequence method. However, students’ answers may vary not only in character length but also in the words they choose. For instance, some students tend only to write the abbreviations or acronyms of the phrase instead of writing meaningful words. As a result, it will reduce the intersection character between the reference answer and the student answer. Moreover, it can change the sentence structure even though it has the same meaning by definition. Therefore, this study aims to improve GAN-LCS method performance by incorporating the abbreviation checker to handle the abbreviations or acronyms found in the reference answer or student answer. The dataset used in this study consisted of 10 questions with 1 reference answer for each question and 585 student answers. The experimental results show an improvement in GAN-LCS performance that could run 34.43% faster. Meanwhile, the Root Mean Square Error (RSME) value became lower by 7.65% and the correlation value was increased by 8%. Looking forward, future studies may continue to investigate a method for automatically generate the abbreviations dictionary.
format Article
PeerReviewed
author Muhammad, Ar-Razy
Permanasari, Adhistya Erna
Hidayah, Indriana
author_facet Muhammad, Ar-Razy
Permanasari, Adhistya Erna
Hidayah, Indriana
author_sort Muhammad, Ar-Razy
title Enhancing GAN-LCS Performance Using an Abbreviations Checker in Automatic Short Answer Scoring
title_short Enhancing GAN-LCS Performance Using an Abbreviations Checker in Automatic Short Answer Scoring
title_full Enhancing GAN-LCS Performance Using an Abbreviations Checker in Automatic Short Answer Scoring
title_fullStr Enhancing GAN-LCS Performance Using an Abbreviations Checker in Automatic Short Answer Scoring
title_full_unstemmed Enhancing GAN-LCS Performance Using an Abbreviations Checker in Automatic Short Answer Scoring
title_sort enhancing gan-lcs performance using an abbreviations checker in automatic short answer scoring
publisher MDPI
publishDate 2022
url https://repository.ugm.ac.id/278927/1/Muhammad_TK.pdf
https://repository.ugm.ac.id/278927/
https://www.mdpi.com/2073-431X/11/7/108
https://doi.org/10.3390/computers11070108
_version_ 1781413316259741696