Singlish Checker: A tool for understanding and analysing an English Creole language

As English is a widely used language in many countries of different cultures, variants of English also known as English creoles have also been created. Singlish is one such English creole used by people in Singapore. Nevertheless, unlike English, Singlish is not taught in schools nor encouraged to b...

Full description

Saved in:
Bibliographic Details
Main Authors: HSIEH, Lee-Hsun, CHUA, Nam Chew, KWEE, Agus Trisnajaya, LO, Pei-Chi, LEE, Yang-Yin, LIM, Ee-peng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2022
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7775
https://ink.library.smu.edu.sg/context/sis_research/article/8778/viewcontent/978_3_031_21756_2_9_pv.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
Description
Summary:As English is a widely used language in many countries of different cultures, variants of English also known as English creoles have also been created. Singlish is one such English creole used by people in Singapore. Nevertheless, unlike English, Singlish is not taught in schools nor encouraged to be used in formal communications. Hence, it remains to be a low resource language with a lack of up-to-date Singlish word dictionary and computational tools to analyse the language. In this paper, we therefore propose Singlish Checker, a tool that is able to help detecting Singlish text, Singlish words and phrases. To develop this tool, we first construct a large set of Singlish words and phrases by identifying different sources of Singlish words and their definitions and integrating them. We later propose a Singlish classifier model based on a BERT model fine-tuned with a large number of classified Singlish sentences. Our experiment show that the BERT-based classifier can achieved very high F1 performance, outperforming the baseline.