Using text mining algorithm to detect gender deception based on Malaysian chatroom lingo / Dianne L.M. Cheong and Nur Atiqah Sia Abdullah@Sia Sze Yieng
E-mail is used for communication between strangers and friends. It can be a fantasy playground for identity experimentations where players take on an imaginary persona and interact with each other in the virtual world. In communication, knowing the identity of those whom you communicate is essential...
Saved in:
Main Authors: | , |
---|---|
Format: | Research Reports |
Language: | English |
Published: |
2006
|
Subjects: | |
Online Access: | https://ir.uitm.edu.my/id/eprint/49538/1/49538.pdf https://ir.uitm.edu.my/id/eprint/49538/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Mara |
Language: | English |
id |
my.uitm.ir.49538 |
---|---|
record_format |
eprints |
spelling |
my.uitm.ir.495382023-04-10T06:44:07Z https://ir.uitm.edu.my/id/eprint/49538/ Using text mining algorithm to detect gender deception based on Malaysian chatroom lingo / Dianne L.M. Cheong and Nur Atiqah Sia Abdullah@Sia Sze Yieng L., Dianne M. Cheong Sia Abdullah, Nur Atiqah Analysis Algorithms E-mail is used for communication between strangers and friends. It can be a fantasy playground for identity experimentations where players take on an imaginary persona and interact with each other in the virtual world. In communication, knowing the identity of those whom you communicate is essential for understanding and evaluating an interaction. However, the presentation of self in the virtual world is often a conscious and deliberate endeavour. Therefore, gender deception is difficult and risky and it can be abandoned at will. Inference can be made both from writing style and from clues hidden in the posting data. A text-mining algorithm was designed to detect gender deception based on gender-preferential features at the word or clause level of Malaysian e-mail users. Based on this designed text algorithm, a prototype in Visual Basic is developed. The prototype was tested with 16 documents; each consists of 5 e-mails exchanges of respective individuals. Out tests have shown that the prototype is at 81.3% of accuracy level. This is consistent with a human reader of the documents. The tested prototype will be a tool to assist interest parties such as the Criminology and Forensic Department, e-mail users and virtual communities to successfully identify gender deception. 2006 Research Reports NonPeerReviewed text en https://ir.uitm.edu.my/id/eprint/49538/1/49538.pdf Using text mining algorithm to detect gender deception based on Malaysian chatroom lingo / Dianne L.M. Cheong and Nur Atiqah Sia Abdullah@Sia Sze Yieng. (2006) [Research Reports] (Unpublished) |
institution |
Universiti Teknologi Mara |
building |
Tun Abdul Razak Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Mara |
content_source |
UiTM Institutional Repository |
url_provider |
http://ir.uitm.edu.my/ |
language |
English |
topic |
Analysis Algorithms |
spellingShingle |
Analysis Algorithms L., Dianne M. Cheong Sia Abdullah, Nur Atiqah Using text mining algorithm to detect gender deception based on Malaysian chatroom lingo / Dianne L.M. Cheong and Nur Atiqah Sia Abdullah@Sia Sze Yieng |
description |
E-mail is used for communication between strangers and friends. It can be a fantasy playground for identity experimentations where players take on an imaginary persona and interact with each other in the virtual world. In communication, knowing the identity of those whom you communicate is essential for understanding and evaluating an interaction. However, the presentation of self in the virtual world is often a conscious and deliberate endeavour. Therefore, gender deception is difficult and risky and it can be abandoned at will. Inference can be made both from writing style and from clues hidden in the posting data. A text-mining algorithm was designed to detect gender deception based on gender-preferential features at the word or clause level of Malaysian e-mail users. Based on this designed text algorithm, a prototype in Visual Basic is developed. The prototype was tested with 16 documents; each consists of 5 e-mails exchanges of respective individuals. Out tests have shown that the prototype is at 81.3% of accuracy level. This is consistent with a human reader of the documents. The tested prototype will be a tool to assist interest parties such as the Criminology and Forensic Department, e-mail users and virtual communities to successfully identify gender deception. |
format |
Research Reports |
author |
L., Dianne M. Cheong Sia Abdullah, Nur Atiqah |
author_facet |
L., Dianne M. Cheong Sia Abdullah, Nur Atiqah |
author_sort |
L., Dianne M. Cheong |
title |
Using text mining algorithm to detect gender deception based on Malaysian chatroom lingo / Dianne L.M. Cheong and Nur Atiqah Sia Abdullah@Sia Sze Yieng |
title_short |
Using text mining algorithm to detect gender deception based on Malaysian chatroom lingo / Dianne L.M. Cheong and Nur Atiqah Sia Abdullah@Sia Sze Yieng |
title_full |
Using text mining algorithm to detect gender deception based on Malaysian chatroom lingo / Dianne L.M. Cheong and Nur Atiqah Sia Abdullah@Sia Sze Yieng |
title_fullStr |
Using text mining algorithm to detect gender deception based on Malaysian chatroom lingo / Dianne L.M. Cheong and Nur Atiqah Sia Abdullah@Sia Sze Yieng |
title_full_unstemmed |
Using text mining algorithm to detect gender deception based on Malaysian chatroom lingo / Dianne L.M. Cheong and Nur Atiqah Sia Abdullah@Sia Sze Yieng |
title_sort |
using text mining algorithm to detect gender deception based on malaysian chatroom lingo / dianne l.m. cheong and nur atiqah sia abdullah@sia sze yieng |
publishDate |
2006 |
url |
https://ir.uitm.edu.my/id/eprint/49538/1/49538.pdf https://ir.uitm.edu.my/id/eprint/49538/ |
_version_ |
1762841149859430400 |