Split over-training for unsupervised purchase intention identification

Recognizing user-expressed intentions in social media can be useful for many applications such as business intelligence, as intentions are intimately linked to potential actions or behaviors. This paper focuses on a binary classification problem: whether a text expresses purchase intention (PI) or n...

Full description

Saved in:
Bibliographic Details
Main Authors: Abd Yusof, Noor Fazilla, Lin, Chenghua, Han, Xiwu, Barawi, Mohamad Hardyman
Format: Article
Language:English
Published: World Academy of Research in Science and Engineering 2020
Online Access:http://eprints.utem.edu.my/id/eprint/24902/2/IJATCSE214932020.PDF
http://eprints.utem.edu.my/id/eprint/24902/
http://www.warse.org/IJATCSE/static/pdf/file/ijatcse214932020.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknikal Malaysia Melaka
Language: English
id my.utem.eprints.24902
record_format eprints
spelling my.utem.eprints.249022023-07-05T16:02:30Z http://eprints.utem.edu.my/id/eprint/24902/ Split over-training for unsupervised purchase intention identification Abd Yusof, Noor Fazilla Lin, Chenghua Han, Xiwu Barawi, Mohamad Hardyman Recognizing user-expressed intentions in social media can be useful for many applications such as business intelligence, as intentions are intimately linked to potential actions or behaviors. This paper focuses on a binary classification problem: whether a text expresses purchase intention (PI) or not (non-PI). In contrast to existing research, which relies on labeled intention corpus or linguistic knowledge, we proposed an unsupervised method called split over-training for the PI identification task. Experiments on PI identification from tweets showed that our approach was effective and promising. The best classifying accuracy of 84.6% and PI F-measure of 70.4% was achieved, which are only 7.7% and 4.9% respectively lower than fully supervised models. This means our unsupervised method may provide reasonable preprocessing for intention corpus labeling or intention knowledge acquisition. World Academy of Research in Science and Engineering 2020-06 Article PeerReviewed text en http://eprints.utem.edu.my/id/eprint/24902/2/IJATCSE214932020.PDF Abd Yusof, Noor Fazilla and Lin, Chenghua and Han, Xiwu and Barawi, Mohamad Hardyman (2020) Split over-training for unsupervised purchase intention identification. International Journal of Advanced Trends in Computer Science and Engineering, 9 (3). pp. 3921-3928. ISSN 2278–3091 http://www.warse.org/IJATCSE/static/pdf/file/ijatcse214932020.pdf 10.30534/ijatcse/2020/214932020
institution Universiti Teknikal Malaysia Melaka
building UTEM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknikal Malaysia Melaka
content_source UTEM Institutional Repository
url_provider http://eprints.utem.edu.my/
language English
description Recognizing user-expressed intentions in social media can be useful for many applications such as business intelligence, as intentions are intimately linked to potential actions or behaviors. This paper focuses on a binary classification problem: whether a text expresses purchase intention (PI) or not (non-PI). In contrast to existing research, which relies on labeled intention corpus or linguistic knowledge, we proposed an unsupervised method called split over-training for the PI identification task. Experiments on PI identification from tweets showed that our approach was effective and promising. The best classifying accuracy of 84.6% and PI F-measure of 70.4% was achieved, which are only 7.7% and 4.9% respectively lower than fully supervised models. This means our unsupervised method may provide reasonable preprocessing for intention corpus labeling or intention knowledge acquisition.
format Article
author Abd Yusof, Noor Fazilla
Lin, Chenghua
Han, Xiwu
Barawi, Mohamad Hardyman
spellingShingle Abd Yusof, Noor Fazilla
Lin, Chenghua
Han, Xiwu
Barawi, Mohamad Hardyman
Split over-training for unsupervised purchase intention identification
author_facet Abd Yusof, Noor Fazilla
Lin, Chenghua
Han, Xiwu
Barawi, Mohamad Hardyman
author_sort Abd Yusof, Noor Fazilla
title Split over-training for unsupervised purchase intention identification
title_short Split over-training for unsupervised purchase intention identification
title_full Split over-training for unsupervised purchase intention identification
title_fullStr Split over-training for unsupervised purchase intention identification
title_full_unstemmed Split over-training for unsupervised purchase intention identification
title_sort split over-training for unsupervised purchase intention identification
publisher World Academy of Research in Science and Engineering
publishDate 2020
url http://eprints.utem.edu.my/id/eprint/24902/2/IJATCSE214932020.PDF
http://eprints.utem.edu.my/id/eprint/24902/
http://www.warse.org/IJATCSE/static/pdf/file/ijatcse214932020.pdf
_version_ 1772816019473039360