One seed to find them all : mining opinion features via association

Feature-based opinion analysis has attracted extensive attention recently. Identifying features associated with opinions expressed in reviews is essential for fine-grained opinion mining. One approach is to exploit the dependency relations that occur naturally between features and opinion words, and...

Full description

Saved in:
Bibliographic Details
Main Authors: Hai, Zhen, Chang, Kuiyu, Cong, Gao
Other Authors: School of Computer Engineering
Format: Conference or Workshop Item
Language:English
Published: 2013
Online Access:https://hdl.handle.net/10356/98037
http://hdl.handle.net/10220/12292
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-98037
record_format dspace
spelling sg-ntu-dr.10356-980372020-05-28T07:18:24Z One seed to find them all : mining opinion features via association Hai, Zhen Chang, Kuiyu Cong, Gao School of Computer Engineering International conference on Information and knowledge management (21st : 2012 : Maui, USA) Feature-based opinion analysis has attracted extensive attention recently. Identifying features associated with opinions expressed in reviews is essential for fine-grained opinion mining. One approach is to exploit the dependency relations that occur naturally between features and opinion words, and among features (or opinion words) themselves. In this paper, we propose a generalized approach to opinion feature extraction by incorporating robust statistical association analysis in a bootstrapping framework. The new approach starts with a small set of feature seeds, on which it iteratively enlarges by mining feature-opinion, feature-feature, and opinion-opinion dependency relations. Two association model types, namely likelihood ratio tests (LRT) and latent semantic analysis (LSA), are proposed for computing the pair-wise associations between terms (features or opinions). We accordingly propose two robust bootstrapping approaches, LRTBOOT and LSABOOT, both of which need just a handful of initial feature seeds to bootstrap opinion feature extraction. We benchmarked LRTBOOT and LSABOOT against existing approaches on a large number of real-life reviews crawled from the cellphone and hotel domains. Experimental results using varying number of feature seeds show that the proposed association-based bootstrapping approach significantly outperforms the competitors. In fact, one seed feature is all that is needed for LRTBOOT to significantly outperform the other methods. This seed feature can simply be the domain feature, e.g., "cellphone" or "hotel". The consequence of our discovery is far reaching: starting with just one feature seed, typically just the domain concept word, LRTBOOT can automatically extract a large set of high-quality opinion features from the corpus without any supervision or labeled features. This means that the automatic creation of a set of domain features is no longer a pipe dream! 2013-07-25T07:53:31Z 2019-12-06T19:49:55Z 2013-07-25T07:53:31Z 2019-12-06T19:49:55Z 2012 2012 Conference Paper Hai, Z., Chang, K., & Cong, G. (2012). One seed to find them all: mining opinion features via association. Proceedings of the 21st ACM international conference on Information and knowledge management. https://hdl.handle.net/10356/98037 http://hdl.handle.net/10220/12292 10.1145/2396761.2396797 en © 2012 ACM.
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
description Feature-based opinion analysis has attracted extensive attention recently. Identifying features associated with opinions expressed in reviews is essential for fine-grained opinion mining. One approach is to exploit the dependency relations that occur naturally between features and opinion words, and among features (or opinion words) themselves. In this paper, we propose a generalized approach to opinion feature extraction by incorporating robust statistical association analysis in a bootstrapping framework. The new approach starts with a small set of feature seeds, on which it iteratively enlarges by mining feature-opinion, feature-feature, and opinion-opinion dependency relations. Two association model types, namely likelihood ratio tests (LRT) and latent semantic analysis (LSA), are proposed for computing the pair-wise associations between terms (features or opinions). We accordingly propose two robust bootstrapping approaches, LRTBOOT and LSABOOT, both of which need just a handful of initial feature seeds to bootstrap opinion feature extraction. We benchmarked LRTBOOT and LSABOOT against existing approaches on a large number of real-life reviews crawled from the cellphone and hotel domains. Experimental results using varying number of feature seeds show that the proposed association-based bootstrapping approach significantly outperforms the competitors. In fact, one seed feature is all that is needed for LRTBOOT to significantly outperform the other methods. This seed feature can simply be the domain feature, e.g., "cellphone" or "hotel". The consequence of our discovery is far reaching: starting with just one feature seed, typically just the domain concept word, LRTBOOT can automatically extract a large set of high-quality opinion features from the corpus without any supervision or labeled features. This means that the automatic creation of a set of domain features is no longer a pipe dream!
author2 School of Computer Engineering
author_facet School of Computer Engineering
Hai, Zhen
Chang, Kuiyu
Cong, Gao
format Conference or Workshop Item
author Hai, Zhen
Chang, Kuiyu
Cong, Gao
spellingShingle Hai, Zhen
Chang, Kuiyu
Cong, Gao
One seed to find them all : mining opinion features via association
author_sort Hai, Zhen
title One seed to find them all : mining opinion features via association
title_short One seed to find them all : mining opinion features via association
title_full One seed to find them all : mining opinion features via association
title_fullStr One seed to find them all : mining opinion features via association
title_full_unstemmed One seed to find them all : mining opinion features via association
title_sort one seed to find them all : mining opinion features via association
publishDate 2013
url https://hdl.handle.net/10356/98037
http://hdl.handle.net/10220/12292
_version_ 1681057076138213376