Aspect extraction from product reviews using category hierarchy information

Aspect extraction is a task to abstract the common properties of objects from corpora discussing them, such as reviews of products. Recent work on aspect extraction is leveraging the hierarchical relationship between products and their categories. However, such effort focuses on the aspects of child...

Full description

Saved in:
Bibliographic Details
Main Authors: YANG, Yifeng, CHEN CEN, QIU, Minghui, BAO, Forrest Sheng
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2017
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/3803
https://ink.library.smu.edu.sg/context/sis_research/article/4805/viewcontent/E17_2107.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-4805
record_format dspace
spelling sg-smu-ink.sis_research-48052017-10-30T05:49:34Z Aspect extraction from product reviews using category hierarchy information YANG, Yifeng CHEN CEN, QIU, Minghui BAO, Forrest Sheng Aspect extraction is a task to abstract the common properties of objects from corpora discussing them, such as reviews of products. Recent work on aspect extraction is leveraging the hierarchical relationship between products and their categories. However, such effort focuses on the aspects of child categories but ignores those from parent categories. Hence, we propose an LDA-based generative topic model inducing the two-layer categorical information (CAT-LDA), to balance the aspects of both a parent category and its child categories. Our hypothesis is that child categories inherit aspects from parent categories, controlled by the hierarchy between them. Experimental results on 5 categories of Amazon.com products show that both common aspects of parent category and the individual aspects of subcategories can be extracted to align well with the common sense. We further evaluate the manually extracted aspects of 16 products, resulting in an average hit rate of 79.10%. 2017-04-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3803 info:doi/10.18653/v1/E17-2107 https://ink.library.smu.edu.sg/context/sis_research/article/4805/viewcontent/E17_2107.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Computational linguistics Linguistics Computational Engineering Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Computational linguistics
Linguistics
Computational Engineering
Databases and Information Systems
spellingShingle Computational linguistics
Linguistics
Computational Engineering
Databases and Information Systems
YANG, Yifeng
CHEN CEN,
QIU, Minghui
BAO, Forrest Sheng
Aspect extraction from product reviews using category hierarchy information
description Aspect extraction is a task to abstract the common properties of objects from corpora discussing them, such as reviews of products. Recent work on aspect extraction is leveraging the hierarchical relationship between products and their categories. However, such effort focuses on the aspects of child categories but ignores those from parent categories. Hence, we propose an LDA-based generative topic model inducing the two-layer categorical information (CAT-LDA), to balance the aspects of both a parent category and its child categories. Our hypothesis is that child categories inherit aspects from parent categories, controlled by the hierarchy between them. Experimental results on 5 categories of Amazon.com products show that both common aspects of parent category and the individual aspects of subcategories can be extracted to align well with the common sense. We further evaluate the manually extracted aspects of 16 products, resulting in an average hit rate of 79.10%.
format text
author YANG, Yifeng
CHEN CEN,
QIU, Minghui
BAO, Forrest Sheng
author_facet YANG, Yifeng
CHEN CEN,
QIU, Minghui
BAO, Forrest Sheng
author_sort YANG, Yifeng
title Aspect extraction from product reviews using category hierarchy information
title_short Aspect extraction from product reviews using category hierarchy information
title_full Aspect extraction from product reviews using category hierarchy information
title_fullStr Aspect extraction from product reviews using category hierarchy information
title_full_unstemmed Aspect extraction from product reviews using category hierarchy information
title_sort aspect extraction from product reviews using category hierarchy information
publisher Institutional Knowledge at Singapore Management University
publishDate 2017
url https://ink.library.smu.edu.sg/sis_research/3803
https://ink.library.smu.edu.sg/context/sis_research/article/4805/viewcontent/E17_2107.pdf
_version_ 1770573764407328768