Topic-driven reader comments summarization

Readers of a news article often read its comments contributed by other readers. By reading comments, readers obtain not only complementary information about this news article but also the opinions from other readers. However, the existing ranking mechanisms for comments (e.g., by recency or by user...

Full description

Saved in:

Bibliographic Details
Main Authors:	Ma, Zongyang, Sun, Aixin, Yuan, Quan, Cong, Gao
Other Authors:	School of Computer Engineering
Format:	Conference or Workshop Item
Language:	English
Published:	2013
Online Access:	https://hdl.handle.net/10356/97966 http://hdl.handle.net/10220/12257
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-97966
record_format	dspace
spelling	sg-ntu-dr.10356-979662020-05-28T07:19:24Z Topic-driven reader comments summarization Ma, Zongyang Sun, Aixin Yuan, Quan Cong, Gao School of Computer Engineering International conference on Information and knowledge management (21st : 2012 : Maui, USA) Readers of a news article often read its comments contributed by other readers. By reading comments, readers obtain not only complementary information about this news article but also the opinions from other readers. However, the existing ranking mechanisms for comments (e.g., by recency or by user rating) fail to offer an overall picture of topics discussed in comments. In this paper, we first propose to study Topic-driven Reader Comments Summarization (Torcs) problem. We observe that many news articles from a news stream are related to each other; so are their comments. Hence, news articles and their associated comments provide context information for user commenting. To implicitly capture the context information, we propose two topic models to address the Torcs problem, namely, Master-Slave Topic Model (MSTM) and Extended Master-Slave Topic Model (EXTM). Both models treat a news article as a master document and each of its comments as a slave document. MSTM model constrains that the topics discussed in comments have to be derived from the commenting news article. On the other hand, EXTM model allows generating words of comments using both the topics derived from the commenting news article, and the topics derived from all comments themselves. Both models are used to group comments into topic clusters. We then use two ranking mechanisms Maximal Marginal Relevance (MMR) and Rating & Length (RL) to select a few most representative comments from each comment cluster. To evaluate the two models, we conducted experiments on 1005 Yahoo! News articles with more than one million comments. Our experimental results show that EXTM significantly outperforms MSTM by perplexity. Through a user study, we also confirm that the comment summary generated by EXTM achieves better intra-cluster topic cohesion and inter-cluster topic diversity. 2013-07-25T07:03:35Z 2019-12-06T19:48:52Z 2013-07-25T07:03:35Z 2019-12-06T19:48:52Z 2012 2012 Conference Paper Ma, Z., Sun, A., Yuan, Q., & Cong, G. (2012). Topic-driven reader comments summarization. Proceedings of the 21st ACM international conference on Information and knowledge management. https://hdl.handle.net/10356/97966 http://hdl.handle.net/10220/12257 10.1145/2396761.2396798 en © 2012 ACM.
institution	Nanyang Technological University
building	NTU Library
country	Singapore
collection	DR-NTU
language	English
description	Readers of a news article often read its comments contributed by other readers. By reading comments, readers obtain not only complementary information about this news article but also the opinions from other readers. However, the existing ranking mechanisms for comments (e.g., by recency or by user rating) fail to offer an overall picture of topics discussed in comments. In this paper, we first propose to study Topic-driven Reader Comments Summarization (Torcs) problem. We observe that many news articles from a news stream are related to each other; so are their comments. Hence, news articles and their associated comments provide context information for user commenting. To implicitly capture the context information, we propose two topic models to address the Torcs problem, namely, Master-Slave Topic Model (MSTM) and Extended Master-Slave Topic Model (EXTM). Both models treat a news article as a master document and each of its comments as a slave document. MSTM model constrains that the topics discussed in comments have to be derived from the commenting news article. On the other hand, EXTM model allows generating words of comments using both the topics derived from the commenting news article, and the topics derived from all comments themselves. Both models are used to group comments into topic clusters. We then use two ranking mechanisms Maximal Marginal Relevance (MMR) and Rating & Length (RL) to select a few most representative comments from each comment cluster. To evaluate the two models, we conducted experiments on 1005 Yahoo! News articles with more than one million comments. Our experimental results show that EXTM significantly outperforms MSTM by perplexity. Through a user study, we also confirm that the comment summary generated by EXTM achieves better intra-cluster topic cohesion and inter-cluster topic diversity.
author2	School of Computer Engineering
author_facet	School of Computer Engineering Ma, Zongyang Sun, Aixin Yuan, Quan Cong, Gao
format	Conference or Workshop Item
author	Ma, Zongyang Sun, Aixin Yuan, Quan Cong, Gao
spellingShingle	Ma, Zongyang Sun, Aixin Yuan, Quan Cong, Gao Topic-driven reader comments summarization
author_sort	Ma, Zongyang
title	Topic-driven reader comments summarization
title_short	Topic-driven reader comments summarization
title_full	Topic-driven reader comments summarization
title_fullStr	Topic-driven reader comments summarization
title_full_unstemmed	Topic-driven reader comments summarization
title_sort	topic-driven reader comments summarization
publishDate	2013
url	https://hdl.handle.net/10356/97966 http://hdl.handle.net/10220/12257
_version_	1681058875342585856

Topic-driven reader comments summarization

Similar Items