Representative entry selection for profiling blogs
Many applications on blog search and mining often meet the challenge of handling huge volume of blog data, in which one single blog could contain hundreds or even thousands of entries. We investigate novel techniques for profiling blogs by selecting a subset of representative entries for each blog....
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2008
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/2382 https://ink.library.smu.edu.sg/context/sis_research/article/3382/viewcontent/p1387_zhuang.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-3382 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-33822018-12-05T02:54:51Z Representative entry selection for profiling blogs ZHUANG, Jinfeng HOI, Steven C. H. SUN, Aixin JIN, Rong Many applications on blog search and mining often meet the challenge of handling huge volume of blog data, in which one single blog could contain hundreds or even thousands of entries. We investigate novel techniques for profiling blogs by selecting a subset of representative entries for each blog. We propose two principles for guiding the entry selection task: representativeness and diversity. Further, we formulate the entry selection task into a combinatorial optimization problem and propose a greedy yet effective algorithm for finding a good approximate solution by exploiting the theory of submodular functions. We suggest blog classification for judging the performance of the proposed entry selection techniques and evaluate their performance on a real blog dataset, in which encouraging results were obtained. 2008-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2382 info:doi/10.1145/1458082.1458293 https://ink.library.smu.edu.sg/context/sis_research/article/3382/viewcontent/p1387_zhuang.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Blog profiling Blog classification Entry selection Computer Sciences Databases and Information Systems |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Blog profiling Blog classification Entry selection Computer Sciences Databases and Information Systems |
spellingShingle |
Blog profiling Blog classification Entry selection Computer Sciences Databases and Information Systems ZHUANG, Jinfeng HOI, Steven C. H. SUN, Aixin JIN, Rong Representative entry selection for profiling blogs |
description |
Many applications on blog search and mining often meet the challenge of handling huge volume of blog data, in which one single blog could contain hundreds or even thousands of entries. We investigate novel techniques for profiling blogs by selecting a subset of representative entries for each blog. We propose two principles for guiding the entry selection task: representativeness and diversity. Further, we formulate the entry selection task into a combinatorial optimization problem and propose a greedy yet effective algorithm for finding a good approximate solution by exploiting the theory of submodular functions. We suggest blog classification for judging the performance of the proposed entry selection techniques and evaluate their performance on a real blog dataset, in which encouraging results were obtained. |
format |
text |
author |
ZHUANG, Jinfeng HOI, Steven C. H. SUN, Aixin JIN, Rong |
author_facet |
ZHUANG, Jinfeng HOI, Steven C. H. SUN, Aixin JIN, Rong |
author_sort |
ZHUANG, Jinfeng |
title |
Representative entry selection for profiling blogs |
title_short |
Representative entry selection for profiling blogs |
title_full |
Representative entry selection for profiling blogs |
title_fullStr |
Representative entry selection for profiling blogs |
title_full_unstemmed |
Representative entry selection for profiling blogs |
title_sort |
representative entry selection for profiling blogs |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2008 |
url |
https://ink.library.smu.edu.sg/sis_research/2382 https://ink.library.smu.edu.sg/context/sis_research/article/3382/viewcontent/p1387_zhuang.pdf |
_version_ |
1770572129345994752 |