Evolutionary taxonomy construction from dynamic tag space

Collaborative tagging becomes a common feature of current web sites, facilitating ordinary users to annotate and represent online resources. The large collection of tags and their relationships form a tag space. In this kind of tag space, the popularity and correlation amongst tags capture the curre...

Full description

Saved in:
Bibliographic Details
Main Authors: Yao, Junjie, Cui, Bin, Cong, Gao, Huang, Yuxin
Other Authors: School of Computer Engineering
Format: Article
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/97003
http://hdl.handle.net/10220/11688
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Collaborative tagging becomes a common feature of current web sites, facilitating ordinary users to annotate and represent online resources. The large collection of tags and their relationships form a tag space. In this kind of tag space, the popularity and correlation amongst tags capture the current social interests. Tags are freely chosen keywords and difficult to organize. As a hierarchical concept structure to represent the subsumption relationships, automatically extracted taxonomies become a viable method to manage collaborative tags. However, tags change over time, and it is also imperative to incorporate the temporal tag evolution into the extracted taxonomies. In this paper, we formalize the problem of evolutionary taxonomy generation over a large collection of tags. A line of taxonomies are generated to reflect the temporal changes of underlying tag space. The proposed evolutionary taxonomy framework consists of two novel contributions. First, we develop a context-aware edge selection algorithm for taxonomy extraction. This method is built on seminal association-rule mining algorithm. Second, we propose several strategies for evolutionary taxonomy fusion, which smooths the newly generated taxonomy with prior ones. We conduct an extensive performance study using a large real-life web page tagging dataset (i.e., Del.ici.ous). The empirical results clearly verify the effectiveness and efficiency of the proposed approach.