Adaptive micro- and macro-knowledge incorporation for hierarchical text classification

Hierarchical text classification (HTC) aims to classify a text into multiple categories organized in a hierarchical structure. The state-of-the-art HTC methods usually employ graph networks, where label graphs are constructed and label representation is learned to interact with text representations...

Full description

Saved in:
Bibliographic Details
Main Authors: Feng, Zijian, Mao, Kezhi, Zhou, Hanzhang
Other Authors: School of Electrical and Electronic Engineering
Format: Article
Language:English
Published: 2024
Subjects:
Online Access:https://hdl.handle.net/10356/175778
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-175778
record_format dspace
spelling sg-ntu-dr.10356-1757782024-05-06T08:07:44Z Adaptive micro- and macro-knowledge incorporation for hierarchical text classification Feng, Zijian Mao, Kezhi Zhou, Hanzhang School of Electrical and Electronic Engineering Interdisciplinary Graduate School (IGS) Future Resilient Systems Programme, Singapore-ETH Centre Engineering Hierarchical text classification Knowledge incorporation Hierarchical text classification (HTC) aims to classify a text into multiple categories organized in a hierarchical structure. The state-of-the-art HTC methods usually employ graph networks, where label graphs are constructed and label representation is learned to interact with text representations for classification. In general, label graphs are built on the intrinsic label hierarchy, label semantic similarity, or label co-occurrence. Such graphs have been proven to be effective, but they only exploit knowledge from training data or simple label descriptions, without considering the vast external knowledge in the open sources. Actually, external knowledge from open sources could bring in complementary information to enhance the label graph's representation power. Motivated by the above considerations, we explore the use of external knowledge for improving HTC in this paper. We categorize knowledge into micro-knowledge and macro-knowledge, which are defined as the fundamental concepts related to a single class label and the correlations among class labels, respectively. For tailor-made incorporation of the two types of knowledge into representation learning and classification, we propose Adaptive Micro- and Macro-Knowledge Incorporation for Hierarchical Text Classification (AMKI-HTC) model in this paper. The micro-knowledge incorporation helps capture class-relevant keywords in the text and hence produce discriminative representations, while the macro-knowledge incorporation improves the accuracy of label graphs. Finally, a confidence maximization fusion strategy is developed for adaptive aggregation of multi-view features. Extensive experiments on three benchmark HTC datasets demonstrate that AMKI-HTC consistently outperforms state-of-the-art models. National Research Foundation (NRF) We would like to acknowledge that this project is a product of the Future Resilient Systems initiative at the Singapore-ETH Centre (SEC), and it has been made possible thanks to the support of the National Research Foundation, Prime Minister’s Office, Singapore, through its Campus for Research Excellence and Technological Enterprise (CREATE) programme. 2024-05-06T08:07:44Z 2024-05-06T08:07:44Z 2024 Journal Article Feng, Z., Mao, K. & Zhou, H. (2024). Adaptive micro- and macro-knowledge incorporation for hierarchical text classification. Expert Systems With Applications, 248, 123374-. https://dx.doi.org/10.1016/j.eswa.2024.123374 0957-4174 https://hdl.handle.net/10356/175778 10.1016/j.eswa.2024.123374 2-s2.0-85185280823 248 123374 en CREATE Expert Systems with Applications © 2024 Elsevier Ltd. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering
Hierarchical text classification
Knowledge incorporation
spellingShingle Engineering
Hierarchical text classification
Knowledge incorporation
Feng, Zijian
Mao, Kezhi
Zhou, Hanzhang
Adaptive micro- and macro-knowledge incorporation for hierarchical text classification
description Hierarchical text classification (HTC) aims to classify a text into multiple categories organized in a hierarchical structure. The state-of-the-art HTC methods usually employ graph networks, where label graphs are constructed and label representation is learned to interact with text representations for classification. In general, label graphs are built on the intrinsic label hierarchy, label semantic similarity, or label co-occurrence. Such graphs have been proven to be effective, but they only exploit knowledge from training data or simple label descriptions, without considering the vast external knowledge in the open sources. Actually, external knowledge from open sources could bring in complementary information to enhance the label graph's representation power. Motivated by the above considerations, we explore the use of external knowledge for improving HTC in this paper. We categorize knowledge into micro-knowledge and macro-knowledge, which are defined as the fundamental concepts related to a single class label and the correlations among class labels, respectively. For tailor-made incorporation of the two types of knowledge into representation learning and classification, we propose Adaptive Micro- and Macro-Knowledge Incorporation for Hierarchical Text Classification (AMKI-HTC) model in this paper. The micro-knowledge incorporation helps capture class-relevant keywords in the text and hence produce discriminative representations, while the macro-knowledge incorporation improves the accuracy of label graphs. Finally, a confidence maximization fusion strategy is developed for adaptive aggregation of multi-view features. Extensive experiments on three benchmark HTC datasets demonstrate that AMKI-HTC consistently outperforms state-of-the-art models.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Feng, Zijian
Mao, Kezhi
Zhou, Hanzhang
format Article
author Feng, Zijian
Mao, Kezhi
Zhou, Hanzhang
author_sort Feng, Zijian
title Adaptive micro- and macro-knowledge incorporation for hierarchical text classification
title_short Adaptive micro- and macro-knowledge incorporation for hierarchical text classification
title_full Adaptive micro- and macro-knowledge incorporation for hierarchical text classification
title_fullStr Adaptive micro- and macro-knowledge incorporation for hierarchical text classification
title_full_unstemmed Adaptive micro- and macro-knowledge incorporation for hierarchical text classification
title_sort adaptive micro- and macro-knowledge incorporation for hierarchical text classification
publishDate 2024
url https://hdl.handle.net/10356/175778
_version_ 1800916163699408896