JCBIE: a joint continual learning neural network for biomedical information extraction

Extracting knowledge from heterogeneous data sources is fundamental for the construction of structured biomedical knowledge graphs (BKGs), where entities and relations are represented as nodes and edges in the graphs, respectively. Previous biomedical knowledge extraction methods simply considered l...

Full description

Saved in:
Bibliographic Details
Main Authors: He, Kai, Mao, Rui, Gong, Tieliang, Cambria, Erik, Li, Chen
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/169543
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-169543
record_format dspace
spelling sg-ntu-dr.10356-1695432023-07-28T15:35:55Z JCBIE: a joint continual learning neural network for biomedical information extraction He, Kai Mao, Rui Gong, Tieliang Cambria, Erik Li, Chen School of Computer Science and Engineering Engineering::Computer science and engineering Biomedical Information Extraction Continual Learning Extracting knowledge from heterogeneous data sources is fundamental for the construction of structured biomedical knowledge graphs (BKGs), where entities and relations are represented as nodes and edges in the graphs, respectively. Previous biomedical knowledge extraction methods simply considered limited entity types and relations by using a task-specific training set, which is insufficient for large-scale BKGs development and downstream task applications in different scenarios. To alleviate this issue, we propose a joint continual learning biomedical information extraction (JCBIE) network to extract entities and relations from different biomedical information datasets. By empirically studying different joint learning and continual learning strategies, the proposed JCBIE can learn and expand different types of entities and relations from different datasets. JCBIE uses two separated encoders in joint-feature extraction, hence can effectively avoid the feature confusion problem comparing with using one hard-parameter sharing encoder. Specifically, it allows us to adopt entity augmented inputs to establish the interaction between named entity recognition and relation extraction. Finally, a novel evaluation mechanism is proposed for measuring cross-corpus generalization errors, which was ignored by traditional evaluation methods. Our empirical studies show that JCBIE achieves promising performance when continual learning strategy is adopted with multiple corpora. Published version This work has been supported by the Key Research and Development Program of Ningxia Hui Nationality Autonomous Region (2022BEG02025); The Key Research and Development Program of Shaanxi Province (2021GXLH-Z-095); The Innovative Research Group of the National Natural Science Foundation of China (61721002); The consulting research project of the Chinese Academy of Engineering (The Online and Offline Mixed Educational Service System for The Belt and Road Training in MOOC China); Project of China Knowledge Centre for Engineering Science and Technology; The innovation team from the Ministry of Education (IRT_17R86). 2023-07-24T02:33:18Z 2023-07-24T02:33:18Z 2022 Journal Article He, K., Mao, R., Gong, T., Cambria, E. & Li, C. (2022). JCBIE: a joint continual learning neural network for biomedical information extraction. BMC Bioinformatics, 23(1), 549-. https://dx.doi.org/10.1186/s12859-022-05096-w 1471-2105 https://hdl.handle.net/10356/169543 10.1186/s12859-022-05096-w 36536280 2-s2.0-85144324849 1 23 549 en BMC Bioinformatics © The Author(s) 2022. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http:// creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publi cdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Biomedical Information Extraction
Continual Learning
spellingShingle Engineering::Computer science and engineering
Biomedical Information Extraction
Continual Learning
He, Kai
Mao, Rui
Gong, Tieliang
Cambria, Erik
Li, Chen
JCBIE: a joint continual learning neural network for biomedical information extraction
description Extracting knowledge from heterogeneous data sources is fundamental for the construction of structured biomedical knowledge graphs (BKGs), where entities and relations are represented as nodes and edges in the graphs, respectively. Previous biomedical knowledge extraction methods simply considered limited entity types and relations by using a task-specific training set, which is insufficient for large-scale BKGs development and downstream task applications in different scenarios. To alleviate this issue, we propose a joint continual learning biomedical information extraction (JCBIE) network to extract entities and relations from different biomedical information datasets. By empirically studying different joint learning and continual learning strategies, the proposed JCBIE can learn and expand different types of entities and relations from different datasets. JCBIE uses two separated encoders in joint-feature extraction, hence can effectively avoid the feature confusion problem comparing with using one hard-parameter sharing encoder. Specifically, it allows us to adopt entity augmented inputs to establish the interaction between named entity recognition and relation extraction. Finally, a novel evaluation mechanism is proposed for measuring cross-corpus generalization errors, which was ignored by traditional evaluation methods. Our empirical studies show that JCBIE achieves promising performance when continual learning strategy is adopted with multiple corpora.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
He, Kai
Mao, Rui
Gong, Tieliang
Cambria, Erik
Li, Chen
format Article
author He, Kai
Mao, Rui
Gong, Tieliang
Cambria, Erik
Li, Chen
author_sort He, Kai
title JCBIE: a joint continual learning neural network for biomedical information extraction
title_short JCBIE: a joint continual learning neural network for biomedical information extraction
title_full JCBIE: a joint continual learning neural network for biomedical information extraction
title_fullStr JCBIE: a joint continual learning neural network for biomedical information extraction
title_full_unstemmed JCBIE: a joint continual learning neural network for biomedical information extraction
title_sort jcbie: a joint continual learning neural network for biomedical information extraction
publishDate 2023
url https://hdl.handle.net/10356/169543
_version_ 1773551305440100352