JCBIE: a joint continual learning neural network for biomedical information extraction
Extracting knowledge from heterogeneous data sources is fundamental for the construction of structured biomedical knowledge graphs (BKGs), where entities and relations are represented as nodes and edges in the graphs, respectively. Previous biomedical knowledge extraction methods simply considered l...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/169543 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-169543 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1695432023-07-28T15:35:55Z JCBIE: a joint continual learning neural network for biomedical information extraction He, Kai Mao, Rui Gong, Tieliang Cambria, Erik Li, Chen School of Computer Science and Engineering Engineering::Computer science and engineering Biomedical Information Extraction Continual Learning Extracting knowledge from heterogeneous data sources is fundamental for the construction of structured biomedical knowledge graphs (BKGs), where entities and relations are represented as nodes and edges in the graphs, respectively. Previous biomedical knowledge extraction methods simply considered limited entity types and relations by using a task-specific training set, which is insufficient for large-scale BKGs development and downstream task applications in different scenarios. To alleviate this issue, we propose a joint continual learning biomedical information extraction (JCBIE) network to extract entities and relations from different biomedical information datasets. By empirically studying different joint learning and continual learning strategies, the proposed JCBIE can learn and expand different types of entities and relations from different datasets. JCBIE uses two separated encoders in joint-feature extraction, hence can effectively avoid the feature confusion problem comparing with using one hard-parameter sharing encoder. Specifically, it allows us to adopt entity augmented inputs to establish the interaction between named entity recognition and relation extraction. Finally, a novel evaluation mechanism is proposed for measuring cross-corpus generalization errors, which was ignored by traditional evaluation methods. Our empirical studies show that JCBIE achieves promising performance when continual learning strategy is adopted with multiple corpora. Published version This work has been supported by the Key Research and Development Program of Ningxia Hui Nationality Autonomous Region (2022BEG02025); The Key Research and Development Program of Shaanxi Province (2021GXLH-Z-095); The Innovative Research Group of the National Natural Science Foundation of China (61721002); The consulting research project of the Chinese Academy of Engineering (The Online and Offline Mixed Educational Service System for The Belt and Road Training in MOOC China); Project of China Knowledge Centre for Engineering Science and Technology; The innovation team from the Ministry of Education (IRT_17R86). 2023-07-24T02:33:18Z 2023-07-24T02:33:18Z 2022 Journal Article He, K., Mao, R., Gong, T., Cambria, E. & Li, C. (2022). JCBIE: a joint continual learning neural network for biomedical information extraction. BMC Bioinformatics, 23(1), 549-. https://dx.doi.org/10.1186/s12859-022-05096-w 1471-2105 https://hdl.handle.net/10356/169543 10.1186/s12859-022-05096-w 36536280 2-s2.0-85144324849 1 23 549 en BMC Bioinformatics © The Author(s) 2022. Open Access. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http:// creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publi cdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Biomedical Information Extraction Continual Learning |
spellingShingle |
Engineering::Computer science and engineering Biomedical Information Extraction Continual Learning He, Kai Mao, Rui Gong, Tieliang Cambria, Erik Li, Chen JCBIE: a joint continual learning neural network for biomedical information extraction |
description |
Extracting knowledge from heterogeneous data sources is fundamental for the construction of structured biomedical knowledge graphs (BKGs), where entities and relations are represented as nodes and edges in the graphs, respectively. Previous biomedical knowledge extraction methods simply considered limited entity types and relations by using a task-specific training set, which is insufficient for large-scale BKGs development and downstream task applications in different scenarios. To alleviate this issue, we propose a joint continual learning biomedical information extraction (JCBIE) network to extract entities and relations from different biomedical information datasets. By empirically studying different joint learning and continual learning strategies, the proposed JCBIE can learn and expand different types of entities and relations from different datasets. JCBIE uses two separated encoders in joint-feature extraction, hence can effectively avoid the feature confusion problem comparing with using one hard-parameter sharing encoder. Specifically, it allows us to adopt entity augmented inputs to establish the interaction between named entity recognition and relation extraction. Finally, a novel evaluation mechanism is proposed for measuring cross-corpus generalization errors, which was ignored by traditional evaluation methods. Our empirical studies show that JCBIE achieves promising performance when continual learning strategy is adopted with multiple corpora. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering He, Kai Mao, Rui Gong, Tieliang Cambria, Erik Li, Chen |
format |
Article |
author |
He, Kai Mao, Rui Gong, Tieliang Cambria, Erik Li, Chen |
author_sort |
He, Kai |
title |
JCBIE: a joint continual learning neural network for biomedical information extraction |
title_short |
JCBIE: a joint continual learning neural network for biomedical information extraction |
title_full |
JCBIE: a joint continual learning neural network for biomedical information extraction |
title_fullStr |
JCBIE: a joint continual learning neural network for biomedical information extraction |
title_full_unstemmed |
JCBIE: a joint continual learning neural network for biomedical information extraction |
title_sort |
jcbie: a joint continual learning neural network for biomedical information extraction |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/169543 |
_version_ |
1773551305440100352 |