Extracting causal relations on HIV drug resistance from literature

In HIV treatment it is critical to have up-to-date resistance data of applicable drugs since HIV has a very high rate of mutation. These data are made available through scientific publications and must be extracted manually by experts in order to be used by virologists and medical doctors. Therefore...

Full description

Saved in:
Bibliographic Details
Main Authors: Bui, Quoc-Chinh, Nuallain, Breanndan O., Boucher, Charles A. B., Sloot, Peter M. A.
Other Authors: School of Computer Engineering
Format: Article
Language:English
Published: 2013
Subjects:
Online Access:https://hdl.handle.net/10356/96395
http://hdl.handle.net/10220/9899
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-96395
record_format dspace
spelling sg-ntu-dr.10356-963952022-02-16T16:29:58Z Extracting causal relations on HIV drug resistance from literature Bui, Quoc-Chinh Nuallain, Breanndan O. Boucher, Charles A. B. Sloot, Peter M. A. School of Computer Engineering DRNTU::Engineering::Computer science and engineering In HIV treatment it is critical to have up-to-date resistance data of applicable drugs since HIV has a very high rate of mutation. These data are made available through scientific publications and must be extracted manually by experts in order to be used by virologists and medical doctors. Therefore there is an urgent need for a tool that partially automates this process and is able to retrieve relations between drugs and virus mutations from literature. Results In this work we present a novel method to extract and combine relationships between HIV drugs and mutations in viral genomes. Our extraction method is based on natural language processing (NLP) which produces grammatical relations and applies a set of rules to these relations. We applied our method to a relevant set of PubMed abstracts and obtained 2,434 extracted relations with an estimated performance of 84% for F-score. We then combined the extracted relations using logistic regression to generate resistance values for each <drug, mutation> pair. The results of this relation combination show more than 85% agreement with the Stanford HIVDB for the ten most frequently occurring mutations. The system is used in 5 hospitals from the Virolab project http://www.virolab.org webcite to preselect the most relevant novel resistance data from literature and present those to virologists and medical doctors for further evaluation. Conclusions The proposed relation extraction and combination method has a good performance on extracting HIV drug resistance data. It can be used in large-scale relation extraction experiments. The developed methods can also be applied to extract other type of relations such as gene-protein, gene-disease, and disease-mutation. Published version 2013-05-07T07:46:09Z 2019-12-06T19:29:55Z 2013-05-07T07:46:09Z 2019-12-06T19:29:55Z 2010 2010 Journal Article Bui, Q. C., Nuallain, B. O., Boucher, C. A., & Sloot, P. M. (2010). Extracting causal relations on HIV drug resistance from literature. BMC Bioinformatics, 11(1), 101. 1471-2105 https://hdl.handle.net/10356/96395 http://hdl.handle.net/10220/9899 10.1186/1471-2105-11-101 20178611 en BMC bioinformatics © 2010 Bui et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License(http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering
spellingShingle DRNTU::Engineering::Computer science and engineering
Bui, Quoc-Chinh
Nuallain, Breanndan O.
Boucher, Charles A. B.
Sloot, Peter M. A.
Extracting causal relations on HIV drug resistance from literature
description In HIV treatment it is critical to have up-to-date resistance data of applicable drugs since HIV has a very high rate of mutation. These data are made available through scientific publications and must be extracted manually by experts in order to be used by virologists and medical doctors. Therefore there is an urgent need for a tool that partially automates this process and is able to retrieve relations between drugs and virus mutations from literature. Results In this work we present a novel method to extract and combine relationships between HIV drugs and mutations in viral genomes. Our extraction method is based on natural language processing (NLP) which produces grammatical relations and applies a set of rules to these relations. We applied our method to a relevant set of PubMed abstracts and obtained 2,434 extracted relations with an estimated performance of 84% for F-score. We then combined the extracted relations using logistic regression to generate resistance values for each <drug, mutation> pair. The results of this relation combination show more than 85% agreement with the Stanford HIVDB for the ten most frequently occurring mutations. The system is used in 5 hospitals from the Virolab project http://www.virolab.org webcite to preselect the most relevant novel resistance data from literature and present those to virologists and medical doctors for further evaluation. Conclusions The proposed relation extraction and combination method has a good performance on extracting HIV drug resistance data. It can be used in large-scale relation extraction experiments. The developed methods can also be applied to extract other type of relations such as gene-protein, gene-disease, and disease-mutation.
author2 School of Computer Engineering
author_facet School of Computer Engineering
Bui, Quoc-Chinh
Nuallain, Breanndan O.
Boucher, Charles A. B.
Sloot, Peter M. A.
format Article
author Bui, Quoc-Chinh
Nuallain, Breanndan O.
Boucher, Charles A. B.
Sloot, Peter M. A.
author_sort Bui, Quoc-Chinh
title Extracting causal relations on HIV drug resistance from literature
title_short Extracting causal relations on HIV drug resistance from literature
title_full Extracting causal relations on HIV drug resistance from literature
title_fullStr Extracting causal relations on HIV drug resistance from literature
title_full_unstemmed Extracting causal relations on HIV drug resistance from literature
title_sort extracting causal relations on hiv drug resistance from literature
publishDate 2013
url https://hdl.handle.net/10356/96395
http://hdl.handle.net/10220/9899
_version_ 1725985544191606784