A hybrid approach to extract protein-protein interactions

Motivation: Protein–protein interactions (PPIs) play an important role in understanding biological processes. Although recent research in text mining has achieved a signiﬁcant progress in automatic PPI extraction from literature, performance of existing systems still needs to be improved. Results: I...

Full description

Saved in:

Bibliographic Details
Main Authors:	Sloot, Peter M. A., Bui, Quoc-Chinh, Katrenko, Sophia
Other Authors:	School of Computer Engineering
Format:	Article
Language:	English
Published:	2014
Subjects:	DRNTU::Engineering::Computer science and engineering::Computer applications::Life and medical sciences
Online Access:	https://hdl.handle.net/10356/96394 http://hdl.handle.net/10220/18848
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Description
Summary:	Motivation: Protein–protein interactions (PPIs) play an important role in understanding biological processes. Although recent research in text mining has achieved a signiﬁcant progress in automatic PPI extraction from literature, performance of existing systems still needs to be improved. Results: In this study, we propose a novel algorithm for extracting PPIs from literature which consists of two phases. First, we automatically categorize the data into subsets based on its semantic properties and extract candidate PPI pairs from these subsets. Second, we apply support vector machines (SVMs) to classify candidate PPI pairs using features speciﬁc for each subset. We obtain promising results on ﬁve benchmark datasets: AIMed, BioInfer, HPRD50, IEPA and LLL with F-scores ranging from 60% to 84%, which are comparable with the state-of-the-art PPI extraction systems. Furthermore, our system achieves the best performance on cross-corpora evaluation and comparative performance in terms of computational efﬁciency. Availability: The source code and scripts used in this article are available for academic use at http://staff.science.uva.nl/∼bui/PPIs.zip

A hybrid approach to extract protein-protein interactions

Similar Items