DeepBindPoc : a deep learning method to rank ligand binding pockets using molecular vector representation

Accurate identification of ligand-binding pockets in a protein is important for structure-based drug design. In recent years, several deep learning models were developed to learn important physical-chemical and spatial information to predict ligand-binding pockets in a protein. However, ranking the...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhang, Haiping, Saravanan, Konda Mani, Lin, Jinzhi, Liao, Linbu, Ng, Justin Tze-Yang, Zhou, Jiaxiu, Wei, Yanjie
Other Authors: School of Biological Sciences
Format: Article
Language:English
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/10356/145360
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-145360
record_format dspace
spelling sg-ntu-dr.10356-1453602023-02-28T16:56:57Z DeepBindPoc : a deep learning method to rank ligand binding pockets using molecular vector representation Zhang, Haiping Saravanan, Konda Mani Lin, Jinzhi Liao, Linbu Ng, Justin Tze-Yang Zhou, Jiaxiu Wei, Yanjie School of Biological Sciences Science::Biological sciences Ligand Pocket Identification Deep Neural Network Accurate identification of ligand-binding pockets in a protein is important for structure-based drug design. In recent years, several deep learning models were developed to learn important physical-chemical and spatial information to predict ligand-binding pockets in a protein. However, ranking the native ligand binding pockets from a pool of predicted pockets is still a hard task for computational molecular biologists using a single web-based tool. Hence, we believe, by using closer to real application data set as training and by providing ligand information, an enhanced model to identify accurate pockets can be obtained. In this article, we propose a new deep learning method called DeepBindPoc for identifying and ranking ligand-binding pockets in proteins. The model is built by using information about the binding pocket and associated ligand. We take advantage of the mol2vec tool to represent both the given ligand and pocket as vectors to construct a densely fully connected layer model. During the training, important features for pocket-ligand binding are automatically extracted and high-level information is preserved appropriately. DeepBindPoc demonstrated a strong complementary advantage for the detection of native-like pockets when combined with traditional popular methods, such as fpocket and P2Rank. The proposed method is extensively tested and validated with standard procedures on multiple datasets, including a dataset with G-protein Coupled receptors. The systematic testing and validation of our method suggest that DeepBindPoc is a valuable tool to rank near-native pockets for theoretically modeled protein with unknown experimental active site but have known ligand. The DeepBindPoc model described in this article is available at GitHub (https://github.com/haiping1010/DeepBindPoc) and the webserver is available at (http://cbblab.siat.ac.cn/DeepBindPoc/index.php). Published version 2020-12-18T04:32:10Z 2020-12-18T04:32:10Z 2020 Journal Article Zhang, H., Saravanan, K. M., Lin. J., Liao, L., Ng, J. T.-Y., Zhou, J., & Wei, Y. (2020). DeepBindPoc : a deep learning method to rank ligand binding pockets using molecular vector representation. PeerJ, 8, e8864-. doi:10.7717/peerj.8864 2167-8359 https://hdl.handle.net/10356/145360 10.7717/peerj.8864 32292649 8 en PeerJ © 2020 Zhang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Biological sciences
Ligand Pocket Identification
Deep Neural Network
spellingShingle Science::Biological sciences
Ligand Pocket Identification
Deep Neural Network
Zhang, Haiping
Saravanan, Konda Mani
Lin, Jinzhi
Liao, Linbu
Ng, Justin Tze-Yang
Zhou, Jiaxiu
Wei, Yanjie
DeepBindPoc : a deep learning method to rank ligand binding pockets using molecular vector representation
description Accurate identification of ligand-binding pockets in a protein is important for structure-based drug design. In recent years, several deep learning models were developed to learn important physical-chemical and spatial information to predict ligand-binding pockets in a protein. However, ranking the native ligand binding pockets from a pool of predicted pockets is still a hard task for computational molecular biologists using a single web-based tool. Hence, we believe, by using closer to real application data set as training and by providing ligand information, an enhanced model to identify accurate pockets can be obtained. In this article, we propose a new deep learning method called DeepBindPoc for identifying and ranking ligand-binding pockets in proteins. The model is built by using information about the binding pocket and associated ligand. We take advantage of the mol2vec tool to represent both the given ligand and pocket as vectors to construct a densely fully connected layer model. During the training, important features for pocket-ligand binding are automatically extracted and high-level information is preserved appropriately. DeepBindPoc demonstrated a strong complementary advantage for the detection of native-like pockets when combined with traditional popular methods, such as fpocket and P2Rank. The proposed method is extensively tested and validated with standard procedures on multiple datasets, including a dataset with G-protein Coupled receptors. The systematic testing and validation of our method suggest that DeepBindPoc is a valuable tool to rank near-native pockets for theoretically modeled protein with unknown experimental active site but have known ligand. The DeepBindPoc model described in this article is available at GitHub (https://github.com/haiping1010/DeepBindPoc) and the webserver is available at (http://cbblab.siat.ac.cn/DeepBindPoc/index.php).
author2 School of Biological Sciences
author_facet School of Biological Sciences
Zhang, Haiping
Saravanan, Konda Mani
Lin, Jinzhi
Liao, Linbu
Ng, Justin Tze-Yang
Zhou, Jiaxiu
Wei, Yanjie
format Article
author Zhang, Haiping
Saravanan, Konda Mani
Lin, Jinzhi
Liao, Linbu
Ng, Justin Tze-Yang
Zhou, Jiaxiu
Wei, Yanjie
author_sort Zhang, Haiping
title DeepBindPoc : a deep learning method to rank ligand binding pockets using molecular vector representation
title_short DeepBindPoc : a deep learning method to rank ligand binding pockets using molecular vector representation
title_full DeepBindPoc : a deep learning method to rank ligand binding pockets using molecular vector representation
title_fullStr DeepBindPoc : a deep learning method to rank ligand binding pockets using molecular vector representation
title_full_unstemmed DeepBindPoc : a deep learning method to rank ligand binding pockets using molecular vector representation
title_sort deepbindpoc : a deep learning method to rank ligand binding pockets using molecular vector representation
publishDate 2020
url https://hdl.handle.net/10356/145360
_version_ 1759855427867115520