Adapting document similarity measures for ligand-based virtual screening
Quantifying the similarity of molecules is considered one of the major tasks in virtual screening. There are many similarity measures that have been proposed for this purpose, some of which have been derived from document and text retrieving areas as most often these similarity methods give good res...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Published: |
2016
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/68740/ http://dx.doi.org/10.3390/molecules21040476 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Malaysia |
id |
my.utm.68740 |
---|---|
record_format |
eprints |
spelling |
my.utm.687402017-11-20T08:52:13Z http://eprints.utm.my/id/eprint/68740/ Adapting document similarity measures for ligand-based virtual screening Himmat, Mubarak Salim, Naomie Al-Dabbagh, Mohammed Mumtaz Saeed, Faisal Ahmed, Ali QA75 Electronic computers. Computer science Quantifying the similarity of molecules is considered one of the major tasks in virtual screening. There are many similarity measures that have been proposed for this purpose, some of which have been derived from document and text retrieving areas as most often these similarity methods give good results in document retrieval and can achieve good results in virtual screening. In this work, we propose a similarity measure for ligand-based virtual screening, which has been derived from a text processing similarity measure. It has been adopted to be suitable for virtual screening; we called this proposed measure the Adapted Similarity Measure of Text Processing (ASMTP). For evaluating and testing the proposed ASMTP we conducted several experiments on two different benchmark datasets: the Maximum Unbiased Validation (MUV) and the MDL Drug Data Report (MDDR). The experiments have been conducted by choosing 10 reference structures from each class randomly as queries and evaluate them in the recall of cut-offs at 1% and 5%. The overall obtained results are compared with some similarity methods including the Tanimoto coefficient, which are considered to be the conventional and standard similarity coefficients for fingerprint-based similarity calculations. The achieved results show that the performance of ligand-based virtual screening is better and outperforms the Tanimoto coefficients and other methods. 2016 Article PeerReviewed Himmat, Mubarak and Salim, Naomie and Al-Dabbagh, Mohammed Mumtaz and Saeed, Faisal and Ahmed, Ali (2016) Adapting document similarity measures for ligand-based virtual screening. Molecules, 21 (4). pp. 1-13. http://dx.doi.org/10.3390/molecules21040476 DOI:10.3390/molecules21040476 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Himmat, Mubarak Salim, Naomie Al-Dabbagh, Mohammed Mumtaz Saeed, Faisal Ahmed, Ali Adapting document similarity measures for ligand-based virtual screening |
description |
Quantifying the similarity of molecules is considered one of the major tasks in virtual screening. There are many similarity measures that have been proposed for this purpose, some of which have been derived from document and text retrieving areas as most often these similarity methods give good results in document retrieval and can achieve good results in virtual screening. In this work, we propose a similarity measure for ligand-based virtual screening, which has been derived from a text processing similarity measure. It has been adopted to be suitable for virtual screening; we called this proposed measure the Adapted Similarity Measure of Text Processing (ASMTP). For evaluating and testing the proposed ASMTP we conducted several experiments on two different benchmark datasets: the Maximum Unbiased Validation (MUV) and the MDL Drug Data Report (MDDR). The experiments have been conducted by choosing 10 reference structures from each class randomly as queries and evaluate them in the recall of cut-offs at 1% and 5%. The overall obtained results are compared with some similarity methods including the Tanimoto coefficient, which are considered to be the conventional and standard similarity coefficients for fingerprint-based similarity calculations. The achieved results show that the performance of ligand-based virtual screening is better and outperforms the Tanimoto coefficients and other methods. |
format |
Article |
author |
Himmat, Mubarak Salim, Naomie Al-Dabbagh, Mohammed Mumtaz Saeed, Faisal Ahmed, Ali |
author_facet |
Himmat, Mubarak Salim, Naomie Al-Dabbagh, Mohammed Mumtaz Saeed, Faisal Ahmed, Ali |
author_sort |
Himmat, Mubarak |
title |
Adapting document similarity measures for ligand-based virtual screening |
title_short |
Adapting document similarity measures for ligand-based virtual screening |
title_full |
Adapting document similarity measures for ligand-based virtual screening |
title_fullStr |
Adapting document similarity measures for ligand-based virtual screening |
title_full_unstemmed |
Adapting document similarity measures for ligand-based virtual screening |
title_sort |
adapting document similarity measures for ligand-based virtual screening |
publishDate |
2016 |
url |
http://eprints.utm.my/id/eprint/68740/ http://dx.doi.org/10.3390/molecules21040476 |
_version_ |
1643655966438195200 |