Feature reduction for molecular similarity searching based on autoencoder deep learning
The concept of molecular similarity has been commonly used in rational drug design, where structurally similar molecules are examined in molecular databases to retrieve functionally similar molecules. The most used conventional similarity methods used two-dimensional (2D) fingerprints to evaluate th...
Saved in:
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI
2022
|
Subjects: | |
Online Access: | http://eprints.utm.my/id/eprint/101240/1/NaomieSalim2022_FeatureReductionforMolecularSimilaritySearching.pdf http://eprints.utm.my/id/eprint/101240/ http://dx.doi.org/10.3390/biom12040508 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Teknologi Malaysia |
Language: | English |
id |
my.utm.101240 |
---|---|
record_format |
eprints |
spelling |
my.utm.1012402023-06-01T10:07:25Z http://eprints.utm.my/id/eprint/101240/ Feature reduction for molecular similarity searching based on autoencoder deep learning Nasser, Maged Salim, Naomie Saeed, Faisal Basurra, Shadi Rabiu, Idris Hamza, Hentabli Alsoufi, Muaadh A. QA75 Electronic computers. Computer science The concept of molecular similarity has been commonly used in rational drug design, where structurally similar molecules are examined in molecular databases to retrieve functionally similar molecules. The most used conventional similarity methods used two-dimensional (2D) fingerprints to evaluate the similarity of molecules towards a target query. However, these descriptors include redundant and irrelevant features that might impact the performance of similarity searching methods. Thus, this study proposed a new approach for identifying the important features of molecules in chemical datasets based on the representation of the molecular features using Autoencoder (AE), with the aim of removing irrelevant and redundant features. The proposed approach experimented using the MDL Data Drug Report standard dataset (MDDR). Based on experimental findings, the proposed approach performed better than several existing benchmark similarity methods such as Tanimoto Similarity Method (TAN), Adapted Similarity Measure of Text Processing (ASMTP), and Quantum-Based Similarity Method (SQB). The results demonstrated that the performance achieved by the proposed approach has proven to be superior, particularly with the use of structurally heterogeneous datasets, where it yielded improved results compared to other previously used methods with the similar goal of improving molecular similarity searching. MDPI 2022 Article PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/101240/1/NaomieSalim2022_FeatureReductionforMolecularSimilaritySearching.pdf Nasser, Maged and Salim, Naomie and Saeed, Faisal and Basurra, Shadi and Rabiu, Idris and Hamza, Hentabli and Alsoufi, Muaadh A. (2022) Feature reduction for molecular similarity searching based on autoencoder deep learning. Biomolecules, 12 (4). pp. 1-23. ISSN 2218-273X http://dx.doi.org/10.3390/biom12040508 DOI : 10.3390/biom12040508 |
institution |
Universiti Teknologi Malaysia |
building |
UTM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Teknologi Malaysia |
content_source |
UTM Institutional Repository |
url_provider |
http://eprints.utm.my/ |
language |
English |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Nasser, Maged Salim, Naomie Saeed, Faisal Basurra, Shadi Rabiu, Idris Hamza, Hentabli Alsoufi, Muaadh A. Feature reduction for molecular similarity searching based on autoencoder deep learning |
description |
The concept of molecular similarity has been commonly used in rational drug design, where structurally similar molecules are examined in molecular databases to retrieve functionally similar molecules. The most used conventional similarity methods used two-dimensional (2D) fingerprints to evaluate the similarity of molecules towards a target query. However, these descriptors include redundant and irrelevant features that might impact the performance of similarity searching methods. Thus, this study proposed a new approach for identifying the important features of molecules in chemical datasets based on the representation of the molecular features using Autoencoder (AE), with the aim of removing irrelevant and redundant features. The proposed approach experimented using the MDL Data Drug Report standard dataset (MDDR). Based on experimental findings, the proposed approach performed better than several existing benchmark similarity methods such as Tanimoto Similarity Method (TAN), Adapted Similarity Measure of Text Processing (ASMTP), and Quantum-Based Similarity Method (SQB). The results demonstrated that the performance achieved by the proposed approach has proven to be superior, particularly with the use of structurally heterogeneous datasets, where it yielded improved results compared to other previously used methods with the similar goal of improving molecular similarity searching. |
format |
Article |
author |
Nasser, Maged Salim, Naomie Saeed, Faisal Basurra, Shadi Rabiu, Idris Hamza, Hentabli Alsoufi, Muaadh A. |
author_facet |
Nasser, Maged Salim, Naomie Saeed, Faisal Basurra, Shadi Rabiu, Idris Hamza, Hentabli Alsoufi, Muaadh A. |
author_sort |
Nasser, Maged |
title |
Feature reduction for molecular similarity searching based on autoencoder deep learning |
title_short |
Feature reduction for molecular similarity searching based on autoencoder deep learning |
title_full |
Feature reduction for molecular similarity searching based on autoencoder deep learning |
title_fullStr |
Feature reduction for molecular similarity searching based on autoencoder deep learning |
title_full_unstemmed |
Feature reduction for molecular similarity searching based on autoencoder deep learning |
title_sort |
feature reduction for molecular similarity searching based on autoencoder deep learning |
publisher |
MDPI |
publishDate |
2022 |
url |
http://eprints.utm.my/id/eprint/101240/1/NaomieSalim2022_FeatureReductionforMolecularSimilaritySearching.pdf http://eprints.utm.my/id/eprint/101240/ http://dx.doi.org/10.3390/biom12040508 |
_version_ |
1768006628986060800 |