Application of fuzzy clustering analysis to compound datasets for drug lead identification

Recently, the increasing number of chemical compound datasets to be screened has been growing rapidly due to the fast developments of high-throughput screening in drug discovery. These compound datasets requires compound selection methods which have become one of the main technique in drug discove...

Full description

Saved in:
Bibliographic Details
Main Authors: Sinarwati, Mohamad Suhaili, Mohamad Nazim, Jambli, Abdul Rahman, Mat
Format: Proceeding
Language:English
Published: IEEE 2012
Subjects:
Online Access:http://ir.unimas.my/id/eprint/16367/1/Application%20of%20Fuzzy%20Clustering%20Analysis%20%28abstract%29.pdf
http://ir.unimas.my/id/eprint/16367/
http://ieeexplore.ieee.org/document/6297272/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Sarawak
Language: English
id my.unimas.ir.16367
record_format eprints
spelling my.unimas.ir.163672023-05-29T03:37:44Z http://ir.unimas.my/id/eprint/16367/ Application of fuzzy clustering analysis to compound datasets for drug lead identification Sinarwati, Mohamad Suhaili Mohamad Nazim, Jambli Abdul Rahman, Mat R Medicine (General) Recently, the increasing number of chemical compound datasets to be screened has been growing rapidly due to the fast developments of high-throughput screening in drug discovery. These compound datasets requires compound selection methods which have become one of the main technique in drug discovery especially in drug lead identification process. Thus, finding the best method in compound selection is needed to the pharmaceutical industry to ensure the accurate results of this process. One of most used compound selection method is clusterbased compound selection, which involves subdividing a set of compounds into clusters and choosing one compound or a small number of compounds from each cluster. In this cluster-based compound selection, non-overlapping methods such as Ward's, Group Average, Jarvis Patrick's and K-means are preferred methods to cluster the diverse set of compounds. However, there are little study on overlapping method such as fuzzy cmean (FCM) and fuzzy c-varieties (FCV) clustering algorithms. Therefore, these two clustering algorithms are applied and their performance is compared based on the effectiveness of the clustering results in terms of separation between actives and inactives (Pa) into different clusters and mean intercluster molecular dissimilarity (MIMDS). The analysis shows FCM gives the best results compare to FCV in terms of Pa indicating that FCM has a promising use in compound selection algorithms. But, FCV is perform better than the FCM in term of MIMDS when a higher number of compounds and higher fuzziness index value are concerned. IEEE 2012 Proceeding PeerReviewed text en http://ir.unimas.my/id/eprint/16367/1/Application%20of%20Fuzzy%20Clustering%20Analysis%20%28abstract%29.pdf Sinarwati, Mohamad Suhaili and Mohamad Nazim, Jambli and Abdul Rahman, Mat (2012) Application of fuzzy clustering analysis to compound datasets for drug lead identification. In: 2012 International Conference on Computer & Information Science (ICCIS), 12-14 June 2012, Kuala Lumpur, Malaysia. http://ieeexplore.ieee.org/document/6297272/ DOI: 10.1109/ICCISci.2012.6297272
institution Universiti Malaysia Sarawak
building Centre for Academic Information Services (CAIS)
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Sarawak
content_source UNIMAS Institutional Repository
url_provider http://ir.unimas.my/
language English
topic R Medicine (General)
spellingShingle R Medicine (General)
Sinarwati, Mohamad Suhaili
Mohamad Nazim, Jambli
Abdul Rahman, Mat
Application of fuzzy clustering analysis to compound datasets for drug lead identification
description Recently, the increasing number of chemical compound datasets to be screened has been growing rapidly due to the fast developments of high-throughput screening in drug discovery. These compound datasets requires compound selection methods which have become one of the main technique in drug discovery especially in drug lead identification process. Thus, finding the best method in compound selection is needed to the pharmaceutical industry to ensure the accurate results of this process. One of most used compound selection method is clusterbased compound selection, which involves subdividing a set of compounds into clusters and choosing one compound or a small number of compounds from each cluster. In this cluster-based compound selection, non-overlapping methods such as Ward's, Group Average, Jarvis Patrick's and K-means are preferred methods to cluster the diverse set of compounds. However, there are little study on overlapping method such as fuzzy cmean (FCM) and fuzzy c-varieties (FCV) clustering algorithms. Therefore, these two clustering algorithms are applied and their performance is compared based on the effectiveness of the clustering results in terms of separation between actives and inactives (Pa) into different clusters and mean intercluster molecular dissimilarity (MIMDS). The analysis shows FCM gives the best results compare to FCV in terms of Pa indicating that FCM has a promising use in compound selection algorithms. But, FCV is perform better than the FCM in term of MIMDS when a higher number of compounds and higher fuzziness index value are concerned.
format Proceeding
author Sinarwati, Mohamad Suhaili
Mohamad Nazim, Jambli
Abdul Rahman, Mat
author_facet Sinarwati, Mohamad Suhaili
Mohamad Nazim, Jambli
Abdul Rahman, Mat
author_sort Sinarwati, Mohamad Suhaili
title Application of fuzzy clustering analysis to compound datasets for drug lead identification
title_short Application of fuzzy clustering analysis to compound datasets for drug lead identification
title_full Application of fuzzy clustering analysis to compound datasets for drug lead identification
title_fullStr Application of fuzzy clustering analysis to compound datasets for drug lead identification
title_full_unstemmed Application of fuzzy clustering analysis to compound datasets for drug lead identification
title_sort application of fuzzy clustering analysis to compound datasets for drug lead identification
publisher IEEE
publishDate 2012
url http://ir.unimas.my/id/eprint/16367/1/Application%20of%20Fuzzy%20Clustering%20Analysis%20%28abstract%29.pdf
http://ir.unimas.my/id/eprint/16367/
http://ieeexplore.ieee.org/document/6297272/
_version_ 1767209810344804352