Optimized subtractive clustering for cluster-based compound selection

Compound selection algorithm has become a need to pharmaceutical industry due to the increasing number of chemical compounds to be screened. One of the widely used methods in compound selection is cluster-based selection where the compound datasets are grouped into clusters and representative compou...

Full description

Saved in:

Bibliographic Details
Main Authors:	Kuik, Sok Ping, Salim, Naomie
Format:	Conference or Workshop Item
Language:	English
Published:	2006
Subjects:	TP Chemical technology
Online Access:	http://eprints.utm.my/id/eprint/301/1/KuikSokPing2006_Optimizedsubtractiveclusteringforcluster-based.pdf http://eprints.utm.my/id/eprint/301/
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Universiti Teknologi Malaysia
Language:	English

id	my.utm.301
record_format	eprints
spelling	my.utm.3012017-08-23T08:35:23Z http://eprints.utm.my/id/eprint/301/ Optimized subtractive clustering for cluster-based compound selection Kuik, Sok Ping Salim, Naomie TP Chemical technology Compound selection algorithm has become a need to pharmaceutical industry due to the increasing number of chemical compounds to be screened. One of the widely used methods in compound selection is cluster-based selection where the compound datasets are grouped into clusters and representative compounds are selected from each cluster. This paper proposes the use subtractive clustering in compound clustering by finding the optimal data points to be defined as a cluster centers based on the density of surrounding data points. The technique resolves the problem of determining the suitable number of clusters for the data. Different values of cluster radii and inter-cluster squash factor have been evaluated. For subtractive clustering, good values of squash factor are between 0.375 and 0.45 and the cluster radii from 0.35 to 0.45 because they always give the highest proportion of active structures in active cluster datasets. The results obtained from subtractive clustering has also been used in fuzzy c-mean (FMC) and k-means. We found that the proportion of actives in active cluster subsets are better when fcm and k-means are based on the results produced by subtractive clustering compared to results from subtractive clustering. K-means produced the best results among the three clustering methods. 2006-07 Conference or Workshop Item PeerReviewed application/pdf en http://eprints.utm.my/id/eprint/301/1/KuikSokPing2006_Optimizedsubtractiveclusteringforcluster-based.pdf Kuik, Sok Ping and Salim, Naomie (2006) Optimized subtractive clustering for cluster-based compound selection. In: 1st International Conference on Natural Resources Engineering & Technology 2006, 24-25th July 2006, Putrajaya, Malaysia.
institution	Universiti Teknologi Malaysia
building	UTM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Teknologi Malaysia
content_source	UTM Institutional Repository
url_provider	http://eprints.utm.my/
language	English
topic	TP Chemical technology
spellingShingle	TP Chemical technology Kuik, Sok Ping Salim, Naomie Optimized subtractive clustering for cluster-based compound selection
description	Compound selection algorithm has become a need to pharmaceutical industry due to the increasing number of chemical compounds to be screened. One of the widely used methods in compound selection is cluster-based selection where the compound datasets are grouped into clusters and representative compounds are selected from each cluster. This paper proposes the use subtractive clustering in compound clustering by finding the optimal data points to be defined as a cluster centers based on the density of surrounding data points. The technique resolves the problem of determining the suitable number of clusters for the data. Different values of cluster radii and inter-cluster squash factor have been evaluated. For subtractive clustering, good values of squash factor are between 0.375 and 0.45 and the cluster radii from 0.35 to 0.45 because they always give the highest proportion of active structures in active cluster datasets. The results obtained from subtractive clustering has also been used in fuzzy c-mean (FMC) and k-means. We found that the proportion of actives in active cluster subsets are better when fcm and k-means are based on the results produced by subtractive clustering compared to results from subtractive clustering. K-means produced the best results among the three clustering methods.
format	Conference or Workshop Item
author	Kuik, Sok Ping Salim, Naomie
author_facet	Kuik, Sok Ping Salim, Naomie
author_sort	Kuik, Sok Ping
title	Optimized subtractive clustering for cluster-based compound selection
title_short	Optimized subtractive clustering for cluster-based compound selection
title_full	Optimized subtractive clustering for cluster-based compound selection
title_fullStr	Optimized subtractive clustering for cluster-based compound selection
title_full_unstemmed	Optimized subtractive clustering for cluster-based compound selection
title_sort	optimized subtractive clustering for cluster-based compound selection
publishDate	2006
url	http://eprints.utm.my/id/eprint/301/1/KuikSokPing2006_Optimizedsubtractiveclusteringforcluster-based.pdf http://eprints.utm.my/id/eprint/301/
_version_	1643643068010725376

Optimized subtractive clustering for cluster-based compound selection

Similar Items