Fuzzy K-means clustering with missing values

Fuzzy K-means clustering algorithm is a popular approach for exploring the structure of a set of patterns, especially when the clusters are overlapping or fuzzy. However, the fuzzy K-means clustering algorithm cannot be applied when the real-life data contain missing values. In many cases, the numbe...

Full description

Saved in:
Bibliographic Details
Main Authors: Sarkar M., Tze-Yun LEONG
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2001
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/3020
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-4020
record_format dspace
spelling sg-smu-ink.sis_research-40202016-02-05T06:30:05Z Fuzzy K-means clustering with missing values Sarkar M., Tze-Yun LEONG, Fuzzy K-means clustering algorithm is a popular approach for exploring the structure of a set of patterns, especially when the clusters are overlapping or fuzzy. However, the fuzzy K-means clustering algorithm cannot be applied when the real-life data contain missing values. In many cases, the number of patterns with missing values is so large that if these patterns are removed, then sufficient number of patterns is not available to characterize the data set. This paper proposes a technique to exploit the information provided by the patterns with the missing values so that the clustering results are enhanced. There are various preprocessing methods to substitute the missing values before clustering the data. However, instead of repairing the data set at the beginning, the repairing can be carried out incrementally in each iteration based on the context. In that case, it is more likely that less uncertainty is added while incorporating the repair work. This scheme is further consolidated in this paper by fine-tuning the missing values using the information from other attributes. The applications of the proposed method in medical domain have produced good performance. 2001-01-01T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/3020 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Computer Sciences Health Information Technology
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Computer Sciences
Health Information Technology
spellingShingle Computer Sciences
Health Information Technology
Sarkar M.,
Tze-Yun LEONG,
Fuzzy K-means clustering with missing values
description Fuzzy K-means clustering algorithm is a popular approach for exploring the structure of a set of patterns, especially when the clusters are overlapping or fuzzy. However, the fuzzy K-means clustering algorithm cannot be applied when the real-life data contain missing values. In many cases, the number of patterns with missing values is so large that if these patterns are removed, then sufficient number of patterns is not available to characterize the data set. This paper proposes a technique to exploit the information provided by the patterns with the missing values so that the clustering results are enhanced. There are various preprocessing methods to substitute the missing values before clustering the data. However, instead of repairing the data set at the beginning, the repairing can be carried out incrementally in each iteration based on the context. In that case, it is more likely that less uncertainty is added while incorporating the repair work. This scheme is further consolidated in this paper by fine-tuning the missing values using the information from other attributes. The applications of the proposed method in medical domain have produced good performance.
format text
author Sarkar M.,
Tze-Yun LEONG,
author_facet Sarkar M.,
Tze-Yun LEONG,
author_sort Sarkar M.,
title Fuzzy K-means clustering with missing values
title_short Fuzzy K-means clustering with missing values
title_full Fuzzy K-means clustering with missing values
title_fullStr Fuzzy K-means clustering with missing values
title_full_unstemmed Fuzzy K-means clustering with missing values
title_sort fuzzy k-means clustering with missing values
publisher Institutional Knowledge at Singapore Management University
publishDate 2001
url https://ink.library.smu.edu.sg/sis_research/3020
_version_ 1770572781514129408