Analysis of K-Mean and X-Mean Clustering Algorithms Using Ontology-Based Dataset Filtering

In the field of computer science, data mining facilitates the extraction of useful knowledge and patterns from a huge amount of data. Various techniques exist in the data mining domain to explore the links, associations, and patterns from data in data warehouses. Among these techniques, clustering...

Full description

Saved in:
Bibliographic Details
Main Authors: Rahmah, Mokhtar, Raza, Muhammad Ahsan, Fauziah, Zainuddin, Nor Azhar, Ahmad, Raza, Muhammad Fahad, Raza, Binish
Format: Article
Language:English
Published: IJCSNS 2021
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/32993/1/analysis%20of%20kmean.pdf
http://umpir.ump.edu.my/id/eprint/32993/
http://paper.ijcsns.org/07_book/202110/20211040.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Pahang
Language: English
id my.ump.umpir.32993
record_format eprints
spelling my.ump.umpir.329932022-01-11T09:03:05Z http://umpir.ump.edu.my/id/eprint/32993/ Analysis of K-Mean and X-Mean Clustering Algorithms Using Ontology-Based Dataset Filtering Rahmah, Mokhtar Raza, Muhammad Ahsan Fauziah, Zainuddin Nor Azhar, Ahmad Raza, Muhammad Fahad Raza, Binish QA75 Electronic computers. Computer science In the field of computer science, data mining facilitates the extraction of useful knowledge and patterns from a huge amount of data. Various techniques exist in the data mining domain to explore the links, associations, and patterns from data in data warehouses. Among these techniques, clustering is more prominent in analyzing raw and unlabeled data from a large volume of datasets. The clustering mechanism identifies similar features between data objects and arranges them into clusters. In this paper, we have compared the performance of K-Mean and XMean clustering algorithms using two datasets of student enrollment in higher education institutions. Our methodology incorporated ontology to filter the datasets and exploited Rapidminer environment to evaluate the performance of clustering algorithms. The results showed that X-Mean is more suitable for large datasets in terms of discovery and accuracy of clusters. IJCSNS 2021-10 Article PeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/32993/1/analysis%20of%20kmean.pdf Rahmah, Mokhtar and Raza, Muhammad Ahsan and Fauziah, Zainuddin and Nor Azhar, Ahmad and Raza, Muhammad Fahad and Raza, Binish (2021) Analysis of K-Mean and X-Mean Clustering Algorithms Using Ontology-Based Dataset Filtering. International Journal of Computer Science and Network Security, 21 (10). pp. 283-287. ISSN 1738-7906 http://paper.ijcsns.org/07_book/202110/20211040.pdf
institution Universiti Malaysia Pahang
building UMP Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Pahang
content_source UMP Institutional Repository
url_provider http://umpir.ump.edu.my/
language English
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Rahmah, Mokhtar
Raza, Muhammad Ahsan
Fauziah, Zainuddin
Nor Azhar, Ahmad
Raza, Muhammad Fahad
Raza, Binish
Analysis of K-Mean and X-Mean Clustering Algorithms Using Ontology-Based Dataset Filtering
description In the field of computer science, data mining facilitates the extraction of useful knowledge and patterns from a huge amount of data. Various techniques exist in the data mining domain to explore the links, associations, and patterns from data in data warehouses. Among these techniques, clustering is more prominent in analyzing raw and unlabeled data from a large volume of datasets. The clustering mechanism identifies similar features between data objects and arranges them into clusters. In this paper, we have compared the performance of K-Mean and XMean clustering algorithms using two datasets of student enrollment in higher education institutions. Our methodology incorporated ontology to filter the datasets and exploited Rapidminer environment to evaluate the performance of clustering algorithms. The results showed that X-Mean is more suitable for large datasets in terms of discovery and accuracy of clusters.
format Article
author Rahmah, Mokhtar
Raza, Muhammad Ahsan
Fauziah, Zainuddin
Nor Azhar, Ahmad
Raza, Muhammad Fahad
Raza, Binish
author_facet Rahmah, Mokhtar
Raza, Muhammad Ahsan
Fauziah, Zainuddin
Nor Azhar, Ahmad
Raza, Muhammad Fahad
Raza, Binish
author_sort Rahmah, Mokhtar
title Analysis of K-Mean and X-Mean Clustering Algorithms Using Ontology-Based Dataset Filtering
title_short Analysis of K-Mean and X-Mean Clustering Algorithms Using Ontology-Based Dataset Filtering
title_full Analysis of K-Mean and X-Mean Clustering Algorithms Using Ontology-Based Dataset Filtering
title_fullStr Analysis of K-Mean and X-Mean Clustering Algorithms Using Ontology-Based Dataset Filtering
title_full_unstemmed Analysis of K-Mean and X-Mean Clustering Algorithms Using Ontology-Based Dataset Filtering
title_sort analysis of k-mean and x-mean clustering algorithms using ontology-based dataset filtering
publisher IJCSNS
publishDate 2021
url http://umpir.ump.edu.my/id/eprint/32993/1/analysis%20of%20kmean.pdf
http://umpir.ump.edu.my/id/eprint/32993/
http://paper.ijcsns.org/07_book/202110/20211040.pdf
_version_ 1724073493366046720