CLUSTER ANALYSIS USING MINKOWSKI DISTANCE WITH POWERS BETWEEN [1, 4] BETWEEN TWO OBJECTS/INDIVIDUALS AND THE UTILIZATION OF PRINCIPAL COMPONENT ANALYSIS AND LINEAR DISCRIMINANT ANALYSIS CASE STUDY: GROUNDWATER QUALITY DATA FROM SUBDISTRICTS IN BANDUNG REGENCY, 2023
This study investigates the impact of cluster analysis using the Minkowski distance metric with varying powers, as well as the utilization of dimensionality reduction techniques such as Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) to enhance the robustness of clusteri...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/84068 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:84068 |
---|---|
spelling |
id-itb.:840682024-08-13T21:33:12ZCLUSTER ANALYSIS USING MINKOWSKI DISTANCE WITH POWERS BETWEEN [1, 4] BETWEEN TWO OBJECTS/INDIVIDUALS AND THE UTILIZATION OF PRINCIPAL COMPONENT ANALYSIS AND LINEAR DISCRIMINANT ANALYSIS CASE STUDY: GROUNDWATER QUALITY DATA FROM SUBDISTRICTS IN BANDUNG REGENCY, 2023 Arbi Wijaya, Mohammad Indonesia Final Project Cluster Analysis, Minkowski Distance, Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), Dimensionality Reduction, Groundwater Quality INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/84068 This study investigates the impact of cluster analysis using the Minkowski distance metric with varying powers, as well as the utilization of dimensionality reduction techniques such as Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) to enhance the robustness of clustering results. Groundwater quality data, which includes information on 10 metal contaminants from various sub-districts in Bandung Regency in 2023, is used as an analysis example. In this context, multivariate analysis is critical for decision-making but poses challenges when faced with large data. Cluster analysis becomes an essential tool for identifying patterns by grouping objects based on similarities determined by distance measures. The Minkowski distance metric, which includes r, offers a more general approach compared to Euclidean distance, as it allows for adjusting sensitivity to data variations. PCA is used to reduce the dimensionality of data while preserving variance using eigenvalues and eigenvectors from the covariance matrix, allowing for the selection of relevant principal components. Subsequently, LDA is applied to enhance class separability by maximizing the ratio of betweencluster scatter to within-cluster scatter. This transformation results in more stable data for cluster analysis. The results show that the combination of PCA and LDA can enhance the stability and interpretability of clustering outcomes, even when changes occur in the Minkowski distance metric parameters. This approach provides a stronger framework for analyzing groundwater quality data, enabling better understanding of the relationships among metal contaminants while ensuring clustering robustness against various distance definitions. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
This study investigates the impact of cluster analysis using the Minkowski distance
metric with varying powers, as well as the utilization of dimensionality reduction
techniques such as Principal Component Analysis (PCA) and Linear Discriminant
Analysis (LDA) to enhance the robustness of clustering results. Groundwater
quality data, which includes information on 10 metal contaminants from various
sub-districts in Bandung Regency in 2023, is used as an analysis example. In this
context, multivariate analysis is critical for decision-making but poses challenges
when faced with large data. Cluster analysis becomes an essential tool for
identifying patterns by grouping objects based on similarities determined by
distance measures. The Minkowski distance metric, which includes r, offers a more
general approach compared to Euclidean distance, as it allows for adjusting
sensitivity to data variations. PCA is used to reduce the dimensionality of data while
preserving variance using eigenvalues and eigenvectors from the covariance
matrix, allowing for the selection of relevant principal components. Subsequently,
LDA is applied to enhance class separability by maximizing the ratio of betweencluster scatter to within-cluster scatter. This transformation results in more stable
data for cluster analysis. The results show that the combination of PCA and LDA
can enhance the stability and interpretability of clustering outcomes, even when
changes occur in the Minkowski distance metric parameters. This approach
provides a stronger framework for analyzing groundwater quality data, enabling
better understanding of the relationships among metal contaminants while ensuring
clustering robustness against various distance definitions.
|
format |
Final Project |
author |
Arbi Wijaya, Mohammad |
spellingShingle |
Arbi Wijaya, Mohammad CLUSTER ANALYSIS USING MINKOWSKI DISTANCE WITH POWERS BETWEEN [1, 4] BETWEEN TWO OBJECTS/INDIVIDUALS AND THE UTILIZATION OF PRINCIPAL COMPONENT ANALYSIS AND LINEAR DISCRIMINANT ANALYSIS CASE STUDY: GROUNDWATER QUALITY DATA FROM SUBDISTRICTS IN BANDUNG REGENCY, 2023 |
author_facet |
Arbi Wijaya, Mohammad |
author_sort |
Arbi Wijaya, Mohammad |
title |
CLUSTER ANALYSIS USING MINKOWSKI DISTANCE WITH POWERS BETWEEN [1, 4] BETWEEN TWO OBJECTS/INDIVIDUALS AND THE UTILIZATION OF PRINCIPAL COMPONENT ANALYSIS AND LINEAR DISCRIMINANT ANALYSIS CASE STUDY: GROUNDWATER QUALITY DATA FROM SUBDISTRICTS IN BANDUNG REGENCY, 2023 |
title_short |
CLUSTER ANALYSIS USING MINKOWSKI DISTANCE WITH POWERS BETWEEN [1, 4] BETWEEN TWO OBJECTS/INDIVIDUALS AND THE UTILIZATION OF PRINCIPAL COMPONENT ANALYSIS AND LINEAR DISCRIMINANT ANALYSIS CASE STUDY: GROUNDWATER QUALITY DATA FROM SUBDISTRICTS IN BANDUNG REGENCY, 2023 |
title_full |
CLUSTER ANALYSIS USING MINKOWSKI DISTANCE WITH POWERS BETWEEN [1, 4] BETWEEN TWO OBJECTS/INDIVIDUALS AND THE UTILIZATION OF PRINCIPAL COMPONENT ANALYSIS AND LINEAR DISCRIMINANT ANALYSIS CASE STUDY: GROUNDWATER QUALITY DATA FROM SUBDISTRICTS IN BANDUNG REGENCY, 2023 |
title_fullStr |
CLUSTER ANALYSIS USING MINKOWSKI DISTANCE WITH POWERS BETWEEN [1, 4] BETWEEN TWO OBJECTS/INDIVIDUALS AND THE UTILIZATION OF PRINCIPAL COMPONENT ANALYSIS AND LINEAR DISCRIMINANT ANALYSIS CASE STUDY: GROUNDWATER QUALITY DATA FROM SUBDISTRICTS IN BANDUNG REGENCY, 2023 |
title_full_unstemmed |
CLUSTER ANALYSIS USING MINKOWSKI DISTANCE WITH POWERS BETWEEN [1, 4] BETWEEN TWO OBJECTS/INDIVIDUALS AND THE UTILIZATION OF PRINCIPAL COMPONENT ANALYSIS AND LINEAR DISCRIMINANT ANALYSIS CASE STUDY: GROUNDWATER QUALITY DATA FROM SUBDISTRICTS IN BANDUNG REGENCY, 2023 |
title_sort |
cluster analysis using minkowski distance with powers between [1, 4] between two objects/individuals and the utilization of principal component analysis and linear discriminant analysis case study: groundwater quality data from subdistricts in bandung regency, 2023 |
url |
https://digilib.itb.ac.id/gdl/view/84068 |
_version_ |
1822998396046147584 |