HYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION
Imbalanced data refer to data condition whose significant disparity between the number of data points in one class compared to another class. In some cases of imbalanced data, classification algorithms may not accurately predict the minority class even though they achieve high accuracy. However,...
Saved in:
Main Author: | |
---|---|
Format: | Theses |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/78368 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:78368 |
---|---|
spelling |
id-itb.:783682023-09-19T12:49:47ZHYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION Fauzi, Ihsan Indonesia Theses imbalanced data, undersampling, oversampling, hybridsampling, DBSCAN, PSO INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/78368 Imbalanced data refer to data condition whose significant disparity between the number of data points in one class compared to another class. In some cases of imbalanced data, classification algorithms may not accurately predict the minority class even though they achieve high accuracy. However, accurate prediction of the minority class is most important, for example in cases of rare medical disease diagnosis where it is crucial to detect the disease. To address the issue of imbalanced data, this research proposes a hybridsampling method that combines the undersampling method proposed by Mirzaei et al. and the oversampling method proposed by Xiaolong et al., where both methods are performed based on density using the DBSCAN algorithm for resampling. However, the DBSCAN algorithm is highly sensitive to the minPts and Eps values, so other research has used Particle Swarm Optimization (PSO) to determine these two parameters. Therefore, the hybridsampling method that proposed in this research uses Particle Swarm Optimization (PSO) to determine the minPts and Eps paramters values in the DBSCAN algorithm used for both undersampling and oversampling. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Imbalanced data refer to data condition whose significant disparity between the
number of data points in one class compared to another class. In some cases of
imbalanced data, classification algorithms may not accurately predict the minority
class even though they achieve high accuracy. However, accurate prediction of the
minority class is most important, for example in cases of rare medical disease
diagnosis where it is crucial to detect the disease. To address the issue of
imbalanced data, this research proposes a hybridsampling method that combines
the undersampling method proposed by Mirzaei et al. and the oversampling method
proposed by Xiaolong et al., where both methods are performed based on density
using the DBSCAN algorithm for resampling. However, the DBSCAN algorithm is
highly sensitive to the minPts and Eps values, so other research has used Particle
Swarm Optimization (PSO) to determine these two parameters. Therefore, the
hybridsampling method that proposed in this research uses Particle Swarm
Optimization (PSO) to determine the minPts and Eps paramters values in the
DBSCAN algorithm used for both undersampling and oversampling. |
format |
Theses |
author |
Fauzi, Ihsan |
spellingShingle |
Fauzi, Ihsan HYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION |
author_facet |
Fauzi, Ihsan |
author_sort |
Fauzi, Ihsan |
title |
HYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION |
title_short |
HYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION |
title_full |
HYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION |
title_fullStr |
HYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION |
title_full_unstemmed |
HYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION |
title_sort |
hybrid sampling method based on dbscan and particle swarm optimization (pso) for imbalanced data classification |
url |
https://digilib.itb.ac.id/gdl/view/78368 |
_version_ |
1822995727731654656 |