HYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION

Imbalanced data refer to data condition whose significant disparity between the number of data points in one class compared to another class. In some cases of imbalanced data, classification algorithms may not accurately predict the minority class even though they achieve high accuracy. However,...

Full description

Saved in:
Bibliographic Details
Main Author: Fauzi, Ihsan
Format: Theses
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/78368
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:78368
spelling id-itb.:783682023-09-19T12:49:47ZHYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION Fauzi, Ihsan Indonesia Theses imbalanced data, undersampling, oversampling, hybridsampling, DBSCAN, PSO INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/78368 Imbalanced data refer to data condition whose significant disparity between the number of data points in one class compared to another class. In some cases of imbalanced data, classification algorithms may not accurately predict the minority class even though they achieve high accuracy. However, accurate prediction of the minority class is most important, for example in cases of rare medical disease diagnosis where it is crucial to detect the disease. To address the issue of imbalanced data, this research proposes a hybridsampling method that combines the undersampling method proposed by Mirzaei et al. and the oversampling method proposed by Xiaolong et al., where both methods are performed based on density using the DBSCAN algorithm for resampling. However, the DBSCAN algorithm is highly sensitive to the minPts and Eps values, so other research has used Particle Swarm Optimization (PSO) to determine these two parameters. Therefore, the hybridsampling method that proposed in this research uses Particle Swarm Optimization (PSO) to determine the minPts and Eps paramters values in the DBSCAN algorithm used for both undersampling and oversampling. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description Imbalanced data refer to data condition whose significant disparity between the number of data points in one class compared to another class. In some cases of imbalanced data, classification algorithms may not accurately predict the minority class even though they achieve high accuracy. However, accurate prediction of the minority class is most important, for example in cases of rare medical disease diagnosis where it is crucial to detect the disease. To address the issue of imbalanced data, this research proposes a hybridsampling method that combines the undersampling method proposed by Mirzaei et al. and the oversampling method proposed by Xiaolong et al., where both methods are performed based on density using the DBSCAN algorithm for resampling. However, the DBSCAN algorithm is highly sensitive to the minPts and Eps values, so other research has used Particle Swarm Optimization (PSO) to determine these two parameters. Therefore, the hybridsampling method that proposed in this research uses Particle Swarm Optimization (PSO) to determine the minPts and Eps paramters values in the DBSCAN algorithm used for both undersampling and oversampling.
format Theses
author Fauzi, Ihsan
spellingShingle Fauzi, Ihsan
HYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION
author_facet Fauzi, Ihsan
author_sort Fauzi, Ihsan
title HYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION
title_short HYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION
title_full HYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION
title_fullStr HYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION
title_full_unstemmed HYBRID SAMPLING METHOD BASED ON DBSCAN AND PARTICLE SWARM OPTIMIZATION (PSO) FOR IMBALANCED DATA CLASSIFICATION
title_sort hybrid sampling method based on dbscan and particle swarm optimization (pso) for imbalanced data classification
url https://digilib.itb.ac.id/gdl/view/78368
_version_ 1822995727731654656