CNN-BASED MODEL COMPARISON FOR COVID-19 DETECTION USING COUGH SOUND
Sars-CoV-2 virus or widely knwon as COVID-19 was classified as a pandemic throughout all over the world in 2019-2022. This phenomenon motivated a lot of researchers to study a machine learning implementation for detecting this virus by using cough sound. The difficulties of the researches regardi...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/76671 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:76671 |
---|---|
spelling |
id-itb.:766712023-08-17T08:43:41ZCNN-BASED MODEL COMPARISON FOR COVID-19 DETECTION USING COUGH SOUND Lucky Mahendra, Dimas Indonesia Final Project COVID-19, cough, imbalance data handling, feature, CNN INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/76671 Sars-CoV-2 virus or widely knwon as COVID-19 was classified as a pandemic throughout all over the world in 2019-2022. This phenomenon motivated a lot of researchers to study a machine learning implementation for detecting this virus by using cough sound. The difficulties of the researches regarding this topic is the limited amount of COVID-19 positive patient data available to be used for machine learning compared to the COVID-19 negative patient data, and also there are many variations of model architecture that could be used. Some researches shows that CNN-based model results in a higher performance score when compared to shallow learning-based model, but the combination between the preprocessing data method and the model used could be studied further to achieve the most optimal result. On this research, an experiment was conducted by using two types of imbalance data handling method, which are Random Undersampling and data augmentation. The experiment also used two types of feature, which are Mel Frequency Cepstral Coefficient (MFCC) and Log-Mel Spectrogram, and also three types of model architecture, which are CNN, ResNet-50, and VGGish. For the baseline model, a combination of Random Undersampling, MFCC, and CNN model was used. The experiment is the continued by using a variation of the total positive training data after the preprocessing phase is done with the amount of 1000-1500 data. The best experiment result is achieved by using a combination of White Noise Injection as the imbalance data handling method, MFCC for the feature, and CNN model architecture. This variation results in 83% specificity score, 44% sensitivity score, and 79% F1-Score. This variation gives an improvement of 0.248 specificity score, 0.0178 sensitivity score, and 0.02 F1-Score when compared with the baseline model. In conclusion, this combination is the best combination to classify COVID-19 by using cough sound in this research. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Sars-CoV-2 virus or widely knwon as COVID-19 was classified as a pandemic throughout all
over the world in 2019-2022. This phenomenon motivated a lot of researchers to study a
machine learning implementation for detecting this virus by using cough sound. The difficulties
of the researches regarding this topic is the limited amount of COVID-19 positive patient data
available to be used for machine learning compared to the COVID-19 negative patient data,
and also there are many variations of model architecture that could be used. Some researches
shows that CNN-based model results in a higher performance score when compared to shallow
learning-based model, but the combination between the preprocessing data method and the
model used could be studied further to achieve the most optimal result.
On this research, an experiment was conducted by using two types of imbalance data handling
method, which are Random Undersampling and data augmentation. The experiment also used
two types of feature, which are Mel Frequency Cepstral Coefficient (MFCC) and Log-Mel
Spectrogram, and also three types of model architecture, which are CNN, ResNet-50, and
VGGish. For the baseline model, a combination of Random Undersampling, MFCC, and CNN
model was used. The experiment is the continued by using a variation of the total positive
training data after the preprocessing phase is done with the amount of 1000-1500 data.
The best experiment result is achieved by using a combination of White Noise Injection as the
imbalance data handling method, MFCC for the feature, and CNN model architecture. This
variation results in 83% specificity score, 44% sensitivity score, and 79% F1-Score. This
variation gives an improvement of 0.248 specificity score, 0.0178 sensitivity score, and 0.02
F1-Score when compared with the baseline model. In conclusion, this combination is the best
combination to classify COVID-19 by using cough sound in this research. |
format |
Final Project |
author |
Lucky Mahendra, Dimas |
spellingShingle |
Lucky Mahendra, Dimas CNN-BASED MODEL COMPARISON FOR COVID-19 DETECTION USING COUGH SOUND |
author_facet |
Lucky Mahendra, Dimas |
author_sort |
Lucky Mahendra, Dimas |
title |
CNN-BASED MODEL COMPARISON FOR COVID-19 DETECTION USING COUGH SOUND |
title_short |
CNN-BASED MODEL COMPARISON FOR COVID-19 DETECTION USING COUGH SOUND |
title_full |
CNN-BASED MODEL COMPARISON FOR COVID-19 DETECTION USING COUGH SOUND |
title_fullStr |
CNN-BASED MODEL COMPARISON FOR COVID-19 DETECTION USING COUGH SOUND |
title_full_unstemmed |
CNN-BASED MODEL COMPARISON FOR COVID-19 DETECTION USING COUGH SOUND |
title_sort |
cnn-based model comparison for covid-19 detection using cough sound |
url |
https://digilib.itb.ac.id/gdl/view/76671 |
_version_ |
1822008048678338560 |