Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset
This paper proposes a strategy of combining multiple techniques to classify paediatric respiratory sound (PRS) from the Open-Source SJTU Paediatric Respiratory Sound Database. Inspired by recent successes in image classification, this work focuses on improving audio classification with limited and i...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/179114 https://ieeexplore.ieee.org/abstract/document/10389029 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-179114 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1791142024-07-19T15:39:07Z Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset Hu, Jinhai Leow, Cong Sheng Tao, Shuailin Goh, Wang Ling Gao, Yuan School of Electrical and Electronic Engineering 2023 IEEE Biomedical Circuits and Systems Conference (BioCAS) Institute of Microelectronics, A*STAR Centre for Integrated Circuits and Systems Engineering Supervised contrastive learning Respiratory sound classification MixUp finetuning This paper proposes a strategy of combining multiple techniques to classify paediatric respiratory sound (PRS) from the Open-Source SJTU Paediatric Respiratory Sound Database. Inspired by recent successes in image classification, this work focuses on improving audio classification with limited and imbalanced datasets through Residual Networks (ResNet). These techniques include augmentations applied to audio features, supervised contrastive (SupCon) pretraining, and MixUp. These three techniques helped reduced overfitting due to imbalanced dataset. To further enhance accuracy, pre-processing, and training hyperparameters were optimized through Bayesian Optimization. The proposed strategy achieved over 95% training accuracies for the four tasks (11, 12, 21, and 22) in the IEEE BioCAS 2023 grand challenge. Through this strategy, the four tasks achieved calculated scores of 0.769, 0.632, 0.662 and 0.512 respectively using the test dataset. The total score is 0.729 including 0.1 obtained from the runtime bonus. Agency for Science, Technology and Research (A*STAR) Submitted/Accepted version . This work was supported by the Agency for Science, Technology and Research (A*STAR), Singapore under the Cyber-Physiochemical Interface programme, grant No. A18A1b0045 and the Nanosystems at the Edge programme, grant No. A18A1b0055. 2024-07-19T05:45:24Z 2024-07-19T05:45:24Z 2023 Conference Paper Hu, J., Leow, C. S., Tao, S., Goh, W. L. & Gao, Y. (2023). Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset. 2023 IEEE Biomedical Circuits and Systems Conference (BioCAS). https://dx.doi.org/10.1109/BioCAS58349.2023.10389029 979-8-3503-0026-0 https://hdl.handle.net/10356/179114 10.1109/BioCAS58349.2023.10389029 https://ieeexplore.ieee.org/abstract/document/10389029 en A18A1b0045 A18A1b0055 © 2023 IEEE. All rights reserved. This article may be downloaded for personal use only. Any other use requires prior permission of the copyright holder. The Version of Record is available online at http://doi.org/10.1109/BioCAS58349.2023.10389029. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering Supervised contrastive learning Respiratory sound classification MixUp finetuning |
spellingShingle |
Engineering Supervised contrastive learning Respiratory sound classification MixUp finetuning Hu, Jinhai Leow, Cong Sheng Tao, Shuailin Goh, Wang Ling Gao, Yuan Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset |
description |
This paper proposes a strategy of combining multiple techniques to classify paediatric respiratory sound (PRS) from the Open-Source SJTU Paediatric Respiratory Sound Database. Inspired by recent successes in image classification, this work focuses on improving audio classification with limited and imbalanced datasets through Residual Networks (ResNet). These techniques include augmentations applied to audio features, supervised contrastive (SupCon) pretraining, and MixUp. These three techniques helped reduced overfitting due to imbalanced dataset. To further enhance accuracy, pre-processing, and training hyperparameters were optimized through Bayesian Optimization. The proposed strategy achieved over 95% training accuracies for the four tasks (11, 12, 21, and 22) in the IEEE BioCAS 2023 grand challenge. Through this strategy, the four tasks achieved calculated scores of 0.769, 0.632, 0.662 and 0.512 respectively using the test dataset. The total score is 0.729 including 0.1 obtained from the runtime bonus. |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Hu, Jinhai Leow, Cong Sheng Tao, Shuailin Goh, Wang Ling Gao, Yuan |
format |
Conference or Workshop Item |
author |
Hu, Jinhai Leow, Cong Sheng Tao, Shuailin Goh, Wang Ling Gao, Yuan |
author_sort |
Hu, Jinhai |
title |
Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset |
title_short |
Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset |
title_full |
Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset |
title_fullStr |
Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset |
title_full_unstemmed |
Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset |
title_sort |
supervised contrastive pretrained resnet with mixup to enhance respiratory sound classification on imbalanced and limited dataset |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/179114 https://ieeexplore.ieee.org/abstract/document/10389029 |
_version_ |
1814047075822731264 |