Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset

This paper proposes a strategy of combining multiple techniques to classify paediatric respiratory sound (PRS) from the Open-Source SJTU Paediatric Respiratory Sound Database. Inspired by recent successes in image classification, this work focuses on improving audio classification with limited and i...

Full description

Saved in:
Bibliographic Details
Main Authors: Hu, Jinhai, Leow, Cong Sheng, Tao, Shuailin, Goh, Wang Ling, Gao, Yuan
Other Authors: School of Electrical and Electronic Engineering
Format: Conference or Workshop Item
Language:English
Published: 2024
Subjects:
Online Access:https://hdl.handle.net/10356/179114
https://ieeexplore.ieee.org/abstract/document/10389029
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-179114
record_format dspace
spelling sg-ntu-dr.10356-1791142024-07-19T15:39:07Z Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset Hu, Jinhai Leow, Cong Sheng Tao, Shuailin Goh, Wang Ling Gao, Yuan School of Electrical and Electronic Engineering 2023 IEEE Biomedical Circuits and Systems Conference (BioCAS) Institute of Microelectronics, A*STAR Centre for Integrated Circuits and Systems Engineering Supervised contrastive learning Respiratory sound classification MixUp finetuning This paper proposes a strategy of combining multiple techniques to classify paediatric respiratory sound (PRS) from the Open-Source SJTU Paediatric Respiratory Sound Database. Inspired by recent successes in image classification, this work focuses on improving audio classification with limited and imbalanced datasets through Residual Networks (ResNet). These techniques include augmentations applied to audio features, supervised contrastive (SupCon) pretraining, and MixUp. These three techniques helped reduced overfitting due to imbalanced dataset. To further enhance accuracy, pre-processing, and training hyperparameters were optimized through Bayesian Optimization. The proposed strategy achieved over 95% training accuracies for the four tasks (11, 12, 21, and 22) in the IEEE BioCAS 2023 grand challenge. Through this strategy, the four tasks achieved calculated scores of 0.769, 0.632, 0.662 and 0.512 respectively using the test dataset. The total score is 0.729 including 0.1 obtained from the runtime bonus. Agency for Science, Technology and Research (A*STAR) Submitted/Accepted version . This work was supported by the Agency for Science, Technology and Research (A*STAR), Singapore under the Cyber-Physiochemical Interface programme, grant No. A18A1b0045 and the Nanosystems at the Edge programme, grant No. A18A1b0055. 2024-07-19T05:45:24Z 2024-07-19T05:45:24Z 2023 Conference Paper Hu, J., Leow, C. S., Tao, S., Goh, W. L. & Gao, Y. (2023). Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset. 2023 IEEE Biomedical Circuits and Systems Conference (BioCAS). https://dx.doi.org/10.1109/BioCAS58349.2023.10389029 979-8-3503-0026-0 https://hdl.handle.net/10356/179114 10.1109/BioCAS58349.2023.10389029 https://ieeexplore.ieee.org/abstract/document/10389029 en A18A1b0045 A18A1b0055 © 2023 IEEE. All rights reserved. This article may be downloaded for personal use only. Any other use requires prior permission of the copyright holder. The Version of Record is available online at http://doi.org/10.1109/BioCAS58349.2023.10389029. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering
Supervised contrastive learning
Respiratory sound classification
MixUp finetuning
spellingShingle Engineering
Supervised contrastive learning
Respiratory sound classification
MixUp finetuning
Hu, Jinhai
Leow, Cong Sheng
Tao, Shuailin
Goh, Wang Ling
Gao, Yuan
Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset
description This paper proposes a strategy of combining multiple techniques to classify paediatric respiratory sound (PRS) from the Open-Source SJTU Paediatric Respiratory Sound Database. Inspired by recent successes in image classification, this work focuses on improving audio classification with limited and imbalanced datasets through Residual Networks (ResNet). These techniques include augmentations applied to audio features, supervised contrastive (SupCon) pretraining, and MixUp. These three techniques helped reduced overfitting due to imbalanced dataset. To further enhance accuracy, pre-processing, and training hyperparameters were optimized through Bayesian Optimization. The proposed strategy achieved over 95% training accuracies for the four tasks (11, 12, 21, and 22) in the IEEE BioCAS 2023 grand challenge. Through this strategy, the four tasks achieved calculated scores of 0.769, 0.632, 0.662 and 0.512 respectively using the test dataset. The total score is 0.729 including 0.1 obtained from the runtime bonus.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Hu, Jinhai
Leow, Cong Sheng
Tao, Shuailin
Goh, Wang Ling
Gao, Yuan
format Conference or Workshop Item
author Hu, Jinhai
Leow, Cong Sheng
Tao, Shuailin
Goh, Wang Ling
Gao, Yuan
author_sort Hu, Jinhai
title Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset
title_short Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset
title_full Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset
title_fullStr Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset
title_full_unstemmed Supervised contrastive pretrained ResNet with MixUp to enhance respiratory sound classification on imbalanced and limited dataset
title_sort supervised contrastive pretrained resnet with mixup to enhance respiratory sound classification on imbalanced and limited dataset
publishDate 2024
url https://hdl.handle.net/10356/179114
https://ieeexplore.ieee.org/abstract/document/10389029
_version_ 1814047075822731264