Identifying individuals with recent COVID-19 through voice classification using deep learning

Recently deep learning has attained a breakthrough in model accuracy for the classification of images due mainly to convolutional neural networks. In the present study, we attempted to investigate the presence of subclinical voice feature alteration in COVID-19 patients after the recent resolution o...

Full description

Saved in:
Bibliographic Details
Main Authors: Pichatorn Suppakitjanusant, Somnuek Sungkanuparph, Thananya Wongsinin, Sirapong Virapongsiri, Nittaya Kasemkosin, Laor Chailurkit, Boonsong Ongphiphadhanakul
Other Authors: Faculty of Medicine Ramathibodi Hospital, Mahidol University
Format: Article
Published: 2022
Subjects:
Online Access:https://repository.li.mahidol.ac.th/handle/123456789/79211
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Mahidol University
id th-mahidol.79211
record_format dspace
spelling th-mahidol.792112022-08-04T18:37:57Z Identifying individuals with recent COVID-19 through voice classification using deep learning Pichatorn Suppakitjanusant Somnuek Sungkanuparph Thananya Wongsinin Sirapong Virapongsiri Nittaya Kasemkosin Laor Chailurkit Boonsong Ongphiphadhanakul Faculty of Medicine Ramathibodi Hospital, Mahidol University Multidisciplinary Recently deep learning has attained a breakthrough in model accuracy for the classification of images due mainly to convolutional neural networks. In the present study, we attempted to investigate the presence of subclinical voice feature alteration in COVID-19 patients after the recent resolution of disease using deep learning. The study was a prospective study of 76 post COVID-19 patients and 40 healthy individuals. The diagnoses of post COVID-19 patients were based on more than the eighth week after onset of symptoms. Voice samples of an ‘ah’ sound, coughing sound and a polysyllabic sentence were collected and preprocessed to log-mel spectrogram. Transfer learning using the VGG19 pre-trained convolutional neural network was performed with all voice samples. The performance of the model using the polysyllabic sentence yielded the highest classification performance of all models. The coughing sound produced the lowest classification performance while the ability of the monosyllabic ‘ah’ sound to predict the recent COVID-19 fell between the other two vocalizations. The model using the polysyllabic sentence achieved 85% accuracy, 89% sensitivity, and 77% specificity. In conclusion, deep learning is able to detect the subtle change in voice features of COVID-19 patients after recent resolution of the disease. 2022-08-04T11:37:57Z 2022-08-04T11:37:57Z 2021-12-01 Article Scientific Reports. Vol.11, No.1 (2021) 10.1038/s41598-021-98742-x 20452322 2-s2.0-85115789818 https://repository.li.mahidol.ac.th/handle/123456789/79211 Mahidol University SCOPUS https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85115789818&origin=inward
institution Mahidol University
building Mahidol University Library
continent Asia
country Thailand
Thailand
content_provider Mahidol University Library
collection Mahidol University Institutional Repository
topic Multidisciplinary
spellingShingle Multidisciplinary
Pichatorn Suppakitjanusant
Somnuek Sungkanuparph
Thananya Wongsinin
Sirapong Virapongsiri
Nittaya Kasemkosin
Laor Chailurkit
Boonsong Ongphiphadhanakul
Identifying individuals with recent COVID-19 through voice classification using deep learning
description Recently deep learning has attained a breakthrough in model accuracy for the classification of images due mainly to convolutional neural networks. In the present study, we attempted to investigate the presence of subclinical voice feature alteration in COVID-19 patients after the recent resolution of disease using deep learning. The study was a prospective study of 76 post COVID-19 patients and 40 healthy individuals. The diagnoses of post COVID-19 patients were based on more than the eighth week after onset of symptoms. Voice samples of an ‘ah’ sound, coughing sound and a polysyllabic sentence were collected and preprocessed to log-mel spectrogram. Transfer learning using the VGG19 pre-trained convolutional neural network was performed with all voice samples. The performance of the model using the polysyllabic sentence yielded the highest classification performance of all models. The coughing sound produced the lowest classification performance while the ability of the monosyllabic ‘ah’ sound to predict the recent COVID-19 fell between the other two vocalizations. The model using the polysyllabic sentence achieved 85% accuracy, 89% sensitivity, and 77% specificity. In conclusion, deep learning is able to detect the subtle change in voice features of COVID-19 patients after recent resolution of the disease.
author2 Faculty of Medicine Ramathibodi Hospital, Mahidol University
author_facet Faculty of Medicine Ramathibodi Hospital, Mahidol University
Pichatorn Suppakitjanusant
Somnuek Sungkanuparph
Thananya Wongsinin
Sirapong Virapongsiri
Nittaya Kasemkosin
Laor Chailurkit
Boonsong Ongphiphadhanakul
format Article
author Pichatorn Suppakitjanusant
Somnuek Sungkanuparph
Thananya Wongsinin
Sirapong Virapongsiri
Nittaya Kasemkosin
Laor Chailurkit
Boonsong Ongphiphadhanakul
author_sort Pichatorn Suppakitjanusant
title Identifying individuals with recent COVID-19 through voice classification using deep learning
title_short Identifying individuals with recent COVID-19 through voice classification using deep learning
title_full Identifying individuals with recent COVID-19 through voice classification using deep learning
title_fullStr Identifying individuals with recent COVID-19 through voice classification using deep learning
title_full_unstemmed Identifying individuals with recent COVID-19 through voice classification using deep learning
title_sort identifying individuals with recent covid-19 through voice classification using deep learning
publishDate 2022
url https://repository.li.mahidol.ac.th/handle/123456789/79211
_version_ 1763495750981910528