Prototype development of speech depression prediction system using TensorFlow lite on edge computing

Depression, a significant contributor to global suicide rates, poses unique diagnostic challenges in traditional clinical settings, resulting in frequently delayed diagnoses and potential patient misrepresentation. To address this issue, we presented an innovative prototype that combines edge comput...

Full description

Saved in:
Bibliographic Details
Main Authors: Gunawan, Teddy Surya, Ibrahim, Nur Firzanah Iwani, Kartiwi, Mira, Ismail, Nanang
Format: Proceeding Paper
Language:English
English
Published: IEEE 2023
Subjects:
Online Access:http://irep.iium.edu.my/109827/7/109827_Prototype%20development%20of%20speech%20depression.pdf
http://irep.iium.edu.my/109827/13/109827_Prototype%20development%20of%20speech%20depression_SCOPUS.pdf
http://irep.iium.edu.my/109827/
https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10335415
https://doi.org/10.1109/ICWT58823.2023.10335415
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Islam Antarabangsa Malaysia
Language: English
English
Description
Summary:Depression, a significant contributor to global suicide rates, poses unique diagnostic challenges in traditional clinical settings, resulting in frequently delayed diagnoses and potential patient misrepresentation. To address this issue, we presented an innovative prototype that combines edge computing and deep learning for improved and faster detection of depression through speech behavior analysis. Our model used a one-dimensional Convolutional Neural Network (CNN) with TensorFlow Lite on an NVIDIA Jetson Nano platform. A MAONO AU903 Studio-Quality USB Microphone was used to achieve optimal audio quality. By analyzing speech behavior, this setup effectively distinguished between depressive and non-depressive speech patterns. A data augmentation procedure that included noise in audio data increased the model's robustness. Because of its suitability and an extensive collection of interviews with subjects in various depressive states, the Distress Analysis Interview Corpus - Wizard of Oz (DAIC-WOZ) database was used for training and testing. The successful operation of the prototype demonstrates the method's diagnostic potential for clinical depression. While the developed model's accuracy could be improved by investigating alternative deep learning architectures, it provided a solid foundation for future development. The study emphasized the importance of further research into real-time depression prediction using speech analysis, which has the potential to revolutionize mental health diagnostics.