Prototype development of speech depression prediction system using TensorFlow lite on edge computing
Depression, a significant contributor to global suicide rates, poses unique diagnostic challenges in traditional clinical settings, resulting in frequently delayed diagnoses and potential patient misrepresentation. To address this issue, we presented an innovative prototype that combines edge comput...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Proceeding Paper |
Language: | English English |
Published: |
IEEE
2023
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/109827/7/109827_Prototype%20development%20of%20speech%20depression.pdf http://irep.iium.edu.my/109827/13/109827_Prototype%20development%20of%20speech%20depression_SCOPUS.pdf http://irep.iium.edu.my/109827/ https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10335415 https://doi.org/10.1109/ICWT58823.2023.10335415 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Islam Antarabangsa Malaysia |
Language: | English English |
Summary: | Depression, a significant contributor to global suicide rates, poses unique diagnostic challenges in traditional clinical settings, resulting in frequently delayed diagnoses and potential patient misrepresentation. To address this issue, we presented an innovative prototype that combines edge computing and deep learning for improved and faster detection of depression through speech behavior analysis. Our model used a one-dimensional Convolutional Neural Network (CNN) with TensorFlow Lite on an NVIDIA Jetson Nano platform. A MAONO AU903 Studio-Quality USB Microphone was used to achieve optimal audio quality. By analyzing speech behavior, this setup effectively distinguished between depressive and non-depressive speech patterns. A data augmentation procedure that included noise in audio data increased the model's robustness. Because of its suitability and an extensive collection of interviews with subjects in various depressive states, the Distress Analysis Interview Corpus - Wizard of Oz (DAIC-WOZ) database was used for training and testing. The successful operation of the prototype demonstrates the method's diagnostic potential for clinical depression. While the developed model's accuracy could be improved by investigating alternative deep learning architectures, it provided a solid foundation for future development. The study emphasized the importance of further research into real-time depression prediction using speech analysis, which has the potential to revolutionize mental health diagnostics. |
---|