Deep learning for fusing speech and text for detection of Alzheimer’s Disease

Alzheimer’s Disease (AD) is one of the most common forms of dementia which occurs mainly in elderlies. Extensive research has been done to find a cure. However, early intervention is just as important. Impaired speech, which occurs in the early stages of AD, could be used as a biomarker for early...

Full description

Saved in:
Bibliographic Details
Main Author: Tan, Jun Xian
Other Authors: Jagath C Rajapakse
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/156350
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-156350
record_format dspace
spelling sg-ntu-dr.10356-1563502022-04-14T12:24:59Z Deep learning for fusing speech and text for detection of Alzheimer’s Disease Tan, Jun Xian Jagath C Rajapakse School of Computer Science and Engineering ASJagath@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Alzheimer’s Disease (AD) is one of the most common forms of dementia which occurs mainly in elderlies. Extensive research has been done to find a cure. However, early intervention is just as important. Impaired speech, which occurs in the early stages of AD, could be used as a biomarker for early detection of AD. Impaired speech could provide useful information, such as audio and text features, to be used for detection of AD. In this project, we will be using speech and text separately to detect AD. Then, speech and text will be combined to detect AD. Audio features like extended Geneva Minimalistic Acoustic Parameter Set (eGeMAPS) and text features like document embeddings, word embeddings, will be extracted. These features will then be used for detection of AD, using different machine learning and deep learning methods, such as Decision Tree, Random Forest, Support Vector Machine, Logistic Regression, Neural Networks. To obtain better results, the best models achieved from using speech and text will be combined. The fusion of speech and text will be done by using fusion mechanisms such as Concatenation and Bilinear Pooling. Our results indicated that by using the bilinear pooling mechanism, it produced better results compared to the concatenation mechanism. Bachelor of Engineering (Computer Science) 2022-04-14T12:24:59Z 2022-04-14T12:24:59Z 2022 Final Year Project (FYP) Tan, J. X. (2022). Deep learning for fusing speech and text for detection of Alzheimer’s Disease. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156350 https://hdl.handle.net/10356/156350 en SCSE21-0419 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Tan, Jun Xian
Deep learning for fusing speech and text for detection of Alzheimer’s Disease
description Alzheimer’s Disease (AD) is one of the most common forms of dementia which occurs mainly in elderlies. Extensive research has been done to find a cure. However, early intervention is just as important. Impaired speech, which occurs in the early stages of AD, could be used as a biomarker for early detection of AD. Impaired speech could provide useful information, such as audio and text features, to be used for detection of AD. In this project, we will be using speech and text separately to detect AD. Then, speech and text will be combined to detect AD. Audio features like extended Geneva Minimalistic Acoustic Parameter Set (eGeMAPS) and text features like document embeddings, word embeddings, will be extracted. These features will then be used for detection of AD, using different machine learning and deep learning methods, such as Decision Tree, Random Forest, Support Vector Machine, Logistic Regression, Neural Networks. To obtain better results, the best models achieved from using speech and text will be combined. The fusion of speech and text will be done by using fusion mechanisms such as Concatenation and Bilinear Pooling. Our results indicated that by using the bilinear pooling mechanism, it produced better results compared to the concatenation mechanism.
author2 Jagath C Rajapakse
author_facet Jagath C Rajapakse
Tan, Jun Xian
format Final Year Project
author Tan, Jun Xian
author_sort Tan, Jun Xian
title Deep learning for fusing speech and text for detection of Alzheimer’s Disease
title_short Deep learning for fusing speech and text for detection of Alzheimer’s Disease
title_full Deep learning for fusing speech and text for detection of Alzheimer’s Disease
title_fullStr Deep learning for fusing speech and text for detection of Alzheimer’s Disease
title_full_unstemmed Deep learning for fusing speech and text for detection of Alzheimer’s Disease
title_sort deep learning for fusing speech and text for detection of alzheimer’s disease
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/156350
_version_ 1731235742218190848