Deep learning for fusing speech and text for detection of Alzheimer’s Disease
Alzheimer’s Disease (AD) is one of the most common forms of dementia which occurs mainly in elderlies. Extensive research has been done to find a cure. However, early intervention is just as important. Impaired speech, which occurs in the early stages of AD, could be used as a biomarker for early...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/156350 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-156350 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1563502022-04-14T12:24:59Z Deep learning for fusing speech and text for detection of Alzheimer’s Disease Tan, Jun Xian Jagath C Rajapakse School of Computer Science and Engineering ASJagath@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Alzheimer’s Disease (AD) is one of the most common forms of dementia which occurs mainly in elderlies. Extensive research has been done to find a cure. However, early intervention is just as important. Impaired speech, which occurs in the early stages of AD, could be used as a biomarker for early detection of AD. Impaired speech could provide useful information, such as audio and text features, to be used for detection of AD. In this project, we will be using speech and text separately to detect AD. Then, speech and text will be combined to detect AD. Audio features like extended Geneva Minimalistic Acoustic Parameter Set (eGeMAPS) and text features like document embeddings, word embeddings, will be extracted. These features will then be used for detection of AD, using different machine learning and deep learning methods, such as Decision Tree, Random Forest, Support Vector Machine, Logistic Regression, Neural Networks. To obtain better results, the best models achieved from using speech and text will be combined. The fusion of speech and text will be done by using fusion mechanisms such as Concatenation and Bilinear Pooling. Our results indicated that by using the bilinear pooling mechanism, it produced better results compared to the concatenation mechanism. Bachelor of Engineering (Computer Science) 2022-04-14T12:24:59Z 2022-04-14T12:24:59Z 2022 Final Year Project (FYP) Tan, J. X. (2022). Deep learning for fusing speech and text for detection of Alzheimer’s Disease. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156350 https://hdl.handle.net/10356/156350 en SCSE21-0419 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Tan, Jun Xian Deep learning for fusing speech and text for detection of Alzheimer’s Disease |
description |
Alzheimer’s Disease (AD) is one of the most common forms of dementia which occurs mainly in
elderlies. Extensive research has been done to find a cure. However, early intervention is just as
important. Impaired speech, which occurs in the early stages of AD, could be used as a
biomarker for early detection of AD. Impaired speech could provide useful information, such as
audio and text features, to be used for detection of AD.
In this project, we will be using speech and text separately to detect AD. Then, speech and text
will be combined to detect AD. Audio features like extended Geneva Minimalistic Acoustic
Parameter Set (eGeMAPS) and text features like document embeddings, word embeddings, will
be extracted. These features will then be used for detection of AD, using different machine
learning and deep learning methods, such as Decision Tree, Random Forest, Support Vector
Machine, Logistic Regression, Neural Networks.
To obtain better results, the best models achieved from using speech and text will be
combined. The fusion of speech and text will be done by using fusion mechanisms such as
Concatenation and Bilinear Pooling. Our results indicated that by using the bilinear pooling
mechanism, it produced better results compared to the concatenation mechanism. |
author2 |
Jagath C Rajapakse |
author_facet |
Jagath C Rajapakse Tan, Jun Xian |
format |
Final Year Project |
author |
Tan, Jun Xian |
author_sort |
Tan, Jun Xian |
title |
Deep learning for fusing speech and text for detection of Alzheimer’s Disease |
title_short |
Deep learning for fusing speech and text for detection of Alzheimer’s Disease |
title_full |
Deep learning for fusing speech and text for detection of Alzheimer’s Disease |
title_fullStr |
Deep learning for fusing speech and text for detection of Alzheimer’s Disease |
title_full_unstemmed |
Deep learning for fusing speech and text for detection of Alzheimer’s Disease |
title_sort |
deep learning for fusing speech and text for detection of alzheimer’s disease |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/156350 |
_version_ |
1731235742218190848 |