SWIN transformer for diabetic retinopathy detection
In the field of Machine Learning, Convolutional Neural Networks (CNNs) have been dominant in executing image classification tasks. Transformer models were first introduced in 2017 for Natural Language Processing tasks, where further development led to the introduction of Vision Transformers (ViTs) f...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/165923 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-165923 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1659232023-04-21T15:36:56Z SWIN transformer for diabetic retinopathy detection Ang, Elroy Wei Yong Jagath C Rajapakse School of Computer Science and Engineering Singapore Eye Research Institute ASJagath@ntu.edu.sg Engineering::Computer science and engineering In the field of Machine Learning, Convolutional Neural Networks (CNNs) have been dominant in executing image classification tasks. Transformer models were first introduced in 2017 for Natural Language Processing tasks, where further development led to the introduction of Vision Transformers (ViTs) for image classification. In the medical field, one of the many use cases of Artificial Intelligence is to detect diseases. Specific to eye diseases, Diabetic Retinopathy (DR) is a common disease that has been using CNNs to aid in its discovery or classification. While recent comparisons have shown that ViTs outperform CNNs on the ImageNet, none has been done on a DR dataset. In this paper, we aim to compare the performances of ViTs and its recent variants, and CNNs on detecting DR using a single standardized dataset. The dataset used for training is obtained from Kaggle, and there are two other separate external validation datasets. We demonstrate that the SWIN Transformer outperforms other architectures in this problem. Bachelor of Science in Data Science and Artificial Intelligence 2023-04-17T01:02:25Z 2023-04-17T01:02:25Z 2023 Final Year Project (FYP) Ang, E. W. Y. (2023). SWIN transformer for diabetic retinopathy detection. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/165923 https://hdl.handle.net/10356/165923 en SCSE22-0643 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering |
spellingShingle |
Engineering::Computer science and engineering Ang, Elroy Wei Yong SWIN transformer for diabetic retinopathy detection |
description |
In the field of Machine Learning, Convolutional Neural Networks (CNNs) have been dominant in executing image classification tasks. Transformer models were first introduced in 2017 for Natural Language Processing tasks, where further development led to the introduction of Vision Transformers (ViTs) for image classification. In the medical field, one of the many use cases of Artificial Intelligence is to detect diseases. Specific to eye diseases, Diabetic Retinopathy (DR) is a common disease that has been using CNNs to aid in its discovery or classification. While recent comparisons have shown that ViTs outperform CNNs on the ImageNet, none has been done on a DR dataset. In this paper, we aim to compare the performances of ViTs and its recent variants, and CNNs on detecting DR using a single standardized dataset. The dataset used for training is obtained from Kaggle, and there are two other separate external validation datasets. We demonstrate that the SWIN Transformer outperforms other architectures in this problem. |
author2 |
Jagath C Rajapakse |
author_facet |
Jagath C Rajapakse Ang, Elroy Wei Yong |
format |
Final Year Project |
author |
Ang, Elroy Wei Yong |
author_sort |
Ang, Elroy Wei Yong |
title |
SWIN transformer for diabetic retinopathy detection |
title_short |
SWIN transformer for diabetic retinopathy detection |
title_full |
SWIN transformer for diabetic retinopathy detection |
title_fullStr |
SWIN transformer for diabetic retinopathy detection |
title_full_unstemmed |
SWIN transformer for diabetic retinopathy detection |
title_sort |
swin transformer for diabetic retinopathy detection |
publisher |
Nanyang Technological University |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/165923 |
_version_ |
1764208113724948480 |