Augment image data using noise
Convolutional Neural Network (CNN) models for image classification have made strides in various fields such as identifying diseases in the medical industry. However, their performance is greatly affected by the data used to train them. Factors such as data quality, generalized data, and data quantit...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/176469 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-176469 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1764692024-05-17T15:44:16Z Augment image data using noise Muhammad Danish Bin Mohamad Nasir Wang Lipo School of Electrical and Electronic Engineering ELPWang@ntu.edu.sg Computer and Information Science Densenet CNN AUC SDV PSNR SSIM Rayleigh Uniform Laplace Negative-exponential Exponential Augment Noise Dataset Code Convolutional Neural Network (CNN) models for image classification have made strides in various fields such as identifying diseases in the medical industry. However, their performance is greatly affected by the data used to train them. Factors such as data quality, generalized data, and data quantity contribute to the performance of CNN models. Noisy, esoteric, and too little data will decrease the classification accuracy of these models. This paper studies the effects of noise augmentation, a method to combat the negative factors above, by injecting various noise types into an X-ray image dataset. The noise types used are Rayleigh, Uniform, Laplace, and Negative Exponential distributed noise. A pre-trained DenseNet121 model is used to conduct training and testing. The metric evaluations highlighted here are Loss, area-under-curve (AUC) score, peak signal-to-noise ratio (PSNR), and structural similarity index measure (SSIM). By comparing models trained with the dataset of different noise types using Loss and AUC score, we determined the most suitable noise type here to be Rayleigh distributed noise. Further model testing is done using Rayleigh noise-augmented dataset and original-trained dataset for observation along with similarity tests of the 2 datasets. This resulted in both models performing similarly based on their average AUC scores of around 0.75, and it is reflected in the PSNR and SSIM tests as well. Bachelor's degree 2024-05-17T01:31:56Z 2024-05-17T01:31:56Z 2024 Final Year Project (FYP) Muhammad Danish Bin Mohamad Nasir (2024). Augment image data using noise. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/176469 https://hdl.handle.net/10356/176469 en A3224-231 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Computer and Information Science Densenet CNN AUC SDV PSNR SSIM Rayleigh Uniform Laplace Negative-exponential Exponential Augment Noise Dataset Code |
spellingShingle |
Computer and Information Science Densenet CNN AUC SDV PSNR SSIM Rayleigh Uniform Laplace Negative-exponential Exponential Augment Noise Dataset Code Muhammad Danish Bin Mohamad Nasir Augment image data using noise |
description |
Convolutional Neural Network (CNN) models for image classification have made strides in various fields such as identifying diseases in the medical industry. However, their performance is greatly affected by the data used to train them. Factors such as data quality, generalized data, and data quantity contribute to the performance of CNN models. Noisy, esoteric, and too little data will decrease the classification accuracy of these models. This paper studies the effects of noise augmentation, a method to combat the negative factors above, by injecting various noise types into an X-ray image dataset. The noise types used are Rayleigh, Uniform, Laplace, and Negative Exponential distributed noise. A pre-trained DenseNet121 model is used to conduct training and testing. The metric evaluations highlighted here are Loss, area-under-curve (AUC) score, peak signal-to-noise ratio (PSNR), and structural similarity index measure (SSIM). By comparing models trained with the dataset of different noise types using Loss and AUC score, we determined the most suitable noise type here to be Rayleigh distributed noise. Further model testing is done using Rayleigh noise-augmented dataset and original-trained dataset for observation along with similarity tests of the 2 datasets. This resulted in both models performing similarly based on their average AUC scores of around 0.75, and it is reflected in the PSNR and SSIM tests as well. |
author2 |
Wang Lipo |
author_facet |
Wang Lipo Muhammad Danish Bin Mohamad Nasir |
format |
Final Year Project |
author |
Muhammad Danish Bin Mohamad Nasir |
author_sort |
Muhammad Danish Bin Mohamad Nasir |
title |
Augment image data using noise |
title_short |
Augment image data using noise |
title_full |
Augment image data using noise |
title_fullStr |
Augment image data using noise |
title_full_unstemmed |
Augment image data using noise |
title_sort |
augment image data using noise |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/176469 |
_version_ |
1806059823830138880 |