Augment image data using noise

Convolutional Neural Network (CNN) models for image classification have made strides in various fields such as identifying diseases in the medical industry. However, their performance is greatly affected by the data used to train them. Factors such as data quality, generalized data, and data quantit...

Full description

Saved in:
Bibliographic Details
Main Author: Muhammad Danish Bin Mohamad Nasir
Other Authors: Wang Lipo
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
CNN
AUC
SDV
Online Access:https://hdl.handle.net/10356/176469
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-176469
record_format dspace
spelling sg-ntu-dr.10356-1764692024-05-17T15:44:16Z Augment image data using noise Muhammad Danish Bin Mohamad Nasir Wang Lipo School of Electrical and Electronic Engineering ELPWang@ntu.edu.sg Computer and Information Science Densenet CNN AUC SDV PSNR SSIM Rayleigh Uniform Laplace Negative-exponential Exponential Augment Noise Dataset Code Convolutional Neural Network (CNN) models for image classification have made strides in various fields such as identifying diseases in the medical industry. However, their performance is greatly affected by the data used to train them. Factors such as data quality, generalized data, and data quantity contribute to the performance of CNN models. Noisy, esoteric, and too little data will decrease the classification accuracy of these models. This paper studies the effects of noise augmentation, a method to combat the negative factors above, by injecting various noise types into an X-ray image dataset. The noise types used are Rayleigh, Uniform, Laplace, and Negative Exponential distributed noise. A pre-trained DenseNet121 model is used to conduct training and testing. The metric evaluations highlighted here are Loss, area-under-curve (AUC) score, peak signal-to-noise ratio (PSNR), and structural similarity index measure (SSIM). By comparing models trained with the dataset of different noise types using Loss and AUC score, we determined the most suitable noise type here to be Rayleigh distributed noise. Further model testing is done using Rayleigh noise-augmented dataset and original-trained dataset for observation along with similarity tests of the 2 datasets. This resulted in both models performing similarly based on their average AUC scores of around 0.75, and it is reflected in the PSNR and SSIM tests as well. Bachelor's degree 2024-05-17T01:31:56Z 2024-05-17T01:31:56Z 2024 Final Year Project (FYP) Muhammad Danish Bin Mohamad Nasir (2024). Augment image data using noise. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/176469 https://hdl.handle.net/10356/176469 en A3224-231 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Densenet
CNN
AUC
SDV
PSNR
SSIM
Rayleigh
Uniform
Laplace
Negative-exponential
Exponential
Augment
Noise
Dataset
Code
spellingShingle Computer and Information Science
Densenet
CNN
AUC
SDV
PSNR
SSIM
Rayleigh
Uniform
Laplace
Negative-exponential
Exponential
Augment
Noise
Dataset
Code
Muhammad Danish Bin Mohamad Nasir
Augment image data using noise
description Convolutional Neural Network (CNN) models for image classification have made strides in various fields such as identifying diseases in the medical industry. However, their performance is greatly affected by the data used to train them. Factors such as data quality, generalized data, and data quantity contribute to the performance of CNN models. Noisy, esoteric, and too little data will decrease the classification accuracy of these models. This paper studies the effects of noise augmentation, a method to combat the negative factors above, by injecting various noise types into an X-ray image dataset. The noise types used are Rayleigh, Uniform, Laplace, and Negative Exponential distributed noise. A pre-trained DenseNet121 model is used to conduct training and testing. The metric evaluations highlighted here are Loss, area-under-curve (AUC) score, peak signal-to-noise ratio (PSNR), and structural similarity index measure (SSIM). By comparing models trained with the dataset of different noise types using Loss and AUC score, we determined the most suitable noise type here to be Rayleigh distributed noise. Further model testing is done using Rayleigh noise-augmented dataset and original-trained dataset for observation along with similarity tests of the 2 datasets. This resulted in both models performing similarly based on their average AUC scores of around 0.75, and it is reflected in the PSNR and SSIM tests as well.
author2 Wang Lipo
author_facet Wang Lipo
Muhammad Danish Bin Mohamad Nasir
format Final Year Project
author Muhammad Danish Bin Mohamad Nasir
author_sort Muhammad Danish Bin Mohamad Nasir
title Augment image data using noise
title_short Augment image data using noise
title_full Augment image data using noise
title_fullStr Augment image data using noise
title_full_unstemmed Augment image data using noise
title_sort augment image data using noise
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/176469
_version_ 1806059823830138880