Synthesizing data for multiclass image classification
Image classification is a task used to identify what each individual image represents. Multiclass classification is a type of classification task which separates data into more than two classes. CNN, a class of deep neural network, is commonly used to analyse visual images. However, insufficient dat...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/145163 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-145163 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1451632023-07-07T18:03:32Z Synthesizing data for multiclass image classification Lee, Tian Fa Alex Chichung Kot School of Electrical and Electronic Engineering EACKOT@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Image classification is a task used to identify what each individual image represents. Multiclass classification is a type of classification task which separates data into more than two classes. CNN, a class of deep neural network, is commonly used to analyse visual images. However, insufficient data will lead to lower accuracy in image classification. To elevate the performance of CNN, it is necessary to increase the amount of quality training data. Research have been done in DCGAN, but they did not evaluate whether different types of synthetic data are useful for data augmentation of images. In this project, a CNN model is proposed to evaluate the accuracy of different types of synthetic data and investigate their usefulness in multiclass classification. Due to limitation in computational power and memory, datasets with 10 classes such as MNIST and CIFAR-10 are used in the experiment. Different sets of tools such as DCGAN and pre-trained deep learning models and computer vision algorithms are used to generate DCGAN-original, cartoonized and sketched version of the dataset respectively. They are then added to the training dataset of the CNN with its test accuracy evaluated. After analysing the results, DCGAN-original data has the highest performance as its test accuracy is highest among the three. Furthermore, there is a higher potential to improve the accuracy of the DCGAN-original dataset by fine-tuning the DCGAN. Bachelor of Engineering (Electrical and Electronic Engineering) 2020-12-14T08:23:52Z 2020-12-14T08:23:52Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/145163 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Lee, Tian Fa Synthesizing data for multiclass image classification |
description |
Image classification is a task used to identify what each individual image represents. Multiclass classification is a type of classification task which separates data into more than two classes. CNN, a class of deep neural network, is commonly used to analyse visual images. However, insufficient data will lead to lower accuracy in image classification. To elevate the performance of CNN, it is necessary to increase the amount of quality training data. Research have been done in DCGAN, but they did not evaluate whether different types of synthetic data are useful for data augmentation of images. In this project, a CNN model is proposed to evaluate the accuracy of different types of synthetic data and investigate their usefulness in multiclass classification. Due to limitation in computational power and memory, datasets with 10 classes such as MNIST and CIFAR-10 are used in the experiment. Different sets of tools such as DCGAN and pre-trained deep learning models and computer vision algorithms are used to generate DCGAN-original, cartoonized and sketched version of the dataset respectively. They are then added to the training dataset of the CNN with its test accuracy evaluated. After analysing the results, DCGAN-original data has the highest performance as its test accuracy is highest among the three. Furthermore, there is a higher potential to improve the accuracy of the DCGAN-original dataset by fine-tuning the DCGAN. |
author2 |
Alex Chichung Kot |
author_facet |
Alex Chichung Kot Lee, Tian Fa |
format |
Final Year Project |
author |
Lee, Tian Fa |
author_sort |
Lee, Tian Fa |
title |
Synthesizing data for multiclass image classification |
title_short |
Synthesizing data for multiclass image classification |
title_full |
Synthesizing data for multiclass image classification |
title_fullStr |
Synthesizing data for multiclass image classification |
title_full_unstemmed |
Synthesizing data for multiclass image classification |
title_sort |
synthesizing data for multiclass image classification |
publisher |
Nanyang Technological University |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/145163 |
_version_ |
1772825146067779584 |