Synthesizing data for multiclass image classification

Image classification is a task used to identify what each individual image represents. Multiclass classification is a type of classification task which separates data into more than two classes. CNN, a class of deep neural network, is commonly used to analyse visual images. However, insufficient dat...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Lee, Tian Fa
مؤلفون آخرون:	Alex Chichung Kot
التنسيق:	Final Year Project
اللغة:	English
منشور في:	Nanyang Technological University 2020
الموضوعات:	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
الوصول للمادة أونلاين:	https://hdl.handle.net/10356/145163
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة:	Nanyang Technological University
اللغة:	English

id	sg-ntu-dr.10356-145163
record_format	dspace
spelling	sg-ntu-dr.10356-1451632023-07-07T18:03:32Z Synthesizing data for multiclass image classification Lee, Tian Fa Alex Chichung Kot School of Electrical and Electronic Engineering EACKOT@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Image classification is a task used to identify what each individual image represents. Multiclass classification is a type of classification task which separates data into more than two classes. CNN, a class of deep neural network, is commonly used to analyse visual images. However, insufficient data will lead to lower accuracy in image classification. To elevate the performance of CNN, it is necessary to increase the amount of quality training data. Research have been done in DCGAN, but they did not evaluate whether different types of synthetic data are useful for data augmentation of images. In this project, a CNN model is proposed to evaluate the accuracy of different types of synthetic data and investigate their usefulness in multiclass classification. Due to limitation in computational power and memory, datasets with 10 classes such as MNIST and CIFAR-10 are used in the experiment. Different sets of tools such as DCGAN and pre-trained deep learning models and computer vision algorithms are used to generate DCGAN-original, cartoonized and sketched version of the dataset respectively. They are then added to the training dataset of the CNN with its test accuracy evaluated. After analysing the results, DCGAN-original data has the highest performance as its test accuracy is highest among the three. Furthermore, there is a higher potential to improve the accuracy of the DCGAN-original dataset by fine-tuning the DCGAN. Bachelor of Engineering (Electrical and Electronic Engineering) 2020-12-14T08:23:52Z 2020-12-14T08:23:52Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/145163 en application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Lee, Tian Fa Synthesizing data for multiclass image classification
description	Image classification is a task used to identify what each individual image represents. Multiclass classification is a type of classification task which separates data into more than two classes. CNN, a class of deep neural network, is commonly used to analyse visual images. However, insufficient data will lead to lower accuracy in image classification. To elevate the performance of CNN, it is necessary to increase the amount of quality training data. Research have been done in DCGAN, but they did not evaluate whether different types of synthetic data are useful for data augmentation of images. In this project, a CNN model is proposed to evaluate the accuracy of different types of synthetic data and investigate their usefulness in multiclass classification. Due to limitation in computational power and memory, datasets with 10 classes such as MNIST and CIFAR-10 are used in the experiment. Different sets of tools such as DCGAN and pre-trained deep learning models and computer vision algorithms are used to generate DCGAN-original, cartoonized and sketched version of the dataset respectively. They are then added to the training dataset of the CNN with its test accuracy evaluated. After analysing the results, DCGAN-original data has the highest performance as its test accuracy is highest among the three. Furthermore, there is a higher potential to improve the accuracy of the DCGAN-original dataset by fine-tuning the DCGAN.
author2	Alex Chichung Kot
author_facet	Alex Chichung Kot Lee, Tian Fa
format	Final Year Project
author	Lee, Tian Fa
author_sort	Lee, Tian Fa
title	Synthesizing data for multiclass image classification
title_short	Synthesizing data for multiclass image classification
title_full	Synthesizing data for multiclass image classification
title_fullStr	Synthesizing data for multiclass image classification
title_full_unstemmed	Synthesizing data for multiclass image classification
title_sort	synthesizing data for multiclass image classification
publisher	Nanyang Technological University
publishDate	2020
url	https://hdl.handle.net/10356/145163
_version_	1772825146067779584

Synthesizing data for multiclass image classification

مواد مشابهة