Troika Generative Adversarial Network (T-GAN): A Synthetic Image Generator That Improves Neural Network Training for Handwriting Classification

Training an artificial neural network for handwriting classification requires a sufficiently sized annotated dataset in order to avoid overfitting. In the absence of sufficient instances, data augmentation techniques are normally considered. In this paper, we propose the troika generative adversaria...

全面介紹

Saved in:

書目詳細資料
Main Authors:	Milan, Joe Anthony M, Fernandez, Proceso L, Jr
格式:	text
出版:	Archīum Ateneo 2020
主題:	Handwriting Classification Generative Adversarial Networks Computer Sciences Databases and Information Systems
在線閱讀:	https://archium.ateneo.edu/discs-faculty-pubs/208 https://archium.ateneo.edu/cgi/viewcontent.cgi?article=1207&context=discs-faculty-pubs
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!
機構:	Ateneo De Manila University

id	ph-ateneo-arc.discs-faculty-pubs-1207
record_format	eprints
spelling	ph-ateneo-arc.discs-faculty-pubs-12072021-07-07T10:35:45Z Troika Generative Adversarial Network (T-GAN): A Synthetic Image Generator That Improves Neural Network Training for Handwriting Classification Milan, Joe Anthony M Fernandez, Proceso L, Jr Training an artificial neural network for handwriting classification requires a sufficiently sized annotated dataset in order to avoid overfitting. In the absence of sufficient instances, data augmentation techniques are normally considered. In this paper, we propose the troika generative adversarial network (T-GAN) for data augmentation to address the scarcity of publicly labeled handwriting datasets. T-GAN has three generator subnetworks architectured to have some weight-sharing in order to learn the joint distribution from three specific domains. We used T-GAN to augment the data from a subset of the IAM Handwriting Database. We then compared this with other data augmentation techniques by measuring the improvements brought by each technique to the handwriting classification accuracies in three types of artificial neural networks (ANNs): deep ANN, convolutional neural network (CNN), and deep CNN. The data augmentation technique involving the T-GAN yielded the highest accuracy improvements in each of the three ANN classifier types – outperforming the standard techniques of image rotation, affine transformation, and combination of these two – as well as the technique that uses another GAN-based model, the coupled GAN (CoGAN). Furthermore, a paired t-test between the 10-fold cross-validation results of the T-GAN and CoGAN, the second-best augmentation technique in this study, on a deep CNN-made classifier confirmed the superiority of the data augmentation technique that uses the T-GAN. Finally, when the generated synthetic data instances from the T-GAN were further enhanced using the pepper noise removal and median filter, the classification accuracy of the trained CNN and deep CNN classifiers were further improved to 93.54% and 95.45%, respectively. Each of these is a big improvement from the original accuracies of 67.43% and 68.32%, respectively of the 2 classifiers trained on the original unaugmented dataset. Thus, data augmentation using T-GAN – coupled with the mentioned two image noise removal techniques – can be a preferred pre-training technique for augmenting handwriting datasets with insufficient data samples. 2020-01-01T08:00:00Z text application/pdf https://archium.ateneo.edu/discs-faculty-pubs/208 https://archium.ateneo.edu/cgi/viewcontent.cgi?article=1207&context=discs-faculty-pubs Department of Information Systems & Computer Science Faculty Publications Archīum Ateneo Handwriting Classification Generative Adversarial Networks Computer Sciences Databases and Information Systems
institution	Ateneo De Manila University
building	Ateneo De Manila University Library
continent	Asia
country	Philippines Philippines
content_provider	Ateneo De Manila University Library
collection	archium.Ateneo Institutional Repository
topic	Handwriting Classification Generative Adversarial Networks Computer Sciences Databases and Information Systems
spellingShingle	Handwriting Classification Generative Adversarial Networks Computer Sciences Databases and Information Systems Milan, Joe Anthony M Fernandez, Proceso L, Jr Troika Generative Adversarial Network (T-GAN): A Synthetic Image Generator That Improves Neural Network Training for Handwriting Classification
description	Training an artificial neural network for handwriting classification requires a sufficiently sized annotated dataset in order to avoid overfitting. In the absence of sufficient instances, data augmentation techniques are normally considered. In this paper, we propose the troika generative adversarial network (T-GAN) for data augmentation to address the scarcity of publicly labeled handwriting datasets. T-GAN has three generator subnetworks architectured to have some weight-sharing in order to learn the joint distribution from three specific domains. We used T-GAN to augment the data from a subset of the IAM Handwriting Database. We then compared this with other data augmentation techniques by measuring the improvements brought by each technique to the handwriting classification accuracies in three types of artificial neural networks (ANNs): deep ANN, convolutional neural network (CNN), and deep CNN. The data augmentation technique involving the T-GAN yielded the highest accuracy improvements in each of the three ANN classifier types – outperforming the standard techniques of image rotation, affine transformation, and combination of these two – as well as the technique that uses another GAN-based model, the coupled GAN (CoGAN). Furthermore, a paired t-test between the 10-fold cross-validation results of the T-GAN and CoGAN, the second-best augmentation technique in this study, on a deep CNN-made classifier confirmed the superiority of the data augmentation technique that uses the T-GAN. Finally, when the generated synthetic data instances from the T-GAN were further enhanced using the pepper noise removal and median filter, the classification accuracy of the trained CNN and deep CNN classifiers were further improved to 93.54% and 95.45%, respectively. Each of these is a big improvement from the original accuracies of 67.43% and 68.32%, respectively of the 2 classifiers trained on the original unaugmented dataset. Thus, data augmentation using T-GAN – coupled with the mentioned two image noise removal techniques – can be a preferred pre-training technique for augmenting handwriting datasets with insufficient data samples.
format	text
author	Milan, Joe Anthony M Fernandez, Proceso L, Jr
author_facet	Milan, Joe Anthony M Fernandez, Proceso L, Jr
author_sort	Milan, Joe Anthony M
title	Troika Generative Adversarial Network (T-GAN): A Synthetic Image Generator That Improves Neural Network Training for Handwriting Classification
title_short	Troika Generative Adversarial Network (T-GAN): A Synthetic Image Generator That Improves Neural Network Training for Handwriting Classification
title_full	Troika Generative Adversarial Network (T-GAN): A Synthetic Image Generator That Improves Neural Network Training for Handwriting Classification
title_fullStr	Troika Generative Adversarial Network (T-GAN): A Synthetic Image Generator That Improves Neural Network Training for Handwriting Classification
title_full_unstemmed	Troika Generative Adversarial Network (T-GAN): A Synthetic Image Generator That Improves Neural Network Training for Handwriting Classification
title_sort	troika generative adversarial network (t-gan): a synthetic image generator that improves neural network training for handwriting classification
publisher	Archīum Ateneo
publishDate	2020
url	https://archium.ateneo.edu/discs-faculty-pubs/208 https://archium.ateneo.edu/cgi/viewcontent.cgi?article=1207&context=discs-faculty-pubs
_version_	1722366504614232064

Troika Generative Adversarial Network (T-GAN): A Synthetic Image Generator That Improves Neural Network Training for Handwriting Classification

相似書籍