Revolutionizing digit image recognition: pushing the limits with simple CNN and challenging image augmentation techniques on MNIST
This study aims to apply Convolutional Neural Networks (CNN) and image augmentation techniques in digit recognition using the MNIST dataset. We built a CNN model and experimented with various image augmentation techniques to improve digit recognition accuracy. The results showed that the use of CN...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English English |
Published: |
Bright Publisher
2023
|
Subjects: | |
Online Access: | http://irep.iium.edu.my/107246/1/107246_Revolutionizing%20digit%20image%20recognition.pdf http://irep.iium.edu.my/107246/2/107246_Revolutionizing%20digit%20image%20recognition_SCOPUS.pdf http://irep.iium.edu.my/107246/ https://bright-journal.org/Journal/index.php/JADS/article/view/104 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Islam Antarabangsa Malaysia |
Language: | English English |
Summary: | This study aims to apply Convolutional Neural Networks (CNN) and image augmentation techniques in digit recognition using the MNIST
dataset. We built a CNN model and experimented with various image augmentation techniques to improve digit recognition accuracy. The
results showed that the use of CNN with image augmentation techniques was effective in improving digit recognition performance. In the data
collection stage, we used the MNIST dataset consisting of images of handwritten digits as training and testing data. After building the CNN
model, we apply image augmentation techniques such as rotation, shift, and flipping to the training data to enrich the data variety and prevent
overfitting. The evaluation results show that the CNN model that has been trained with image augmentation techniques produces significant
accuracy, with a maximum accuracy of 99.81%. We also performed an ensemble of several CNN models and found that this approach increased
the digit recognition accuracy to 99.79%. This research has the potential for further development. Recommendations for further research
include exploring more specific and complex image augmentation techniques, as well as using more challenging datasets. In addition, future
research may consider improvements to the CNN architecture used or combining it with other methods such as recurrent neural networks
(RNN). |
---|