A convolutional neural network approach to textual captcha solving

When trying to determine whether a user is a human or automated software, CAPTCHA is often employed as part of the verification process. However, due to the intricacy and security of the CAPTCHA, there are several methods for cracking or solving the CAPTCHA, one of which in a text- based CAPTCHA is...

Full description

Saved in:
Bibliographic Details
Main Authors: Leong, Yau Wah, Rechard Lee
Format: Proceedings
Language:English
English
Published: Faculty of Science & Natural Resources, UMS 2022
Subjects:
Online Access:https://eprints.ums.edu.my/id/eprint/40624/1/ABSTRACT.pdf
https://eprints.ums.edu.my/id/eprint/40624/2/FULL%20TEXT.pdf
https://eprints.ums.edu.my/id/eprint/40624/
https://www.ums.edu.my/fssa/index.php/research/conference-publication
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaysia Sabah
Language: English
English
id my.ums.eprints.40624
record_format eprints
spelling my.ums.eprints.406242024-08-13T02:23:20Z https://eprints.ums.edu.my/id/eprint/40624/ A convolutional neural network approach to textual captcha solving Leong, Yau Wah Rechard Lee Q350-390 Information theory When trying to determine whether a user is a human or automated software, CAPTCHA is often employed as part of the verification process. However, due to the intricacy and security of the CAPTCHA, there are several methods for cracking or solving the CAPTCHA, one of which in a text- based CAPTCHA is the use of noise interference to deter CAPTCHA solvers. Therefore, in this study, we built a Convolutional Neural Network model to decipher text-based CAPTCHAs and presented a pre-processing technique to lessen the impact of those noises. To reduce noise in the text-based CAPTCHAs, we incorporated picture binarization, morphological operation, and median filter as a pre- processing step. We then trained our own Convolutional Neural Network model to distinguish between 34 other classes of alphanumeric characters other than the letters 'I' and 'O on a dataset consisting of 16,000 pre-processed CAPTCHAs generated with Python's Image Captcha package, and then we used 4,000 pre-processed CAPTCHAs to evaluate our model's performance on text-based CAPTCHAs. With our pre-processing strategy, we were able to raise the success rate of text-based CAPTCHA solutions from 84.68% to 89.41%, a substantial improvement of 4.73%. The overall accuracy is 0.9724 or 97.24% for our model in classifying all the 34 alphanumeric characters in Image Captcha. Faculty of Science & Natural Resources, UMS 2022 Proceedings PeerReviewed text en https://eprints.ums.edu.my/id/eprint/40624/1/ABSTRACT.pdf text en https://eprints.ums.edu.my/id/eprint/40624/2/FULL%20TEXT.pdf Leong, Yau Wah and Rechard Lee (2022) A convolutional neural network approach to textual captcha solving. https://www.ums.edu.my/fssa/index.php/research/conference-publication
institution Universiti Malaysia Sabah
building UMS Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaysia Sabah
content_source UMS Institutional Repository
url_provider http://eprints.ums.edu.my/
language English
English
topic Q350-390 Information theory
spellingShingle Q350-390 Information theory
Leong, Yau Wah
Rechard Lee
A convolutional neural network approach to textual captcha solving
description When trying to determine whether a user is a human or automated software, CAPTCHA is often employed as part of the verification process. However, due to the intricacy and security of the CAPTCHA, there are several methods for cracking or solving the CAPTCHA, one of which in a text- based CAPTCHA is the use of noise interference to deter CAPTCHA solvers. Therefore, in this study, we built a Convolutional Neural Network model to decipher text-based CAPTCHAs and presented a pre-processing technique to lessen the impact of those noises. To reduce noise in the text-based CAPTCHAs, we incorporated picture binarization, morphological operation, and median filter as a pre- processing step. We then trained our own Convolutional Neural Network model to distinguish between 34 other classes of alphanumeric characters other than the letters 'I' and 'O on a dataset consisting of 16,000 pre-processed CAPTCHAs generated with Python's Image Captcha package, and then we used 4,000 pre-processed CAPTCHAs to evaluate our model's performance on text-based CAPTCHAs. With our pre-processing strategy, we were able to raise the success rate of text-based CAPTCHA solutions from 84.68% to 89.41%, a substantial improvement of 4.73%. The overall accuracy is 0.9724 or 97.24% for our model in classifying all the 34 alphanumeric characters in Image Captcha.
format Proceedings
author Leong, Yau Wah
Rechard Lee
author_facet Leong, Yau Wah
Rechard Lee
author_sort Leong, Yau Wah
title A convolutional neural network approach to textual captcha solving
title_short A convolutional neural network approach to textual captcha solving
title_full A convolutional neural network approach to textual captcha solving
title_fullStr A convolutional neural network approach to textual captcha solving
title_full_unstemmed A convolutional neural network approach to textual captcha solving
title_sort convolutional neural network approach to textual captcha solving
publisher Faculty of Science & Natural Resources, UMS
publishDate 2022
url https://eprints.ums.edu.my/id/eprint/40624/1/ABSTRACT.pdf
https://eprints.ums.edu.my/id/eprint/40624/2/FULL%20TEXT.pdf
https://eprints.ums.edu.my/id/eprint/40624/
https://www.ums.edu.my/fssa/index.php/research/conference-publication
_version_ 1809140079603482624