A convolutional neural network approach to textual captcha solving
When trying to determine whether a user is a human or automated software, CAPTCHA is often employed as part of the verification process. However, due to the intricacy and security of the CAPTCHA, there are several methods for cracking or solving the CAPTCHA, one of which in a text- based CAPTCHA is...
Saved in:
Main Authors: | , |
---|---|
Format: | Proceedings |
Language: | English English |
Published: |
Faculty of Science & Natural Resources, UMS
2022
|
Subjects: | |
Online Access: | https://eprints.ums.edu.my/id/eprint/40624/1/ABSTRACT.pdf https://eprints.ums.edu.my/id/eprint/40624/2/FULL%20TEXT.pdf https://eprints.ums.edu.my/id/eprint/40624/ https://www.ums.edu.my/fssa/index.php/research/conference-publication |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Malaysia Sabah |
Language: | English English |
id |
my.ums.eprints.40624 |
---|---|
record_format |
eprints |
spelling |
my.ums.eprints.406242024-08-13T02:23:20Z https://eprints.ums.edu.my/id/eprint/40624/ A convolutional neural network approach to textual captcha solving Leong, Yau Wah Rechard Lee Q350-390 Information theory When trying to determine whether a user is a human or automated software, CAPTCHA is often employed as part of the verification process. However, due to the intricacy and security of the CAPTCHA, there are several methods for cracking or solving the CAPTCHA, one of which in a text- based CAPTCHA is the use of noise interference to deter CAPTCHA solvers. Therefore, in this study, we built a Convolutional Neural Network model to decipher text-based CAPTCHAs and presented a pre-processing technique to lessen the impact of those noises. To reduce noise in the text-based CAPTCHAs, we incorporated picture binarization, morphological operation, and median filter as a pre- processing step. We then trained our own Convolutional Neural Network model to distinguish between 34 other classes of alphanumeric characters other than the letters 'I' and 'O on a dataset consisting of 16,000 pre-processed CAPTCHAs generated with Python's Image Captcha package, and then we used 4,000 pre-processed CAPTCHAs to evaluate our model's performance on text-based CAPTCHAs. With our pre-processing strategy, we were able to raise the success rate of text-based CAPTCHA solutions from 84.68% to 89.41%, a substantial improvement of 4.73%. The overall accuracy is 0.9724 or 97.24% for our model in classifying all the 34 alphanumeric characters in Image Captcha. Faculty of Science & Natural Resources, UMS 2022 Proceedings PeerReviewed text en https://eprints.ums.edu.my/id/eprint/40624/1/ABSTRACT.pdf text en https://eprints.ums.edu.my/id/eprint/40624/2/FULL%20TEXT.pdf Leong, Yau Wah and Rechard Lee (2022) A convolutional neural network approach to textual captcha solving. https://www.ums.edu.my/fssa/index.php/research/conference-publication |
institution |
Universiti Malaysia Sabah |
building |
UMS Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaysia Sabah |
content_source |
UMS Institutional Repository |
url_provider |
http://eprints.ums.edu.my/ |
language |
English English |
topic |
Q350-390 Information theory |
spellingShingle |
Q350-390 Information theory Leong, Yau Wah Rechard Lee A convolutional neural network approach to textual captcha solving |
description |
When trying to determine whether a user is a human or automated software, CAPTCHA is often employed as part of the verification process. However, due to the intricacy and security of the CAPTCHA, there are several methods for cracking or solving the CAPTCHA, one of which in a text- based CAPTCHA is the use of noise interference to deter CAPTCHA solvers. Therefore, in this study, we built a Convolutional Neural Network model to decipher text-based CAPTCHAs and presented a pre-processing technique to lessen the impact of those noises. To reduce noise in the text-based CAPTCHAs, we incorporated picture binarization, morphological operation, and median filter as a pre- processing step. We then trained our own Convolutional Neural Network model to distinguish between 34 other classes of alphanumeric characters other than the letters 'I' and 'O on a dataset consisting of 16,000 pre-processed CAPTCHAs generated with Python's Image Captcha package, and then we used 4,000 pre-processed CAPTCHAs to evaluate our model's performance on text-based CAPTCHAs. With our pre-processing strategy, we were able to raise the success rate of text-based CAPTCHA solutions from 84.68% to 89.41%, a substantial improvement of 4.73%. The overall accuracy is 0.9724 or 97.24% for our model in classifying all the 34 alphanumeric characters in Image Captcha. |
format |
Proceedings |
author |
Leong, Yau Wah Rechard Lee |
author_facet |
Leong, Yau Wah Rechard Lee |
author_sort |
Leong, Yau Wah |
title |
A convolutional neural network approach to textual captcha solving |
title_short |
A convolutional neural network approach to textual captcha solving |
title_full |
A convolutional neural network approach to textual captcha solving |
title_fullStr |
A convolutional neural network approach to textual captcha solving |
title_full_unstemmed |
A convolutional neural network approach to textual captcha solving |
title_sort |
convolutional neural network approach to textual captcha solving |
publisher |
Faculty of Science & Natural Resources, UMS |
publishDate |
2022 |
url |
https://eprints.ums.edu.my/id/eprint/40624/1/ABSTRACT.pdf https://eprints.ums.edu.my/id/eprint/40624/2/FULL%20TEXT.pdf https://eprints.ums.edu.my/id/eprint/40624/ https://www.ums.edu.my/fssa/index.php/research/conference-publication |
_version_ |
1809140079603482624 |