A convolutional neural network approach to textual captcha solving

When trying to determine whether a user is a human or automated software, CAPTCHA is often employed as part of the verification process. However, due to the intricacy and security of the CAPTCHA, there are several methods for cracking or solving the CAPTCHA, one of which in a text- based CAPTCHA is...

Full description

Saved in:

Bibliographic Details
Main Authors:	Leong, Yau Wah, Rechard Lee
Format:	Proceedings
Language:	English English
Published:	Faculty of Science & Natural Resources, UMS 2022
Subjects:	Q350-390 Information theory
Online Access:	https://eprints.ums.edu.my/id/eprint/40624/1/ABSTRACT.pdf https://eprints.ums.edu.my/id/eprint/40624/2/FULL%20TEXT.pdf https://eprints.ums.edu.my/id/eprint/40624/ https://www.ums.edu.my/fssa/index.php/research/conference-publication
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Universiti Malaysia Sabah
Language:	English English

id	my.ums.eprints.40624
record_format	eprints
spelling	my.ums.eprints.406242024-08-13T02:23:20Z https://eprints.ums.edu.my/id/eprint/40624/ A convolutional neural network approach to textual captcha solving Leong, Yau Wah Rechard Lee Q350-390 Information theory When trying to determine whether a user is a human or automated software, CAPTCHA is often employed as part of the verification process. However, due to the intricacy and security of the CAPTCHA, there are several methods for cracking or solving the CAPTCHA, one of which in a text- based CAPTCHA is the use of noise interference to deter CAPTCHA solvers. Therefore, in this study, we built a Convolutional Neural Network model to decipher text-based CAPTCHAs and presented a pre-processing technique to lessen the impact of those noises. To reduce noise in the text-based CAPTCHAs, we incorporated picture binarization, morphological operation, and median filter as a pre- processing step. We then trained our own Convolutional Neural Network model to distinguish between 34 other classes of alphanumeric characters other than the letters 'I' and 'O on a dataset consisting of 16,000 pre-processed CAPTCHAs generated with Python's Image Captcha package, and then we used 4,000 pre-processed CAPTCHAs to evaluate our model's performance on text-based CAPTCHAs. With our pre-processing strategy, we were able to raise the success rate of text-based CAPTCHA solutions from 84.68% to 89.41%, a substantial improvement of 4.73%. The overall accuracy is 0.9724 or 97.24% for our model in classifying all the 34 alphanumeric characters in Image Captcha. Faculty of Science & Natural Resources, UMS 2022 Proceedings PeerReviewed text en https://eprints.ums.edu.my/id/eprint/40624/1/ABSTRACT.pdf text en https://eprints.ums.edu.my/id/eprint/40624/2/FULL%20TEXT.pdf Leong, Yau Wah and Rechard Lee (2022) A convolutional neural network approach to textual captcha solving. https://www.ums.edu.my/fssa/index.php/research/conference-publication
institution	Universiti Malaysia Sabah
building	UMS Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Malaysia Sabah
content_source	UMS Institutional Repository
url_provider	http://eprints.ums.edu.my/
language	English English
topic	Q350-390 Information theory
spellingShingle	Q350-390 Information theory Leong, Yau Wah Rechard Lee A convolutional neural network approach to textual captcha solving
description	When trying to determine whether a user is a human or automated software, CAPTCHA is often employed as part of the verification process. However, due to the intricacy and security of the CAPTCHA, there are several methods for cracking or solving the CAPTCHA, one of which in a text- based CAPTCHA is the use of noise interference to deter CAPTCHA solvers. Therefore, in this study, we built a Convolutional Neural Network model to decipher text-based CAPTCHAs and presented a pre-processing technique to lessen the impact of those noises. To reduce noise in the text-based CAPTCHAs, we incorporated picture binarization, morphological operation, and median filter as a pre- processing step. We then trained our own Convolutional Neural Network model to distinguish between 34 other classes of alphanumeric characters other than the letters 'I' and 'O on a dataset consisting of 16,000 pre-processed CAPTCHAs generated with Python's Image Captcha package, and then we used 4,000 pre-processed CAPTCHAs to evaluate our model's performance on text-based CAPTCHAs. With our pre-processing strategy, we were able to raise the success rate of text-based CAPTCHA solutions from 84.68% to 89.41%, a substantial improvement of 4.73%. The overall accuracy is 0.9724 or 97.24% for our model in classifying all the 34 alphanumeric characters in Image Captcha.
format	Proceedings
author	Leong, Yau Wah Rechard Lee
author_facet	Leong, Yau Wah Rechard Lee
author_sort	Leong, Yau Wah
title	A convolutional neural network approach to textual captcha solving
title_short	A convolutional neural network approach to textual captcha solving
title_full	A convolutional neural network approach to textual captcha solving
title_fullStr	A convolutional neural network approach to textual captcha solving
title_full_unstemmed	A convolutional neural network approach to textual captcha solving
title_sort	convolutional neural network approach to textual captcha solving
publisher	Faculty of Science & Natural Resources, UMS
publishDate	2022
url	https://eprints.ums.edu.my/id/eprint/40624/1/ABSTRACT.pdf https://eprints.ums.edu.my/id/eprint/40624/2/FULL%20TEXT.pdf https://eprints.ums.edu.my/id/eprint/40624/ https://www.ums.edu.my/fssa/index.php/research/conference-publication
_version_	1809140079603482624

A convolutional neural network approach to textual captcha solving

Similar Items