DEVELOPMENT OF A TEXT-BASED ANTI-CHEAT COMPONENT USING ALBERT TO IDENTIFY CHEATING ACTIONS IN THE AUTOMATED INTERVIEW PROCESS
The online recruitment process is increasingly challenging in maintaining the integrity and honesty of candidates, especially with the use of AI tools like ChatGPT to answer interview questions. This research develops a text-based anti-cheat component using the ALBERT model (A Lite BERT) to ident...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/85055 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:85055 |
---|---|
spelling |
id-itb.:850552024-08-19T14:02:55ZDEVELOPMENT OF A TEXT-BASED ANTI-CHEAT COMPONENT USING ALBERT TO IDENTIFY CHEATING ACTIONS IN THE AUTOMATED INTERVIEW PROCESS Iryanto Prasethio, Yakobus Indonesia Final Project anti-cheat, ALBERT, sentiment, perplexity, fine-tuning, Logistic Regression, asynchronous, message queue. INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/85055 The online recruitment process is increasingly challenging in maintaining the integrity and honesty of candidates, especially with the use of AI tools like ChatGPT to answer interview questions. This research develops a text-based anti-cheat component using the ALBERT model (A Lite BERT) to identify cheating during automated interviews. ALBERT was chosen for its memory efficiency and its ability to understand the context and structure of language. To ensure reliable and effective detection, this research explores techniques for detecting the probability that a user's textual response was generated by AI and tests ALBERT's performance in classifying the source of the user's response. Additionally, this research includes methods for integrating the anti-cheat system with talent recruitment software. Several external indicators used include POS tags, sentence emotions and sentiment, readability, perplexity, backspace usage count, typo count, and cosine similarity for gpt-4o answers. Features found to have less influence on the model's inference results include unique word ratio, burstiness, cosine similarity for gpt-3.5-turbo and gemini-1-pro answers, and typing duration. The system implementation involves fine-tuning the ALBERT model and using Logistic Regression for classification, as well as integrating with intelligent recruitment systems asynchronously through a message queue. The best-trained model was ALBERT with a two- path classification head: the first path for ALBERT's own results and the second path for external indicators. The ALBERT model achieved an accuracy of 0.81, precision of 0.867, recall of 0.788, and an F1-Score of 0.82. The Logistic Regression model, on the other hand, achieved an accuracy of 0.783, precision of 0.73, recall of 0.73, and an F1-Score of 0.72. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
The online recruitment process is increasingly challenging in maintaining the integrity and
honesty of candidates, especially with the use of AI tools like ChatGPT to answer interview
questions. This research develops a text-based anti-cheat component using the ALBERT model
(A Lite BERT) to identify cheating during automated interviews. ALBERT was chosen for its
memory efficiency and its ability to understand the context and structure of language.
To ensure reliable and effective detection, this research explores techniques for detecting the
probability that a user's textual response was generated by AI and tests ALBERT's performance
in classifying the source of the user's response. Additionally, this research includes methods
for integrating the anti-cheat system with talent recruitment software.
Several external indicators used include POS tags, sentence emotions and sentiment,
readability, perplexity, backspace usage count, typo count, and cosine similarity for gpt-4o
answers. Features found to have less influence on the model's inference results include unique
word ratio, burstiness, cosine similarity for gpt-3.5-turbo and gemini-1-pro answers, and typing
duration.
The system implementation involves fine-tuning the ALBERT model and using Logistic
Regression for classification, as well as integrating with intelligent recruitment systems
asynchronously through a message queue. The best-trained model was ALBERT with a two-
path classification head: the first path for ALBERT's own results and the second path for
external indicators. The ALBERT model achieved an accuracy of 0.81, precision of 0.867,
recall of 0.788, and an F1-Score of 0.82. The Logistic Regression model, on the other hand,
achieved an accuracy of 0.783, precision of 0.73, recall of 0.73, and an F1-Score of 0.72. |
format |
Final Project |
author |
Iryanto Prasethio, Yakobus |
spellingShingle |
Iryanto Prasethio, Yakobus DEVELOPMENT OF A TEXT-BASED ANTI-CHEAT COMPONENT USING ALBERT TO IDENTIFY CHEATING ACTIONS IN THE AUTOMATED INTERVIEW PROCESS |
author_facet |
Iryanto Prasethio, Yakobus |
author_sort |
Iryanto Prasethio, Yakobus |
title |
DEVELOPMENT OF A TEXT-BASED ANTI-CHEAT COMPONENT USING ALBERT TO IDENTIFY CHEATING ACTIONS IN THE AUTOMATED INTERVIEW PROCESS |
title_short |
DEVELOPMENT OF A TEXT-BASED ANTI-CHEAT COMPONENT USING ALBERT TO IDENTIFY CHEATING ACTIONS IN THE AUTOMATED INTERVIEW PROCESS |
title_full |
DEVELOPMENT OF A TEXT-BASED ANTI-CHEAT COMPONENT USING ALBERT TO IDENTIFY CHEATING ACTIONS IN THE AUTOMATED INTERVIEW PROCESS |
title_fullStr |
DEVELOPMENT OF A TEXT-BASED ANTI-CHEAT COMPONENT USING ALBERT TO IDENTIFY CHEATING ACTIONS IN THE AUTOMATED INTERVIEW PROCESS |
title_full_unstemmed |
DEVELOPMENT OF A TEXT-BASED ANTI-CHEAT COMPONENT USING ALBERT TO IDENTIFY CHEATING ACTIONS IN THE AUTOMATED INTERVIEW PROCESS |
title_sort |
development of a text-based anti-cheat component using albert to identify cheating actions in the automated interview process |
url |
https://digilib.itb.ac.id/gdl/view/85055 |
_version_ |
1822010592260521984 |