Towards explainable and semantically coherent claim extraction for an automated fact-checker

Misinformation and fake news spread everywhere through online social media platforms. Although Automatic Fact-Checkers and LLMs like ChatGPT have become popular and seem to be a promising solution to detect fake news, these models still have some limitations regarding their reliance on pre-existing...

Full description

Saved in:

Bibliographic Details
Main Author:	Yoswara, Jocelyn Valencia
Other Authors:	Erry Gunawan
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2024
Subjects:	Computer and Information Science Engineering Natural language processing Large language models Claim extraction Automatic fact checker Machine learning
Online Access:	https://hdl.handle.net/10356/176481
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-176481
record_format	dspace
spelling	sg-ntu-dr.10356-1764812024-05-17T15:45:38Z Towards explainable and semantically coherent claim extraction for an automated fact-checker Yoswara, Jocelyn Valencia Erry Gunawan School of Electrical and Electronic Engineering Institute for Infocomm Research, A*STAR EGUNAWAN@ntu.edu.sg Computer and Information Science Engineering Natural language processing Large language models Claim extraction Automatic fact checker Machine learning Misinformation and fake news spread everywhere through online social media platforms. Although Automatic Fact-Checkers and LLMs like ChatGPT have become popular and seem to be a promising solution to detect fake news, these models still have some limitations regarding their reliance on pre-existing knowledge, and concerns are raised about whether they can differentiate between truth and falsehood. As these concerns arise, the claim extraction process of obtaining claims made from various resources becomes one of the most crucial steps in an automatic fact-checker. Therefore, this project aims to enhance the claim extraction process of an automatic fact-checker, improve the accuracy of fact-checking, and mitigate the limitations associated with LLMs' reliance on outdated information. Firstly, a baseline model was created using SBert with an evidence retrieval system. Secondly, an evidence retrieval system was implemented using the Google Search API to gather evidence related to claims from online sources. Lastly, the GPT-4 model was utilized to verify claims based on the available evidence. The GPT-4 model outperforms the baseline model, achieving a 94% accuracy in claim verification. However, the baseline model provides more comprehensive insights into the coverage of the entire dataset, and it is also found that the evidence retrieval process significantly affects the model's accuracy and coverage of claim verification tasks. Hence, this project represents an enhancement in the claim extraction process while also identifying limitations and suggesting potential areas for further improvements to improve the effectiveness of claim extraction for an automatic fact-checker. Bachelor's degree 2024-05-17T02:20:17Z 2024-05-17T02:20:17Z 2024 Final Year Project (FYP) Yoswara, J. V. (2024). Towards explainable and semantically coherent claim extraction for an automated fact-checker. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/176481 https://hdl.handle.net/10356/176481 en B3054 -231 application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Computer and Information Science Engineering Natural language processing Large language models Claim extraction Automatic fact checker Machine learning
spellingShingle	Computer and Information Science Engineering Natural language processing Large language models Claim extraction Automatic fact checker Machine learning Yoswara, Jocelyn Valencia Towards explainable and semantically coherent claim extraction for an automated fact-checker
description	Misinformation and fake news spread everywhere through online social media platforms. Although Automatic Fact-Checkers and LLMs like ChatGPT have become popular and seem to be a promising solution to detect fake news, these models still have some limitations regarding their reliance on pre-existing knowledge, and concerns are raised about whether they can differentiate between truth and falsehood. As these concerns arise, the claim extraction process of obtaining claims made from various resources becomes one of the most crucial steps in an automatic fact-checker. Therefore, this project aims to enhance the claim extraction process of an automatic fact-checker, improve the accuracy of fact-checking, and mitigate the limitations associated with LLMs' reliance on outdated information. Firstly, a baseline model was created using SBert with an evidence retrieval system. Secondly, an evidence retrieval system was implemented using the Google Search API to gather evidence related to claims from online sources. Lastly, the GPT-4 model was utilized to verify claims based on the available evidence. The GPT-4 model outperforms the baseline model, achieving a 94% accuracy in claim verification. However, the baseline model provides more comprehensive insights into the coverage of the entire dataset, and it is also found that the evidence retrieval process significantly affects the model's accuracy and coverage of claim verification tasks. Hence, this project represents an enhancement in the claim extraction process while also identifying limitations and suggesting potential areas for further improvements to improve the effectiveness of claim extraction for an automatic fact-checker.
author2	Erry Gunawan
author_facet	Erry Gunawan Yoswara, Jocelyn Valencia
format	Final Year Project
author	Yoswara, Jocelyn Valencia
author_sort	Yoswara, Jocelyn Valencia
title	Towards explainable and semantically coherent claim extraction for an automated fact-checker
title_short	Towards explainable and semantically coherent claim extraction for an automated fact-checker
title_full	Towards explainable and semantically coherent claim extraction for an automated fact-checker
title_fullStr	Towards explainable and semantically coherent claim extraction for an automated fact-checker
title_full_unstemmed	Towards explainable and semantically coherent claim extraction for an automated fact-checker
title_sort	towards explainable and semantically coherent claim extraction for an automated fact-checker
publisher	Nanyang Technological University
publishDate	2024
url	https://hdl.handle.net/10356/176481
_version_	1814047191172382720

Towards explainable and semantically coherent claim extraction for an automated fact-checker

Similar Items