VALIDATION OF COVID-19 INFORMATION IN INDONESIAN LANGUAGE USING A NATURAL LANGUAGE INFERENCE AND KNOWLEDGE GRAPH-BASED APPROACH

COVID-19 has emerged as a global public health concern due to its rapid spread and high mortality rate. Effective community-level control of this disease hinges on the availability of accurate information. However, the massive spread of misinformation through the internet makes individuals suscep...

Full description

Saved in:
Bibliographic Details
Main Author: Purnama Muharram, Arief
Format: Theses
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/81541
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:81541
spelling id-itb.:815412024-06-29T05:06:49ZVALIDATION OF COVID-19 INFORMATION IN INDONESIAN LANGUAGE USING A NATURAL LANGUAGE INFERENCE AND KNOWLEDGE GRAPH-BASED APPROACH Purnama Muharram, Arief Indonesia Theses information validation, automated fact-checking system, natural language inference, knowledge graph, misinformation, COVID-19 INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/81541 COVID-19 has emerged as a global public health concern due to its rapid spread and high mortality rate. Effective community-level control of this disease hinges on the availability of accurate information. However, the massive spread of misinformation through the internet makes individuals susceptible to false information. Automated fact-checking systems can play a crucial role in assisting with the validation of information veracity in such cases. These systems typically involve a Natural Language Inference (NLI) approach to identify the semantic relationship between a proven-true premise sentence and a hypothesis sentence representing the claim to be verified. This semantic relationship can be entailment, contradiction, or neutral. An information item is considered true if an entailment relationship exists between the sentences. This research introduces the use of Knowledge Graphs (KGs) to enhance the performance of NLI in validating information veracity. KGs serve to augment factual information, enabling the model to draw conclusions about the truthfulness of information based on factual evidence. In the proposed model architecture, information from the KG is processed in a separate module and combined with the semantic relationship information between the premise and hypothesis sentences processed by the NLI module. Subsequently, the information is processed by a final module to draw conclusions. In the implementation of the proposed model architecture, NLI and KG inputs are processed separately, and then the respective representative vectors from these inputs are combined to form a final vector. The final vector is then used as input to the classifier to produce the final result. The resulting model is trained using the Indonesian COVID-19 NLI dataset and the Indonesian COVID-19 KG. The research findings demonstrate that using KGs within the proposed model architecture can improve NLI performance in validating information veracity. The highest achievable accuracy reaches 0,8616 in validating the COVID-19 information in Indonesian language. KGs provide additional information that can strengthen the validation of information veracity, rather than solely relying on the semantic relationships formed. Therefore, KGs can serve as a crucial component iv in an automated fact-checking system to validate information veracity based on factual evidence. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description COVID-19 has emerged as a global public health concern due to its rapid spread and high mortality rate. Effective community-level control of this disease hinges on the availability of accurate information. However, the massive spread of misinformation through the internet makes individuals susceptible to false information. Automated fact-checking systems can play a crucial role in assisting with the validation of information veracity in such cases. These systems typically involve a Natural Language Inference (NLI) approach to identify the semantic relationship between a proven-true premise sentence and a hypothesis sentence representing the claim to be verified. This semantic relationship can be entailment, contradiction, or neutral. An information item is considered true if an entailment relationship exists between the sentences. This research introduces the use of Knowledge Graphs (KGs) to enhance the performance of NLI in validating information veracity. KGs serve to augment factual information, enabling the model to draw conclusions about the truthfulness of information based on factual evidence. In the proposed model architecture, information from the KG is processed in a separate module and combined with the semantic relationship information between the premise and hypothesis sentences processed by the NLI module. Subsequently, the information is processed by a final module to draw conclusions. In the implementation of the proposed model architecture, NLI and KG inputs are processed separately, and then the respective representative vectors from these inputs are combined to form a final vector. The final vector is then used as input to the classifier to produce the final result. The resulting model is trained using the Indonesian COVID-19 NLI dataset and the Indonesian COVID-19 KG. The research findings demonstrate that using KGs within the proposed model architecture can improve NLI performance in validating information veracity. The highest achievable accuracy reaches 0,8616 in validating the COVID-19 information in Indonesian language. KGs provide additional information that can strengthen the validation of information veracity, rather than solely relying on the semantic relationships formed. Therefore, KGs can serve as a crucial component iv in an automated fact-checking system to validate information veracity based on factual evidence.
format Theses
author Purnama Muharram, Arief
spellingShingle Purnama Muharram, Arief
VALIDATION OF COVID-19 INFORMATION IN INDONESIAN LANGUAGE USING A NATURAL LANGUAGE INFERENCE AND KNOWLEDGE GRAPH-BASED APPROACH
author_facet Purnama Muharram, Arief
author_sort Purnama Muharram, Arief
title VALIDATION OF COVID-19 INFORMATION IN INDONESIAN LANGUAGE USING A NATURAL LANGUAGE INFERENCE AND KNOWLEDGE GRAPH-BASED APPROACH
title_short VALIDATION OF COVID-19 INFORMATION IN INDONESIAN LANGUAGE USING A NATURAL LANGUAGE INFERENCE AND KNOWLEDGE GRAPH-BASED APPROACH
title_full VALIDATION OF COVID-19 INFORMATION IN INDONESIAN LANGUAGE USING A NATURAL LANGUAGE INFERENCE AND KNOWLEDGE GRAPH-BASED APPROACH
title_fullStr VALIDATION OF COVID-19 INFORMATION IN INDONESIAN LANGUAGE USING A NATURAL LANGUAGE INFERENCE AND KNOWLEDGE GRAPH-BASED APPROACH
title_full_unstemmed VALIDATION OF COVID-19 INFORMATION IN INDONESIAN LANGUAGE USING A NATURAL LANGUAGE INFERENCE AND KNOWLEDGE GRAPH-BASED APPROACH
title_sort validation of covid-19 information in indonesian language using a natural language inference and knowledge graph-based approach
url https://digilib.itb.ac.id/gdl/view/81541
_version_ 1822997356375703552