Event detection for cyber security news articles
In recent years, there has been an increasing focus on using text mining techniques in the field of cyber security. Extensive studies have been conducted to improve the techniques that allow computers to understand and process language in this domain. One important task in this field is event detect...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/165914 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-165914 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1659142023-04-21T15:36:54Z Event detection for cyber security news articles Huang, Jovan Tian Chun Hui Siu Cheung School of Computer Science and Engineering ASSCHUI@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Document and text processing In recent years, there has been an increasing focus on using text mining techniques in the field of cyber security. Extensive studies have been conducted to improve the techniques that allow computers to understand and process language in this domain. One important task in this field is event detection, which involves identifying specific events or occurrences in text by using certain keywords or triggers. However, current methods often focus on understanding the text itself and do not pay enough attention to the meaning of the events being identified. In this study, we introduce a novel approach for cybersecurity event detection, referred to as the Label-Pivoting Model for Cybersecurity News Event Detection (LPCNED) model that is enhanced from the Semantic Pivoting Model for Effective Event Detection (SPEED) model. SPEED model demonstrated superior performance when compared to various robust baselines on benchmark datasets such as ACE 2005 for event detection. In LPCNED, we employed the pretrained NewsBERT language model to encode the combined representation of input sentences and labels. It employs the semantic meanings of predetermined event type labels to identify candidates for event triggers. The NewsBERT model provides domain-specific knowledge, drawing from popular news data sources, thereby enhancing the overall effectiveness of the model. Our experiments, conducted using the Cybersecurity Event Annotation Corpus (CEAC), the sole corpus available for cybersecurity news event extraction at the time of writing, demonstrate the robustness and efficacy of the LPCNED model despite limited data availability, even outperforming the BERT-CRF model used on the MAVEN dataset in the general domain and the SPEED model. These results indicate the potential utility of the LPCNED model for event detection in cybersecurity news articles and warrant further investigation in the task of event extraction. Bachelor of Engineering (Computer Science) 2023-04-16T23:51:52Z 2023-04-16T23:51:52Z 2023 Final Year Project (FYP) Huang, J. T. C. (2023). Event detection for cyber security news articles. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/165914 https://hdl.handle.net/10356/165914 en SCSE22-0487 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Document and text processing |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Document and text processing Huang, Jovan Tian Chun Event detection for cyber security news articles |
description |
In recent years, there has been an increasing focus on using text mining techniques in the field of cyber security. Extensive studies have been conducted to improve the techniques that allow computers to understand and process language in this domain. One important task in this field is event detection, which involves identifying specific events or occurrences in text by using certain keywords or triggers. However, current methods often focus on understanding the text itself and do not pay enough attention to the meaning of the events being identified. In this study, we introduce a novel approach for cybersecurity event detection, referred to as the Label-Pivoting Model for Cybersecurity News Event Detection (LPCNED) model that is enhanced from the Semantic Pivoting Model for Effective Event Detection (SPEED) model. SPEED model demonstrated superior performance when compared to various robust baselines on benchmark datasets such as ACE 2005 for event detection. In LPCNED, we employed the pretrained NewsBERT language model to encode the combined representation of input sentences and labels. It employs the semantic meanings of predetermined event type labels to identify candidates for event triggers. The NewsBERT model provides domain-specific knowledge, drawing from popular news data sources, thereby enhancing the overall effectiveness of the model. Our experiments, conducted using the Cybersecurity Event Annotation Corpus (CEAC), the sole corpus available for cybersecurity news event extraction at the time of writing, demonstrate the robustness and efficacy of the LPCNED model despite limited data availability, even outperforming the BERT-CRF model used on the MAVEN dataset in the general domain and the SPEED model. These results indicate the potential utility of the LPCNED model for event detection in cybersecurity news articles and warrant further investigation in the task of event extraction. |
author2 |
Hui Siu Cheung |
author_facet |
Hui Siu Cheung Huang, Jovan Tian Chun |
format |
Final Year Project |
author |
Huang, Jovan Tian Chun |
author_sort |
Huang, Jovan Tian Chun |
title |
Event detection for cyber security news articles |
title_short |
Event detection for cyber security news articles |
title_full |
Event detection for cyber security news articles |
title_fullStr |
Event detection for cyber security news articles |
title_full_unstemmed |
Event detection for cyber security news articles |
title_sort |
event detection for cyber security news articles |
publisher |
Nanyang Technological University |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/165914 |
_version_ |
1764208113326489600 |