ENHANCEMENTS OF HOT EVENT DETECTION METHOD FOR DETECTING EVENTS BASED ON NEWS DATA
Events are things that actually occur, attract attention, have a location and time, and have consequences such as the emergence of news reporting the event. Therefore, scientists have developed various methods to detect events from news documents in the hope that these events can be further utili...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/53776 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:53776 |
---|---|
spelling |
id-itb.:537762021-03-10T10:03:17ZENHANCEMENTS OF HOT EVENT DETECTION METHOD FOR DETECTING EVENTS BASED ON NEWS DATA Shandy Indonesia Final Project Enhancements, Detection, Events, News INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/53776 Events are things that actually occur, attract attention, have a location and time, and have consequences such as the emergence of news reporting the event. Therefore, scientists have developed various methods to detect events from news documents in the hope that these events can be further utilized, such as processing location and time. One such method is called Hot Event Detection, which was developed by Qi et al. This method was chosen because it is easy to understand, implement and enhance. However, like other event detection methods, it does not support the processing of location and time of events. Therefore, this final project adds these enhancements along with other enhancements such as event merging and keyword processing. Enhancement for keyword processing is divided into analyzing keyword extraction and semantic similarity. In this final project, enhancements to the Hot Event Detection method are implemented through a combination of various techniques. The location attribute is obtained accurately through the news text, the time attribute is obtained from the time of news publication. Event merging is implemented utilizing the DBSCAN clustering algorithm. The keyword extraction analysis uses the TF-IDF dictionary to check the relevance of these keywords and WordNet ontology is used for Semantic Similarity. Enhancements such as event merging and keyword processing successfully improve Hot Event Detection performance through better cluster quality. The enhancement of the addition of the location and time attributes of the event succeeded in obtaining good results with the correct threshold and attenuation configuration (71% precision or 79% recall), but deficiencies were still found that caused the precision or recall to not be higher, for example, clusterization errors. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Events are things that actually occur, attract attention, have a location and time, and
have consequences such as the emergence of news reporting the event. Therefore,
scientists have developed various methods to detect events from news documents
in the hope that these events can be further utilized, such as processing location and
time. One such method is called Hot Event Detection, which was developed by Qi
et al. This method was chosen because it is easy to understand, implement and
enhance. However, like other event detection methods, it does not support the
processing of location and time of events. Therefore, this final project adds these
enhancements along with other enhancements such as event merging and keyword
processing. Enhancement for keyword processing is divided into analyzing
keyword extraction and semantic similarity.
In this final project, enhancements to the Hot Event Detection method are
implemented through a combination of various techniques. The location attribute is
obtained accurately through the news text, the time attribute is obtained from the
time of news publication. Event merging is implemented utilizing the DBSCAN
clustering algorithm. The keyword extraction analysis uses the TF-IDF dictionary
to check the relevance of these keywords and WordNet ontology is used for
Semantic Similarity.
Enhancements such as event merging and keyword processing successfully
improve Hot Event Detection performance through better cluster quality. The
enhancement of the addition of the location and time attributes of the event
succeeded in obtaining good results with the correct threshold and attenuation
configuration (71% precision or 79% recall), but deficiencies were still found that
caused the precision or recall to not be higher, for example, clusterization errors.
|
format |
Final Project |
author |
Shandy |
spellingShingle |
Shandy ENHANCEMENTS OF HOT EVENT DETECTION METHOD FOR DETECTING EVENTS BASED ON NEWS DATA |
author_facet |
Shandy |
author_sort |
Shandy |
title |
ENHANCEMENTS OF HOT EVENT DETECTION METHOD FOR DETECTING EVENTS BASED ON NEWS DATA |
title_short |
ENHANCEMENTS OF HOT EVENT DETECTION METHOD FOR DETECTING EVENTS BASED ON NEWS DATA |
title_full |
ENHANCEMENTS OF HOT EVENT DETECTION METHOD FOR DETECTING EVENTS BASED ON NEWS DATA |
title_fullStr |
ENHANCEMENTS OF HOT EVENT DETECTION METHOD FOR DETECTING EVENTS BASED ON NEWS DATA |
title_full_unstemmed |
ENHANCEMENTS OF HOT EVENT DETECTION METHOD FOR DETECTING EVENTS BASED ON NEWS DATA |
title_sort |
enhancements of hot event detection method for detecting events based on news data |
url |
https://digilib.itb.ac.id/gdl/view/53776 |
_version_ |
1822929421615497216 |