Topic detection on social media and online news
In recent years, more and more people like to publish and obtain information on the Internet through various mobile phone applications, and this information mainly exists in the form of short texts. For example, when some natural disasters occur, someone has a Twitter or Facebook message. This messa...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Coursework |
Language: | English |
Published: |
Nanyang Technological University
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/152938 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-152938 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1529382023-07-04T16:27:34Z Topic detection on social media and online news Li, Yuhao Mao Kezhi School of Electrical and Electronic Engineering EKZMao@ntu.edu.sg Engineering::Electrical and electronic engineering In recent years, more and more people like to publish and obtain information on the Internet through various mobile phone applications, and this information mainly exists in the form of short texts. For example, when some natural disasters occur, someone has a Twitter or Facebook message. This message may be about or at the same time about "flood", "storm" and "earthquake". Use intelligent technology to classify these short texts for timely detection The occurrence of disasters and the judgment of disaster types are of great significance. Because of its automatic learning and high-dimensional features, deep learning technology has attracted many scholars to apply it to the field of text analysis. Therefore, this paper has carried out research on the text classification technology based on deep learning, and the main work is summarized as follows. In the first place, the research status of text classification technology at home and abroad is investigated, and the main technologies, which are based on deep learning, involved in text classification process are introduced. From text preprocessing and presentation, to CNN, RNN, Attention mechanism, and at last to models based on transformer. In view of the small number of disaster-related public social network message text datasets, a web crawler is used for data crawling, a new dataset is built, which is labeled for multi-class and multi-label classification task as well as the existing public datasets. Then, a deep learning-based text classification model CRNNA is established. The GloVe model is used to preprocess the text, and then the high-dimensional features were extracted through the convolutional neural network, and then the bidirectional LSTM is used in order to capture the contextual features and global features. In addition, Hierarchical Attention mechanism is introduced combined with LSTM layer, and finally the output is classified. Experiments have proved that it has achieved better performance than traditional methods on both multi-class and multi-label classification tasks. Master of Science (Computer Control and Automation) 2021-10-22T01:23:16Z 2021-10-22T01:23:16Z 2021 Thesis-Master by Coursework Li, Y. (2021). Topic detection on social media and online news. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/152938 https://hdl.handle.net/10356/152938 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Electrical and electronic engineering Li, Yuhao Topic detection on social media and online news |
description |
In recent years, more and more people like to publish and obtain information on the Internet through various mobile phone applications, and this information mainly exists in the form of short texts. For example, when some natural disasters occur, someone has a Twitter or Facebook message. This message may be about or at the same time about "flood", "storm" and "earthquake". Use intelligent technology to classify these short texts for timely detection The occurrence of disasters and the judgment of disaster types are of great significance.
Because of its automatic learning and high-dimensional features, deep learning technology has attracted many scholars to apply it to the field of text analysis. Therefore, this paper has carried out research on the text classification technology based on deep learning, and the main work is summarized as follows.
In the first place, the research status of text classification technology at home and abroad is investigated, and the main technologies, which are based on deep learning, involved in text classification process are introduced. From text preprocessing and presentation, to CNN, RNN, Attention mechanism, and at last to models based on transformer.
In view of the small number of disaster-related public social network message text datasets, a web crawler is used for data crawling, a new dataset is built, which is labeled for multi-class and multi-label classification task as well as the existing public datasets.
Then, a deep learning-based text classification model CRNNA is established. The GloVe model is used to preprocess the text, and then the high-dimensional features were extracted through the convolutional neural network, and then the bidirectional LSTM is used in order to capture the contextual features and global features. In addition, Hierarchical Attention mechanism is introduced combined with LSTM layer, and finally the output is classified. Experiments have proved that it has achieved better performance than traditional methods on both multi-class and multi-label classification tasks. |
author2 |
Mao Kezhi |
author_facet |
Mao Kezhi Li, Yuhao |
format |
Thesis-Master by Coursework |
author |
Li, Yuhao |
author_sort |
Li, Yuhao |
title |
Topic detection on social media and online news |
title_short |
Topic detection on social media and online news |
title_full |
Topic detection on social media and online news |
title_fullStr |
Topic detection on social media and online news |
title_full_unstemmed |
Topic detection on social media and online news |
title_sort |
topic detection on social media and online news |
publisher |
Nanyang Technological University |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/152938 |
_version_ |
1772826076624453632 |