Topic detection on social media and online news

In recent years, more and more people like to publish and obtain information on the Internet through various mobile phone applications, and this information mainly exists in the form of short texts. For example, when some natural disasters occur, someone has a Twitter or Facebook message. This messa...

Full description

Saved in:
Bibliographic Details
Main Author: Li, Yuhao
Other Authors: Mao Kezhi
Format: Thesis-Master by Coursework
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/152938
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-152938
record_format dspace
spelling sg-ntu-dr.10356-1529382023-07-04T16:27:34Z Topic detection on social media and online news Li, Yuhao Mao Kezhi School of Electrical and Electronic Engineering EKZMao@ntu.edu.sg Engineering::Electrical and electronic engineering In recent years, more and more people like to publish and obtain information on the Internet through various mobile phone applications, and this information mainly exists in the form of short texts. For example, when some natural disasters occur, someone has a Twitter or Facebook message. This message may be about or at the same time about "flood", "storm" and "earthquake". Use intelligent technology to classify these short texts for timely detection The occurrence of disasters and the judgment of disaster types are of great significance. Because of its automatic learning and high-dimensional features, deep learning technology has attracted many scholars to apply it to the field of text analysis. Therefore, this paper has carried out research on the text classification technology based on deep learning, and the main work is summarized as follows. In the first place, the research status of text classification technology at home and abroad is investigated, and the main technologies, which are based on deep learning, involved in text classification process are introduced. From text preprocessing and presentation, to CNN, RNN, Attention mechanism, and at last to models based on transformer. In view of the small number of disaster-related public social network message text datasets, a web crawler is used for data crawling, a new dataset is built, which is labeled for multi-class and multi-label classification task as well as the existing public datasets. Then, a deep learning-based text classification model CRNNA is established. The GloVe model is used to preprocess the text, and then the high-dimensional features were extracted through the convolutional neural network, and then the bidirectional LSTM is used in order to capture the contextual features and global features. In addition, Hierarchical Attention mechanism is introduced combined with LSTM layer, and finally the output is classified. Experiments have proved that it has achieved better performance than traditional methods on both multi-class and multi-label classification tasks. Master of Science (Computer Control and Automation) 2021-10-22T01:23:16Z 2021-10-22T01:23:16Z 2021 Thesis-Master by Coursework Li, Y. (2021). Topic detection on social media and online news. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/152938 https://hdl.handle.net/10356/152938 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Li, Yuhao
Topic detection on social media and online news
description In recent years, more and more people like to publish and obtain information on the Internet through various mobile phone applications, and this information mainly exists in the form of short texts. For example, when some natural disasters occur, someone has a Twitter or Facebook message. This message may be about or at the same time about "flood", "storm" and "earthquake". Use intelligent technology to classify these short texts for timely detection The occurrence of disasters and the judgment of disaster types are of great significance. Because of its automatic learning and high-dimensional features, deep learning technology has attracted many scholars to apply it to the field of text analysis. Therefore, this paper has carried out research on the text classification technology based on deep learning, and the main work is summarized as follows. In the first place, the research status of text classification technology at home and abroad is investigated, and the main technologies, which are based on deep learning, involved in text classification process are introduced. From text preprocessing and presentation, to CNN, RNN, Attention mechanism, and at last to models based on transformer. In view of the small number of disaster-related public social network message text datasets, a web crawler is used for data crawling, a new dataset is built, which is labeled for multi-class and multi-label classification task as well as the existing public datasets. Then, a deep learning-based text classification model CRNNA is established. The GloVe model is used to preprocess the text, and then the high-dimensional features were extracted through the convolutional neural network, and then the bidirectional LSTM is used in order to capture the contextual features and global features. In addition, Hierarchical Attention mechanism is introduced combined with LSTM layer, and finally the output is classified. Experiments have proved that it has achieved better performance than traditional methods on both multi-class and multi-label classification tasks.
author2 Mao Kezhi
author_facet Mao Kezhi
Li, Yuhao
format Thesis-Master by Coursework
author Li, Yuhao
author_sort Li, Yuhao
title Topic detection on social media and online news
title_short Topic detection on social media and online news
title_full Topic detection on social media and online news
title_fullStr Topic detection on social media and online news
title_full_unstemmed Topic detection on social media and online news
title_sort topic detection on social media and online news
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/152938
_version_ 1772826076624453632