Detecting interested topics on social media

The increasing amount of information on social media and news sites has made it challenging for readers to filter out irrelevant news. To address this issue, deep learning and NLP technology have been used to build text classifiers to categorize the latest news. This paper aims to develop an e...

Full description

Saved in:
Bibliographic Details
Main Author: Wang, Hongfei
Other Authors: Mao Kezhi
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/167193
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:The increasing amount of information on social media and news sites has made it challenging for readers to filter out irrelevant news. To address this issue, deep learning and NLP technology have been used to build text classifiers to categorize the latest news. This paper aims to develop an end-to-end system that can deliver different interesting topics of news to users. The system includes a news extraction API, news dataset collection, a model that can classify the latest news, and a UI interface to show different topic news to users. This paper compares the performance of different classification models, including TF IDF connected to Naive Bayes, LightGBM, KNearestNeighbours, Random Forest, XGBRF, and embedding layer developed with Tensorflow Keras connected to RNN, CNN, and LSTM. The best models was then applied to real-world data to filter daily latest news articles for users. The study updates the training dataset to keep up with vocabulary trends, ensuring better news categorization. Finally, a Streamlit framework was used to deploy the model into a web application with a good user experience. This project report provides a comprehensive guide for building a news classification system using deep learning models and deploying them on a web application.