Sentiment analysis and topic modelling of 2024 U.S. and Indonesian election tweets: a study of political discourse and public opinion

This study investigates the effectiveness of various deep learning architectures and statistical models in both sentiment analysis and the temporal analysis of online public discourse through topic modelling and sentiment forecasting of tweets related to the 2024 Indonesian and U.S. elections. Given...

Full description

Saved in:
Bibliographic Details
Main Author: Widawati, Elisia Brispalma
Other Authors: Jagath C Rajapakse
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/181153
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:This study investigates the effectiveness of various deep learning architectures and statistical models in both sentiment analysis and the temporal analysis of online public discourse through topic modelling and sentiment forecasting of tweets related to the 2024 Indonesian and U.S. elections. Given the increasing importance of social media platforms like X (Twitter) in shaping political discourse, this research aims to explore how different models perform across diverse linguistic contexts. The study employs Long Short-Term Memory (LSTM) networks, Transformer models (IndoBERTweet and BERTweet), and Large Language Models (LLM) like GPT-4o for sentiment analysis, BERTopic leveraging Transformers and LLM for topic modelling, and Seasonal AutoRegressive Integrated Moving Average (SARIMA) model for sentiment forecasting. Data was collected using the Tweet Harvest tool, focusing on tweets with specific keywords related to the elections, and analysed across different time periods to capture the evolution of public sentiment and key themes. The sentiment classification models were evaluated using accuracy, precision, recall, and F1-score metrics; the topic models were assessed for coherence and diversity; and the SARIMA models were evaluated by their fit and residual diagnostics. Results demonstrate that LLMs significantly outperform LSTM and Transformer models in sentiment classification; BERTopic successfully captures the dynamic shifts in conversations, highlighting the evolving focus on key election-related issues; and SARIMA models fairly reliably forecast sentiment trends, though they struggle with predicting extreme fluctuations. These findings underscore the importance of combining advanced LLMs and topic modelling techniques with forecasting to provide a nuanced understanding of public sentiment and discourse and inform future research and applications in these areas.