A Sentiment Analysis of Singapore Presidential Election 2011 using Twitter Data with Census Correction
Sentiment analysis is a new area in text analytics where it focuses on the analysis and understanding of the human emotions from the text patterns. This new form of analysis has been widely adopted in customer relationship management especially in the context of complaint management. However, sentim...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2012
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/1436 https://ink.library.smu.edu.sg/context/sis_research/article/2435/viewcontent/1108.5520.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Summary: | Sentiment analysis is a new area in text analytics where it focuses on the analysis and understanding of the human emotions from the text patterns. This new form of analysis has been widely adopted in customer relationship management especially in the context of complaint management. However, sentiment analysis using Twitter data has remained extremely difficult to manage due to sampling biasness. In this paper, we will discuss about the application of reweighting techniques in conjunction with online sentiment divisions to predict the vote percentage that individual presidential candidate in Singapore will receive in the Presidential Election 2011. There will be in depth discussion about the various aspects using sentiment analysis to predict outcomes as well as the potential pitfalls in the estimation due to the anonymous nature of the Internet. Our methodology was successful in predicting the top two contenders in a four-corner fight, and that there would be a thin margin between them. Our modified result was able to predict the winner with swing voters’ estimation using cluster analysis. However, the final predicted values still differ from actual values due to astroturfing, which is extremely difficult to estimate and will be recommended for future work. |
---|