SENTIMENT CLASSIFICATION OF CORPORATIONS IN NEWS ARTICLES USING WORD EMBEDDING

Sentiment of company influences an individual or society view of the company. This view affects individual's actions in stock trading transactions. These corporate sentiments can be contained in news articles in several factors related to the company. These factors are internal factors and exte...

Full description

Saved in:
Bibliographic Details
Main Author: AODYRA KHAIDIR - NIM : 13513063 , MUHAMMAD
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/23197
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:23197
spelling id-itb.:231972017-10-09T10:28:07ZSENTIMENT CLASSIFICATION OF CORPORATIONS IN NEWS ARTICLES USING WORD EMBEDDING AODYRA KHAIDIR - NIM : 13513063 , MUHAMMAD Indonesia Final Project INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/23197 Sentiment of company influences an individual or society view of the company. This view affects individual's actions in stock trading transactions. These corporate sentiments can be contained in news articles in several factors related to the company. These factors are internal factors and external factors of a company. Sentiments contained in the news article can be predicted with the sentiment classification. Sentiment classification facilitates the individual or society in determining their actions in stock trading. <br /> <br /> <br /> <br /> The problem of sentiment classification can be solved using supervised learning. To form a model of learning with supervised learning, a feature extraction method is performed to obtain representations of a sentence. Feature extraction methods that can be used are the features of n-gram and word embedding. Related research produces good accuracy using the Support Vector Machine and Convolutional Neural Network. The feature selection can be used to reduce the size of the vector space model so that the machine learning algorithm can run fast. Selection features also eliminate features that are not relevant in the feature extraction process. <br /> <br /> <br /> <br /> The formation of the learning model uses 6,358 sentences with three classes of sentiments, which are positive, negative and neutral. The experiments were experimental n-gram features and word embedding features. In the n-gram feature experiment, experiments were performed on the features of unigram, bigram, trigram, and all three combined. Each experiment was performed with three weights, namely the appearance of a word, term frequency (TF), and TF-IDF. In addition, feature selection is also performed for each of these n-gram experiments. In word embedding experiments, the experiments performed are tuning parameters, average word vector, and Convolutional Neural Network. Methods with word embedding and Convolutional Neural Network algorithms performed with information gain feature selection and word sentiment dictionary yield the best accuracy. The best accuracy produced is 85.729%. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description Sentiment of company influences an individual or society view of the company. This view affects individual's actions in stock trading transactions. These corporate sentiments can be contained in news articles in several factors related to the company. These factors are internal factors and external factors of a company. Sentiments contained in the news article can be predicted with the sentiment classification. Sentiment classification facilitates the individual or society in determining their actions in stock trading. <br /> <br /> <br /> <br /> The problem of sentiment classification can be solved using supervised learning. To form a model of learning with supervised learning, a feature extraction method is performed to obtain representations of a sentence. Feature extraction methods that can be used are the features of n-gram and word embedding. Related research produces good accuracy using the Support Vector Machine and Convolutional Neural Network. The feature selection can be used to reduce the size of the vector space model so that the machine learning algorithm can run fast. Selection features also eliminate features that are not relevant in the feature extraction process. <br /> <br /> <br /> <br /> The formation of the learning model uses 6,358 sentences with three classes of sentiments, which are positive, negative and neutral. The experiments were experimental n-gram features and word embedding features. In the n-gram feature experiment, experiments were performed on the features of unigram, bigram, trigram, and all three combined. Each experiment was performed with three weights, namely the appearance of a word, term frequency (TF), and TF-IDF. In addition, feature selection is also performed for each of these n-gram experiments. In word embedding experiments, the experiments performed are tuning parameters, average word vector, and Convolutional Neural Network. Methods with word embedding and Convolutional Neural Network algorithms performed with information gain feature selection and word sentiment dictionary yield the best accuracy. The best accuracy produced is 85.729%.
format Final Project
author AODYRA KHAIDIR - NIM : 13513063 , MUHAMMAD
spellingShingle AODYRA KHAIDIR - NIM : 13513063 , MUHAMMAD
SENTIMENT CLASSIFICATION OF CORPORATIONS IN NEWS ARTICLES USING WORD EMBEDDING
author_facet AODYRA KHAIDIR - NIM : 13513063 , MUHAMMAD
author_sort AODYRA KHAIDIR - NIM : 13513063 , MUHAMMAD
title SENTIMENT CLASSIFICATION OF CORPORATIONS IN NEWS ARTICLES USING WORD EMBEDDING
title_short SENTIMENT CLASSIFICATION OF CORPORATIONS IN NEWS ARTICLES USING WORD EMBEDDING
title_full SENTIMENT CLASSIFICATION OF CORPORATIONS IN NEWS ARTICLES USING WORD EMBEDDING
title_fullStr SENTIMENT CLASSIFICATION OF CORPORATIONS IN NEWS ARTICLES USING WORD EMBEDDING
title_full_unstemmed SENTIMENT CLASSIFICATION OF CORPORATIONS IN NEWS ARTICLES USING WORD EMBEDDING
title_sort sentiment classification of corporations in news articles using word embedding
url https://digilib.itb.ac.id/gdl/view/23197
_version_ 1821121002653876224