Organizing news articles and editorials through information extraction and sentiment analysis
How can we organize voluminous amount of news articles to facilitate better search options and analysis? We propose the use of natural language processing techniques, specifically information extraction and sentiment analysis, to allow easier data analysis on news articles and editorials. The propos...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Published: |
Animo Repository
2016
|
Subjects: | |
Online Access: | https://animorepository.dlsu.edu.ph/faculty_research/3515 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | De La Salle University |
id |
oai:animorepository.dlsu.edu.ph:faculty_research-4517 |
---|---|
record_format |
eprints |
spelling |
oai:animorepository.dlsu.edu.ph:faculty_research-45172021-09-15T01:24:32Z Organizing news articles and editorials through information extraction and sentiment analysis Cheng, Charibeth Cagampan, Bernadyn Lim, Christine Diane How can we organize voluminous amount of news articles to facilitate better search options and analysis? We propose the use of natural language processing techniques, specifically information extraction and sentiment analysis, to allow easier data analysis on news articles and editorials. The proposed technique was tested on news documents written in Filipino. Grammar-based rules were formulated to extract pertinent information from the articles, and were automated through bootstrapping. The extracted information include the Filipino equivalent of the 5W user requirement proposed by Das et al. (2012) that answers the questions who, what, when, where, and why. Subsequently, the articles related through the 5Ws were analyzed based on their sentiment. Both information extraction and sentiment analysis were done at the article level. Collective results were presented visually. In designing the user interface, we considered (1) how the user would be able to find the articles he is looking for, (2) how he will immediately see the important points in the articles, as well as (3) the presenting the sentiment present in each articles and in the selected articles as a whole. To evaluate the performance of the information extraction and sentiment analysis, a gold standard was built to which the machine's output was compared. The visualization system was also subjectively rated according to usability. 2016-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/faculty_research/3515 Faculty Research Work Animo Repository Natural language generation (Computer science) Computational linguistics Sentiment analysis Text data mining Computer Sciences Software Engineering |
institution |
De La Salle University |
building |
De La Salle University Library |
continent |
Asia |
country |
Philippines Philippines |
content_provider |
De La Salle University Library |
collection |
DLSU Institutional Repository |
topic |
Natural language generation (Computer science) Computational linguistics Sentiment analysis Text data mining Computer Sciences Software Engineering |
spellingShingle |
Natural language generation (Computer science) Computational linguistics Sentiment analysis Text data mining Computer Sciences Software Engineering Cheng, Charibeth Cagampan, Bernadyn Lim, Christine Diane Organizing news articles and editorials through information extraction and sentiment analysis |
description |
How can we organize voluminous amount of news articles to facilitate better search options and analysis? We propose the use of natural language processing techniques, specifically information extraction and sentiment analysis, to allow easier data analysis on news articles and editorials. The proposed technique was tested on news documents written in Filipino. Grammar-based rules were formulated to extract pertinent information from the articles, and were automated through bootstrapping. The extracted information include the Filipino equivalent of the 5W user requirement proposed by Das et al. (2012) that answers the questions who, what, when, where, and why. Subsequently, the articles related through the 5Ws were analyzed based on their sentiment. Both information extraction and sentiment analysis were done at the article level. Collective results were presented visually. In designing the user interface, we considered (1) how the user would be able to find the articles he is looking for, (2) how he will immediately see the important points in the articles, as well as (3) the presenting the sentiment present in each articles and in the selected articles as a whole. To evaluate the performance of the information extraction and sentiment analysis, a gold standard was built to which the machine's output was compared. The visualization system was also subjectively rated according to usability. |
format |
text |
author |
Cheng, Charibeth Cagampan, Bernadyn Lim, Christine Diane |
author_facet |
Cheng, Charibeth Cagampan, Bernadyn Lim, Christine Diane |
author_sort |
Cheng, Charibeth |
title |
Organizing news articles and editorials through information extraction and sentiment analysis |
title_short |
Organizing news articles and editorials through information extraction and sentiment analysis |
title_full |
Organizing news articles and editorials through information extraction and sentiment analysis |
title_fullStr |
Organizing news articles and editorials through information extraction and sentiment analysis |
title_full_unstemmed |
Organizing news articles and editorials through information extraction and sentiment analysis |
title_sort |
organizing news articles and editorials through information extraction and sentiment analysis |
publisher |
Animo Repository |
publishDate |
2016 |
url |
https://animorepository.dlsu.edu.ph/faculty_research/3515 |
_version_ |
1767195926333489152 |