Organizing news articles and editorials through information extraction and sentiment analysis

How can we organize voluminous amount of news articles to facilitate better search options and analysis? We propose the use of natural language processing techniques, specifically information extraction and sentiment analysis, to allow easier data analysis on news articles and editorials. The propos...

Full description

Saved in:
Bibliographic Details
Main Authors: Cheng, Charibeth, Cagampan, Bernadyn, Lim, Christine Diane
Format: text
Published: Animo Repository 2016
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/faculty_research/3515
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
id oai:animorepository.dlsu.edu.ph:faculty_research-4517
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:faculty_research-45172021-09-15T01:24:32Z Organizing news articles and editorials through information extraction and sentiment analysis Cheng, Charibeth Cagampan, Bernadyn Lim, Christine Diane How can we organize voluminous amount of news articles to facilitate better search options and analysis? We propose the use of natural language processing techniques, specifically information extraction and sentiment analysis, to allow easier data analysis on news articles and editorials. The proposed technique was tested on news documents written in Filipino. Grammar-based rules were formulated to extract pertinent information from the articles, and were automated through bootstrapping. The extracted information include the Filipino equivalent of the 5W user requirement proposed by Das et al. (2012) that answers the questions who, what, when, where, and why. Subsequently, the articles related through the 5Ws were analyzed based on their sentiment. Both information extraction and sentiment analysis were done at the article level. Collective results were presented visually. In designing the user interface, we considered (1) how the user would be able to find the articles he is looking for, (2) how he will immediately see the important points in the articles, as well as (3) the presenting the sentiment present in each articles and in the selected articles as a whole. To evaluate the performance of the information extraction and sentiment analysis, a gold standard was built to which the machine's output was compared. The visualization system was also subjectively rated according to usability. 2016-01-01T08:00:00Z text https://animorepository.dlsu.edu.ph/faculty_research/3515 Faculty Research Work Animo Repository Natural language generation (Computer science) Computational linguistics Sentiment analysis Text data mining Computer Sciences Software Engineering
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
topic Natural language generation (Computer science)
Computational linguistics
Sentiment analysis
Text data mining
Computer Sciences
Software Engineering
spellingShingle Natural language generation (Computer science)
Computational linguistics
Sentiment analysis
Text data mining
Computer Sciences
Software Engineering
Cheng, Charibeth
Cagampan, Bernadyn
Lim, Christine Diane
Organizing news articles and editorials through information extraction and sentiment analysis
description How can we organize voluminous amount of news articles to facilitate better search options and analysis? We propose the use of natural language processing techniques, specifically information extraction and sentiment analysis, to allow easier data analysis on news articles and editorials. The proposed technique was tested on news documents written in Filipino. Grammar-based rules were formulated to extract pertinent information from the articles, and were automated through bootstrapping. The extracted information include the Filipino equivalent of the 5W user requirement proposed by Das et al. (2012) that answers the questions who, what, when, where, and why. Subsequently, the articles related through the 5Ws were analyzed based on their sentiment. Both information extraction and sentiment analysis were done at the article level. Collective results were presented visually. In designing the user interface, we considered (1) how the user would be able to find the articles he is looking for, (2) how he will immediately see the important points in the articles, as well as (3) the presenting the sentiment present in each articles and in the selected articles as a whole. To evaluate the performance of the information extraction and sentiment analysis, a gold standard was built to which the machine's output was compared. The visualization system was also subjectively rated according to usability.
format text
author Cheng, Charibeth
Cagampan, Bernadyn
Lim, Christine Diane
author_facet Cheng, Charibeth
Cagampan, Bernadyn
Lim, Christine Diane
author_sort Cheng, Charibeth
title Organizing news articles and editorials through information extraction and sentiment analysis
title_short Organizing news articles and editorials through information extraction and sentiment analysis
title_full Organizing news articles and editorials through information extraction and sentiment analysis
title_fullStr Organizing news articles and editorials through information extraction and sentiment analysis
title_full_unstemmed Organizing news articles and editorials through information extraction and sentiment analysis
title_sort organizing news articles and editorials through information extraction and sentiment analysis
publisher Animo Repository
publishDate 2016
url https://animorepository.dlsu.edu.ph/faculty_research/3515
_version_ 1767195926333489152