Summarizing Indonesian News Articles Using Graph Convolutional Network

Multi-document summarization transforms a set of related documents into one concise summary. Existing Indonesian news articles summarizations do not take relationships between sentences into account and heavily depends on Indonesian language tools and resources. In this paper, we employ Graph Convol...

Full description

Saved in:
Bibliographic Details
Main Authors: Garmastewira, Garmastewira, Khodra, Masayu Leylia
Format: Article
Language:English
Published: Universiti Utara Malaysia Press 2019
Subjects:
Online Access:https://repo.uum.edu.my/id/eprint/29121/1/JICT%2018%2003%202019%20345-365.pdf
https://repo.uum.edu.my/id/eprint/29121/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Utara Malaysia
Language: English
id my.uum.repo.29121
record_format eprints
spelling my.uum.repo.291212023-01-29T01:30:33Z https://repo.uum.edu.my/id/eprint/29121/ Summarizing Indonesian News Articles Using Graph Convolutional Network Garmastewira, Garmastewira Khodra, Masayu Leylia T Technology (General) Multi-document summarization transforms a set of related documents into one concise summary. Existing Indonesian news articles summarizations do not take relationships between sentences into account and heavily depends on Indonesian language tools and resources. In this paper, we employ Graph Convolutional Network (GCN) which accepts word embedding sequence and sentence relationship graph as input for Indonesian news articles summarization. Our system is comprised of four main components, which are preprocess, graph construction, sentence scoring, and sentence selection components. Sentence scoring component is a neural network that uses Recurrent Neural Network (RNN) and GCN to produce the scores of all sentences. We use three different representation types for the sentence relationship graph. Sentence selection component then generates summary with two different techniques, which are by greedily choosing sentences with the highest scores and by using Maximum Marginal Relevance (MMR) technique. The evaluation shows that GCN summarizer with Personalized Discourse Graph (PDG) graph representation system achieves the best results with average ROUGE-2 recall score of 0.370 for 100-word summary and 0.378 for 200-word summary. Sentence selection using greedy technique gives better results for generating 100-word summary, while MMR performs better for generating 200-word summary. Universiti Utara Malaysia Press 2019 Article PeerReviewed application/pdf en https://repo.uum.edu.my/id/eprint/29121/1/JICT%2018%2003%202019%20345-365.pdf Garmastewira, Garmastewira and Khodra, Masayu Leylia (2019) Summarizing Indonesian News Articles Using Graph Convolutional Network. Journal of Information and Communication Technology, 18 (3). pp. 345-365. ISSN 2180-3862
institution Universiti Utara Malaysia
building UUM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Utara Malaysia
content_source UUM Institutional Repository
url_provider http://repo.uum.edu.my/
language English
topic T Technology (General)
spellingShingle T Technology (General)
Garmastewira, Garmastewira
Khodra, Masayu Leylia
Summarizing Indonesian News Articles Using Graph Convolutional Network
description Multi-document summarization transforms a set of related documents into one concise summary. Existing Indonesian news articles summarizations do not take relationships between sentences into account and heavily depends on Indonesian language tools and resources. In this paper, we employ Graph Convolutional Network (GCN) which accepts word embedding sequence and sentence relationship graph as input for Indonesian news articles summarization. Our system is comprised of four main components, which are preprocess, graph construction, sentence scoring, and sentence selection components. Sentence scoring component is a neural network that uses Recurrent Neural Network (RNN) and GCN to produce the scores of all sentences. We use three different representation types for the sentence relationship graph. Sentence selection component then generates summary with two different techniques, which are by greedily choosing sentences with the highest scores and by using Maximum Marginal Relevance (MMR) technique. The evaluation shows that GCN summarizer with Personalized Discourse Graph (PDG) graph representation system achieves the best results with average ROUGE-2 recall score of 0.370 for 100-word summary and 0.378 for 200-word summary. Sentence selection using greedy technique gives better results for generating 100-word summary, while MMR performs better for generating 200-word summary.
format Article
author Garmastewira, Garmastewira
Khodra, Masayu Leylia
author_facet Garmastewira, Garmastewira
Khodra, Masayu Leylia
author_sort Garmastewira, Garmastewira
title Summarizing Indonesian News Articles Using Graph Convolutional Network
title_short Summarizing Indonesian News Articles Using Graph Convolutional Network
title_full Summarizing Indonesian News Articles Using Graph Convolutional Network
title_fullStr Summarizing Indonesian News Articles Using Graph Convolutional Network
title_full_unstemmed Summarizing Indonesian News Articles Using Graph Convolutional Network
title_sort summarizing indonesian news articles using graph convolutional network
publisher Universiti Utara Malaysia Press
publishDate 2019
url https://repo.uum.edu.my/id/eprint/29121/1/JICT%2018%2003%202019%20345-365.pdf
https://repo.uum.edu.my/id/eprint/29121/
_version_ 1756687049054748672