Graph based extractive text summarization based on triangle counting approach

Currently, with the exponential rising quantity of automated textual data available on the Web, end users require the ability to get information in summary form, while keeping the most vital information in the document. As a result of this, the necessity for the creation of Summarization systems bec...

Full description

Saved in:
Bibliographic Details
Main Authors: Isiaka, Obasa Adekunle, Al-Khassawneh, Yazan Alaya Jameel, Salim, Naomie
Format: Conference or Workshop Item
Published: 2014
Subjects:
Online Access:http://eprints.utm.my/id/eprint/63429/
https://rome2014.sched.com/event/1pSKuAu/graph-based-extractive-text-summarization-based-on-triangle-counting-approach
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Teknologi Malaysia
id my.utm.63429
record_format eprints
spelling my.utm.634292017-05-29T03:58:18Z http://eprints.utm.my/id/eprint/63429/ Graph based extractive text summarization based on triangle counting approach Isiaka, Obasa Adekunle Al-Khassawneh, Yazan Alaya Jameel Salim, Naomie QA75 Electronic computers. Computer science Currently, with the exponential rising quantity of automated textual data available on the Web, end users require the ability to get information in summary form, while keeping the most vital information in the document. As a result of this, the necessity for the creation of Summarization systems became vital. Summarization systems, collect and focus on the most important ideas of the papers and help the users to find and understand the main ideas of the text faster and in a simpler way from the dispensation of information. Compelling set of such systems are those that create summaries of extracts. This type of summary, which is called Extractive Summarization, extracts the most applicable sentences from the main document. The used methods, usually assign a score for every sentence in the text, based on specific features. Then choose the most important sentences, according to the degree of score for each sentence. These features include but not limited to, the sentence length, its similarity with the title, the position of the sentence in the main document, and the frequency of the words in the sentence. Nevertheless, not have been achieved quality summaries corresponds with the ones made by humans, and therefore proposing the techniques continue to be raised, for the aims of improving the outcomes. One methodology for creating extractive summary is using the graph theory. One field in graph theory called graph pruning / reduction, which aims to find the greatest illustration of the original graph with less number of nodes and edges. This paper proposes a method of extractive summarization based on a graph reduction technique called the triangle counting approach. This method has three main phases. The first phase is graph representation, where nodes are the sentences and edges are the similarity between the sentences. The second phase is triangles construction, and the third phase is bit vector representation for the triangles nodes and finally create the summary based on the values of bit vector. The proposed method was evaluated, using ROUGE measures on the dataset DUC2002. The results showed that by using triangle counting as a reduction technique, it performs better than the state of the art methods. 2014 Conference or Workshop Item PeerReviewed Isiaka, Obasa Adekunle and Al-Khassawneh, Yazan Alaya Jameel and Salim, Naomie (2014) Graph based extractive text summarization based on triangle counting approach. In: International Conference for Technology and Science, 28-31 Oct, 2014, Italy. https://rome2014.sched.com/event/1pSKuAu/graph-based-extractive-text-summarization-based-on-triangle-counting-approach
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Isiaka, Obasa Adekunle
Al-Khassawneh, Yazan Alaya Jameel
Salim, Naomie
Graph based extractive text summarization based on triangle counting approach
description Currently, with the exponential rising quantity of automated textual data available on the Web, end users require the ability to get information in summary form, while keeping the most vital information in the document. As a result of this, the necessity for the creation of Summarization systems became vital. Summarization systems, collect and focus on the most important ideas of the papers and help the users to find and understand the main ideas of the text faster and in a simpler way from the dispensation of information. Compelling set of such systems are those that create summaries of extracts. This type of summary, which is called Extractive Summarization, extracts the most applicable sentences from the main document. The used methods, usually assign a score for every sentence in the text, based on specific features. Then choose the most important sentences, according to the degree of score for each sentence. These features include but not limited to, the sentence length, its similarity with the title, the position of the sentence in the main document, and the frequency of the words in the sentence. Nevertheless, not have been achieved quality summaries corresponds with the ones made by humans, and therefore proposing the techniques continue to be raised, for the aims of improving the outcomes. One methodology for creating extractive summary is using the graph theory. One field in graph theory called graph pruning / reduction, which aims to find the greatest illustration of the original graph with less number of nodes and edges. This paper proposes a method of extractive summarization based on a graph reduction technique called the triangle counting approach. This method has three main phases. The first phase is graph representation, where nodes are the sentences and edges are the similarity between the sentences. The second phase is triangles construction, and the third phase is bit vector representation for the triangles nodes and finally create the summary based on the values of bit vector. The proposed method was evaluated, using ROUGE measures on the dataset DUC2002. The results showed that by using triangle counting as a reduction technique, it performs better than the state of the art methods.
format Conference or Workshop Item
author Isiaka, Obasa Adekunle
Al-Khassawneh, Yazan Alaya Jameel
Salim, Naomie
author_facet Isiaka, Obasa Adekunle
Al-Khassawneh, Yazan Alaya Jameel
Salim, Naomie
author_sort Isiaka, Obasa Adekunle
title Graph based extractive text summarization based on triangle counting approach
title_short Graph based extractive text summarization based on triangle counting approach
title_full Graph based extractive text summarization based on triangle counting approach
title_fullStr Graph based extractive text summarization based on triangle counting approach
title_full_unstemmed Graph based extractive text summarization based on triangle counting approach
title_sort graph based extractive text summarization based on triangle counting approach
publishDate 2014
url http://eprints.utm.my/id/eprint/63429/
https://rome2014.sched.com/event/1pSKuAu/graph-based-extractive-text-summarization-based-on-triangle-counting-approach
_version_ 1643655712361938944