Automatic document summarization from social media and online news

This dissertation provides a new method for sentence embedding and document summarization. The topic model is utilized to modify the sentence embedding method SIF by capturing the information in the document, instead of relying on an external corpus. Thus, the modification embeds the information of...

Full description

Saved in:
Bibliographic Details
Main Author: Feng, Zijian
Other Authors: Mao Kezhi
Format: Thesis-Master by Coursework
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/141154
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-141154
record_format dspace
spelling sg-ntu-dr.10356-1411542023-07-04T16:35:53Z Automatic document summarization from social media and online news Feng, Zijian Mao Kezhi School of Electrical and Electronic Engineering EKZMao@ntu.edu.sg Engineering::Electrical and electronic engineering This dissertation provides a new method for sentence embedding and document summarization. The topic model is utilized to modify the sentence embedding method SIF by capturing the information in the document, instead of relying on an external corpus. Thus, the modification embeds the information of the entire document into the sentence vectors, which is beneficial for further information extraction. Then we employ the graph-based method to score the sentences and select the high-scoring sentences to form the summary. In addition, this dissertation also tested the impact of different parameter changes in the model. The experimental results show that the proposed model can beat other classic and advanced models in semantic analysis and summary extraction with strong robustness. The datasets used in this dissertation are from social media and online news, which proves the applicability of this model to online information extraction. Master of Science (Signal Processing) 2020-06-04T07:59:21Z 2020-06-04T07:59:21Z 2020 Thesis-Master by Coursework https://hdl.handle.net/10356/141154 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Feng, Zijian
Automatic document summarization from social media and online news
description This dissertation provides a new method for sentence embedding and document summarization. The topic model is utilized to modify the sentence embedding method SIF by capturing the information in the document, instead of relying on an external corpus. Thus, the modification embeds the information of the entire document into the sentence vectors, which is beneficial for further information extraction. Then we employ the graph-based method to score the sentences and select the high-scoring sentences to form the summary. In addition, this dissertation also tested the impact of different parameter changes in the model. The experimental results show that the proposed model can beat other classic and advanced models in semantic analysis and summary extraction with strong robustness. The datasets used in this dissertation are from social media and online news, which proves the applicability of this model to online information extraction.
author2 Mao Kezhi
author_facet Mao Kezhi
Feng, Zijian
format Thesis-Master by Coursework
author Feng, Zijian
author_sort Feng, Zijian
title Automatic document summarization from social media and online news
title_short Automatic document summarization from social media and online news
title_full Automatic document summarization from social media and online news
title_fullStr Automatic document summarization from social media and online news
title_full_unstemmed Automatic document summarization from social media and online news
title_sort automatic document summarization from social media and online news
publisher Nanyang Technological University
publishDate 2020
url https://hdl.handle.net/10356/141154
_version_ 1772826934360670208