Proposal-driven segmentation for videos

Effectively utilizing the common information in a set of video frames is a vital aspect in video segmentation. However, existing methods that transport the common information from a prior frame to the current frame do not make use of the common information effectively. In order to address this issue...

Full description

Saved in:
Bibliographic Details
Main Authors: LI, Junliang, HE, Shengfeng, WONG, Hon-Cheng, LO, Sio-Long
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2019
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7876
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-8879
record_format dspace
spelling sg-smu-ink.sis_research-88792023-06-15T09:00:05Z Proposal-driven segmentation for videos LI, Junliang HE, Shengfeng WONG, Hon-Cheng LO, Sio-Long Effectively utilizing the common information in a set of video frames is a vital aspect in video segmentation. However, existing methods that transport the common information from a prior frame to the current frame do not make use of the common information effectively. In order to address this issue, we apply a new strategy that jointly segments object through a convolutional neural network (CNN) to build a proposal-driven framework for exploiting the common information between two video frames by processing two video frames simultaneously in this letter. Moreover, proposals from the video frames are found useful for refining the segmentation results through fusing their segmentation results with the ones of the video frames. In our framework, proposals with features are generated by a faster region-CNN, and the L2 loss function is used to establish proposal pairs among proposals from the two selected frames. A new trained ResNet then keeps proposal pairs, which contain the same content, and the PSPNet model for segmentation is utilized to generate the segmentation results belonging to the frames and proposals. Finally, the proposals' segmentation results are refined using the video frames' segmentation results. The VOT 2016 segmentation dataset, the DAVIS 2017 dataset, and the SegTrack v2 dataset were used for training and testing our framework. Experimental results show that our proposal-driven segmentation framework is able to achieve higher accuracies in video segmentation challenge compared to those of the existing video segmentation methods. 2019-08-01T07:00:00Z text https://ink.library.smu.edu.sg/sis_research/7876 info:doi/10.1109/LSP.2019.2921654 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Segmentation proposals convolutional neural network (CNN) Information Security
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Segmentation
proposals
convolutional neural network (CNN)
Information Security
spellingShingle Segmentation
proposals
convolutional neural network (CNN)
Information Security
LI, Junliang
HE, Shengfeng
WONG, Hon-Cheng
LO, Sio-Long
Proposal-driven segmentation for videos
description Effectively utilizing the common information in a set of video frames is a vital aspect in video segmentation. However, existing methods that transport the common information from a prior frame to the current frame do not make use of the common information effectively. In order to address this issue, we apply a new strategy that jointly segments object through a convolutional neural network (CNN) to build a proposal-driven framework for exploiting the common information between two video frames by processing two video frames simultaneously in this letter. Moreover, proposals from the video frames are found useful for refining the segmentation results through fusing their segmentation results with the ones of the video frames. In our framework, proposals with features are generated by a faster region-CNN, and the L2 loss function is used to establish proposal pairs among proposals from the two selected frames. A new trained ResNet then keeps proposal pairs, which contain the same content, and the PSPNet model for segmentation is utilized to generate the segmentation results belonging to the frames and proposals. Finally, the proposals' segmentation results are refined using the video frames' segmentation results. The VOT 2016 segmentation dataset, the DAVIS 2017 dataset, and the SegTrack v2 dataset were used for training and testing our framework. Experimental results show that our proposal-driven segmentation framework is able to achieve higher accuracies in video segmentation challenge compared to those of the existing video segmentation methods.
format text
author LI, Junliang
HE, Shengfeng
WONG, Hon-Cheng
LO, Sio-Long
author_facet LI, Junliang
HE, Shengfeng
WONG, Hon-Cheng
LO, Sio-Long
author_sort LI, Junliang
title Proposal-driven segmentation for videos
title_short Proposal-driven segmentation for videos
title_full Proposal-driven segmentation for videos
title_fullStr Proposal-driven segmentation for videos
title_full_unstemmed Proposal-driven segmentation for videos
title_sort proposal-driven segmentation for videos
publisher Institutional Knowledge at Singapore Management University
publishDate 2019
url https://ink.library.smu.edu.sg/sis_research/7876
_version_ 1770576574291116032