Video summarization via multiview representative selection

Video contents are inherently heterogeneous. To exploit different feature modalities in a diverse video collection for video summarization, we propose to formulate the task as a multiview representative selection problem. The goal is to select visual elements that are representative of a video consi...

Full description

Saved in:

Bibliographic Details
Main Authors:	Meng, Jingjing, Wang, Suchen, Wang, Hongxing, Yuan, Junsong, Tan, Yap-Peng
Other Authors:	School of Electrical and Electronic Engineering
Format:	Article
Language:	English
Published:	2019
Subjects:	Video Summarization Multi-view DRNTU::Engineering::Electrical and electronic engineering
Online Access:	https://hdl.handle.net/10356/106096 http://hdl.handle.net/10220/48870 http://dx.doi.org/10.1109/TIP.2017.2789332
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Description
Summary:	Video contents are inherently heterogeneous. To exploit different feature modalities in a diverse video collection for video summarization, we propose to formulate the task as a multiview representative selection problem. The goal is to select visual elements that are representative of a video consistently across different views (i.e., feature modalities). We present in this paper the multiview sparse dictionary selection with centroid co-regularization method, which optimizes the representative selection in each view, and enforces that the view-specific selections to be similar by regularizing them towards a consensus selection. We also introduce a diversity regularizer to favor a selection of diverse representatives. The problem can be efficiently solved by an alternating minimizing optimization with the fast iterative shrinkage thresholding algorithm. Experiments on synthetic data and benchmark video datasets validate the effectiveness of the proposed approach for video summarization, in comparison with other video summarization methods and representative selection methods such as K-medoids, sparse dictionary selection, and multiview clustering.

Video summarization via multiview representative selection

Similar Items