Semantic-filtered Soft-Split-Aware video captioning with audio-augmented feature

Automatic video description, or video captioning, is a challenging yet much attractive task. It aims to combine video with text. Multiple methods have been proposed based on neural networks, utilizing Convolutional Neural Networks (CNN) to extract features, and Recurrent Neural Networks (RNN) to enc...

Full description

Saved in:

Bibliographic Details
Main Authors:	Xu, Yuecong, Yang, Jianfei, Mao, Kezhi
Other Authors:	School of Electrical and Electronic Engineering
Format:	Article
Language:	English
Published:	2021
Subjects:	Engineering::Electrical and electronic engineering Video Captioning Long Short-term Memory
Online Access:	https://hdl.handle.net/10356/151341
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Be the first to leave a comment!

Semantic-filtered Soft-Split-Aware video captioning with audio-augmented feature

Similar Items