Semantic-filtered Soft-Split-Aware video captioning with audio-augmented feature
Automatic video description, or video captioning, is a challenging yet much attractive task. It aims to combine video with text. Multiple methods have been proposed based on neural networks, utilizing Convolutional Neural Networks (CNN) to extract features, and Recurrent Neural Networks (RNN) to enc...
Saved in:
Main Authors: | Xu, Yuecong, Yang, Jianfei, Mao, Kezhi |
---|---|
Other Authors: | School of Electrical and Electronic Engineering |
Format: | Article |
Language: | English |
Published: |
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/151341 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Similar Items
-
PERSONALIZED VISUAL INFORMATION CAPTIONING
by: WU SHUANG
Published: (2023) -
Cross-modal graph with meta concepts for video captioning
by: Wang, Hao, et al.
Published: (2022) -
A Fine-Grained Spatial-Temporal Attention Model for Video Captioning
by: Liu, A.-A., et al.
Published: (2021) -
Image captioning via semantic element embedding
by: ZHANG, Xiaodan, et al.
Published: (2020) -
基于深度学习的 LSTM 模型在 X 荧光光谱中的应用 = Application of an LSTM model based on deep learning through X-ray fluorescence spectroscopy
by: Tang, Lin, et al.
Published: (2024)