Semantic-filtered Soft-Split-Aware video captioning with audio-augmented feature

Semantic-filtered Soft-Split-Aware video captioning with audio-augmented feature

Automatic video description, or video captioning, is a challenging yet much attractive task. It aims to combine video with text. Multiple methods have been proposed based on neural networks, utilizing Convolutional Neural Networks (CNN) to extract features, and Recurrent Neural Networks (RNN) to enc...

Full description

Saved in:

Bibliographic Details
Main Authors:	Xu, Yuecong, Yang, Jianfei, Mao, Kezhi
Other Authors:	School of Electrical and Electronic Engineering
Format:	Article
Language:	English
Published:	2021
Subjects:	Engineering::Electrical and electronic engineering Video Captioning Long Short-term Memory
Online Access:	https://hdl.handle.net/10356/151341
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Similar Items

PERSONALIZED VISUAL INFORMATION CAPTIONING
by: WU SHUANG
Published: (2023)

Cross-modal graph with meta concepts for video captioning
by: Wang, Hao, et al.
Published: (2022)

基于深度学习的 LSTM 模型在 X 荧光光谱中的应用 = Application of an LSTM model based on deep learning through X-ray fluorescence spectroscopy
by: Tang, Lin, et al.
Published: (2024)

Developing effective optimized machine learning approaches for settlement prediction of shallow foundation
by: Khajehzadeh, Mohammad, et al.
Published: (2024)

A Fine-Grained Spatial-Temporal Attention Model for Video Captioning
by: Liu, A.-A., et al.
Published: (2021)

Image captioning via semantic element embedding
by: ZHANG, Xiaodan, et al.
Published: (2020)

Short-Term photovoltaic power forecasting based on long short term memory neural network and attention mechanism
by: Zhou, H., et al.
Published: (2021)

Skeleton-based human action recognition with global context-aware attention LSTM networks
by: Liu, Jun, et al.
Published: (2020)

Stack-VS : stacked visual-semantic attention for image caption generation
by: Cheng, Ling, et al.
Published: (2021)

Learning generalized video memory for automatic video captioning
by: CHANG, Poo-Hee, et al.
Published: (2018)

Deconfounded image captioning: a causal retrospect
by: Yang, Xu, et al.
Published: (2022)

Context-aware visual policy network for fine-grained image captioning
by: Zha, Zheng-Jun, et al.
Published: (2022)

Classification of ECG anomaly with dynamically-biased LSTM for continuous cardiac monitoring
by: Hu, Jinhai, et al.
Published: (2024)

Dynamic captioning: Video accessibility enhancement for hearing impairment
by: Hong, R., et al.
Published: (2013)

Interactive change-aware transformer network for remote sensing image change captioning
by: Cai, Chen, et al.
Published: (2024)

Learning to collocate Visual-Linguistic Neural Modules for image captioning
by: Yang, Xu, et al.
Published: (2023)

The development of short-term and incidental memory of Children in Partially Urban Area
by: Chanya Hengtrakul
Published: (2012)

Sentic API for mental health detection
by: Yang, Willis Xianzu
Published: (2024)

Iconicity and anxiety levels: Short term memory recall effects
by: Acosta, Maximino P., et al.
Published: (1988)

Emergence of cortical network motifs for short-term memory during learning
by: Chia, Xin Wei, et al.
Published: (2024)

DISTINCTIVENESS AND MODALITY EFFECTS ON THE FORMATION OF SHORT-TERM FALSE MEMORIES
by: LIONEL LIM CHENG LIANG
Published: (2018)

Learning transferable perturbations for image captioning
by: WU, Hanjie, et al.
Published: (2022)

Keyword-driven image captioning via Context-dependent Bilateral LSTM
by: ZHANG, Xiaodan, et al.
Published: (2017)

Cross-modal graph with meta concepts for video captioning
by: WANG, Hao, et al.
Published: (2022)

Sleep after learning aids the consolidation of factual knowledge, but not relearning
by: Cousins, James N., et al.
Published: (2022)

Efficient implementation of activation functions for LSTM accelerators
by: Chong, Yi Sheng, et al.
Published: (2021)

The Implementation of Long-Short Term Memory for Tourism Industry in Malaysia
by: Taib, Siti Aishah Tsamienah, et al.
Published: (2025)

AmpSum: adaptive multiple-product summarization towards improving recommendation captions
by: TRUONG, Quoc Tuan, et al.
Published: (2022)

Quantum adaptive agents with efficient long-term memories
by: Elliott, Thomas J., et al.
Published: (2023)

Sleep deprivation accelerates delay-related loss of visual short-term memories without affecting precision
by: Wee, N., et al.
Published: (2014)

Low antibody titers five years after vaccination with the CYD-TDV dengue vaccine in both pre-immune and naïve vaccinees
by: Velumani, Sumathy, et al.
Published: (2016)

Not all trips are equal: Analyzing foursquare check-ins of trips and city visitors
by: CHONG, Wen Haw, et al.
Published: (2015)

More is better : precise and detailed image captioning using online positive recall and missing concepts mining
by: Zhang, Mingxing, et al.
Published: (2020)

Distilling the knowledge from handcrafted features for human activity recognition
by: Chen, Zhenghua, et al.
Published: (2019)

Topical co-attention networks for hashtag recommendation on microblogs
by: LI, Yang, et al.
Published: (2019)

Man-machine cooperative method based on deep learning in flexible manufacturing system
by: He, Chongshan
Published: (2024)

A novel nucleolar transcriptional activator ApLLP for long-term memory formation is intrinsically unstructured but functionally active
by: Liu, J., et al.
Published: (2011)

CgT-GAN: CLIP-guided text GAN for image captioning
by: YU, Jiarui, et al.
Published: (2023)

Presynaptic learning and memory with a persistent firing neuron and a habituating synapse: A model of short term persistent habituation
by: Ramanathan, K., et al.
Published: (2014)

Cholinergic augmentation modulates visual task performance in sleep-deprived young adults
by: Chuah, L.Y.M., et al.
Published: (2014)