Context-aware visual policy network for fine-grained image captioning

With the maturity of visual detection techniques, we are more ambitious in describing visual content with open-vocabulary, fine-grained and free-form language, i.e., the task of image captioning. In particular, we are interested in generating longer, richer and more fine-grained sentences and paragr...

Full description

Saved in:
Bibliographic Details
Main Authors: Zha, Zheng-Jun, Liu, Daqing, Zhang, Hanwang, Zhang, Yongdong, Wu, Feng
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2022
Subjects:
Online Access:https://hdl.handle.net/10356/162628
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Be the first to leave a comment!
You must be logged in first