Structure-aware multimodal feature fusion for RGB-D scene classification and beyond
While convolutional neural networks (CNNs) have been excellent for object recognition, the greater spatial variability in scene images typically means that the standard full-image CNN features are suboptimal for scene classification. In this article, we investigate a framework allowing greater spati...
Saved in:
Main Authors: | Wang, Anran, Cai, Jianfei, Lu, Jiwen, Cham, Tat-Jen |
---|---|
Other Authors: | School of Computer Science and Engineering |
Format: | Article |
Language: | English |
Published: |
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/138263 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Similar Items
-
Feature learning for RGB-D scene understanding
by: Wang, Anran
Published: (2016) -
Multimodal sentiment analysis using hierarchical fusion with context modeling
by: Majumder, Navonil, et al.
Published: (2020) -
Fusing pairwise modalities for emotion recognition in conversations
by: Fan, Chunxiao, et al.
Published: (2024) -
Multimodal fusion for multimedia analysis: A survey
by: Atrey, P.K., et al.
Published: (2013) -
Sentic maxine: Multimodal affective fusion and emotional paths
by: Hupont, I., et al.
Published: (2014)