Structure-aware multimodal feature fusion for RGB-D scene classification and beyond
While convolutional neural networks (CNNs) have been excellent for object recognition, the greater spatial variability in scene images typically means that the standard full-image CNN features are suboptimal for scene classification. In this article, we investigate a framework allowing greater spati...
Saved in:
Main Authors: | Wang, Anran, Cai, Jianfei, Lu, Jiwen, Cham, Tat-Jen |
---|---|
Other Authors: | School of Computer Science and Engineering |
Format: | Article |
Language: | English |
Published: |
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/138263 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Similar Items
-
Towards robust and efficient multimodal representation learning and fusion
by: Guo, Xiaobao
Published: (2025) -
Feature learning for RGB-D scene understanding
by: Wang, Anran
Published: (2016) -
Multimodal sentiment analysis using hierarchical fusion with context modeling
by: Majumder, Navonil, et al.
Published: (2020) -
Fusing pairwise modalities for emotion recognition in conversations
by: Fan, Chunxiao, et al.
Published: (2024) -
Exploring a multimodal fusion-based deep learning network for detecting facial palsy
by: OO, Heng Yim Nicole, et al.
Published: (2024)