Towards robust and efficient multimodal representation learning and fusion
In the past few years, multimodal learning has made significant progress. The goal of multimodal learning is to create models that can relate and process data from various modalities. One of the challenges is to learn useful representations efficiently given the heterogeneity of the data. Another is...
Saved in:
Main Author: | Guo, Xiaobao |
---|---|
Other Authors: | Kong Wai-Kin Adams |
Format: | Thesis-Doctor of Philosophy |
Language: | English |
Published: |
Nanyang Technological University
2025
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/182226 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Similar Items
-
Multimodal fusion for multimedia analysis: A survey
by: Atrey, P.K., et al.
Published: (2013) -
Fusion of multimodal embeddings for ad-hoc video search
by: FRANCIS, Danny, et al.
Published: (2019) -
Query-document-dependent fusion: A case study of multimodal music retrieval
by: Li, Z., et al.
Published: (2014) -
Multimodal sentiment analysis using hierarchical fusion with context modeling
by: Majumder, Navonil, et al.
Published: (2020) -
Adaptive multimodal fusion based similarity measures in music information retrieval
by: ZHANG BINGJUN
Published: (2011)