Towards unbiased visual language reasoning and consistent segmentation
In recent years, we have made significant advances in standard recognition tasks such as classification, detection or segmentation. To further understand from vi- sion, more and more researchers pay attention to introduce text information for reasoning. Such as image caption, visual question answeri...
Saved in:
Main Author: | Huang, Jianqiang |
---|---|
Other Authors: | Hanwang Zhang |
Format: | Thesis-Doctor of Philosophy |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/169540 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Similar Items
-
Exploiting visual context and consistency for semantic segmentation
by: Kang, Dang
Published: (2018) -
Instance LSeg - exploring instance level information from visual language model
by: Lin, Zixing
Published: (2023) -
Reasoning over multiple human-human interaction activities
by: Perez, Mauricio Lisboa
Published: (2021) -
Towards unbiased visual emotion recognition via causal intervention
by: Chen, Yuedong, et al.
Published: (2023) -
Disentangling latent space of variational autoencoder with distribution dependent guarantees for out-of-distribution detection and reasoning
by: Rahiminasab Zahra Reza (Zahra Rahiminasab)
Published: (2024)