Towards unbiased visual language reasoning and consistent segmentation

Towards unbiased visual language reasoning and consistent segmentation

In recent years, we have made significant advances in standard recognition tasks such as classification, detection or segmentation. To further understand from vi- sion, more and more researchers pay attention to introduce text information for reasoning. Such as image caption, visual question answeri...

Full description

Saved in:

Bibliographic Details
Main Author:	Huang, Jianqiang
Other Authors:	Hanwang Zhang
Format:	Thesis-Doctor of Philosophy
Language:	English
Published:	Nanyang Technological University 2023
Subjects:	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Engineering::Computer science and engineering::Computing methodologies::Image processing and computer vision
Online Access:	https://hdl.handle.net/10356/169540
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Similar Items

Exploiting visual context and consistency for semantic segmentation
by: Kang, Dang
Published: (2018)

Instance LSeg - exploring instance level information from visual language model
by: Lin, Zixing
Published: (2023)

Reasoning over multiple human-human interaction activities
by: Perez, Mauricio Lisboa
Published: (2021)

Towards unbiased visual emotion recognition via causal intervention
by: Chen, Yuedong, et al.
Published: (2023)

Disentangling latent space of variational autoencoder with distribution dependent guarantees for out-of-distribution detection and reasoning
by: Rahiminasab Zahra Reza (Zahra Rahiminasab)
Published: (2024)

Semantic segmentation of delayered IC images with shape-variant convolution
by: Wang, Xue
Published: (2022)

Language-guided visual retrieval
by: He, Su
Published: (2021)

Learning to control visual data translation
by: Koksal, Ali
Published: (2023)

Learning visual representations without human supervision
by: Xie, Jiahao
Published: (2023)

An empirical study on adaptation methods for large-scale vision-language models
by: Wang, Annan
Published: (2023)

2D and 3D visual understanding with limited supervision
by: Wu, Zhonghua
Published: (2023)

Towards interpretable & robust face recognition
by: Pattra, Surya Paryanta
Published: (2022)

Towards interpretable & robust occluded facial recognition
by: Rachita, Agrawal
Published: (2023)

Towards deep neural networks robust to adversarial examples
by: Matyasko, Alexander
Published: (2020)

Pose- and Attribute-consistent Person Image Synthesis
by: XU, Cheng, et al.
Published: (2023)

RGBD indoor semantic segmentation with segmentation transformer
by: Choong, Han Yi
Published: (2022)

Co-saliency based visual object co-segmentation and co-localization
by: Jerripothula, Koteswar Rao
Published: (2017)

Background preservation for text-guided image editing
by: Huang, Runtao
Published: (2023)

Panoptic image segmentation
by: Chua, Shahrin Zong Da
Published: (2022)

Semantic segmentation with less annotation efforts
by: Zhang, Tianyi
Published: (2020)

Unsupervised domain adaptation for LiDAR segmentation
by: Kong, Lingdong
Published: (2022)

4D point cloud semantic segmentation
by: Shi, Hanyu
Published: (2023)

Towards high-quality panoptic segmentation
by: Chen, Chongsong
Published: (2020)

Embodied object hunt
by: Yeo, Zhi Hong
Published: (2020)

Training deep network models for accurate recognition of texts in scenes
by: Teo, Ren Jie
Published: (2020)

Object detection from satellite imagery
by: Seah, Yi Xuan
Published: (2020)

Photorealistic stylised image quality assessment database (PSIQAD) building and modelling
by: Low, Qing Ru
Published: (2020)

Understanding variations (variant & invariant) of classification tasks/targets
by: Wan, Tai Fong
Published: (2020)

Human pose estimation and action recognition based on monocular video inputs
by: Leong, Mei Chee
Published: (2020)

On the exploration of referenced-based super resolution for face images
by: Ong, Ming Yang
Published: (2020)

Deep learning based car license plate recognition
by: Ngo, Jason Jun Hao
Published: (2021)

Multi-degradation image super-resolution using texture-transfer
by: Susanto, Stephanie Audrey
Published: (2021)

Learning to see in the dark
by: Chen, Sihao
Published: (2021)

Learning to recognize objects by adaptive knowledge transfer
by: Tao, Qingyi
Published: (2021)

Video-based traffic analysis
by: Fong, Hao Wei
Published: (2021)

Using deep learning for quality control in cyber-manufacturing
by: Dai, Wenting
Published: (2022)

Skin cancer detection with deep learning
by: Gupta, Jay
Published: (2022)

Attack on training effort of deep learning
by: Ho, Tony Man Tung
Published: (2022)

Attack on prediction confidence of deep learning neural networks
by: Ng, Garyl Xuan
Published: (2022)

Automatic recognition of facial expressions
by: Gee, Cheng Mun
Published: (2020)