Visual relationship detection

Visual relationship detection is the process of pairing the objects in the image and identifying the relationships between the objects in the form of “object-predicate-object”, such as “person riding bike”. Although there had been many attempts to develop visual relationship detection, most may not...

Full description

Saved in:
Bibliographic Details
Main Author: Park, Kunyoung
Other Authors: Zhang Hanwang
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/156480
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-156480
record_format dspace
spelling sg-ntu-dr.10356-1564802022-04-17T12:16:27Z Visual relationship detection Park, Kunyoung Zhang Hanwang School of Computer Science and Engineering hanwangzhang@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Visual relationship detection is the process of pairing the objects in the image and identifying the relationships between the objects in the form of “object-predicate-object”, such as “person riding bike”. Although there had been many attempts to develop visual relationship detection, most may not provide useful information about the image due to biasedness. For instance, predicates made by biased scene graph generation (SGG) such as “on” and “next to” do not provide useful information about the image as compared to predicates generated by unbiased SGG, such as “sitting on” and “in front of”. In this project, Total Direct Effect (TDE) in causal inference with counterfactual thinking method was explored and adopted on SGG to remove the biasedness. This implementation had shown significant improvement of accuracy measured with Mean Recall@K (mR@K) metric used in this project. The results of the visual relationship detection were also visualised and analysed. Bachelor of Engineering (Computer Science) 2022-04-17T12:16:27Z 2022-04-17T12:16:27Z 2022 Final Year Project (FYP) Park, K. (2022). Visual relationship detection. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156480 https://hdl.handle.net/10356/156480 en SCSE21-0518 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Park, Kunyoung
Visual relationship detection
description Visual relationship detection is the process of pairing the objects in the image and identifying the relationships between the objects in the form of “object-predicate-object”, such as “person riding bike”. Although there had been many attempts to develop visual relationship detection, most may not provide useful information about the image due to biasedness. For instance, predicates made by biased scene graph generation (SGG) such as “on” and “next to” do not provide useful information about the image as compared to predicates generated by unbiased SGG, such as “sitting on” and “in front of”. In this project, Total Direct Effect (TDE) in causal inference with counterfactual thinking method was explored and adopted on SGG to remove the biasedness. This implementation had shown significant improvement of accuracy measured with Mean Recall@K (mR@K) metric used in this project. The results of the visual relationship detection were also visualised and analysed.
author2 Zhang Hanwang
author_facet Zhang Hanwang
Park, Kunyoung
format Final Year Project
author Park, Kunyoung
author_sort Park, Kunyoung
title Visual relationship detection
title_short Visual relationship detection
title_full Visual relationship detection
title_fullStr Visual relationship detection
title_full_unstemmed Visual relationship detection
title_sort visual relationship detection
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/156480
_version_ 1731235708887105536