Visual relationship detection
Visual relationship detection is the process of pairing the objects in the image and identifying the relationships between the objects in the form of “object-predicate-object”, such as “person riding bike”. Although there had been many attempts to develop visual relationship detection, most may not...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/156480 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-156480 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1564802022-04-17T12:16:27Z Visual relationship detection Park, Kunyoung Zhang Hanwang School of Computer Science and Engineering hanwangzhang@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Visual relationship detection is the process of pairing the objects in the image and identifying the relationships between the objects in the form of “object-predicate-object”, such as “person riding bike”. Although there had been many attempts to develop visual relationship detection, most may not provide useful information about the image due to biasedness. For instance, predicates made by biased scene graph generation (SGG) such as “on” and “next to” do not provide useful information about the image as compared to predicates generated by unbiased SGG, such as “sitting on” and “in front of”. In this project, Total Direct Effect (TDE) in causal inference with counterfactual thinking method was explored and adopted on SGG to remove the biasedness. This implementation had shown significant improvement of accuracy measured with Mean Recall@K (mR@K) metric used in this project. The results of the visual relationship detection were also visualised and analysed. Bachelor of Engineering (Computer Science) 2022-04-17T12:16:27Z 2022-04-17T12:16:27Z 2022 Final Year Project (FYP) Park, K. (2022). Visual relationship detection. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/156480 https://hdl.handle.net/10356/156480 en SCSE21-0518 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Park, Kunyoung Visual relationship detection |
description |
Visual relationship detection is the process of pairing the objects in the image and identifying the relationships between the objects in the form of “object-predicate-object”, such as “person riding bike”. Although there had been many attempts to develop visual relationship detection, most may not provide useful information about the image due to biasedness. For instance, predicates made by biased scene graph generation (SGG) such as “on” and “next to” do not provide useful information about the image as compared to predicates generated by unbiased SGG, such as “sitting on” and “in front of”. In this project, Total Direct Effect (TDE) in causal inference with counterfactual thinking method was explored and adopted on SGG to remove the biasedness. This implementation had shown significant improvement of accuracy measured with Mean Recall@K (mR@K) metric used in this project. The results of the visual relationship detection were also visualised and analysed. |
author2 |
Zhang Hanwang |
author_facet |
Zhang Hanwang Park, Kunyoung |
format |
Final Year Project |
author |
Park, Kunyoung |
author_sort |
Park, Kunyoung |
title |
Visual relationship detection |
title_short |
Visual relationship detection |
title_full |
Visual relationship detection |
title_fullStr |
Visual relationship detection |
title_full_unstemmed |
Visual relationship detection |
title_sort |
visual relationship detection |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/156480 |
_version_ |
1731235708887105536 |