A thorough benchmark and a new model for light field saliency detection
Compared with current RGB or RGB-D saliency detection datasets, those for light field saliency detection often suffer from many defects, e.g., insufficient data amount and diversity, incomplete data formats, and rough annotations, thus impeding the prosperity of this field. To settle these issues, w...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/172179 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-172179 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1721792023-11-28T05:57:53Z A thorough benchmark and a new model for light field saliency detection Gao, Wei Fan, Songlin Li, Ge Lin, Weisi School of Computer Science and Engineering Engineering::Computer science and engineering Focal Stack Light Field Compared with current RGB or RGB-D saliency detection datasets, those for light field saliency detection often suffer from many defects, e.g., insufficient data amount and diversity, incomplete data formats, and rough annotations, thus impeding the prosperity of this field. To settle these issues, we elaborately build a large-scale light field dataset, dubbed PKU-LF, comprising 5,000 light fields and covering diverse indoor and outdoor scenes. Our PKU-LF provides all-inclusive representation formats of light fields and offers a unified platform for comparing algorithms utilizing different input formats. For sparking new vitality in saliency detection tasks, we present many unexplored scenarios (such as underwater and high-resolution scenes) and the richest annotations (such as scribble annotations, bounding boxes, object-/instance-level annotations, and edge annotations), on which many potential attention modeling tasks can be investigated. To facilitate the development of saliency detection, we systematically evaluate and analyze 16 representative 2D, 3D, and 4D methods on four existing datasets and the proposed dataset, furnishing a thorough benchmark. Furthermore, tailored to the distinct structural characteristics of light fields, a novel symmetric two-stream architecture (STSA) network is proposed to predict the saliency of light fields more accurately. Specifically, our STSA incorporates a focalness interweavement module (FIM) and three partial decoder modules (PDM). The former is designed to efficiently establish long-range dependencies across focal slices, while the latter aims to effectively aggregate the extracted coadjutant features in a mutual-enhancement way. Extensive experiments demonstrate that our method can significantly outperform the competitors. This work was supported in part by the Natural Science Foundation of China under Grants 62271013 and 62031013, in part by Guangdong Basic and Applied Basic Research Foundation under Grant 2019A1515012031, in part by Shenzhen Fundamental Research Program under Grant GXWD20201231165807007-20200806163656003, in part by Shenzhen Science and Technology Plan Basic Research Project under Grant JCYJ20190808161805519, and in part by The Major Key Project of PCL under Grant PCL2021A06. 2023-11-28T05:57:53Z 2023-11-28T05:57:53Z 2023 Journal Article Gao, W., Fan, S., Li, G. & Lin, W. (2023). A thorough benchmark and a new model for light field saliency detection. IEEE Transactions On Pattern Analysis and Machine Intelligence, 45(7), 8003-8019. https://dx.doi.org/10.1109/TPAMI.2023.3235415 0162-8828 https://hdl.handle.net/10356/172179 10.1109/TPAMI.2023.3235415 37276121 2-s2.0-85147222569 7 45 8003 8019 en IEEE Transactions on Pattern Analysis and Machine Intelligence © 2023 IEEE. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Focal Stack Light Field |
spellingShingle |
Engineering::Computer science and engineering Focal Stack Light Field Gao, Wei Fan, Songlin Li, Ge Lin, Weisi A thorough benchmark and a new model for light field saliency detection |
description |
Compared with current RGB or RGB-D saliency detection datasets, those for light field saliency detection often suffer from many defects, e.g., insufficient data amount and diversity, incomplete data formats, and rough annotations, thus impeding the prosperity of this field. To settle these issues, we elaborately build a large-scale light field dataset, dubbed PKU-LF, comprising 5,000 light fields and covering diverse indoor and outdoor scenes. Our PKU-LF provides all-inclusive representation formats of light fields and offers a unified platform for comparing algorithms utilizing different input formats. For sparking new vitality in saliency detection tasks, we present many unexplored scenarios (such as underwater and high-resolution scenes) and the richest annotations (such as scribble annotations, bounding boxes, object-/instance-level annotations, and edge annotations), on which many potential attention modeling tasks can be investigated. To facilitate the development of saliency detection, we systematically evaluate and analyze 16 representative 2D, 3D, and 4D methods on four existing datasets and the proposed dataset, furnishing a thorough benchmark. Furthermore, tailored to the distinct structural characteristics of light fields, a novel symmetric two-stream architecture (STSA) network is proposed to predict the saliency of light fields more accurately. Specifically, our STSA incorporates a focalness interweavement module (FIM) and three partial decoder modules (PDM). The former is designed to efficiently establish long-range dependencies across focal slices, while the latter aims to effectively aggregate the extracted coadjutant features in a mutual-enhancement way. Extensive experiments demonstrate that our method can significantly outperform the competitors. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Gao, Wei Fan, Songlin Li, Ge Lin, Weisi |
format |
Article |
author |
Gao, Wei Fan, Songlin Li, Ge Lin, Weisi |
author_sort |
Gao, Wei |
title |
A thorough benchmark and a new model for light field saliency detection |
title_short |
A thorough benchmark and a new model for light field saliency detection |
title_full |
A thorough benchmark and a new model for light field saliency detection |
title_fullStr |
A thorough benchmark and a new model for light field saliency detection |
title_full_unstemmed |
A thorough benchmark and a new model for light field saliency detection |
title_sort |
thorough benchmark and a new model for light field saliency detection |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/172179 |
_version_ |
1783955580039200768 |