Flip-invariant SIFT for copy and object detection
Scale-invariant feature transform (SIFT) feature has been widely accepted as an effective local keypoint descriptor for its invariance to rotation, scale, and lighting changes in images. However, it is also well known that SIFT, which is derived from directionally sensitive gradient fields, is not f...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2013
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6323 https://ink.library.smu.edu.sg/context/sis_research/article/7326/viewcontent/tip13_zhao.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-7326 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-73262021-11-23T05:10:24Z Flip-invariant SIFT for copy and object detection ZHAO, Wan-Lei NGO, Chong-wah Scale-invariant feature transform (SIFT) feature has been widely accepted as an effective local keypoint descriptor for its invariance to rotation, scale, and lighting changes in images. However, it is also well known that SIFT, which is derived from directionally sensitive gradient fields, is not flip invariant. In real-world applications, flip or flip-like transformations are commonly observed in images due to artificial flipping, opposite capturing viewpoint, or symmetric patterns of objects. This paper proposes a new descriptor, named flip-invariant SIFT (or F-SIFT), that preserves the original properties of SIFT while being tolerant to flips. F-SIFT starts by estimating the dominant curl of a local patch and then geometrically normalizes the patch by flipping before the computation of SIFT. We demonstrate the power of F-SIFT on three tasks: large-scale video copy detection, object recognition, and detection. In copy detection, a framework, which smartly indices the flip properties of F-SIFT for rapid filtering and weak geometric checking, is proposed. F-SIFT not only significantly improves the detection accuracy of SIFT, but also leads to a more than 50% savings in computational cost. In object recognition, we demonstrate the superiority of F-SIFT in dealing with flip transformation by comparing it to seven other descriptors. In object detection, we further show the ability of F-SIFT in describing symmetric objects. Consistent improvement across different kinds of keypoint detectors is observed for F-SIFT over the original SIFT. 2013-03-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6323 info:doi/10.1109/TIP.2012.2226043 https://ink.library.smu.edu.sg/context/sis_research/article/7326/viewcontent/tip13_zhao.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Flip invariant scale-invariant feature transform (SIFT) geometric verification object detection video copy detection Computer Sciences Graphics and Human Computer Interfaces |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Flip invariant scale-invariant feature transform (SIFT) geometric verification object detection video copy detection Computer Sciences Graphics and Human Computer Interfaces |
spellingShingle |
Flip invariant scale-invariant feature transform (SIFT) geometric verification object detection video copy detection Computer Sciences Graphics and Human Computer Interfaces ZHAO, Wan-Lei NGO, Chong-wah Flip-invariant SIFT for copy and object detection |
description |
Scale-invariant feature transform (SIFT) feature has been widely accepted as an effective local keypoint descriptor for its invariance to rotation, scale, and lighting changes in images. However, it is also well known that SIFT, which is derived from directionally sensitive gradient fields, is not flip invariant. In real-world applications, flip or flip-like transformations are commonly observed in images due to artificial flipping, opposite capturing viewpoint, or symmetric patterns of objects. This paper proposes a new descriptor, named flip-invariant SIFT (or F-SIFT), that preserves the original properties of SIFT while being tolerant to flips. F-SIFT starts by estimating the dominant curl of a local patch and then geometrically normalizes the patch by flipping before the computation of SIFT. We demonstrate the power of F-SIFT on three tasks: large-scale video copy detection, object recognition, and detection. In copy detection, a framework, which smartly indices the flip properties of F-SIFT for rapid filtering and weak geometric checking, is proposed. F-SIFT not only significantly improves the detection accuracy of SIFT, but also leads to a more than 50% savings in computational cost. In object recognition, we demonstrate the superiority of F-SIFT in dealing with flip transformation by comparing it to seven other descriptors. In object detection, we further show the ability of F-SIFT in describing symmetric objects. Consistent improvement across different kinds of keypoint detectors is observed for F-SIFT over the original SIFT. |
format |
text |
author |
ZHAO, Wan-Lei NGO, Chong-wah |
author_facet |
ZHAO, Wan-Lei NGO, Chong-wah |
author_sort |
ZHAO, Wan-Lei |
title |
Flip-invariant SIFT for copy and object detection |
title_short |
Flip-invariant SIFT for copy and object detection |
title_full |
Flip-invariant SIFT for copy and object detection |
title_fullStr |
Flip-invariant SIFT for copy and object detection |
title_full_unstemmed |
Flip-invariant SIFT for copy and object detection |
title_sort |
flip-invariant sift for copy and object detection |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2013 |
url |
https://ink.library.smu.edu.sg/sis_research/6323 https://ink.library.smu.edu.sg/context/sis_research/article/7326/viewcontent/tip13_zhao.pdf |
_version_ |
1770575934275977216 |