Visual product search in mobile business

Visual Search Technology allows users to retrieve information regarding visual objects. With the recent development of smartphones, this function can be performed on mobiles and is known as Mobile Visual Search (MVS). This report focuses on some of the image processing and pattern recognition techni...

Full description

Saved in:
Bibliographic Details
Main Author: Vijay, Dalmia Devanshu
Other Authors: Yap, Kim Hui
Format: Final Year Project
Language:English
Published: 2014
Subjects:
Online Access:http://hdl.handle.net/10356/61006
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-61006
record_format dspace
spelling sg-ntu-dr.10356-610062023-07-07T17:09:06Z Visual product search in mobile business Vijay, Dalmia Devanshu Yap, Kim Hui School of Electrical and Electronic Engineering DRNTU::Engineering Visual Search Technology allows users to retrieve information regarding visual objects. With the recent development of smartphones, this function can be performed on mobiles and is known as Mobile Visual Search (MVS). This report focuses on some of the image processing and pattern recognition techniques using the Mobile Visual Search. A mobile visual search application has already been developed on the Android platform using the client-server architecture. It uses the image processing techniques like Bag-of-Words model, Scale-invariant Feature Transform (SIFT) detector and descriptor, Inverted Index, Vocabulary Tree and Geometric Verification. One major part of this image recognition process is known as Keypoint Detection. The current application uses the SIFT (Difference of Gaussian) keypoint detector. This report seeks to evaluate some other keypoint detection techniques like Harris Affine and Hessian Affine. First, preliminary analysis is performed on the Harris Affine, Hessian Affine and the SIFT (DoG) detectors using the 48 image database provided by the Visual Geometry Group (VGG) at Oxford. These detectors are evaluated across five different image transformations which are viewpoint change, scale change, blur, light change and JPEG compression. Across all these transformations Hessian Affine is found to be the most optimal detector using the criteria which is explained in the chapter 4 of the report. Since the current Mobile Visual Search (MVS) application is in the C programming language, a C version of the Hessian Affine detector is found and is integrated in the current code and pipeline. The performance of this new MVS pipeline using the Hessian Affine detector is tested against the old pipeline which uses the SIFT (DoG) detector using the NTU Landmark Database. The percentage of images successfully recognized using the SIFT (DoG) detector is 84.36% whereas the percentage of images successfully recognized using the Hessian detector is only 81.60%. Contrary to the preliminary analysis, the SIFT (DoG) detector outperforms the Hessian Affine detector by 2.76%. Bachelor of Engineering 2014-06-04T02:08:34Z 2014-06-04T02:08:34Z 2014 2014 Final Year Project (FYP) http://hdl.handle.net/10356/61006 en Nanyang Technological University 72 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering
spellingShingle DRNTU::Engineering
Vijay, Dalmia Devanshu
Visual product search in mobile business
description Visual Search Technology allows users to retrieve information regarding visual objects. With the recent development of smartphones, this function can be performed on mobiles and is known as Mobile Visual Search (MVS). This report focuses on some of the image processing and pattern recognition techniques using the Mobile Visual Search. A mobile visual search application has already been developed on the Android platform using the client-server architecture. It uses the image processing techniques like Bag-of-Words model, Scale-invariant Feature Transform (SIFT) detector and descriptor, Inverted Index, Vocabulary Tree and Geometric Verification. One major part of this image recognition process is known as Keypoint Detection. The current application uses the SIFT (Difference of Gaussian) keypoint detector. This report seeks to evaluate some other keypoint detection techniques like Harris Affine and Hessian Affine. First, preliminary analysis is performed on the Harris Affine, Hessian Affine and the SIFT (DoG) detectors using the 48 image database provided by the Visual Geometry Group (VGG) at Oxford. These detectors are evaluated across five different image transformations which are viewpoint change, scale change, blur, light change and JPEG compression. Across all these transformations Hessian Affine is found to be the most optimal detector using the criteria which is explained in the chapter 4 of the report. Since the current Mobile Visual Search (MVS) application is in the C programming language, a C version of the Hessian Affine detector is found and is integrated in the current code and pipeline. The performance of this new MVS pipeline using the Hessian Affine detector is tested against the old pipeline which uses the SIFT (DoG) detector using the NTU Landmark Database. The percentage of images successfully recognized using the SIFT (DoG) detector is 84.36% whereas the percentage of images successfully recognized using the Hessian detector is only 81.60%. Contrary to the preliminary analysis, the SIFT (DoG) detector outperforms the Hessian Affine detector by 2.76%.
author2 Yap, Kim Hui
author_facet Yap, Kim Hui
Vijay, Dalmia Devanshu
format Final Year Project
author Vijay, Dalmia Devanshu
author_sort Vijay, Dalmia Devanshu
title Visual product search in mobile business
title_short Visual product search in mobile business
title_full Visual product search in mobile business
title_fullStr Visual product search in mobile business
title_full_unstemmed Visual product search in mobile business
title_sort visual product search in mobile business
publishDate 2014
url http://hdl.handle.net/10356/61006
_version_ 1772828522711089152