Patch relational covariance distance similarity approach for image ranking in content-based image retrieval

© 2020 ACM. Content-Based Image Retrieval (CBIR) is an information retrieval framework for retrieving similar images based on objects in the images. Machine learning based CBIR consists of object detection, the majority of which rely on Convolutional Neural Network (CNN) as object detector, and imag...

Full description

Saved in:
Bibliographic Details
Main Authors: Piyavach Khunsongkiet, Jakramate Bootkrajang, Churee Techawut
Format: Conference Proceeding
Published: 2020
Subjects:
Online Access:https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85090913472&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/70419
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
Description
Summary:© 2020 ACM. Content-Based Image Retrieval (CBIR) is an information retrieval framework for retrieving similar images based on objects in the images. Machine learning based CBIR consists of object detection, the majority of which rely on Convolutional Neural Network (CNN) as object detector, and image similarity ranking. However, object detection with CNN requires expensive retraining when new set of the images is added to the database, while current ranking techniques focus on visual characteristics without considering object's spatial information. In this work, we propose a new CBIR framework to alleviate the aforementioned problems. We employ the Hierarchical Deep Convolutional Neural Network (HD-CNN) for single object detection. HD-CNN has been shown to be more efficient in model retraining on partitions of large dataset. In addition, a new similarity measure based on the covariance descriptor called Patch Relational Covariance Distance Similarity (PRCDS) is proposed. PRCDS summarizes the low-level visual features as well as object's spatial information (patch arrangement descriptor) to rank the candidate images from the HD-CNN. Finally, the proposed framework was validated on a subset of ImageNet dataset, and the experimental results showed that the ranking based on the newly proposed similarity measure is consistent with human perception.