Distribution-Based Similarity Measures for Multi-Dimensional Point Set Retrieval Applications

Effective and efficient method of similarity assessment continues to be one of the most fundamental problems in multimedia data analysis. In case of retrieving relevant items from a collection of objects based on series of multivariate observations (e.g., searching the similar video clips in a repos...

Full description

Saved in:
Bibliographic Details
Main Authors: SHAO, Jie, HUANG, Zi, SHEN, Heng Tao, SHEN, Jialie, ZHOU, Xiaofang
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2008
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/574
http://dx.doi.org/10.1145/1459359.1459417
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1573
record_format dspace
spelling sg-smu-ink.sis_research-15732010-09-24T08:24:04Z Distribution-Based Similarity Measures for Multi-Dimensional Point Set Retrieval Applications SHAO, Jie HUANG, Zi SHEN, Heng Tao SHEN, Jialie ZHOU, Xiaofang Effective and efficient method of similarity assessment continues to be one of the most fundamental problems in multimedia data analysis. In case of retrieving relevant items from a collection of objects based on series of multivariate observations (e.g., searching the similar video clips in a repository to a query example), satisfactory performance cannot be expected using many conventional similarity measures based on the aggregation of element pairwise comparisons. Some correlation information among the individual elements has also been investigated to characterize each set of multi-dimensional points for ranked retrieval, by making use of an unwarranted assumption that the underlying data distribution has a particular parametric form. Motivated by this observation, this paper introduces a novel collective gauge of relevance ranking by evaluating the probabilities that point sets are consistent with the same distribution of the query. Two non-parametric hypothesis tests in statistics are justified to exploit the distributional discrepancy of samples for assessing the similarity between two ensembles of points. While our methodology is mainly presented in the context of video similarity search, it enjoys great flexibility and can be easily adapted to other applications involving generic multi-dimensional point set representation for each object such as human gesture recognition. 2008-10-27T07:00:00Z text https://ink.library.smu.edu.sg/sis_research/574 info:doi/10.1145/1459359.1459417 http://dx.doi.org/10.1145/1459359.1459417 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Databases and Information Systems
Numerical Analysis and Scientific Computing
SHAO, Jie
HUANG, Zi
SHEN, Heng Tao
SHEN, Jialie
ZHOU, Xiaofang
Distribution-Based Similarity Measures for Multi-Dimensional Point Set Retrieval Applications
description Effective and efficient method of similarity assessment continues to be one of the most fundamental problems in multimedia data analysis. In case of retrieving relevant items from a collection of objects based on series of multivariate observations (e.g., searching the similar video clips in a repository to a query example), satisfactory performance cannot be expected using many conventional similarity measures based on the aggregation of element pairwise comparisons. Some correlation information among the individual elements has also been investigated to characterize each set of multi-dimensional points for ranked retrieval, by making use of an unwarranted assumption that the underlying data distribution has a particular parametric form. Motivated by this observation, this paper introduces a novel collective gauge of relevance ranking by evaluating the probabilities that point sets are consistent with the same distribution of the query. Two non-parametric hypothesis tests in statistics are justified to exploit the distributional discrepancy of samples for assessing the similarity between two ensembles of points. While our methodology is mainly presented in the context of video similarity search, it enjoys great flexibility and can be easily adapted to other applications involving generic multi-dimensional point set representation for each object such as human gesture recognition.
format text
author SHAO, Jie
HUANG, Zi
SHEN, Heng Tao
SHEN, Jialie
ZHOU, Xiaofang
author_facet SHAO, Jie
HUANG, Zi
SHEN, Heng Tao
SHEN, Jialie
ZHOU, Xiaofang
author_sort SHAO, Jie
title Distribution-Based Similarity Measures for Multi-Dimensional Point Set Retrieval Applications
title_short Distribution-Based Similarity Measures for Multi-Dimensional Point Set Retrieval Applications
title_full Distribution-Based Similarity Measures for Multi-Dimensional Point Set Retrieval Applications
title_fullStr Distribution-Based Similarity Measures for Multi-Dimensional Point Set Retrieval Applications
title_full_unstemmed Distribution-Based Similarity Measures for Multi-Dimensional Point Set Retrieval Applications
title_sort distribution-based similarity measures for multi-dimensional point set retrieval applications
publisher Institutional Knowledge at Singapore Management University
publishDate 2008
url https://ink.library.smu.edu.sg/sis_research/574
http://dx.doi.org/10.1145/1459359.1459417
_version_ 1770570483180240896