Geometry of sample spaces

In statistics, independent, identically distributed random samples do not carry a natural ordering, and their statistics are typically invariant with respect to permutations of their order. Thus, an n-sample in a space M can be considered as an element of the quotient space of Mn modulo the permutat...

Full description

Saved in:
Bibliographic Details
Main Authors: Harms, Philipp, Michor, Peter W., Pennec, Xavier, Sommer, Stefan
Other Authors: School of Physical and Mathematical Sciences
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/172874
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-172874
record_format dspace
spelling sg-ntu-dr.10356-1728742024-01-01T15:34:53Z Geometry of sample spaces Harms, Philipp Michor, Peter W. Pennec, Xavier Sommer, Stefan School of Physical and Mathematical Sciences Science::Mathematics Statistics on Metric Spaces Geometric Statistics In statistics, independent, identically distributed random samples do not carry a natural ordering, and their statistics are typically invariant with respect to permutations of their order. Thus, an n-sample in a space M can be considered as an element of the quotient space of Mn modulo the permutation group. The present paper takes this definition of sample space and the related concept of orbit types as a starting point for developing a geometric perspective on statistics. We aim at deriving a general mathematical setting for studying the behavior of empirical and population means in spaces ranging from smooth Riemannian manifolds to general stratified spaces. We fully describe the orbifold and path-metric structure of the sample space when M is a manifold or path-metric space, respectively. These results are non-trivial even when M is Euclidean. We show that the infinite sample space exists in a Gromov–Hausdorff type sense and coincides with the Wasserstein space of probability distributions on M. We exhibit Fréchet means and k-means as metric projections onto 1-skeleta or k-skeleta in Wasserstein space, and we define a new and more general notion of polymeans. This geometric characterization via metric projections applies equally to sample and population means, and we use it to establish asymptotic properties of polymeans such as consistency and asymptotic normality. Nanyang Technological University National Research Foundation (NRF) Published version The authors would like to thank François-Xavier Vialard for helpful discussions. P. Harms was funded by the National Research Foundation Singapore under the award NRF-NRFF13-2021-0012 and by Nanyang Technological University Singapore under the award NAP-SUG. X. Pennec was funded by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (grant agreement Nr. 786854 G-Statistics). He was also supported by the French government through the 3IA Côte d’Azur Investments ANR-19-P3IA-0002 managed by the French National Research Agency (ANR). S. Sommer is supported by the Villum Foundation Grants 40582 and the Novo Nordisk Foundation grant NNF18OC0052000. 2023-12-27T06:06:00Z 2023-12-27T06:06:00Z 2023 Journal Article Harms, P., Michor, P. W., Pennec, X. & Sommer, S. (2023). Geometry of sample spaces. Differential Geometry and Its Applications, 90, 102029-. https://dx.doi.org/10.1016/j.difgeo.2023.102029 0926-2245 https://hdl.handle.net/10356/172874 10.1016/j.difgeo.2023.102029 2-s2.0-85162109125 90 102029 en NRF-NRFF13-2021-0012 NAP-SUG Differential Geometry and its Applications © 2023 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Mathematics
Statistics on Metric Spaces
Geometric Statistics
spellingShingle Science::Mathematics
Statistics on Metric Spaces
Geometric Statistics
Harms, Philipp
Michor, Peter W.
Pennec, Xavier
Sommer, Stefan
Geometry of sample spaces
description In statistics, independent, identically distributed random samples do not carry a natural ordering, and their statistics are typically invariant with respect to permutations of their order. Thus, an n-sample in a space M can be considered as an element of the quotient space of Mn modulo the permutation group. The present paper takes this definition of sample space and the related concept of orbit types as a starting point for developing a geometric perspective on statistics. We aim at deriving a general mathematical setting for studying the behavior of empirical and population means in spaces ranging from smooth Riemannian manifolds to general stratified spaces. We fully describe the orbifold and path-metric structure of the sample space when M is a manifold or path-metric space, respectively. These results are non-trivial even when M is Euclidean. We show that the infinite sample space exists in a Gromov–Hausdorff type sense and coincides with the Wasserstein space of probability distributions on M. We exhibit Fréchet means and k-means as metric projections onto 1-skeleta or k-skeleta in Wasserstein space, and we define a new and more general notion of polymeans. This geometric characterization via metric projections applies equally to sample and population means, and we use it to establish asymptotic properties of polymeans such as consistency and asymptotic normality.
author2 School of Physical and Mathematical Sciences
author_facet School of Physical and Mathematical Sciences
Harms, Philipp
Michor, Peter W.
Pennec, Xavier
Sommer, Stefan
format Article
author Harms, Philipp
Michor, Peter W.
Pennec, Xavier
Sommer, Stefan
author_sort Harms, Philipp
title Geometry of sample spaces
title_short Geometry of sample spaces
title_full Geometry of sample spaces
title_fullStr Geometry of sample spaces
title_full_unstemmed Geometry of sample spaces
title_sort geometry of sample spaces
publishDate 2023
url https://hdl.handle.net/10356/172874
_version_ 1787153692162523136