Topics in spectral analysis of large sample covariance matrices

This thesis addresses two topics concerning spectral properties of sample covariance matrices when the data dimensionality M scales proportionally with the sample size N. In the first part, we consider the left and right singular vectors u_i and v_i of an M×N data matrix Y=Σ^(1/2) X. We establish th...

Full description

Saved in:
Bibliographic Details
Main Author: Lin, Zeqin
Other Authors: Pan Guangming
Format: Thesis-Doctor of Philosophy
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/181489
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-181489
record_format dspace
spelling sg-ntu-dr.10356-1814892024-12-09T15:37:32Z Topics in spectral analysis of large sample covariance matrices Lin, Zeqin Pan Guangming School of Physical and Mathematical Sciences GMPAN@ntu.edu.sg Mathematical Sciences Random matrix theory High dimensional statistics Sample covariance matrices This thesis addresses two topics concerning spectral properties of sample covariance matrices when the data dimensionality M scales proportionally with the sample size N. In the first part, we consider the left and right singular vectors u_i and v_i of an M×N data matrix Y=Σ^(1/2) X. We establish the convergence in probability of the singular vector overlaps ⟨u_i,D_1 u_j⟩, ⟨v_i,D_2 v_j⟩ and ⟨u_i,D_3 v_j⟩ towards their deterministic counterparts, where the D_k's are general deterministic matrices with bounded operator norms. Building on these findings, we offer a more precise characterization of the loss associated with Ledoit and Wolf's nonlinear shrinkage estimators. The second part examines large signal-plus-noise data matrices of the form S+Σ^(1/2) X, where S is an M×N low-rank deterministic signal matrix and Σ^(1/2) X represents the noise matrix. Under general assumptions concerning the structure of (S,Σ) and the distribution of X, we establish the asymptotic joint distribution of the spiked singular values of the model when the signals are supercritical. It turns out that the asymptotic distributions exhibit nonuniversality in the sense of dependence on the distributions of X. As a corollary, we obtain the asymptotic distribution of the spiked eigenvalues associated with mixture models. Doctor of Philosophy 2024-12-05T01:55:06Z 2024-12-05T01:55:06Z 2024 Thesis-Doctor of Philosophy Lin, Z. (2024). Topics in spectral analysis of large sample covariance matrices. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181489 https://hdl.handle.net/10356/181489 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Mathematical Sciences
Random matrix theory
High dimensional statistics
Sample covariance matrices
spellingShingle Mathematical Sciences
Random matrix theory
High dimensional statistics
Sample covariance matrices
Lin, Zeqin
Topics in spectral analysis of large sample covariance matrices
description This thesis addresses two topics concerning spectral properties of sample covariance matrices when the data dimensionality M scales proportionally with the sample size N. In the first part, we consider the left and right singular vectors u_i and v_i of an M×N data matrix Y=Σ^(1/2) X. We establish the convergence in probability of the singular vector overlaps ⟨u_i,D_1 u_j⟩, ⟨v_i,D_2 v_j⟩ and ⟨u_i,D_3 v_j⟩ towards their deterministic counterparts, where the D_k's are general deterministic matrices with bounded operator norms. Building on these findings, we offer a more precise characterization of the loss associated with Ledoit and Wolf's nonlinear shrinkage estimators. The second part examines large signal-plus-noise data matrices of the form S+Σ^(1/2) X, where S is an M×N low-rank deterministic signal matrix and Σ^(1/2) X represents the noise matrix. Under general assumptions concerning the structure of (S,Σ) and the distribution of X, we establish the asymptotic joint distribution of the spiked singular values of the model when the signals are supercritical. It turns out that the asymptotic distributions exhibit nonuniversality in the sense of dependence on the distributions of X. As a corollary, we obtain the asymptotic distribution of the spiked eigenvalues associated with mixture models.
author2 Pan Guangming
author_facet Pan Guangming
Lin, Zeqin
format Thesis-Doctor of Philosophy
author Lin, Zeqin
author_sort Lin, Zeqin
title Topics in spectral analysis of large sample covariance matrices
title_short Topics in spectral analysis of large sample covariance matrices
title_full Topics in spectral analysis of large sample covariance matrices
title_fullStr Topics in spectral analysis of large sample covariance matrices
title_full_unstemmed Topics in spectral analysis of large sample covariance matrices
title_sort topics in spectral analysis of large sample covariance matrices
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/181489
_version_ 1819112942673592320