Topics in spectral analysis of large sample covariance matrices
This thesis addresses two topics concerning spectral properties of sample covariance matrices when the data dimensionality M scales proportionally with the sample size N. In the first part, we consider the left and right singular vectors u_i and v_i of an M×N data matrix Y=Σ^(1/2) X. We establish th...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Doctor of Philosophy |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/181489 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-181489 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1814892024-12-09T15:37:32Z Topics in spectral analysis of large sample covariance matrices Lin, Zeqin Pan Guangming School of Physical and Mathematical Sciences GMPAN@ntu.edu.sg Mathematical Sciences Random matrix theory High dimensional statistics Sample covariance matrices This thesis addresses two topics concerning spectral properties of sample covariance matrices when the data dimensionality M scales proportionally with the sample size N. In the first part, we consider the left and right singular vectors u_i and v_i of an M×N data matrix Y=Σ^(1/2) X. We establish the convergence in probability of the singular vector overlaps ⟨u_i,D_1 u_j⟩, ⟨v_i,D_2 v_j⟩ and ⟨u_i,D_3 v_j⟩ towards their deterministic counterparts, where the D_k's are general deterministic matrices with bounded operator norms. Building on these findings, we offer a more precise characterization of the loss associated with Ledoit and Wolf's nonlinear shrinkage estimators. The second part examines large signal-plus-noise data matrices of the form S+Σ^(1/2) X, where S is an M×N low-rank deterministic signal matrix and Σ^(1/2) X represents the noise matrix. Under general assumptions concerning the structure of (S,Σ) and the distribution of X, we establish the asymptotic joint distribution of the spiked singular values of the model when the signals are supercritical. It turns out that the asymptotic distributions exhibit nonuniversality in the sense of dependence on the distributions of X. As a corollary, we obtain the asymptotic distribution of the spiked eigenvalues associated with mixture models. Doctor of Philosophy 2024-12-05T01:55:06Z 2024-12-05T01:55:06Z 2024 Thesis-Doctor of Philosophy Lin, Z. (2024). Topics in spectral analysis of large sample covariance matrices. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181489 https://hdl.handle.net/10356/181489 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Mathematical Sciences Random matrix theory High dimensional statistics Sample covariance matrices |
spellingShingle |
Mathematical Sciences Random matrix theory High dimensional statistics Sample covariance matrices Lin, Zeqin Topics in spectral analysis of large sample covariance matrices |
description |
This thesis addresses two topics concerning spectral properties of sample covariance matrices when the data dimensionality M scales proportionally with the sample size N. In the first part, we consider the left and right singular vectors u_i and v_i of an M×N data matrix Y=Σ^(1/2) X. We establish the convergence in probability of the singular vector overlaps ⟨u_i,D_1 u_j⟩, ⟨v_i,D_2 v_j⟩ and ⟨u_i,D_3 v_j⟩ towards their deterministic counterparts, where the D_k's are general deterministic matrices with bounded operator norms. Building on these findings, we offer a more precise characterization of the loss associated with Ledoit and Wolf's nonlinear shrinkage estimators. The second part examines large signal-plus-noise data matrices of the form S+Σ^(1/2) X, where S is an M×N low-rank deterministic signal matrix and Σ^(1/2) X represents the noise matrix. Under general assumptions concerning the structure of (S,Σ) and the distribution of X, we establish the asymptotic joint distribution of the spiked singular values of the model when the signals are supercritical. It turns out that the asymptotic distributions exhibit nonuniversality in the sense of dependence on the distributions of X. As a corollary, we obtain the asymptotic distribution of the spiked eigenvalues associated with mixture models. |
author2 |
Pan Guangming |
author_facet |
Pan Guangming Lin, Zeqin |
format |
Thesis-Doctor of Philosophy |
author |
Lin, Zeqin |
author_sort |
Lin, Zeqin |
title |
Topics in spectral analysis of large sample covariance matrices |
title_short |
Topics in spectral analysis of large sample covariance matrices |
title_full |
Topics in spectral analysis of large sample covariance matrices |
title_fullStr |
Topics in spectral analysis of large sample covariance matrices |
title_full_unstemmed |
Topics in spectral analysis of large sample covariance matrices |
title_sort |
topics in spectral analysis of large sample covariance matrices |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/181489 |
_version_ |
1819112942673592320 |