Random Matrix Analysis of Protein Families
Proteins are vital for almost all biochemical and cellular processes. Although there is an enormous growth in the protein sequence data, the statistical characterization, structure and function of many of these sequences are still unknown. The statistical and spectral analysis of the Pearson correla...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Published: |
2023
|
Subjects: | |
Online Access: | https://repository.li.mahidol.ac.th/handle/123456789/84633 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Mahidol University |
id |
th-mahidol.84633 |
---|---|
record_format |
dspace |
spelling |
th-mahidol.846332023-06-19T00:12:45Z Random Matrix Analysis of Protein Families Kumari R. Mahidol University Engineering Proteins are vital for almost all biochemical and cellular processes. Although there is an enormous growth in the protein sequence data, the statistical characterization, structure and function of many of these sequences are still unknown. The statistical and spectral analysis of the Pearson correlation matrices between positions based on physiochemical properties of amino acids of seven protein families is performed and compared with the random Wishart matrix model results. A detailed analysis shows that the protein families significantly diverge from the Marchenko-Pastur distribution with many eigenvalues (outliers) outside the Wishart lower and upper bound. It is shown that level spacing distribution of protein families is similar to the Gaussian orthogonal ensemble. Further, the number variance varies as log of the system size indicating the presence of long range correlations within the protein families. 2023-06-18T17:12:45Z 2023-06-18T17:12:45Z 2022-01-01 Conference Paper ECS Transactions Vol.107 No.1 (2022) , 18877-18891 10.1149/10701.18877ecst 19385862 19386737 2-s2.0-85133370159 https://repository.li.mahidol.ac.th/handle/123456789/84633 SCOPUS |
institution |
Mahidol University |
building |
Mahidol University Library |
continent |
Asia |
country |
Thailand Thailand |
content_provider |
Mahidol University Library |
collection |
Mahidol University Institutional Repository |
topic |
Engineering |
spellingShingle |
Engineering Kumari R. Random Matrix Analysis of Protein Families |
description |
Proteins are vital for almost all biochemical and cellular processes. Although there is an enormous growth in the protein sequence data, the statistical characterization, structure and function of many of these sequences are still unknown. The statistical and spectral analysis of the Pearson correlation matrices between positions based on physiochemical properties of amino acids of seven protein families is performed and compared with the random Wishart matrix model results. A detailed analysis shows that the protein families significantly diverge from the Marchenko-Pastur distribution with many eigenvalues (outliers) outside the Wishart lower and upper bound. It is shown that level spacing distribution of protein families is similar to the Gaussian orthogonal ensemble. Further, the number variance varies as log of the system size indicating the presence of long range correlations within the protein families. |
author2 |
Mahidol University |
author_facet |
Mahidol University Kumari R. |
format |
Conference or Workshop Item |
author |
Kumari R. |
author_sort |
Kumari R. |
title |
Random Matrix Analysis of Protein Families |
title_short |
Random Matrix Analysis of Protein Families |
title_full |
Random Matrix Analysis of Protein Families |
title_fullStr |
Random Matrix Analysis of Protein Families |
title_full_unstemmed |
Random Matrix Analysis of Protein Families |
title_sort |
random matrix analysis of protein families |
publishDate |
2023 |
url |
https://repository.li.mahidol.ac.th/handle/123456789/84633 |
_version_ |
1781416593953128448 |