Random Matrix Analysis of Protein Families

Proteins are vital for almost all biochemical and cellular processes. Although there is an enormous growth in the protein sequence data, the statistical characterization, structure and function of many of these sequences are still unknown. The statistical and spectral analysis of the Pearson correla...

Full description

Saved in:
Bibliographic Details
Main Authors: Rakhi Kumari, Pradeep Bhadola, Nivedita Deo
Other Authors: University of Delhi
Format: Conference or Workshop Item
Published: 2022
Subjects:
Online Access:https://repository.li.mahidol.ac.th/handle/123456789/73926
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Mahidol University
id th-mahidol.73926
record_format dspace
spelling th-mahidol.739262022-08-04T11:00:47Z Random Matrix Analysis of Protein Families Rakhi Kumari Pradeep Bhadola Nivedita Deo University of Delhi Mahidol University Engineering Proteins are vital for almost all biochemical and cellular processes. Although there is an enormous growth in the protein sequence data, the statistical characterization, structure and function of many of these sequences are still unknown. The statistical and spectral analysis of the Pearson correlation matrices between positions based on physiochemical properties of amino acids of seven protein families is performed and compared with the random Wishart matrix model results. A detailed analysis shows that the protein families significantly diverge from the Marchenko-Pastur distribution with many eigenvalues (outliers) outside the Wishart lower and upper bound. It is shown that level spacing distribution of protein families is similar to the Gaussian orthogonal ensemble. Further, the number variance varies as log of the system size indicating the presence of long range correlations within the protein families. 2022-08-04T04:00:47Z 2022-08-04T04:00:47Z 2022-01-01 Conference Paper ECS Transactions. Vol.107, No.1 (2022), 18877-18891 10.1149/10701.18877ecst 19385862 19386737 2-s2.0-85133370159 https://repository.li.mahidol.ac.th/handle/123456789/73926 Mahidol University SCOPUS https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85133370159&origin=inward
institution Mahidol University
building Mahidol University Library
continent Asia
country Thailand
Thailand
content_provider Mahidol University Library
collection Mahidol University Institutional Repository
topic Engineering
spellingShingle Engineering
Rakhi Kumari
Pradeep Bhadola
Nivedita Deo
Random Matrix Analysis of Protein Families
description Proteins are vital for almost all biochemical and cellular processes. Although there is an enormous growth in the protein sequence data, the statistical characterization, structure and function of many of these sequences are still unknown. The statistical and spectral analysis of the Pearson correlation matrices between positions based on physiochemical properties of amino acids of seven protein families is performed and compared with the random Wishart matrix model results. A detailed analysis shows that the protein families significantly diverge from the Marchenko-Pastur distribution with many eigenvalues (outliers) outside the Wishart lower and upper bound. It is shown that level spacing distribution of protein families is similar to the Gaussian orthogonal ensemble. Further, the number variance varies as log of the system size indicating the presence of long range correlations within the protein families.
author2 University of Delhi
author_facet University of Delhi
Rakhi Kumari
Pradeep Bhadola
Nivedita Deo
format Conference or Workshop Item
author Rakhi Kumari
Pradeep Bhadola
Nivedita Deo
author_sort Rakhi Kumari
title Random Matrix Analysis of Protein Families
title_short Random Matrix Analysis of Protein Families
title_full Random Matrix Analysis of Protein Families
title_fullStr Random Matrix Analysis of Protein Families
title_full_unstemmed Random Matrix Analysis of Protein Families
title_sort random matrix analysis of protein families
publishDate 2022
url https://repository.li.mahidol.ac.th/handle/123456789/73926
_version_ 1763487254234267648