Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens

The reliance on one type of correlation to construct Gene Coexpression Networks (GCNs) using ensemble approaches hinders a comprehensive understanding of gene co-expressions. To address this, Lim et al. (unpublished) propose a Two-Tier Ensemble Aggregation (TEA) GCN, combining various GCNs genera...

全面介紹

Saved in:
書目詳細資料
主要作者: Antony Velankanni Jenet Princy
其他作者: Marek Mutwil
格式: Final Year Project
語言:English
出版: Nanyang Technological University 2024
主題:
在線閱讀:https://hdl.handle.net/10356/176362
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
id sg-ntu-dr.10356-176362
record_format dspace
spelling sg-ntu-dr.10356-1763622024-05-20T15:33:23Z Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens Antony Velankanni Jenet Princy Marek Mutwil School of Biological Sciences Lim Peng Ken mutwil@ntu.edu.sg Medicine, Health and Life Sciences The reliance on one type of correlation to construct Gene Coexpression Networks (GCNs) using ensemble approaches hinders a comprehensive understanding of gene co-expressions. To address this, Lim et al. (unpublished) propose a Two-Tier Ensemble Aggregation (TEA) GCN, combining various GCNs generated using different correlation coefficients and different dataset partitions for Arabidopsis thaliana. This project extends the evaluation of TEA-GCN to Saccharomyces cerevisiae and Homo sapiens through edge labelling and benchmarking TEA-GCN against other state-of-the-art ensemble methodologies. To this end, we used gene information from Metabolic pathways, Gene Ontology, and Transcription factors by utilising gene lists from public databases to generate labelled edges. These edges were further refined to obtain the best evaluation data for network performance assessment using Receiver Operating Characteristic and Precision-Recall curves. We show that TEA-GCN outperformed the state-of-the-art GCNs for both the species, in addition to Arabidopsis thaliana. This has demonstrated that the robustness of TEA-GCN’s methodology is generalisable across the diverse transcriptional programs underpinning not just fungal, plant, and mammalian species, but also single-cellular and multicellular organisms. Furthermore, this endeavour has resulted in first-ever comprehensive benchmarking data for Homo sapiens and Saccharomyces cerevisiae GCNs which will be instrumental to the development of more advanced GCN methods. Bachelor's degree 2024-05-16T01:11:01Z 2024-05-16T01:11:01Z 2024 Final Year Project (FYP) Antony Velankanni Jenet Princy (2024). Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/176362 https://hdl.handle.net/10356/176362 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Medicine, Health and Life Sciences
spellingShingle Medicine, Health and Life Sciences
Antony Velankanni Jenet Princy
Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens
description The reliance on one type of correlation to construct Gene Coexpression Networks (GCNs) using ensemble approaches hinders a comprehensive understanding of gene co-expressions. To address this, Lim et al. (unpublished) propose a Two-Tier Ensemble Aggregation (TEA) GCN, combining various GCNs generated using different correlation coefficients and different dataset partitions for Arabidopsis thaliana. This project extends the evaluation of TEA-GCN to Saccharomyces cerevisiae and Homo sapiens through edge labelling and benchmarking TEA-GCN against other state-of-the-art ensemble methodologies. To this end, we used gene information from Metabolic pathways, Gene Ontology, and Transcription factors by utilising gene lists from public databases to generate labelled edges. These edges were further refined to obtain the best evaluation data for network performance assessment using Receiver Operating Characteristic and Precision-Recall curves. We show that TEA-GCN outperformed the state-of-the-art GCNs for both the species, in addition to Arabidopsis thaliana. This has demonstrated that the robustness of TEA-GCN’s methodology is generalisable across the diverse transcriptional programs underpinning not just fungal, plant, and mammalian species, but also single-cellular and multicellular organisms. Furthermore, this endeavour has resulted in first-ever comprehensive benchmarking data for Homo sapiens and Saccharomyces cerevisiae GCNs which will be instrumental to the development of more advanced GCN methods.
author2 Marek Mutwil
author_facet Marek Mutwil
Antony Velankanni Jenet Princy
format Final Year Project
author Antony Velankanni Jenet Princy
author_sort Antony Velankanni Jenet Princy
title Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens
title_short Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens
title_full Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens
title_fullStr Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens
title_full_unstemmed Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens
title_sort establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/176362
_version_ 1814047278962311168