Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens

The reliance on one type of correlation to construct Gene Coexpression Networks (GCNs) using ensemble approaches hinders a comprehensive understanding of gene co-expressions. To address this, Lim et al. (unpublished) propose a Two-Tier Ensemble Aggregation (TEA) GCN, combining various GCNs genera...

Full description

Saved in:
Bibliographic Details
Main Author: Antony Velankanni Jenet Princy
Other Authors: Marek Mutwil
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/176362
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-176362
record_format dspace
spelling sg-ntu-dr.10356-1763622024-05-20T15:33:23Z Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens Antony Velankanni Jenet Princy Marek Mutwil School of Biological Sciences Lim Peng Ken mutwil@ntu.edu.sg Medicine, Health and Life Sciences The reliance on one type of correlation to construct Gene Coexpression Networks (GCNs) using ensemble approaches hinders a comprehensive understanding of gene co-expressions. To address this, Lim et al. (unpublished) propose a Two-Tier Ensemble Aggregation (TEA) GCN, combining various GCNs generated using different correlation coefficients and different dataset partitions for Arabidopsis thaliana. This project extends the evaluation of TEA-GCN to Saccharomyces cerevisiae and Homo sapiens through edge labelling and benchmarking TEA-GCN against other state-of-the-art ensemble methodologies. To this end, we used gene information from Metabolic pathways, Gene Ontology, and Transcription factors by utilising gene lists from public databases to generate labelled edges. These edges were further refined to obtain the best evaluation data for network performance assessment using Receiver Operating Characteristic and Precision-Recall curves. We show that TEA-GCN outperformed the state-of-the-art GCNs for both the species, in addition to Arabidopsis thaliana. This has demonstrated that the robustness of TEA-GCN’s methodology is generalisable across the diverse transcriptional programs underpinning not just fungal, plant, and mammalian species, but also single-cellular and multicellular organisms. Furthermore, this endeavour has resulted in first-ever comprehensive benchmarking data for Homo sapiens and Saccharomyces cerevisiae GCNs which will be instrumental to the development of more advanced GCN methods. Bachelor's degree 2024-05-16T01:11:01Z 2024-05-16T01:11:01Z 2024 Final Year Project (FYP) Antony Velankanni Jenet Princy (2024). Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/176362 https://hdl.handle.net/10356/176362 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Medicine, Health and Life Sciences
spellingShingle Medicine, Health and Life Sciences
Antony Velankanni Jenet Princy
Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens
description The reliance on one type of correlation to construct Gene Coexpression Networks (GCNs) using ensemble approaches hinders a comprehensive understanding of gene co-expressions. To address this, Lim et al. (unpublished) propose a Two-Tier Ensemble Aggregation (TEA) GCN, combining various GCNs generated using different correlation coefficients and different dataset partitions for Arabidopsis thaliana. This project extends the evaluation of TEA-GCN to Saccharomyces cerevisiae and Homo sapiens through edge labelling and benchmarking TEA-GCN against other state-of-the-art ensemble methodologies. To this end, we used gene information from Metabolic pathways, Gene Ontology, and Transcription factors by utilising gene lists from public databases to generate labelled edges. These edges were further refined to obtain the best evaluation data for network performance assessment using Receiver Operating Characteristic and Precision-Recall curves. We show that TEA-GCN outperformed the state-of-the-art GCNs for both the species, in addition to Arabidopsis thaliana. This has demonstrated that the robustness of TEA-GCN’s methodology is generalisable across the diverse transcriptional programs underpinning not just fungal, plant, and mammalian species, but also single-cellular and multicellular organisms. Furthermore, this endeavour has resulted in first-ever comprehensive benchmarking data for Homo sapiens and Saccharomyces cerevisiae GCNs which will be instrumental to the development of more advanced GCN methods.
author2 Marek Mutwil
author_facet Marek Mutwil
Antony Velankanni Jenet Princy
format Final Year Project
author Antony Velankanni Jenet Princy
author_sort Antony Velankanni Jenet Princy
title Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens
title_short Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens
title_full Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens
title_fullStr Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens
title_full_unstemmed Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens
title_sort establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/176362
_version_ 1814047278962311168