Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens
The reliance on one type of correlation to construct Gene Coexpression Networks (GCNs) using ensemble approaches hinders a comprehensive understanding of gene co-expressions. To address this, Lim et al. (unpublished) propose a Two-Tier Ensemble Aggregation (TEA) GCN, combining various GCNs genera...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/176362 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-176362 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1763622024-05-20T15:33:23Z Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens Antony Velankanni Jenet Princy Marek Mutwil School of Biological Sciences Lim Peng Ken mutwil@ntu.edu.sg Medicine, Health and Life Sciences The reliance on one type of correlation to construct Gene Coexpression Networks (GCNs) using ensemble approaches hinders a comprehensive understanding of gene co-expressions. To address this, Lim et al. (unpublished) propose a Two-Tier Ensemble Aggregation (TEA) GCN, combining various GCNs generated using different correlation coefficients and different dataset partitions for Arabidopsis thaliana. This project extends the evaluation of TEA-GCN to Saccharomyces cerevisiae and Homo sapiens through edge labelling and benchmarking TEA-GCN against other state-of-the-art ensemble methodologies. To this end, we used gene information from Metabolic pathways, Gene Ontology, and Transcription factors by utilising gene lists from public databases to generate labelled edges. These edges were further refined to obtain the best evaluation data for network performance assessment using Receiver Operating Characteristic and Precision-Recall curves. We show that TEA-GCN outperformed the state-of-the-art GCNs for both the species, in addition to Arabidopsis thaliana. This has demonstrated that the robustness of TEA-GCN’s methodology is generalisable across the diverse transcriptional programs underpinning not just fungal, plant, and mammalian species, but also single-cellular and multicellular organisms. Furthermore, this endeavour has resulted in first-ever comprehensive benchmarking data for Homo sapiens and Saccharomyces cerevisiae GCNs which will be instrumental to the development of more advanced GCN methods. Bachelor's degree 2024-05-16T01:11:01Z 2024-05-16T01:11:01Z 2024 Final Year Project (FYP) Antony Velankanni Jenet Princy (2024). Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/176362 https://hdl.handle.net/10356/176362 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Medicine, Health and Life Sciences |
spellingShingle |
Medicine, Health and Life Sciences Antony Velankanni Jenet Princy Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens |
description |
The reliance on one type of correlation to construct Gene Coexpression Networks (GCNs)
using ensemble approaches hinders a comprehensive understanding of gene co-expressions.
To address this, Lim et al. (unpublished) propose a Two-Tier Ensemble Aggregation (TEA)
GCN, combining various GCNs generated using different correlation coefficients and
different dataset partitions for Arabidopsis thaliana. This project extends the evaluation of
TEA-GCN to Saccharomyces cerevisiae and Homo sapiens through edge labelling and
benchmarking TEA-GCN against other state-of-the-art ensemble methodologies. To this end,
we used gene information from Metabolic pathways, Gene Ontology, and Transcription
factors by utilising gene lists from public databases to generate labelled edges. These edges
were further refined to obtain the best evaluation data for network performance assessment
using Receiver Operating Characteristic and Precision-Recall curves. We show that
TEA-GCN outperformed the state-of-the-art GCNs for both the species, in addition to
Arabidopsis thaliana. This has demonstrated that the robustness of TEA-GCN’s methodology
is generalisable across the diverse transcriptional programs underpinning not just fungal,
plant, and mammalian species, but also single-cellular and multicellular organisms.
Furthermore, this endeavour has resulted in first-ever comprehensive benchmarking data for
Homo sapiens and Saccharomyces cerevisiae GCNs which will be instrumental to the
development of more advanced GCN methods. |
author2 |
Marek Mutwil |
author_facet |
Marek Mutwil Antony Velankanni Jenet Princy |
format |
Final Year Project |
author |
Antony Velankanni Jenet Princy |
author_sort |
Antony Velankanni Jenet Princy |
title |
Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens |
title_short |
Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens |
title_full |
Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens |
title_fullStr |
Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens |
title_full_unstemmed |
Establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens |
title_sort |
establishing edge labels for benchmarking gene co-expression networks of saccharomyces cerevisiae and homo sapiens |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/176362 |
_version_ |
1814047278962311168 |