Feature importance network reveals novel functional relationships between biological features in Arabidopsis thaliana
Understanding how the different cellular components are working together to form a living cell requires multidisciplinary approaches combining molecular and computational biology. Machine learning shows great potential in life sciences, as it can find novel relationships between biological features....
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/164538 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-164538 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1645382023-02-28T17:13:38Z Feature importance network reveals novel functional relationships between biological features in Arabidopsis thaliana Ng, Jonathan Wei Xiong Chua, Swee Kwang Mutwil, Marek School of Biological Sciences Science::Biological sciences Machine Learning Random Forest Understanding how the different cellular components are working together to form a living cell requires multidisciplinary approaches combining molecular and computational biology. Machine learning shows great potential in life sciences, as it can find novel relationships between biological features. Here, we constructed a dataset of 11,801 gene features for 31,522 Arabidopsis thaliana genes and developed a machine learning workflow to identify linked features. The detected linked features are visualised as a Feature Important Network (FIN), which can be mined to reveal a variety of novel biological insights pertaining to gene function. We demonstrate how FIN can be used to generate novel insights into gene function. To make this network easily accessible to the scientific community, we present the FINder database, available at finder.plant.tools. Ministry of Education (MOE) Nanyang Technological University Published version MM is supported by a NTU Start-Up Grant and Singaporean Ministry of Education grant MOE2018-T2-2-053, while JN is supported by an NTU PhD stipend. 2023-01-31T06:14:12Z 2023-01-31T06:14:12Z 2022 Journal Article Ng, J. W. X., Chua, S. K. & Mutwil, M. (2022). Feature importance network reveals novel functional relationships between biological features in Arabidopsis thaliana. Frontiers in Plant Science, 13, 944992-. https://dx.doi.org/10.3389/fpls.2022.944992 1664-462X https://hdl.handle.net/10356/164538 10.3389/fpls.2022.944992 36212273 2-s2.0-85140099696 13 944992 en MOE2018-T2-2-053 NTU-SUG Frontiers in Plant Science © 2022 Ng, Chua and Mutwil. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Science::Biological sciences Machine Learning Random Forest |
spellingShingle |
Science::Biological sciences Machine Learning Random Forest Ng, Jonathan Wei Xiong Chua, Swee Kwang Mutwil, Marek Feature importance network reveals novel functional relationships between biological features in Arabidopsis thaliana |
description |
Understanding how the different cellular components are working together to form a living cell requires multidisciplinary approaches combining molecular and computational biology. Machine learning shows great potential in life sciences, as it can find novel relationships between biological features. Here, we constructed a dataset of 11,801 gene features for 31,522 Arabidopsis thaliana genes and developed a machine learning workflow to identify linked features. The detected linked features are visualised as a Feature Important Network (FIN), which can be mined to reveal a variety of novel biological insights pertaining to gene function. We demonstrate how FIN can be used to generate novel insights into gene function. To make this network easily accessible to the scientific community, we present the FINder database, available at finder.plant.tools. |
author2 |
School of Biological Sciences |
author_facet |
School of Biological Sciences Ng, Jonathan Wei Xiong Chua, Swee Kwang Mutwil, Marek |
format |
Article |
author |
Ng, Jonathan Wei Xiong Chua, Swee Kwang Mutwil, Marek |
author_sort |
Ng, Jonathan Wei Xiong |
title |
Feature importance network reveals novel functional relationships between biological features in Arabidopsis thaliana |
title_short |
Feature importance network reveals novel functional relationships between biological features in Arabidopsis thaliana |
title_full |
Feature importance network reveals novel functional relationships between biological features in Arabidopsis thaliana |
title_fullStr |
Feature importance network reveals novel functional relationships between biological features in Arabidopsis thaliana |
title_full_unstemmed |
Feature importance network reveals novel functional relationships between biological features in Arabidopsis thaliana |
title_sort |
feature importance network reveals novel functional relationships between biological features in arabidopsis thaliana |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/164538 |
_version_ |
1759857894348554240 |