Constrained neuro fuzzy inference methodology for explainable personalised modelling with applications on gene expression data
Interpretable machine learning models for gene expression datasets are important for understanding the decision-making process of a classifier and gaining insights on the underlying molecular processes of genetic conditions. Interpretable models can potentially support early diagnosis before full di...
Saved in:
Main Authors: | , , , , , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/169391 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-169391 |
---|---|
record_format |
dspace |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Science::Biological sciences Algorithm Bipolar Disorder |
spellingShingle |
Science::Biological sciences Algorithm Bipolar Disorder Singh, Balkaran Doborjeh, Maryam Doborjeh, Zohreh Budhraja, Sugam Tan, Samuel Sumich, Alexander Goh, Wilson Lee, Jimmy Lai, Edmund Kasabov, Nikola Constrained neuro fuzzy inference methodology for explainable personalised modelling with applications on gene expression data |
description |
Interpretable machine learning models for gene expression datasets are important for understanding the decision-making process of a classifier and gaining insights on the underlying molecular processes of genetic conditions. Interpretable models can potentially support early diagnosis before full disease manifestation. This is particularly important yet, challenging for mental health. We hypothesise this is due to extreme heterogeneity issues which may be overcome and explained by personalised modelling techniques. Thus far, most machine learning methods applied to gene expression datasets, including deep neural networks, lack personalised interpretability. This paper proposes a new methodology named personalised constrained neuro fuzzy inference (PCNFI) for learning personalised rules from high dimensional datasets which are structurally and semantically interpretable. Case studies on two mental health related datasets (schizophrenia and bipolar disorders) have shown that the relatively short and simple personalised fuzzy rules provided enhanced interpretability as well as better classification performance compared to other commonly used machine learning methods. Performance test on a cancer dataset also showed that PCNFI matches previous benchmarks. Insights from our approach also indicated the importance of two genes (ATRX and TSPAN2) as possible biomarkers for early differentiation of ultra-high risk, bipolar and healthy individuals. These genes are linked to cognitive ability and impulsive behaviour. Our findings suggest a significant starting point for further research into the biological role of cognitive and impulsivity-related differences. With potential applications across bio-medical research, the proposed PCNFI method is promising for diagnosis, prognosis, and the design of personalised treatment plans for better outcomes in the future. |
author2 |
School of Biological Sciences |
author_facet |
School of Biological Sciences Singh, Balkaran Doborjeh, Maryam Doborjeh, Zohreh Budhraja, Sugam Tan, Samuel Sumich, Alexander Goh, Wilson Lee, Jimmy Lai, Edmund Kasabov, Nikola |
format |
Article |
author |
Singh, Balkaran Doborjeh, Maryam Doborjeh, Zohreh Budhraja, Sugam Tan, Samuel Sumich, Alexander Goh, Wilson Lee, Jimmy Lai, Edmund Kasabov, Nikola |
author_sort |
Singh, Balkaran |
title |
Constrained neuro fuzzy inference methodology for explainable personalised modelling with applications on gene expression data |
title_short |
Constrained neuro fuzzy inference methodology for explainable personalised modelling with applications on gene expression data |
title_full |
Constrained neuro fuzzy inference methodology for explainable personalised modelling with applications on gene expression data |
title_fullStr |
Constrained neuro fuzzy inference methodology for explainable personalised modelling with applications on gene expression data |
title_full_unstemmed |
Constrained neuro fuzzy inference methodology for explainable personalised modelling with applications on gene expression data |
title_sort |
constrained neuro fuzzy inference methodology for explainable personalised modelling with applications on gene expression data |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/169391 |
_version_ |
1773551406978957312 |
spelling |
sg-ntu-dr.10356-1693912023-07-17T15:31:59Z Constrained neuro fuzzy inference methodology for explainable personalised modelling with applications on gene expression data Singh, Balkaran Doborjeh, Maryam Doborjeh, Zohreh Budhraja, Sugam Tan, Samuel Sumich, Alexander Goh, Wilson Lee, Jimmy Lai, Edmund Kasabov, Nikola School of Biological Sciences Lee Kong Chian School of Medicine (LKCMedicine) Center for Biomedical Informatics Science::Biological sciences Algorithm Bipolar Disorder Interpretable machine learning models for gene expression datasets are important for understanding the decision-making process of a classifier and gaining insights on the underlying molecular processes of genetic conditions. Interpretable models can potentially support early diagnosis before full disease manifestation. This is particularly important yet, challenging for mental health. We hypothesise this is due to extreme heterogeneity issues which may be overcome and explained by personalised modelling techniques. Thus far, most machine learning methods applied to gene expression datasets, including deep neural networks, lack personalised interpretability. This paper proposes a new methodology named personalised constrained neuro fuzzy inference (PCNFI) for learning personalised rules from high dimensional datasets which are structurally and semantically interpretable. Case studies on two mental health related datasets (schizophrenia and bipolar disorders) have shown that the relatively short and simple personalised fuzzy rules provided enhanced interpretability as well as better classification performance compared to other commonly used machine learning methods. Performance test on a cancer dataset also showed that PCNFI matches previous benchmarks. Insights from our approach also indicated the importance of two genes (ATRX and TSPAN2) as possible biomarkers for early differentiation of ultra-high risk, bipolar and healthy individuals. These genes are linked to cognitive ability and impulsive behaviour. Our findings suggest a significant starting point for further research into the biological role of cognitive and impulsivity-related differences. With potential applications across bio-medical research, the proposed PCNFI method is promising for diagnosis, prognosis, and the design of personalised treatment plans for better outcomes in the future. National Medical Research Council (NMRC) National Research Foundation (NRF) Published version This research is supported by the MBIE Catalyst: Strategic—New Zealand-Singapore Data Science Research Programme Funding and the National Research Foundation, Singapore under its Industry Alignment Fund—Pre-positioning (IAF-PP) Funding Initiative. The LYRIKS study was supported by the National Research Foundation Singapore under the National Medical Research Council Translational and Clinical Research Flagship Programme (NMRC/TCR/003/2008). 2023-07-17T07:52:03Z 2023-07-17T07:52:03Z 2023 Journal Article Singh, B., Doborjeh, M., Doborjeh, Z., Budhraja, S., Tan, S., Sumich, A., Goh, W., Lee, J., Lai, E. & Kasabov, N. (2023). Constrained neuro fuzzy inference methodology for explainable personalised modelling with applications on gene expression data. Scientific Reports, 13(1), 456-. https://dx.doi.org/10.1038/s41598-022-27132-8 2045-2322 https://hdl.handle.net/10356/169391 10.1038/s41598-022-27132-8 36624117 2-s2.0-85145956479 1 13 456 en NMRC/TCR/003/2008 Scientific Reports © 2023 The Author(s). This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. application/pdf |