Protocol to identify functional doppelgängers and verify biomedical gene expression data using doppelgangerIdentifier
Functional doppelgängers (FDs) are independently derived sample pairs that confound machine learning model (ML) performance when assorted across training and validation sets. Here, we detail the use of doppelgangerIdentifier (DI), providing software installation, data preparation, doppelgänger ident...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/164598 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-164598 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1645982023-02-28T17:13:49Z Protocol to identify functional doppelgängers and verify biomedical gene expression data using doppelgangerIdentifier Wang, Li Rong Fan, Xiuyi Goh, Wilson Wen Bin School of Computer Science and Engineering Lee Kong Chian School of Medicine (LKCMedicine) School of Biological Sciences Centre for Biomedical Informatics Engineering::Computer science and engineering Science::Biological sciences Gene Expression Machine Learning Functional doppelgängers (FDs) are independently derived sample pairs that confound machine learning model (ML) performance when assorted across training and validation sets. Here, we detail the use of doppelgangerIdentifier (DI), providing software installation, data preparation, doppelgänger identification, and functional testing steps. We demonstrate examples with biomedical gene expression data. We also provide guidelines for the selection of user-defined function arguments. For complete details on the use and execution of this protocol, please refer to Wang et al. (2022). Ministry of Education (MOE) National Research Foundation (NRF) Published version This research/project is supported by the National Research Foundation, Singapore under its Industry Alignment Fund – Pre-positioning (IAF-PP) Funding Initiative. Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not reflect the views of National Research Foundation, Singapore. W.W.B.G. also acknowledges support from a Ministry of Education (MOE), Singapore Tier 1 grant (grant no. RG35/20). 2023-02-06T05:37:02Z 2023-02-06T05:37:02Z 2022 Journal Article Wang, L. R., Fan, X. & Goh, W. W. B. (2022). Protocol to identify functional doppelgängers and verify biomedical gene expression data using doppelgangerIdentifier. STAR Protocols, 3(4), 101783-. https://dx.doi.org/10.1016/j.xpro.2022.101783 2666-1667 https://hdl.handle.net/10356/164598 10.1016/j.xpro.2022.101783 36317174 2-s2.0-85140458047 4 3 101783 en RG35/20 STAR Protocols © 2022 The Author(s). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Science::Biological sciences Gene Expression Machine Learning |
spellingShingle |
Engineering::Computer science and engineering Science::Biological sciences Gene Expression Machine Learning Wang, Li Rong Fan, Xiuyi Goh, Wilson Wen Bin Protocol to identify functional doppelgängers and verify biomedical gene expression data using doppelgangerIdentifier |
description |
Functional doppelgängers (FDs) are independently derived sample pairs that confound machine learning model (ML) performance when assorted across training and validation sets. Here, we detail the use of doppelgangerIdentifier (DI), providing software installation, data preparation, doppelgänger identification, and functional testing steps. We demonstrate examples with biomedical gene expression data. We also provide guidelines for the selection of user-defined function arguments. For complete details on the use and execution of this protocol, please refer to Wang et al. (2022). |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Wang, Li Rong Fan, Xiuyi Goh, Wilson Wen Bin |
format |
Article |
author |
Wang, Li Rong Fan, Xiuyi Goh, Wilson Wen Bin |
author_sort |
Wang, Li Rong |
title |
Protocol to identify functional doppelgängers and verify biomedical gene expression data using doppelgangerIdentifier |
title_short |
Protocol to identify functional doppelgängers and verify biomedical gene expression data using doppelgangerIdentifier |
title_full |
Protocol to identify functional doppelgängers and verify biomedical gene expression data using doppelgangerIdentifier |
title_fullStr |
Protocol to identify functional doppelgängers and verify biomedical gene expression data using doppelgangerIdentifier |
title_full_unstemmed |
Protocol to identify functional doppelgängers and verify biomedical gene expression data using doppelgangerIdentifier |
title_sort |
protocol to identify functional doppelgängers and verify biomedical gene expression data using doppelgangeridentifier |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/164598 |
_version_ |
1759853683444547584 |