Dealing with confounders in omics analysis

The Anna Karenina effect is a manifestation of the theory-practice gap that exists when theoretical statistics are applied on real-world data. In the course of analyzing biological data for differential features such as genes or proteins, it derives from the situation where the null hypothesis is re...

Full description

Saved in:
Bibliographic Details
Main Authors: Goh, Wilson Wen Bin, Wong, Limsoon
Other Authors: School of Biological Sciences
Format: Article
Language:English
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/10356/142047
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-142047
record_format dspace
spelling sg-ntu-dr.10356-1420472020-06-15T05:07:37Z Dealing with confounders in omics analysis Goh, Wilson Wen Bin Wong, Limsoon School of Biological Sciences Science::Biological sciences Statistics Feature Selection The Anna Karenina effect is a manifestation of the theory-practice gap that exists when theoretical statistics are applied on real-world data. In the course of analyzing biological data for differential features such as genes or proteins, it derives from the situation where the null hypothesis is rejected for extraneous reasons (or confounders), rather than because the alternative hypothesis is relevant to the disease phenotype. The mechanics of applying statistical tests therefore must address and resolve confounders. It is inadequate to simply rely on manipulating the P-value. We discuss three mechanistic elements (hypothesis statement construction, null distribution appropriateness, and test-statistic construction) and suggest how they can be designed to foil the Anna Karenina effect to select phenotypically relevant biological features. 2020-06-15T05:07:37Z 2020-06-15T05:07:37Z 2018 Journal Article Goh, W. W. B., & Wong, L. (2018). Dealing with confounders in omics analysis. Trends in biotechnology, 36(5), 488-498. doi:10.1016/j.tibtech.2018.01.013 0167-7799 https://hdl.handle.net/10356/142047 10.1016/j.tibtech.2018.01.013 29475622 2-s2.0-85042200175 5 36 488 498 en Trends in biotechnology © 2018 Elsevier Ltd. All rights reserved.
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Science::Biological sciences
Statistics
Feature Selection
spellingShingle Science::Biological sciences
Statistics
Feature Selection
Goh, Wilson Wen Bin
Wong, Limsoon
Dealing with confounders in omics analysis
description The Anna Karenina effect is a manifestation of the theory-practice gap that exists when theoretical statistics are applied on real-world data. In the course of analyzing biological data for differential features such as genes or proteins, it derives from the situation where the null hypothesis is rejected for extraneous reasons (or confounders), rather than because the alternative hypothesis is relevant to the disease phenotype. The mechanics of applying statistical tests therefore must address and resolve confounders. It is inadequate to simply rely on manipulating the P-value. We discuss three mechanistic elements (hypothesis statement construction, null distribution appropriateness, and test-statistic construction) and suggest how they can be designed to foil the Anna Karenina effect to select phenotypically relevant biological features.
author2 School of Biological Sciences
author_facet School of Biological Sciences
Goh, Wilson Wen Bin
Wong, Limsoon
format Article
author Goh, Wilson Wen Bin
Wong, Limsoon
author_sort Goh, Wilson Wen Bin
title Dealing with confounders in omics analysis
title_short Dealing with confounders in omics analysis
title_full Dealing with confounders in omics analysis
title_fullStr Dealing with confounders in omics analysis
title_full_unstemmed Dealing with confounders in omics analysis
title_sort dealing with confounders in omics analysis
publishDate 2020
url https://hdl.handle.net/10356/142047
_version_ 1681059368181694464