Examining the practical limits of batch effect-correction algorithms : when should you care about batch effects?

Batch effects are technical sources of variation and can confound analysis. While many performance ranking exercises have been conducted to establish the best batch effect-correction algorithm (BECA), we hold the viewpoint that the notion of best is context-dependent. Moreover, alternative questions...

全面介紹

Saved in:

書目詳細資料
Main Authors:	Zhou, Longjian, Sue, Andrew Chi-Hau, Goh, Wilson Wen Bin
其他作者:	School of Biological Sciences
格式:	Article
語言:	English
出版:	2021
主題:	Science::Biological sciences Batch Effects Bioinformatics
在線閱讀:	https://hdl.handle.net/10356/150368
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!

id	sg-ntu-dr.10356-150368
record_format	dspace
spelling	sg-ntu-dr.10356-1503682023-02-28T16:57:27Z Examining the practical limits of batch effect-correction algorithms : when should you care about batch effects? Zhou, Longjian Sue, Andrew Chi-Hau Goh, Wilson Wen Bin School of Biological Sciences Science::Biological sciences Batch Effects Bioinformatics Batch effects are technical sources of variation and can confound analysis. While many performance ranking exercises have been conducted to establish the best batch effect-correction algorithm (BECA), we hold the viewpoint that the notion of best is context-dependent. Moreover, alternative questions beyond the simplistic notion of "best" are also interesting: are BECAs robust against various degrees of confounding and if so, what is the limit? Using two different methods for simulating class (phenotype) and batch effects and taking various representative datasets across both genomics (RNA-Seq) and proteomics platforms, we demonstrate that under situations where sample classes and batch factors are moderately confounded, most BECAs are remarkably robust and only weakly affected by upstream normalization procedures. This observation is consistently supported across the multitude of test datasets. BECAs do have limits: When sample classes and batch factors are strongly confounded, BECA performance declines, with variable performance in precision, recall and also batch correction. We also report that while conventional normalization methods have minimal impact on batch effect correction, they do not affect downstream statistical feature selection, and in strongly confounded scenarios, may even outperform BECAs. In other words, removing batch effects is no guarantee of optimal functional analysis. Overall, this study suggests that simplistic performance ranking exercises are quite trivial, and all BECAs are compromises in some context or another. National Research Foundation (NRF) Accepted version WWBG gratefully acknowledges Limsoon Wong, National University of Singapore, for inspiring this work. WWBG gratefully acknowledges support from the National Research Foundation of Singapore, NRF-NSFC (Grant No. NRF2018NRF-NSFC003SB-006). 2021-05-24T00:46:24Z 2021-05-24T00:46:24Z 2019 Journal Article Zhou, L., Sue, A. C. & Goh, W. W. B. (2019). Examining the practical limits of batch effect-correction algorithms : when should you care about batch effects?. Journal of Genetics and Genomics, 46(9), 433-443. https://dx.doi.org/10.1016/j.jgg.2019.08.002 1673-8527 0000-0002-4480-4073 0000-0003-3863-7501 https://hdl.handle.net/10356/150368 10.1016/j.jgg.2019.08.002 31611172 2-s2.0-85073164824 9 46 433 443 en NRF2018NRF-NSFC003SB-006 Journal of Genetics and Genomics © 2019 Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, and Genetics Society of China. All rights reserved. This paper was published by Elsevier Limited and Science Press in Journal of Genetics and Genomics and is made available with permission of Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, and Genetics Society of China. application/pdf
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Science::Biological sciences Batch Effects Bioinformatics
spellingShingle	Science::Biological sciences Batch Effects Bioinformatics Zhou, Longjian Sue, Andrew Chi-Hau Goh, Wilson Wen Bin Examining the practical limits of batch effect-correction algorithms : when should you care about batch effects?
description	Batch effects are technical sources of variation and can confound analysis. While many performance ranking exercises have been conducted to establish the best batch effect-correction algorithm (BECA), we hold the viewpoint that the notion of best is context-dependent. Moreover, alternative questions beyond the simplistic notion of "best" are also interesting: are BECAs robust against various degrees of confounding and if so, what is the limit? Using two different methods for simulating class (phenotype) and batch effects and taking various representative datasets across both genomics (RNA-Seq) and proteomics platforms, we demonstrate that under situations where sample classes and batch factors are moderately confounded, most BECAs are remarkably robust and only weakly affected by upstream normalization procedures. This observation is consistently supported across the multitude of test datasets. BECAs do have limits: When sample classes and batch factors are strongly confounded, BECA performance declines, with variable performance in precision, recall and also batch correction. We also report that while conventional normalization methods have minimal impact on batch effect correction, they do not affect downstream statistical feature selection, and in strongly confounded scenarios, may even outperform BECAs. In other words, removing batch effects is no guarantee of optimal functional analysis. Overall, this study suggests that simplistic performance ranking exercises are quite trivial, and all BECAs are compromises in some context or another.
author2	School of Biological Sciences
author_facet	School of Biological Sciences Zhou, Longjian Sue, Andrew Chi-Hau Goh, Wilson Wen Bin
format	Article
author	Zhou, Longjian Sue, Andrew Chi-Hau Goh, Wilson Wen Bin
author_sort	Zhou, Longjian
title	Examining the practical limits of batch effect-correction algorithms : when should you care about batch effects?
title_short	Examining the practical limits of batch effect-correction algorithms : when should you care about batch effects?
title_full	Examining the practical limits of batch effect-correction algorithms : when should you care about batch effects?
title_fullStr	Examining the practical limits of batch effect-correction algorithms : when should you care about batch effects?
title_full_unstemmed	Examining the practical limits of batch effect-correction algorithms : when should you care about batch effects?
title_sort	examining the practical limits of batch effect-correction algorithms : when should you care about batch effects?
publishDate	2021
url	https://hdl.handle.net/10356/150368
_version_	1759853841833000960

Examining the practical limits of batch effect-correction algorithms : when should you care about batch effects?

相似書籍