LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data

Background: With the advances in high-throughput DNA sequencing technologies, RNA-seq has rapidly emerged as a powerful tool for the quantitative analysis of gene expression and transcript variant discovery. In comparative experiments, differential expression analysis is commonly performed on RNA-se...

Full description

Saved in:
Bibliographic Details
Main Authors: Lin, Bingqing, Zhang, Li-Feng, Chen, Xin
Other Authors: School of Biological Sciences
Format: Article
Language:English
Published: 2015
Subjects:
Online Access:https://hdl.handle.net/10356/80204
http://hdl.handle.net/10220/38890
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-80204
record_format dspace
spelling sg-ntu-dr.10356-802042023-02-28T16:59:08Z LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data Lin, Bingqing Zhang, Li-Feng Chen, Xin School of Biological Sciences School of Physical and Mathematical Sciences Differential expression Nonparametric RNA-seq Background: With the advances in high-throughput DNA sequencing technologies, RNA-seq has rapidly emerged as a powerful tool for the quantitative analysis of gene expression and transcript variant discovery. In comparative experiments, differential expression analysis is commonly performed on RNA-seq data to identify genes/features that are differentially expressed between biological conditions. Most existing statistical methods for differential expression analysis are parametric and assume either Poisson distribution or negative binomial distribution on gene read counts. However, violation of distributional assumptions or a poor estimation of parameters often leads to unreliable results. Results: In this paper, we introduce a new nonparametric approach called LFCseq that uses log fold changes as a differential expression test statistic. To test each gene for differential expression, LFCseq estimates a null probability distribution of count changes from a selected set of genes with similar expression strength. In contrast, the nonparametric NOISeq approach relies on a null distribution estimated from all genes within an experimental condition regardless of their expression levels. Conclusion: Through extensive simulation study and RNA-seq real data analysis, we demonstrate that the proposed approach could well rank the differentially expressed genes ahead of non-differentially expressed genes, thereby achieving a much improved overall performance for differential expression analysis. NMRC (Natl Medical Research Council, S’pore) Published version 2015-12-02T04:36:55Z 2019-12-06T13:42:52Z 2015-12-02T04:36:55Z 2019-12-06T13:42:52Z 2014 Journal Article Lin, B., Zhang, L.-F., & Chen, X. (2014). LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data. BMC Genomics, 15(S7). 1471-2164 https://hdl.handle.net/10356/80204 http://hdl.handle.net/10220/38890 10.1186/1471-2164-15-S10-S7 25560842 en BMC Genomics © 2014 Lin et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. 9 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Differential expression
Nonparametric
RNA-seq
spellingShingle Differential expression
Nonparametric
RNA-seq
Lin, Bingqing
Zhang, Li-Feng
Chen, Xin
LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data
description Background: With the advances in high-throughput DNA sequencing technologies, RNA-seq has rapidly emerged as a powerful tool for the quantitative analysis of gene expression and transcript variant discovery. In comparative experiments, differential expression analysis is commonly performed on RNA-seq data to identify genes/features that are differentially expressed between biological conditions. Most existing statistical methods for differential expression analysis are parametric and assume either Poisson distribution or negative binomial distribution on gene read counts. However, violation of distributional assumptions or a poor estimation of parameters often leads to unreliable results. Results: In this paper, we introduce a new nonparametric approach called LFCseq that uses log fold changes as a differential expression test statistic. To test each gene for differential expression, LFCseq estimates a null probability distribution of count changes from a selected set of genes with similar expression strength. In contrast, the nonparametric NOISeq approach relies on a null distribution estimated from all genes within an experimental condition regardless of their expression levels. Conclusion: Through extensive simulation study and RNA-seq real data analysis, we demonstrate that the proposed approach could well rank the differentially expressed genes ahead of non-differentially expressed genes, thereby achieving a much improved overall performance for differential expression analysis.
author2 School of Biological Sciences
author_facet School of Biological Sciences
Lin, Bingqing
Zhang, Li-Feng
Chen, Xin
format Article
author Lin, Bingqing
Zhang, Li-Feng
Chen, Xin
author_sort Lin, Bingqing
title LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data
title_short LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data
title_full LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data
title_fullStr LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data
title_full_unstemmed LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data
title_sort lfcseq: a nonparametric approach for differential expression analysis of rna-seq data
publishDate 2015
url https://hdl.handle.net/10356/80204
http://hdl.handle.net/10220/38890
_version_ 1759854215724793856