A Unified Model for Joint Normalization and Differential Gene Expression Detection in RNA-Seq Data

Liu Kefei; Ye Jieping; Yang Yang; Shen Li; Jiang Hui

首页> 外文期刊>IEEE/ACM transactions on computational biology and bioinformatics >A Unified Model for Joint Normalization and Differential Gene Expression Detection in RNA-Seq Data

【24h】

A Unified Model for Joint Normalization and Differential Gene Expression Detection in RNA-Seq Data

机译：RNA-Seq数据中联合标准化和差异基因表达检测的统一模型

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The RNA-sequencing (RNA-seq) is becoming increasingly popular for quantifying gene expression levels. Since the RNA-seq measurements are relative in nature, between-sample normalization is an essential step in differential expression (DE) analysis. The normalization step of existing DE detection algorithms is usually ad hoc and performed only once prior to DE detection, which may be suboptimal since ideally normalization should be based on non-DE genes only and thus coupled with DE detection. We propose a unified statistical model for joint normalization and DE detection of RNA-seq data. Sample-specific normalization factors are modeled as unknown parameters in the gene-wise linear models and jointly estimated with the regression coefficients. By imposing sparsity-inducing L1 penalty (or mixed L1/L2 penalty for multiple treatment conditions) on the regression coefficients, we formulate the problem as a penalized least-squares regression problem and apply the augmented Lagrangian method to solve it. Simulation and real data studies show that the proposed model and algorithms perform better than or comparably to existing methods in terms of detection power and false-positive rate. The performance gain increases with increasingly larger sample size or higher signal to noise ratio, and is more significant when a large proportion of genes are differentially expressed in an asymmetric manner.

机译：RNA测序（RNA-seq）在量化基因表达水平方面正变得越来越流行。由于RNA-seq测量本质上是相对的，因此样品间标准化是差异表达（DE）分析中必不可少的步骤。现有DE检测算法的归一化步骤通常是临时的，并且仅在DE检测之前执行一次，这可能是次优的，因为理想的归一化应该仅基于非DE基因，并因此与DE检测结合。我们提出了一个统一的统计模型，用于联合归一化和DE检测RNA-seq数据。特定于样本的归一化因子在基因线性模型中被建模为未知参数，并与回归系数共同估算。通过在回归系数上施加稀疏性诱导L1罚分（或在多种处理条件下混合使用L1 / L2罚分），可以将该问题公式化为惩罚最小二乘回归问题，并应用增强拉格朗日方法进行求解。仿真和实际数据研究表明，在检测能力和假阳性率方面，所提出的模型和算法的性能优于或与现有方法相当。随着样本量的增加或信噪比的提高，性能增益会提高，而当大量基因以非对称方式差异表达时，性能增益将更为显着。

著录项

来源
《IEEE/ACM transactions on computational biology and bioinformatics》 |2019年第2期|442-454|共13页
作者
Liu Kefei; Ye Jieping; Yang Yang; Shen Li; Jiang Hui;
展开▼
作者单位

Indiana Univ Sch Med, Dept Radiol & Imaging Sci, Indianapolis, IN 46202 USA;

Univ Michigan, Dept Computat Med & Bioinformat, Ann Arbor, MI 48109 USA;

Beihang Univ, Sch Engn & Comp Sci, Beijing 100191, Peoples R China;

Indiana Univ Sch Med, Dept Radiol & Imaging Sci, Indianapolis, IN 46202 USA;

Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
RNA-seq; differential expression analysis; normalization; linear regression; L1-norm regularization; augmented Lagrangian method;

机译：RNA-SEQ;差异表达分析;归一化;线性回归;L1-Norm正规;增强拉格朗日方法;

相似文献

外文文献
中文文献
专利

1. A Unified Model for Joint Normalization and Differential Gene Expression Detection in RNA-Seq Data [J] . Liu Kefei, Ye Jieping, Yang Yang, IEEE/ACM transactions on computational biology and bioinformatics . 2019,第2期

机译：RNA-SEQ数据中的关节归一化和差异基因表达检测的统一模型
2. A statistical normalization method and differential expression analysis for RNA-seq data between different species [J] . Yan Zhou, Jiadi Zhu, Tiejun Tong, BMC Bioinformatics . 2019,第1期

机译：不同物种之间RNA-SEQ数据的统计标准化方法和差异表达分析
3. Comparison of normalization and differential expression analyses using RNA-Seq data from 726 individual Drosophila melanogaster [J] . Yanzhu Lin, Kseniya Golovnina, Zhen-Xia Chen, BMC Genomics . 2016,第1期

机译：使用来自726个黑腹果蝇的RNA-Seq数据进行标准化和差异表达分析的比较
4. A Unified Model for Robust Differential Expression Analysis of RNA-Seq Data [C] . Kefei Liu, Li Shen, Hui Jiang IEEE International Conference on Bioinformatics and Biomedicine . 2018

机译：RNA-Seq数据鲁棒差异表达分析的统一模型
5. An Isoform-free Model for Differential Expression Analysis in RNA-seq Data. [D] . Liu, Yang. 2016

机译：RNA序列数据中差异表达分析的无异构形式模型。
6. A Unified Model for Joint Normalization and Differential Gene Expression Detection in RNA-Seq data [O] . Kefei Liu, Jieping Ye, Yang Yang, -1

机译：RNA-Seq数据中联合标准化和差异基因表达检测的统一模型
7. A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data [O] . Li, Xiaohong, Brock, Guy N, Rouchka, Eric C, 2017

机译：用于RNA-seq数据差异表达分析的每个样本全局缩放和每个基因标准化方法的比较

A Unified Model for Joint Normalization and Differential Gene Expression Detection in RNA-Seq Data

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅