...
首页> 外文期刊>Bioinformatics >TIGAR: transcript isoform abundance estimation method with gapped alignment of RNA-Seq data by variational Bayesian inference
【24h】

TIGAR: transcript isoform abundance estimation method with gapped alignment of RNA-Seq data by variational Bayesian inference

机译:TIGAR:利用变分贝叶斯推断对RNA-Seq数据进行空位比对的转录亚型丰度估计方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Motivation: Many human genes express multiple transcript isoforms through alternative splicing, which greatly increases diversity of protein function. Although RNA sequencing (RNA-Seq) technologies have been widely used in measuring amounts of transcribed mRNA, accurate estimation of transcript isoform abundances from RNA-Seq data is challenging because reads often map to more than one transcript isoforms or paralogs whose sequences are similar to each other. Results: We propose a statistical method to estimate transcript isoform abundances from RNA-Seq data. Our method can handle gapped alignments of reads against reference sequences so that it allows insertion or deletion errors within reads. The proposed method optimizes the number of transcript isoforms by variational Bayesian inference through an iterative procedure, and its convergence is guaranteed under a stopping criterion. On simulated datasets, our method outperformed the comparable quantification methods in inferring transcript isoform abundances, and at the same time its rate of convergence was faster than that of the expectation maximization algorithm. We also applied our method to RNA-Seq data of human cell line samples, and showed that our prediction result was more consistent among technical replicates than those of other methods.
机译:动机:许多人类基因通过选择性剪接表达多种转录异构体,这大大增加了蛋白质功能的多样性。尽管RNA测序(RNA-Seq)技术已广泛用于测量转录的mRNA的量,但是从RNA-Seq数据中准确估计转录本同工型的丰度仍具有挑战性,因为读段经常映射到序列与之相似的多个转录本同工型或旁系同源物。彼此。结果:我们提出了一种统计方法,可根据RNA-Seq数据估算转录本亚型的丰度。我们的方法可以处理针对参考序列的读段的空位比对,从而允许读段内出现插入或缺失错误。所提出的方法通过迭代过程通过变分贝叶斯推断来优化转录本同工型的数量,并在停止条件下保证了其收敛性。在模拟数据集上,我们的方法在推断笔录异构体丰度方面优于可比的量化方法,同时其收敛速度比期望最大化算法快。我们还将我们的方法应用于人类细胞系样品的RNA-Seq数据,并表明我们的预测结果在技术重复中比其他方法更为一致。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号