首页> 外文期刊>BMC Bioinformatics >A novel min-cost flow method for estimating transcript expression with RNA-Seq
【24h】

A novel min-cost flow method for estimating transcript expression with RNA-Seq

机译:RNA-Seq估算转录本表达的新型最小成本流方法

获取原文
           

摘要

BackgroundThrough transcription and alternative splicing, a gene can be transcribed into different RNA sequences (isoforms), depending on the individual, on the tissue the cell is in, or in response to some stimuli. Recent RNA-Seq technology allows for new high-throughput ways for isoform identification and quantification based on short reads, and various methods have been put forward for this non-trivial problem.ResultsIn this paper we propose a novel radically different method based on minimum-cost network flows. This has a two-fold advantage: on the one hand, it translates the problem as an established one in the field of network flows, which can be solved in polynomial time, with different existing solvers; on the other hand, it is general enough to encompass many of the previous proposals under the least sum of squares model. Our method works as follows: in order to find the transcripts which best explain, under a given fitness model, a splicing graph resulting from an RNA-Seq experiment, we find a min-cost flow in an offset flow network, under an equivalent cost model. Under very weak assumptions on the fitness model, the optimal flow can be computed in polynomial time. Parsimoniously splitting the flow back into few path transcripts can be done with any of the heuristics and approximations available from the theory of network flows. In the present implementation, we choose the simple strategy of repeatedly removing the heaviest path.ConclusionsWe proposed a new very general method based on network flows for a multiassembly problem arising from isoform identification and quantification with RNA-Seq. Experimental results on prediction accuracy show that our method is very competitive with popular tools such as Cufflinks and IsoLasso. Our tool, called Traph (Transcrips in gRAPHs), is available at: http://www.cs.helsinki.fi/gsa/traph/.
机译:背景通过转录和选择性剪接,可以将基因转录为不同的RNA序列(同工型),具体取决于个体,细胞所处的组织或对某些刺激的反应。最近的RNA-Seq技术为基于短读段的异构体鉴定和定量提供了新的高通量方法,并且针对此非平凡问题提出了各种方法。结果在本文中,我们提出了一种基于极小值的根本不同的新方法。成本网络流量。这具有两方面的优势:一方面,它可以将问题转换为网络流领域中的既定问题,可以在多项式时间内使用现有的不同求解器进行求解;另一方面,在最小平方和模型下包含许多先前的建议已经足够通用了。我们的方法的工作原理如下:为了找到在给定适应性模型下最能解释由RNA-Seq实验产生的剪接图的转录本,我们在等价成本下找到了偏移流量网络中的最小成本流量模型。在适应性模型的非常弱的假设下,可以在多项式时间内计算出最佳流量。可以使用网络流量理论中可用的任何启发式方法和近似方法,将流量简化为几条路径记录。在当前的实现中,我们选择重复删除最重路径的简单策略。结论我们针对基于RNA-Seq的同工型识别和定量化引起的多装配问题,提出了一种基于网络流的非常通用的新方法。关于预测准确性的实验结果表明,我们的方法与诸如Cufflinks和IsoLasso的流行工具非常有竞争力。我们的工具称为Traph(gRAPH中的转录),可从以下网址获得:http://www.cs.helsinki.fi/gsa/traph/。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号