...
首页> 外文期刊>Methods and Protocols >Combining Multiple RNA-Seq Data Analysis Algorithms Using Machine Learning Improves Differential Isoform Expression Analysis
【24h】

Combining Multiple RNA-Seq Data Analysis Algorithms Using Machine Learning Improves Differential Isoform Expression Analysis

机译:使用机器学习结合多个RNA-SEQ数据分析算法改善了差分同种型表达分析

获取原文
           

摘要

RNA sequencing has become the standard technique for high resolution genome-wide monitoring of gene expression. As such, it often comprises the first step towards understanding complex molecular mechanisms driving various phenotypes, spanning organ development to disease genesis, monitoring and progression. An advantage of RNA sequencing is its ability to capture complex transcriptomic events such as alternative splicing which results in alternate isoform abundance. At the same time, this advantage remains algorithmically and computationally challenging, especially with the emergence of even higher resolution technologies such as single-cell RNA sequencing. Although several algorithms have been proposed for the effective detection of differential isoform expression from RNA-Seq data, no widely accepted golden standards have been established. This fact is further compounded by the significant differences in the output of different algorithms when applied on the same data. In addition, many of the proposed algorithms remain scarce and poorly maintained. Driven by these challenges, we developed a novel integrative approach that effectively combines the most widely used algorithms for differential transcript and isoform analysis using state-of-the-art machine learning techniques. We demonstrate its usability by applying it on simulated data based on several organisms, and using several performance metrics; we conclude that our strategy outperforms the application of the individual algorithms. Finally, our approach is implemented as an R Shiny application, with the underlying data analysis pipelines also available as docker containers.
机译:RNA测序已成为基因表达的高分辨率基因组监测的标准技术。因此,它通常包括迈向理解复杂的分子机制驾驶各种表型的第一步,跨越器官发育到疾病起因,监测和进展。 RNA测序的优点是其能够捕获复杂的转录组物事件,例如替代剪接,这导致交替的同种型丰度。与此同时,这种优势仍然算法和计算上具有挑战性,特别是在出现甚至更高的分辨率技术,例如单细胞RNA测序。虽然已经提出了有效检测来自RNA-SEQ数据的差分异构表达的几种算法,但没有建立广泛接受的黄金标准。该事实进一步通过在相同数据上应用时不同算法的输出的显着差异进行了复杂的。此外,许多所提出的算法仍然稀缺,维护不佳。通过这些挑战推动,我们开发了一种新型一致的方法,有效地结合了使用最先进的机器学习技术的差分转录物和同种型分析的最广泛使用的算法。我们通过基于若干生物的模拟数据应用,并使用几个性能指标来展示其可用性;我们得出结论,我们的策略优于个别算法的应用。最后,我们的方法是实现为R闪亮的应用程序,底层数据分析管道也可作为Docker容器提供。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号