首页> 外文期刊>Nucleic Acids Research >NCLscan: accurate identification of non-co-linear transcripts (fusion, trans-splicing and circular RNA) with a good balance between sensitivity and precision
【24h】

NCLscan: accurate identification of non-co-linear transcripts (fusion, trans-splicing and circular RNA) with a good balance between sensitivity and precision

机译:NCLscan:准确识别非共线转录本(融合,反式剪接和环状RNA),并在灵敏度和精度之间取得良好的平衡

获取原文
获取原文并翻译 | 示例
           

摘要

Analysis of RNA-seq data often detects numerous 'non-co-linear' (NCL) transcripts, which comprised sequence segments that are topologically inconsistent with their corresponding DNA sequences in the reference genome. However, detection of NCL transcripts involves two major challenges: removal of false positives arising from alignment artifacts and discrimination between different types of NCL transcripts (trans-spliced, circular or fusion transcripts). Here, we developed a new NCL-transcriptdetecting method ('NCLscan'), which utilized a stepwise alignment strategy to almost completely eliminate false calls (>98% precision) without sacrificing true positives, enabling NCLscan outperform 18 other publicly-available tools (including fusion-and circular-RNA-detecting tools) in terms of sensitivity and precision, regardless of the generation strategy of simulated dataset, type of intragenic or intergenic NCL event, read depth of coverage, read length or expression level of NCL transcript. With the high accuracy, NCLscan was applied to distinguishing between trans-spliced, circular and fusion transcripts on the basis of poly(A)-and nonpoly(A)-selected RNA-seq data. We showed that circular RNAs were expressed more ubiquitously, more abundantly and less cell type-specifically than trans-spliced and fusion transcripts. Our study thus describes a robust pipeline for the discovery of NCL transcripts, and sheds light on the fundamental biology of these noncanonical RNA events in human transcriptome.
机译:RNA-seq数据的分析通常会检测到许多“非线性”(NCL)转录本,其中包含与参考基因组中的相应DNA序列在拓扑上不一致的序列段。然而,NCL转录本的检测涉及两个主要挑战:消除由比对伪像引起的假阳性和区分不同类型的NCL转录本(反式剪接,环状或融合式转录本)。在这里,我们开发了一种新的NCL文字检测方法('NCLscan'),该方法利用逐步对齐策略几乎完全消除了误报(> 98%精度),而又不牺牲真实的肯定,从而使NCLscan的性能优于其他18种公开可用的工具(包括融合和环状RNA检测工具),无论模拟数据集的生成策略,基因内或基因间NCL事件的类型,覆盖的读取深度,读取长度或NCL转录本的表达水平如何。 NCLscan具有很高的准确性,可用于根据poly(A)和nonpoly(A)选择的RNA-seq数据区分反转录的,环状的和融合的转录本。我们显示,与反式剪接和融合转录本相比,环状RNA更普遍,更丰富且细胞类型特异性更少。因此,我们的研究描述了用于NCL转录本发现的强大流程,并阐明了人类转录组中这些非规范RNA事件的基础生物学。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号