...
首页> 外文期刊>Bioinformatics >PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data
【24h】

PASSion: a pattern growth algorithm-based pipeline for splice junction detection in paired-end RNA-Seq data

机译:PASSion:基于模式增长算法的管道,用于配对末端RNA-Seq数据中的剪接连接检测

获取原文
获取原文并翻译 | 示例
           

摘要

Motivation: RNA-seq is a powerful technology for the study of transcriptome profiles that uses deep- sequencing technologies. Moreover, it may be used for cellular phenotyping and help establishing the etiology of diseases characterized by abnormal splicing patterns. In RNA- Seq, the exact nature of splicing events is buried in the reads that span exon- exon boundaries. The accurate and efficient mapping of these reads to the reference genome is a major challenge. Results: We developed PASSion, a pattern growth algorithm-based pipeline for splice site detection in paired-end RNA-Seq reads. Comparing the performance of PASSion to three existing RNA-Seq analysis pipelines, TopHat, MapSplice and HMMSplicer, revealed that PASSion is competitive with these packages. Moreover, the performance of PASSion is not affected by read length and coverage. It performs better than the other three approaches when detecting junctions in highly abundant transcripts. PASSion has the ability to detect junctions that do not have known splicing motifs, which cannot be found by the other tools. Of the two public RNA-Seq datasets, PASSion predicted similar to 137 000 and 173 000 splicing events, of which on average 82 are known junctions annotated in the Ensembl transcript database and 18% are novel. In addition, our package can discover differential and shared splicing patterns among multiple samples.
机译:动机:RNA-seq是一种使用深度测序技术研究转录组图谱的强大技术。此外,它可用于细胞表型分析并帮助建立以异常剪接模式为特征的疾病的病因。在RNA-Seq中,剪接事件的确切性质掩盖在跨外显子边界的读段中。将这些读段准确而有效地映射到参考基因组是一个重大挑战。结果:我们开发了PASSion,这是一种基于模式增长算法的流水线,可用于配对末端RNA-Seq读数中的剪接位点检测。将PASSion的性能与三个现有的RNA-Seq分析管道TopHat,MapSplice和HMMSplicer进行比较,发现PASSion在这些包装上具有竞争力。此外,PASSion的性能不受读取长度和覆盖范围的影响。当检测高度丰富的转录本中的连接点时,它的性能比其他三种方法更好。 PASSion具有检测没有已知剪接基序的接合点的能力,而其他工具无法找到这些接合点。在两个公共RNA-Seq数据集中,PASSion预测类似于137 000和173 000的剪接事件,其中Ensembl转录本数据库中标注的平均已知接合处为82个,而新颖的则为18%。此外,我们的程序包还可以发现多个样本之间的差异和共享剪接模式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号