首页> 美国卫生研究院文献>Nucleic Acids Research >UnSplicer: mapping spliced RNA-seq reads in compact genomes and filtering noisy splicing
【2h】

UnSplicer: mapping spliced RNA-seq reads in compact genomes and filtering noisy splicing

机译:UnSplicer:在紧凑的基因组中绘制剪接的RNA-seq读图并过滤有噪声的剪接

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Accurate mapping of spliced RNA-Seq reads to genomic DNA has been known as a challenging problem. Despite significant efforts invested in developing efficient algorithms, with the human genome as a primary focus, the best solution is still not known. A recently introduced tool, TrueSight, has demonstrated better performance compared with earlier developed algorithms such as TopHat and MapSplice. To improve detection of splice junctions, TrueSight uses information on statistical patterns of nucleotide ordering in intronic and exonic DNA. This line of research led to yet another new algorithm, UnSplicer, designed for eukaryotic species with compact genomes where functional alternative splicing is likely to be dominated by splicing noise. Genome-specific parameters of the new algorithm are generated by GeneMark-ES, an ab initio gene prediction algorithm based on unsupervised training. UnSplicer shares several components with TrueSight; the difference lies in the training strategy and the classification algorithm. We tested UnSplicer on RNA-Seq data sets of Arabidopsis thaliana, Caenorhabditis elegans, Cryptococcus neoformans and Drosophila melanogaster. We have shown that splice junctions inferred by UnSplicer are in better agreement with knowledge accumulated on these well-studied genomes than predictions made by earlier developed tools.
机译:已知将剪接的RNA-Seq读数准确定位到基因组DNA是一个具有挑战性的问题。尽管投入了大量精力开发有效的算法,但以人类基因组为主要焦点,最佳解决方案仍然未知。与诸如TopHat和MapSplice之类的较早开发的算法相比,最近推出的TrueSight工具具有更好的性能。为了改善对剪接点的检测,TrueSight使用有关内含子和外显子DNA中核苷酸排序的统计模式的信息。这一系列研究导致了另一种新算法,即UnSplicer,它是为基因组紧凑的真核生物设计的,其中功能性可变剪接可能主要由剪接噪声主导。新算法的基因组特定参数由GeneMark-ES生成,GeneMark-ES是一种基于无监督训练的从头算基因的预测算法。 UnSplicer与TrueSight共享多个组件。区别在于训练策略和分类算法。我们在拟南芥,秀丽隐杆线虫,新隐球菌和黑腹果蝇的RNA-Seq数据集上测试了UnSplicer。我们已经表明,与早期开发的工具所做的预测相比,UnSplicer推断的剪接点与在这些经过充分研究的基因组上积累的知识更加吻合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号