首页> 美国卫生研究院文献>Nucleic Acids Research >Integration of mapped RNA-Seq reads into automatic training of eukaryotic gene finding algorithm
【2h】

Integration of mapped RNA-Seq reads into automatic training of eukaryotic gene finding algorithm

机译:将映射的RNA-Seq读码整合到真核基因发现算法的自动训练中

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We present a new approach to automatic training of a eukaryotic ab initio gene finding algorithm. With the advent of Next-Generation Sequencing, automatic training has become paramount, allowing genome annotation pipelines to keep pace with the speed of genome sequencing. Earlier we developed GeneMark-ES, currently the only gene finding algorithm for eukaryotic genomes that performs automatic training in unsupervised ab initio mode. The new algorithm, GeneMark-ET augments GeneMark-ES with a novel method that integrates RNA-Seq read alignments into the self-training procedure. Use of ‘assembled’ RNA-Seq transcripts is far from trivial; significant error rate of assembly was revealed in recent assessments. We demonstrated in computational experiments that the proposed method of incorporation of ‘unassembled’ RNA-Seq reads improves the accuracy of gene prediction; particularly, for the 1.3 GB genome of Aedes aegypti the mean value of prediction Sensitivity and Specificity at the gene level increased over GeneMark-ES by 24.5%. In the current surge of genomic data when the need for accurate sequence annotation is higher than ever, GeneMark-ET will be a valuable addition to the narrow arsenal of automatic gene prediction tools.
机译:我们提出了一种自动训练真核生物从头算基因寻找算法的新方法。随着下一代测序的出现,自动训练已变得至关重要,这使得基因组注释流程可以跟上基因组测序的速度。较早之前,我们开发了GeneMark-ES,这是目前唯一在无监督的从头开始模式下进行自动训练的真核基因组基因发现算法。新算法GeneMark-ET通过将RNA-Seq读数比对整合到自训练程序中的新方法增强了GeneMark-ES。使用“组装的” RNA-Seq转录本并非易事。最近的评估显示出明显的组装错误率。我们在计算实验中证明,提出的“未组装” RNA-Seq读段整合方法可提高基因预测的准确性;特别是,对于埃及伊蚊的1.3 GB基因组,在基因水平上的预测敏感性和特异性的平均值比GeneMark-ES高出24.5%。在当前的基因组数据激增时,对精确序列注释的需求比以往任何时候都高,GeneMark-ET将成为自动基因预测工具的缩影。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号